Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Gentleman R, Huber W. Making the most of high-throughput protein-interaction data. Genome Biol 2008;8:112. [PMID: 18001486 PMCID: PMC2246275 DOI: 10.1186/gb-2007-8-10-112] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023] Open

For:	Gentleman R, Huber W. Making the most of high-throughput protein-interaction data. Genome Biol 2008;8:112. [PMID: 18001486 PMCID: PMC2246275 DOI: 10.1186/gb-2007-8-10-112] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023] Open

Number

Cited by Other Article(s)

Fakhar AZ, Liu J, Pajerowska-Mukhtar KM. Dynamic Enrichment for Evaluation of Protein Networks (DEEPN): A High Throughput Yeast Two-Hybrid (Y2H) Protocol to Evaluate Networks. Methods Mol Biol 2023;2690:179-192. [PMID: 37450148 DOI: 10.1007/978-1-0716-3327-4_17] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/18/2023]

Paul M, Anand A. A New Family of Similarity Measures for Scoring Confidence of Protein Interactions Using Gene Ontology. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022;19:19-30. [PMID: 34029194 DOI: 10.1109/tcbb.2021.3083150] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]

Massoud TF, Paulmurugan R. Molecular Imaging of Protein–Protein Interactions and Protein Folding. Mol Imaging 2021. [DOI: 10.1016/b978-0-12-816386-3.00071-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open

Paul M, Anand A. Impact of low-confidence interactions on computational identification of protein complexes. J Bioinform Comput Biol 2020;18:2050025. [PMID: 32757809 DOI: 10.1142/s0219720020500250] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]

Kutzera J, Smilde AK, Wilderjans TF, Hoefsloot HCJ. Towards a Hierarchical Strategy to Explore Multi-Scale IP/MS Data for Protein Complexes. PLoS One 2015;10:e0139704. [PMID: 26448546 PMCID: PMC4598013 DOI: 10.1371/journal.pone.0139704] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2015] [Accepted: 09/16/2015] [Indexed: 11/24/2022] Open

Abstract

Protein interaction in cells can be described at different levels. At a low interaction level, proteins function together in small, stable complexes and at a higher level, in sets of interacting complexes. All interaction levels are crucial for the living organism, and one of the challenges in proteomics is to measure the proteins at their different interaction levels. One common method for such measurements is immunoprecipitation followed by mass spectrometry (IP/MS), which has the potential to probe the different protein interaction forms. However, IP/MS data are complex because proteins, in their diverse interaction forms, manifest themselves in different ways in the data. Numerous bioinformatic tools for finding protein complexes in IP/MS data are currently available, but most tools do not provide information about the interaction level of the discovered complexes, and no tool is geared specifically to unraveling and visualizing these different levels. We present a new bioinformatic tool to explore IP/MS datasets for protein complexes at different interaction levels and show its performance on several real–life datasets. Our tool creates clusters that represent protein complexes, but unlike previous methods, it arranges them in a tree–shaped structure, reporting why specific proteins are predicted to build a complex and where it can be divided into smaller complexes. In every data analysis method, parameters have to be chosen. Our method can suggest values for its parameters and comes with adapted visualization tools that display the effect of the parameters on the result. The tools provide fast graphical feedback and allow the user to interact with the data by changing the parameters and examining the result. The tools also allow for exploring the different organizational levels of the protein complexes in a given dataset. Our method is available as GNU-R source code and includes examples at www.bdagroup.nl.

Collapse

Pan A, Lahiri C, Rajendiran A, Shanmugham B. Computational analysis of protein interaction networks for infectious diseases. Brief Bioinform 2015;17:517-26. [PMID: 26261187 PMCID: PMC7110031 DOI: 10.1093/bib/bbv059] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2015] [Indexed: 12/13/2022] Open

Integration strategy is a key step in network-based analysis and dramatically affects network topological properties and inferring outcomes. BIOMED RESEARCH INTERNATIONAL 2014;2014:296349. [PMID: 25243127 PMCID: PMC4163410 DOI: 10.1155/2014/296349] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/22/2014] [Revised: 07/14/2014] [Accepted: 07/17/2014] [Indexed: 01/17/2023]

Kutzera J, Hoefsloot HCJ, Malovannaya A, Smit AB, Van Mechelen I, Smilde AK. Inferring protein-protein interaction complexes from immunoprecipitation data. BMC Res Notes 2013;6:468. [PMID: 24237943 PMCID: PMC3874675 DOI: 10.1186/1756-0500-6-468] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2013] [Accepted: 10/31/2013] [Indexed: 11/26/2022] Open

Abstract

Background

Protein–protein interactions in cells are widely explored using small–scale experiments. However, the search for protein complexes and their interactions in data from high throughput experiments such as immunoprecipitation is still a challenge. We present "4N", a novel method for detecting protein complexes in such data. Our method is a heuristic algorithm based on Near Neighbor Network (3N) clustering. It is written in R, it is faster than model-based methods, and has only a small number of tuning parameters. We explain the application of our new method to real immunoprecipitation results and two artificial datasets. We show that the method can infer protein complexes from protein immunoprecipitation datasets of different densities and sizes.

Findings

4N was applied on the immunoprecipitation dataset that was presented by the authors of the original 3N in Cell 145:787–799, 2011. The test with our method shows that it can reproduce the original clustering results with fewer manually adapted parameters and, in addition, gives direct insight into the complex–complex interactions. We also tested 4N on the human "Tip49a/b" dataset. We conclude that 4N can handle the contaminants and can correctly infer complexes from this very dense dataset. Further tests were performed on two artificial datasets of different sizes. We proved that the method predicts the reference complexes in the two artificial datasets with high accuracy, even when the number of samples is reduced.

Conclusions

4N has been implemented in R. We provide the sourcecode of 4N and a user-friendly toolbox including two example calculations. Biologists can use this 4N-toolbox even if they have a limited knowledge of R. There are only a few tuning parameters to set, and each of these parameters has a biological interpretation. The run times for medium scale datasets are in the order of minutes on a standard desktop PC. Large datasets can typically be analyzed within a few hours.

Collapse

Zoraghi R, Reiner NE. Protein interaction networks as starting points to identify novel antimicrobial drug targets. Curr Opin Microbiol 2013;16:566-72. [PMID: 23938265 DOI: 10.1016/j.mib.2013.07.010] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2013] [Revised: 07/12/2013] [Accepted: 07/16/2013] [Indexed: 01/17/2023]

Quantitative real-time PCR as a sensitive protein–protein interaction quantification method and a partial solution for non-accessible autoactivator and false-negative molecule analysis in the yeast two-hybrid system. Methods 2012;58:376-84. [DOI: 10.1016/j.ymeth.2012.09.001] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2011] [Revised: 09/03/2012] [Accepted: 09/06/2012] [Indexed: 12/15/2022] Open

Maier CJ, Maier RH, Virok DP, Maass M, Hintner H, Bauer JW, Onder K. Construction of a highly flexible and comprehensive gene collection representing the ORFeome of the human pathogen Chlamydia pneumoniae. BMC Genomics 2012;13:632. [PMID: 23157390 PMCID: PMC3534531 DOI: 10.1186/1471-2164-13-632] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2012] [Accepted: 11/11/2012] [Indexed: 12/02/2022] Open

Abstract

Background

The Gram-negative bacterium Chlamydia pneumoniae (Cpn) is the leading intracellular human pathogen responsible for respiratory infections such as pneumonia and bronchitis. Basic and applied research in pathogen biology, especially the elaboration of new mechanism-based anti-pathogen strategies, target discovery and drug development, rely heavily on the availability of the entire set of pathogen open reading frames, the ORFeome. The ORFeome of Cpn will enable genome- and proteome-wide systematic analysis of Cpn, which will improve our understanding of the molecular networks and mechanisms underlying and governing its pathogenesis.

Results

Here we report the construction of a comprehensive gene collection covering 98.5% of the 1052 predicted and verified ORFs of Cpn (Chlamydia pneumoniae strain CWL029) in Gateway® ‘entry’ vectors. Based on genomic DNA isolated from the vascular chlamydial strain CV-6, we constructed an ORFeome library that contains 869 unique Gateway® entry clones (83% coverage) and an additional 168 PCR-verified ‘pooled’ entry clones, reaching an overall coverage of ~98.5% of the predicted CWL029 ORFs. The high quality of the ORFeome library was verified by PCR-gel electrophoresis and DNA sequencing, and its functionality was demonstrated by expressing panels of recombinant proteins in Escherichia coli and by genome-wide protein interaction analysis for a test set of three Cpn virulence factors in a yeast 2-hybrid system. The ORFeome is available in different configurations of resource stocks, PCR-products, purified plasmid DNA, and living cultures of E. coli harboring the desired entry clone or pooled entry clones. All resources are available in 96-well microtiterplates.

Conclusion

This first ORFeome library for Cpn provides an essential new tool for this important pathogen. The high coverage of entry clones will enable a systems biology approach for Cpn or host–pathogen analysis. The high yield of recombinant proteins and the promising interactors for Cpn virulence factors described here demonstrate the possibilities for proteome-wide studies.

Collapse

Fujimori S, Hirai N, Ohashi H, Masuoka K, Nishikimi A, Fukui Y, Washio T, Oshikubo T, Yamashita T, Miyamoto-Sato E. Next-generation sequencing coupled with a cell-free display technology for high-throughput production of reliable interactome data. Sci Rep 2012;2:691. [PMID: 23056904 PMCID: PMC3466446 DOI: 10.1038/srep00691] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2012] [Accepted: 09/07/2012] [Indexed: 11/09/2022] Open

Le Meur N, Gentleman R. Analyzing biological data using R: methods for graphs and networks. Methods Mol Biol 2012;804:343-73. [PMID: 22144163 DOI: 10.1007/978-1-61779-361-5_19] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]

Teyra J, Samsonov SA, Schreiber S, Pisabarro MT. SCOWLP update: 3D classification of protein-protein, -peptide, -saccharide and -nucleic acid interactions, and structure-based binding inferences across folds. BMC Bioinformatics 2011;12:398. [PMID: 21992011 PMCID: PMC3210135 DOI: 10.1186/1471-2105-12-398] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2011] [Accepted: 10/13/2011] [Indexed: 11/10/2022] Open

Abstract

Background

Protein interactions are essential for coordinating cellular functions. Proteomic studies have already elucidated a huge amount of protein-protein interactions that require detailed functional analysis. Understanding the structural basis of each individual interaction through their structural determination is necessary, yet an unfeasible task. Therefore, computational tools able to predict protein binding regions and recognition modes are required to rationalize putative molecular functions for proteins. With this aim, we previously created SCOWLP, a structural classification of protein binding regions at protein family level, based on the information obtained from high-resolution 3D protein-protein and protein-peptide complexes.

Description

We present here a new version of SCOWLP that has been enhanced by the inclusion of protein-nucleic acid and protein-saccharide interactions. SCOWLP takes interfacial solvent into account for a detailed characterization of protein interactions. In addition, the binding regions obtained per protein family have been enriched by the inclusion of predicted binding regions, which have been inferred from structurally related proteins across all existing folds. These inferences might become very useful to suggest novel recognition regions and compare structurally similar interfaces from different families.

Conclusions

The updated SCOWLP has new functionalities that allow both, detection and comparison of protein regions recognizing different types of ligands, which include other proteins, peptides, nucleic acids and saccharides, within a solvated environment. Currently, SCOWLP allows the analysis of predicted protein binding regions based on structure-based inferences across fold space. These predictions may have a unique potential in assisting protein docking, in providing insights into protein interaction networks, and in guiding rational engineering of protein ligands. The newly designed SCOWLP web application has an improved user-friendly interface that facilitates its usage, and is available at http://www.scowlp.org.

Collapse

Yu X, Ivanic J, Memisević V, Wallqvist A, Reifman J. Categorizing biases in high-confidence high-throughput protein-protein interaction data sets. Mol Cell Proteomics 2011;10:M111.012500. [PMID: 21876202 DOI: 10.1074/mcp.m111.012500] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open

Abstract

We characterized and evaluated the functional attributes of three yeast high-confidence protein-protein interaction data sets derived from affinity purification/mass spectrometry, protein-fragment complementation assay, and yeast two-hybrid experiments. The interacting proteins retrieved from these data sets formed distinct, partially overlapping sets with different protein-protein interaction characteristics. These differences were primarily a function of the deployed experimental technologies used to recover these interactions. This affected the total coverage of interactions and was especially evident in the recovery of interactions among different functional classes of proteins. We found that the interaction data obtained by the yeast two-hybrid method was the least biased toward any particular functional characterization. In contrast, interacting proteins in the affinity purification/mass spectrometry and protein-fragment complementation assay data sets were over- and under-represented among distinct and different functional categories. We delineated how these differences affected protein complex organization in the network of interactions, in particular for strongly interacting complexes (e.g. RNA and protein synthesis) versus weak and transient interacting complexes (e.g. protein transport). We quantified methodological differences in detecting protein interactions from larger protein complexes, in the correlation of protein abundance among interacting proteins, and in their connectivity of essential proteins. In the latter case, we showed that minimizing inherent methodology biases removed many of the ambiguous conclusions about protein essentiality and protein connectivity. We used these findings to rationalize how biological insights obtained by analyzing data sets originating from different sources sometimes do not agree or may even contradict each other. An important corollary of this work was that discrepancies in biological insights did not necessarily imply that one detection methodology was better or worse, but rather that, to a large extent, the insights reflected the methodological biases themselves. Consequently, interpreting the protein interaction data within their experimental or cellular context provided the best avenue for overcoming biases and inferring biological knowledge.

Collapse

Assessing coverage of protein interaction data using capture-recapture models. Bull Math Biol 2011;74:356-74. [PMID: 21870201 DOI: 10.1007/s11538-011-9680-2] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2010] [Accepted: 07/14/2011] [Indexed: 01/08/2023]

Towards a rigorous network of protein-protein interactions of the model sulfate reducer Desulfovibrio vulgaris Hildenborough. PLoS One 2011;6:e21470. [PMID: 21738675 PMCID: PMC3125180 DOI: 10.1371/journal.pone.0021470] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2011] [Accepted: 06/01/2011] [Indexed: 11/19/2022] Open

Terradot L, Noirot-Gros MF. Bacterial protein interaction networks: puzzle stones from solved complex structures add to a clearer picture. Integr Biol (Camb) 2011;3:645-52. [PMID: 21584322 DOI: 10.1039/c0ib00023j] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]

Lindén RO, Eronen VP, Aittokallio T. Quantitative maps of genetic interactions in yeast - comparative evaluation and integrative analysis. BMC SYSTEMS BIOLOGY 2011;5:45. [PMID: 21435228 PMCID: PMC3079637 DOI: 10.1186/1752-0509-5-45] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/30/2010] [Accepted: 03/24/2011] [Indexed: 01/08/2023]

Abstract

Background

High-throughput genetic screening approaches have enabled systematic means to study how interactions among gene mutations contribute to quantitative fitness phenotypes, with the aim of providing insights into the functional wiring diagrams of genetic interaction networks on a global scale. However, it is poorly known how well these quantitative interaction measurements agree across the screening approaches, which hinders their integrated use toward improving the coverage and quality of the genetic interaction maps in yeast and other organisms.

Results

Using large-scale data matrices from epistatic miniarray profiling (E-MAP), genetic interaction mapping (GIM), and synthetic genetic array (SGA) approaches, we carried out here a systematic comparative evaluation among these quantitative maps of genetic interactions in yeast. The relatively low association between the original interaction measurements or their customized scores could be improved using a matrix-based modelling framework, which enables the use of single- and double-mutant fitness estimates and measurements, respectively, when scoring genetic interactions. Toward an integrative analysis, we show how the detections from the different screening approaches can be combined to suggest novel positive and negative interactions which are complementary to those obtained using any single screening approach alone. The matrix approximation procedure has been made available to support the design and analysis of the future screening studies.

Conclusions

We have shown here that even if the correlation between the currently available quantitative genetic interaction maps in yeast is relatively low, their comparability can be improved by means of our computational matrix approximation procedure, which will enable integrative analysis and detection of a wider spectrum of genetic interactions using data from the complementary screening approaches.

Collapse

Lim YH, Charette JM, Baserga SJ. Assembling a protein-protein interaction map of the SSU processome from existing datasets. PLoS One 2011;6:e17701. [PMID: 21423703 PMCID: PMC3053386 DOI: 10.1371/journal.pone.0017701] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2010] [Accepted: 02/08/2011] [Indexed: 01/12/2023] Open

Rossin EJ, Lage K, Raychaudhuri S, Xavier RJ, Tatar D, Benita Y, Cotsapas C, Daly MJ. Proteins encoded in genomic regions associated with immune-mediated disease physically interact and suggest underlying biology. PLoS Genet 2011;7:e1001273. [PMID: 21249183 PMCID: PMC3020935 DOI: 10.1371/journal.pgen.1001273] [Citation(s) in RCA: 407] [Impact Index Per Article: 31.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2010] [Accepted: 12/09/2010] [Indexed: 12/14/2022] Open

Abstract

Genome-wide association studies (GWAS) have defined over 150 genomic regions unequivocally containing variation predisposing to immune-mediated disease. Inferring disease biology from these observations, however, hinges on our ability to discover the molecular processes being perturbed by these risk variants. It has previously been observed that different genes harboring causal mutations for the same Mendelian disease often physically interact. We sought to evaluate the degree to which this is true of genes within strongly associated loci in complex disease. Using sets of loci defined in rheumatoid arthritis (RA) and Crohn's disease (CD) GWAS, we build protein-protein interaction (PPI) networks for genes within associated loci and find abundant physical interactions between protein products of associated genes. We apply multiple permutation approaches to show that these networks are more densely connected than chance expectation. To confirm biological relevance, we show that the components of the networks tend to be expressed in similar tissues relevant to the phenotypes in question, suggesting the network indicates common underlying processes perturbed by risk loci. Furthermore, we show that the RA and CD networks have predictive power by demonstrating that proteins in these networks, not encoded in the confirmed list of disease associated loci, are significantly enriched for association to the phenotypes in question in extended GWAS analysis. Finally, we test our method in 3 non-immune traits to assess its applicability to complex traits in general. We find that genes in loci associated to height and lipid levels assemble into significantly connected networks but did not detect excess connectivity among Type 2 Diabetes (T2D) loci beyond chance. Taken together, our results constitute evidence that, for many of the complex diseases studied here, common genetic associations implicate regions encoding proteins that physically interact in a preferential manner, in line with observations in Mendelian disease.

Collapse

Affiliation(s)

Elizabeth J. Rossin Center for Human Genetics Research and Center for Computational and Integrative Biology, Massachusetts General Hospital, Boston, Massachusetts, United States of America Program in Medical and Population Genetics, The Broad Institute, Cambridge, Massachusetts, United States of America Department of Medicine, Harvard Medical School, Boston, Massachusetts, United States of America Health Science and Technology MD Program, Harvard University and Massachusetts Institute of Technology, Boston, Massachusetts, United States of America Harvard Biological and Biomedical Sciences Program, Harvard University, Boston, Massachusetts, United States of America
Kasper Lage Program in Medical and Population Genetics, The Broad Institute, Cambridge, Massachusetts, United States of America Department of Medicine, Harvard Medical School, Boston, Massachusetts, United States of America Pediatric Surgical Research Laboratories, Massachusetts General Hospital, Boston, Massachusetts, United States of America Center for Biological Sequence Analysis, Department of Systems Biology, Technical University of Denmark, Lyngby, Denmark
Soumya Raychaudhuri Center for Human Genetics Research and Center for Computational and Integrative Biology, Massachusetts General Hospital, Boston, Massachusetts, United States of America Program in Medical and Population Genetics, The Broad Institute, Cambridge, Massachusetts, United States of America Division of Rheumatology, Immunology, and Allergy, Brigham and Women's Hospital, Boston, Massachusetts, United States of America
Ramnik J. Xavier Center for Human Genetics Research and Center for Computational and Integrative Biology, Massachusetts General Hospital, Boston, Massachusetts, United States of America Program in Medical and Population Genetics, The Broad Institute, Cambridge, Massachusetts, United States of America Department of Medicine, Harvard Medical School, Boston, Massachusetts, United States of America
Diana Tatar Pediatric Surgical Research Laboratories, Massachusetts General Hospital, Boston, Massachusetts, United States of America
Yair Benita Center for Human Genetics Research and Center for Computational and Integrative Biology, Massachusetts General Hospital, Boston, Massachusetts, United States of America
International Inflammatory Bowel Disease Genetics Constortium
Chris Cotsapas Center for Human Genetics Research and Center for Computational and Integrative Biology, Massachusetts General Hospital, Boston, Massachusetts, United States of America Program in Medical and Population Genetics, The Broad Institute, Cambridge, Massachusetts, United States of America
Mark J. Daly Center for Human Genetics Research and Center for Computational and Integrative Biology, Massachusetts General Hospital, Boston, Massachusetts, United States of America Program in Medical and Population Genetics, The Broad Institute, Cambridge, Massachusetts, United States of America Department of Medicine, Harvard Medical School, Boston, Massachusetts, United States of America Health Science and Technology MD Program, Harvard University and Massachusetts Institute of Technology, Boston, Massachusetts, United States of America Harvard Biological and Biomedical Sciences Program, Harvard University, Boston, Massachusetts, United States of America

Collapse

Wodak SJ, Vlasblom J, Pu S. High-throughput analyses and curation of protein interactions in yeast. Methods Mol Biol 2011;759:381-406. [PMID: 21863499 DOI: 10.1007/978-1-61779-173-4_22] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]

Wu M, Li X, Chua HN, Kwoh CK, Ng SK. Integrating diverse biological and computational sources for reliable protein-protein interactions. BMC Bioinformatics 2010;11 Suppl 7:S8. [PMID: 21106130 PMCID: PMC2957691 DOI: 10.1186/1471-2105-11-s7-s8] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open

Ooi HS, Schneider G, Chan YL, Lim TT, Eisenhaber B, Eisenhaber F. Databases of protein-protein interactions and complexes. Methods Mol Biol 2010;609:145-59. [PMID: 20221918 DOI: 10.1007/978-1-60327-241-4_9] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/12/2023]

Li X, Wu M, Kwoh CK, Ng SK. Computational approaches for detecting protein complexes from protein interaction networks: a survey. BMC Genomics 2010;11 Suppl 1:S3. [PMID: 20158874 PMCID: PMC2822531 DOI: 10.1186/1471-2164-11-s1-s3] [Citation(s) in RCA: 167] [Impact Index Per Article: 11.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open

Abstract

Background

Most proteins form macromolecular complexes to perform their biological functions. However, experimentally determined protein complex data, especially of those involving more than two protein partners, are relatively limited in the current state-of-the-art high-throughput experimental techniques. Nevertheless, many techniques (such as yeast-two-hybrid) have enabled systematic screening of pairwise protein-protein interactions en masse. Thus computational approaches for detecting protein complexes from protein interaction data are useful complements to the limited experimental methods. They can be used together with the experimental methods for mapping the interactions of proteins to understand how different proteins are organized into higher-level substructures to perform various cellular functions.

Results

Given the abundance of pairwise protein interaction data from high-throughput genome-wide experimental screenings, a protein interaction network can be constructed from protein interaction data by considering individual proteins as the nodes, and the existence of a physical interaction between a pair of proteins as a link. This binary protein interaction graph can then be used for detecting protein complexes using graph clustering techniques. In this paper, we review and evaluate the state-of-the-art techniques for computational detection of protein complexes, and discuss some promising research directions in this field.

Conclusions

Experimental results with yeast protein interaction data show that the interaction subgraphs discovered by various computational methods matched well with actual protein complexes. In addition, the computational approaches have also improved in performance over the years. Further improvements could be achieved if the quality of the underlying protein interaction data can be considered adequately to minimize the undesirable effects from the irrelevant and noisy sources, and the various biological evidences can be better incorporated into the detection process to maximize the exploitation of the increasing wealth of biological knowledge available.

Collapse

Ratmann O, Wiuf C, Pinney JW. From evidence to inference: probing the evolution of protein interaction networks. HFSP JOURNAL 2009;3:290-306. [PMID: 20357887 DOI: 10.2976/1.3167215] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/16/2009] [Revised: 05/30/2009] [Indexed: 01/06/2023]

Tun K, Rao RK, Samavedham L, Tanaka H, Dhar PK. Rich can get poor: conversion of hub to non-hub proteins. SYSTEMS AND SYNTHETIC BIOLOGY 2009;2:75-82. [PMID: 19399641 PMCID: PMC2735643 DOI: 10.1007/s11693-009-9024-9] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/20/2009] [Revised: 04/06/2009] [Accepted: 04/07/2009] [Indexed: 11/26/2022]

Cusick ME, Yu H, Smolyar A, Venkatesan K, Carvunis AR, Simonis N, Rual JF, Borick H, Braun P, Dreze M, Vandenhaute J, Galli M, Yazaki J, Hill DE, Ecker JR, Roth FP, Vidal M. Literature-curated protein interaction datasets. Nat Methods 2009;6:39-46. [PMID: 19116613 PMCID: PMC2683745 DOI: 10.1038/nmeth.1284] [Citation(s) in RCA: 234] [Impact Index Per Article: 15.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]

Wodak SJ, Pu S, Vlasblom J, Seéraphin B. Challenges and Rewards of Interaction Proteomics. Mol Cell Proteomics 2009;8:3-18. [DOI: 10.1074/mcp.r800014-mcp200] [Citation(s) in RCA: 75] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open

Kelly W, Stumpf M. Protein-protein interactions: from global to local analyses. Curr Opin Biotechnol 2008;19:396-403. [PMID: 18644446 DOI: 10.1016/j.copbio.2008.06.010] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2008] [Revised: 06/25/2008] [Accepted: 06/25/2008] [Indexed: 12/26/2022]