1
|
Fakhar AZ, Liu J, Pajerowska-Mukhtar KM. Dynamic Enrichment for Evaluation of Protein Networks (DEEPN): A High Throughput Yeast Two-Hybrid (Y2H) Protocol to Evaluate Networks. Methods Mol Biol 2023; 2690:179-192. [PMID: 37450148 DOI: 10.1007/978-1-0716-3327-4_17] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/18/2023]
Abstract
Proteins are the building blocks of life, and a vast array of cellular processes is handled by protein-protein interactions (PPIs). The protein complexes formed via PPIs lead to tangled networks that, with their continuous remodeling, build up systematic functional units. Over the years, PPIs have become an area of interest for many researchers, leading to the development of multiple in vitro and in vivo methods to reveal these interactions. The yeast-two-hybrid (Y2H) system is a potent genetic way to map PPIs in both a micro- and high-throughput manner. Y2H is a technique that involves using modified yeast cells to identify protein-protein interactions. For Y2H, the yeast cells are engineered only to grow when there is a significant interaction between a specific protein with its interacting partner. PPIs are identified in the Y2H system by stimulating reporter genes in response to a restored transcription factor. However, Y2H results may be constrained by stringency requirements, as the limited number of colony screenings through this technique could result in the possible elimination of numerous genuine interactions. Therefore, DEEPN (dynamic enrichment for evaluation of protein networks) can be used, offering the potential to study the multiple static and transient protein interactions in a single Y2H experiment. DEEPN utilizes next-generation DNA sequencing (NGS) data in a high-throughput manner and subsequently applies computational analysis and statistical modeling to identify interacting partners. This protocol describes customized reagents and protocols through which DEEPN analysis can be utilized efficiently and cost-effectively.
Collapse
Affiliation(s)
| | - Jinbao Liu
- Department of Biology at University of Alabama, Birmingham, AL, USA
| | | |
Collapse
|
2
|
Paul M, Anand A. A New Family of Similarity Measures for Scoring Confidence of Protein Interactions Using Gene Ontology. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022; 19:19-30. [PMID: 34029194 DOI: 10.1109/tcbb.2021.3083150] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
The large-scale protein-protein interaction (PPI) data has the potential to play a significant role in the endeavor of understanding cellular processes. However, the presence of a considerable fraction of false positives is a bottleneck in realizing this potential. There have been continuous efforts to utilize complementary resources for scoring confidence of PPIs in a manner that false positive interactions get a low confidence score. Gene Ontology (GO), a taxonomy of biological terms to represent the properties of gene products and their relations, has been widely used for this purpose. We utilize GO to introduce a new set of specificity measures: Relative Depth Specificity (RDS), Relative Node-based Specificity (RNS), and Relative Edge-based Specificity (RES), leading to a new family of similarity measures. We use these similarity measures to obtain a confidence score for each PPI. We evaluate the new measures using four different benchmarks. We show that all the three measures are quite effective. Notably, RNS and RES more effectively distinguish true PPIs from false positives than the existing alternatives. RES also shows a robust set-discriminating power and can be useful for protein functional clustering as well.
Collapse
|
3
|
Massoud TF, Paulmurugan R. Molecular Imaging of Protein–Protein Interactions and Protein Folding. Mol Imaging 2021. [DOI: 10.1016/b978-0-12-816386-3.00071-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open
|
4
|
Paul M, Anand A. Impact of low-confidence interactions on computational identification of protein complexes. J Bioinform Comput Biol 2020; 18:2050025. [PMID: 32757809 DOI: 10.1142/s0219720020500250] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Protein complexes are the cornerstones of most of the biological processes. Identifying protein complexes is crucial in understanding the principles of cellular organization with several important applications, including in disease diagnosis. Several computational techniques have been developed to identify protein complexes from protein-protein interaction (PPI) data (equivalently, from PPI networks). These PPI data have a significant amount of false positives, which is a bottleneck in identifying protein complexes correctly. Gene ontology (GO)-based semantic similarity measures can be used to assign a confidence score to PPIs. Consequently, low-confidence PPIs are highly likely to be false positives. In this paper, we systematically study the impact of low-confidence PPIs on the performance of complex detection methods using GO-based semantic similarity measures. We consider five state-of-the-art complex detection algorithms and nine GO-based similarity measures in the evaluation. We find that each complex detection algorithm significantly improves its performance after the filtration of low-similarity scored PPIs. It is also observed that the percentage improvement and the filtration percentage (of low-confidence PPIs) are highly correlated.
Collapse
Affiliation(s)
- Madhusudan Paul
- Department of Computer Science and Engineering, Indian Institute of Technology Guwahati, Guwahati 781039, Assam, India.,Department of Computer and System Sciences, Visva-Bharati, Santiniketan 731235, West Bengal, India
| | - Ashish Anand
- Department of Computer Science and Engineering, Indian Institute of Technology Guwahati, Guwahati 781039, Assam, India
| |
Collapse
|
5
|
Kutzera J, Smilde AK, Wilderjans TF, Hoefsloot HCJ. Towards a Hierarchical Strategy to Explore Multi-Scale IP/MS Data for Protein Complexes. PLoS One 2015; 10:e0139704. [PMID: 26448546 PMCID: PMC4598013 DOI: 10.1371/journal.pone.0139704] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2015] [Accepted: 09/16/2015] [Indexed: 11/24/2022] Open
Abstract
Protein interaction in cells can be described at different levels. At a low interaction level, proteins function together in small, stable complexes and at a higher level, in sets of interacting complexes. All interaction levels are crucial for the living organism, and one of the challenges in proteomics is to measure the proteins at their different interaction levels. One common method for such measurements is immunoprecipitation followed by mass spectrometry (IP/MS), which has the potential to probe the different protein interaction forms. However, IP/MS data are complex because proteins, in their diverse interaction forms, manifest themselves in different ways in the data. Numerous bioinformatic tools for finding protein complexes in IP/MS data are currently available, but most tools do not provide information about the interaction level of the discovered complexes, and no tool is geared specifically to unraveling and visualizing these different levels. We present a new bioinformatic tool to explore IP/MS datasets for protein complexes at different interaction levels and show its performance on several real–life datasets. Our tool creates clusters that represent protein complexes, but unlike previous methods, it arranges them in a tree–shaped structure, reporting why specific proteins are predicted to build a complex and where it can be divided into smaller complexes. In every data analysis method, parameters have to be chosen. Our method can suggest values for its parameters and comes with adapted visualization tools that display the effect of the parameters on the result. The tools provide fast graphical feedback and allow the user to interact with the data by changing the parameters and examining the result. The tools also allow for exploring the different organizational levels of the protein complexes in a given dataset. Our method is available as GNU-R source code and includes examples at www.bdagroup.nl.
Collapse
Affiliation(s)
- Joachim Kutzera
- Swammerdam Institute for Life Sciences, University of Amsterdam, Amsterdam, The Netherlands
- Netherlands Institute for Systems Biology, University of Amsterdam, Amsterdam, The Netherlands
- * E-mail:
| | - Age K. Smilde
- Swammerdam Institute for Life Sciences, University of Amsterdam, Amsterdam, The Netherlands
- Netherlands Institute for Systems Biology, University of Amsterdam, Amsterdam, The Netherlands
| | - Tom F. Wilderjans
- Faculty of Psychology and Educational Sciences, KU Leuven, Leuven, Belgium
- Faculty of Social and Behavioural Sciences, Leiden University, Leiden, The Netherlands
| | - Huub C. J. Hoefsloot
- Swammerdam Institute for Life Sciences, University of Amsterdam, Amsterdam, The Netherlands
- Netherlands Institute for Systems Biology, University of Amsterdam, Amsterdam, The Netherlands
| |
Collapse
|
6
|
Pan A, Lahiri C, Rajendiran A, Shanmugham B. Computational analysis of protein interaction networks for infectious diseases. Brief Bioinform 2015; 17:517-26. [PMID: 26261187 PMCID: PMC7110031 DOI: 10.1093/bib/bbv059] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2015] [Indexed: 12/13/2022] Open
Abstract
Infectious diseases caused by pathogens, including viruses, bacteria and parasites, pose a serious threat to human health worldwide. Frequent changes in the pattern of infection mechanisms and the emergence of multidrug-resistant strains among pathogens have weakened the current treatment regimen. This necessitates the development of new therapeutic interventions to prevent and control such diseases. To cater to the need, analysis of protein interaction networks (PINs) has gained importance as one of the promising strategies. The present review aims to discuss various computational approaches to analyse the PINs in context to infectious diseases. Topology and modularity analysis of the network with their biological relevance, and the scenario till date about host–pathogen and intra-pathogenic protein interaction studies were delineated. This would provide useful insights to the research community, thereby enabling them to design novel biomedicine against such infectious diseases.
Collapse
|
7
|
Integration strategy is a key step in network-based analysis and dramatically affects network topological properties and inferring outcomes. BIOMED RESEARCH INTERNATIONAL 2014; 2014:296349. [PMID: 25243127 PMCID: PMC4163410 DOI: 10.1155/2014/296349] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/22/2014] [Revised: 07/14/2014] [Accepted: 07/17/2014] [Indexed: 01/17/2023]
Abstract
An increasing number of experiments have been designed to detect intracellular and intercellular molecular interactions. Based on these molecular interactions (especially protein interactions), molecular networks have been built for using in several typical applications, such as the discovery of new disease genes and the identification of drug targets and molecular complexes. Because the data are incomplete and a considerable number of false-positive interactions exist, protein interactions from different sources are commonly integrated in network analyses to build a stable molecular network. Although various types of integration strategies are being applied in current studies, the topological properties of the networks from these different integration strategies, especially typical applications based on these network integration strategies, have not been rigorously evaluated. In this paper, systematic analyses were performed to evaluate 11 frequently used methods using two types of integration strategies: empirical and machine learning methods. The topological properties of the networks of these different integration strategies were found to significantly differ. Moreover, these networks were found to dramatically affect the outcomes of typical applications, such as disease gene predictions, drug target detections, and molecular complex identifications. The analysis presented in this paper could provide an important basis for future network-based biological researches.
Collapse
|
8
|
Kutzera J, Hoefsloot HCJ, Malovannaya A, Smit AB, Van Mechelen I, Smilde AK. Inferring protein-protein interaction complexes from immunoprecipitation data. BMC Res Notes 2013; 6:468. [PMID: 24237943 PMCID: PMC3874675 DOI: 10.1186/1756-0500-6-468] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2013] [Accepted: 10/31/2013] [Indexed: 11/26/2022] Open
Abstract
Background Protein–protein interactions in cells are widely explored using small–scale experiments. However, the search for protein complexes and their interactions in data from high throughput experiments such as immunoprecipitation is still a challenge. We present "4N", a novel method for detecting protein complexes in such data. Our method is a heuristic algorithm based on Near Neighbor Network (3N) clustering. It is written in R, it is faster than model-based methods, and has only a small number of tuning parameters. We explain the application of our new method to real immunoprecipitation results and two artificial datasets. We show that the method can infer protein complexes from protein immunoprecipitation datasets of different densities and sizes. Findings 4N was applied on the immunoprecipitation dataset that was presented by the authors of the original 3N in Cell 145:787–799, 2011. The test with our method shows that it can reproduce the original clustering results with fewer manually adapted parameters and, in addition, gives direct insight into the complex–complex interactions. We also tested 4N on the human "Tip49a/b" dataset. We conclude that 4N can handle the contaminants and can correctly infer complexes from this very dense dataset. Further tests were performed on two artificial datasets of different sizes. We proved that the method predicts the reference complexes in the two artificial datasets with high accuracy, even when the number of samples is reduced. Conclusions 4N has been implemented in R. We provide the sourcecode of 4N and a user-friendly toolbox including two example calculations. Biologists can use this 4N-toolbox even if they have a limited knowledge of R. There are only a few tuning parameters to set, and each of these parameters has a biological interpretation. The run times for medium scale datasets are in the order of minutes on a standard desktop PC. Large datasets can typically be analyzed within a few hours.
Collapse
Affiliation(s)
- Joachim Kutzera
- Biosystems Data Analysis, Swammerdam Institute for Life Sciences, University of Amsterdam, Amsterdam, The Netherlands.
| | | | | | | | | | | |
Collapse
|
9
|
Zoraghi R, Reiner NE. Protein interaction networks as starting points to identify novel antimicrobial drug targets. Curr Opin Microbiol 2013; 16:566-72. [PMID: 23938265 DOI: 10.1016/j.mib.2013.07.010] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2013] [Revised: 07/12/2013] [Accepted: 07/16/2013] [Indexed: 01/17/2023]
Abstract
Novel classes of antimicrobials are needed to address the challenge of multidrug-resistant bacteria. Current bacterial drug targets mainly consist of specific proteins or subsets of proteins without regard for either how these targets are integrated in cellular networks or how they may interact with host proteins. However, proteins rarely act in isolation, and the majority of biological processes are dependent on interactions with other proteins. Consequently, protein-protein interaction (PPI) networks offer a realm of unexplored potential for next-generation drug targets. In this review, we argue that the architecture of bacterial or host-pathogen protein interactomes can provide invaluable insights for the identification of novel antibacterial drug targets.
Collapse
Affiliation(s)
- Roya Zoraghi
- Division of Infectious Diseases, Department of Medicine, University of British Columbia, Vancouver, Canada
| | | |
Collapse
|
10
|
Quantitative real-time PCR as a sensitive protein–protein interaction quantification method and a partial solution for non-accessible autoactivator and false-negative molecule analysis in the yeast two-hybrid system. Methods 2012; 58:376-84. [DOI: 10.1016/j.ymeth.2012.09.001] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2011] [Revised: 09/03/2012] [Accepted: 09/06/2012] [Indexed: 12/15/2022] Open
|
11
|
Maier CJ, Maier RH, Virok DP, Maass M, Hintner H, Bauer JW, Onder K. Construction of a highly flexible and comprehensive gene collection representing the ORFeome of the human pathogen Chlamydia pneumoniae. BMC Genomics 2012; 13:632. [PMID: 23157390 PMCID: PMC3534531 DOI: 10.1186/1471-2164-13-632] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2012] [Accepted: 11/11/2012] [Indexed: 12/02/2022] Open
Abstract
Background The Gram-negative bacterium Chlamydia pneumoniae (Cpn) is the leading intracellular human pathogen responsible for respiratory infections such as pneumonia and bronchitis. Basic and applied research in pathogen biology, especially the elaboration of new mechanism-based anti-pathogen strategies, target discovery and drug development, rely heavily on the availability of the entire set of pathogen open reading frames, the ORFeome. The ORFeome of Cpn will enable genome- and proteome-wide systematic analysis of Cpn, which will improve our understanding of the molecular networks and mechanisms underlying and governing its pathogenesis. Results Here we report the construction of a comprehensive gene collection covering 98.5% of the 1052 predicted and verified ORFs of Cpn (Chlamydia pneumoniae strain CWL029) in Gateway® ‘entry’ vectors. Based on genomic DNA isolated from the vascular chlamydial strain CV-6, we constructed an ORFeome library that contains 869 unique Gateway® entry clones (83% coverage) and an additional 168 PCR-verified ‘pooled’ entry clones, reaching an overall coverage of ~98.5% of the predicted CWL029 ORFs. The high quality of the ORFeome library was verified by PCR-gel electrophoresis and DNA sequencing, and its functionality was demonstrated by expressing panels of recombinant proteins in Escherichia coli and by genome-wide protein interaction analysis for a test set of three Cpn virulence factors in a yeast 2-hybrid system. The ORFeome is available in different configurations of resource stocks, PCR-products, purified plasmid DNA, and living cultures of E. coli harboring the desired entry clone or pooled entry clones. All resources are available in 96-well microtiterplates. Conclusion This first ORFeome library for Cpn provides an essential new tool for this important pathogen. The high coverage of entry clones will enable a systems biology approach for Cpn or host–pathogen analysis. The high yield of recombinant proteins and the promising interactors for Cpn virulence factors described here demonstrate the possibilities for proteome-wide studies.
Collapse
Affiliation(s)
- Christina J Maier
- Department of Dermatology, Paracelsus Medical University, Salzburg, Austria
| | | | | | | | | | | | | |
Collapse
|
12
|
Fujimori S, Hirai N, Ohashi H, Masuoka K, Nishikimi A, Fukui Y, Washio T, Oshikubo T, Yamashita T, Miyamoto-Sato E. Next-generation sequencing coupled with a cell-free display technology for high-throughput production of reliable interactome data. Sci Rep 2012; 2:691. [PMID: 23056904 PMCID: PMC3466446 DOI: 10.1038/srep00691] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2012] [Accepted: 09/07/2012] [Indexed: 11/09/2022] Open
Abstract
Next-generation sequencing (NGS) has been applied to various kinds of omics studies, resulting in many biological and medical discoveries. However, high-throughput protein-protein interactome datasets derived from detection by sequencing are scarce, because protein-protein interaction analysis requires many cell manipulations to examine the interactions. The low reliability of the high-throughput data is also a problem. Here, we describe a cell-free display technology combined with NGS that can improve both the coverage and reliability of interactome datasets. The completely cell-free method gives a high-throughput and a large detection space, testing the interactions without using clones. The quantitative information provided by NGS reduces the number of false positives. The method is suitable for the in vitro detection of proteins that interact not only with the bait protein, but also with DNA, RNA and chemical compounds. Thus, it could become a universal approach for exploring the large space of protein sequences and interactome networks.
Collapse
Affiliation(s)
- Shigeo Fujimori
- Division of Interactome Medical Sciences, The Institute of Medical Science, The University of Tokyo, Tokyo, Japan
| | | | | | | | | | | | | | | | | | | |
Collapse
|
13
|
Le Meur N, Gentleman R. Analyzing biological data using R: methods for graphs and networks. Methods Mol Biol 2012; 804:343-73. [PMID: 22144163 DOI: 10.1007/978-1-61779-361-5_19] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]
Abstract
R is a powerful language and widely used software tool for the analysis and visualization of data. Its core capabilities can be extended through many different add-on packages. Among the many packages are some which offer a broad range of facilities for analyzing statistical properties of graphs. This chapter provides a practical tutorial covering the use of R methods for graphs and networks to examine biological data and analyze their topological and statistical properties.
Collapse
Affiliation(s)
- Nolwenn Le Meur
- IRISA, Equipe Symbiose, Université de Rennes I, Rennes, France.
| | | |
Collapse
|
14
|
Teyra J, Samsonov SA, Schreiber S, Pisabarro MT. SCOWLP update: 3D classification of protein-protein, -peptide, -saccharide and -nucleic acid interactions, and structure-based binding inferences across folds. BMC Bioinformatics 2011; 12:398. [PMID: 21992011 PMCID: PMC3210135 DOI: 10.1186/1471-2105-12-398] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2011] [Accepted: 10/13/2011] [Indexed: 11/10/2022] Open
Abstract
Background Protein interactions are essential for coordinating cellular functions. Proteomic studies have already elucidated a huge amount of protein-protein interactions that require detailed functional analysis. Understanding the structural basis of each individual interaction through their structural determination is necessary, yet an unfeasible task. Therefore, computational tools able to predict protein binding regions and recognition modes are required to rationalize putative molecular functions for proteins. With this aim, we previously created SCOWLP, a structural classification of protein binding regions at protein family level, based on the information obtained from high-resolution 3D protein-protein and protein-peptide complexes. Description We present here a new version of SCOWLP that has been enhanced by the inclusion of protein-nucleic acid and protein-saccharide interactions. SCOWLP takes interfacial solvent into account for a detailed characterization of protein interactions. In addition, the binding regions obtained per protein family have been enriched by the inclusion of predicted binding regions, which have been inferred from structurally related proteins across all existing folds. These inferences might become very useful to suggest novel recognition regions and compare structurally similar interfaces from different families. Conclusions The updated SCOWLP has new functionalities that allow both, detection and comparison of protein regions recognizing different types of ligands, which include other proteins, peptides, nucleic acids and saccharides, within a solvated environment. Currently, SCOWLP allows the analysis of predicted protein binding regions based on structure-based inferences across fold space. These predictions may have a unique potential in assisting protein docking, in providing insights into protein interaction networks, and in guiding rational engineering of protein ligands. The newly designed SCOWLP web application has an improved user-friendly interface that facilitates its usage, and is available at http://www.scowlp.org.
Collapse
Affiliation(s)
- Joan Teyra
- Structural Bioinformatics BIOTEC TU Dresden, Tatzberg 47-51 01037 Dresden, Germany.
| | | | | | | |
Collapse
|
15
|
Yu X, Ivanic J, Memisević V, Wallqvist A, Reifman J. Categorizing biases in high-confidence high-throughput protein-protein interaction data sets. Mol Cell Proteomics 2011; 10:M111.012500. [PMID: 21876202 DOI: 10.1074/mcp.m111.012500] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
We characterized and evaluated the functional attributes of three yeast high-confidence protein-protein interaction data sets derived from affinity purification/mass spectrometry, protein-fragment complementation assay, and yeast two-hybrid experiments. The interacting proteins retrieved from these data sets formed distinct, partially overlapping sets with different protein-protein interaction characteristics. These differences were primarily a function of the deployed experimental technologies used to recover these interactions. This affected the total coverage of interactions and was especially evident in the recovery of interactions among different functional classes of proteins. We found that the interaction data obtained by the yeast two-hybrid method was the least biased toward any particular functional characterization. In contrast, interacting proteins in the affinity purification/mass spectrometry and protein-fragment complementation assay data sets were over- and under-represented among distinct and different functional categories. We delineated how these differences affected protein complex organization in the network of interactions, in particular for strongly interacting complexes (e.g. RNA and protein synthesis) versus weak and transient interacting complexes (e.g. protein transport). We quantified methodological differences in detecting protein interactions from larger protein complexes, in the correlation of protein abundance among interacting proteins, and in their connectivity of essential proteins. In the latter case, we showed that minimizing inherent methodology biases removed many of the ambiguous conclusions about protein essentiality and protein connectivity. We used these findings to rationalize how biological insights obtained by analyzing data sets originating from different sources sometimes do not agree or may even contradict each other. An important corollary of this work was that discrepancies in biological insights did not necessarily imply that one detection methodology was better or worse, but rather that, to a large extent, the insights reflected the methodological biases themselves. Consequently, interpreting the protein interaction data within their experimental or cellular context provided the best avenue for overcoming biases and inferring biological knowledge.
Collapse
Affiliation(s)
- Xueping Yu
- Biotechnology HPC Software Applications Institute, Telemedicine and Advanced Technology Research Center, US Army Medical Research and Materiel Command, Ft. Detrick, MD 21702, USA
| | | | | | | | | |
Collapse
|
16
|
Assessing coverage of protein interaction data using capture-recapture models. Bull Math Biol 2011; 74:356-74. [PMID: 21870201 DOI: 10.1007/s11538-011-9680-2] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2010] [Accepted: 07/14/2011] [Indexed: 01/08/2023]
Abstract
Protein interaction networks comprise thousands of individual binary links between distinct proteins. Whilst these data have attracted considerable attention and been the focus of many different studies, the networks, their structure, function, and how they change over time are still not fully known. More importantly, there is still considerable uncertainty regarding their size, and the quality of the available data continues to be questioned. Here, we employ statistical models of the experimental sampling process, in particular capture-recapture methods, in order to assess the false discovery rate and size of protein interaction networks. We uses these methods to gauge the ability of different experimental systems to find the true binary interactome. Our model allows us to obtain estimates for the size and false-discovery rate from simple considerations regarding the number of repeatedly interactions, and provides suggestions as to how we can exploit this information in order to reduce the effects of noise in such data. In particular our approach does not require a reference dataset. We estimate that approximately more than half of the true physical interactome has now been sampled in yeast.
Collapse
|
17
|
Towards a rigorous network of protein-protein interactions of the model sulfate reducer Desulfovibrio vulgaris Hildenborough. PLoS One 2011; 6:e21470. [PMID: 21738675 PMCID: PMC3125180 DOI: 10.1371/journal.pone.0021470] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2011] [Accepted: 06/01/2011] [Indexed: 11/19/2022] Open
Abstract
Protein-protein interactions offer an insight into cellular processes beyond what may be obtained by the quantitative functional genomics tools of proteomics and transcriptomics. The aforementioned tools have been extensively applied to study Escherichia coli and other aerobes and more recently to study the stress response behavior of Desulfovibrio vulgaris Hildenborough, a model obligate anaerobe and sulfate reducer and the subject of this study. Here we carried out affinity purification followed by mass spectrometry to reconstruct an interaction network among 12 chromosomally encoded bait and 90 prey proteins based on 134 bait-prey interactions identified to be of high confidence. Protein-protein interaction data are often plagued by the lack of adequate controls and replication analyses necessary to assess confidence in the results, including identification of potential false positives. We addressed these issues through the use of biological replication, exponentially modified protein abundance indices, results from an experimental negative control, and a statistical test to assign confidence to each putative interacting pair applicable to small interaction data studies. We discuss the biological significance of metabolic features of D. vulgaris revealed by these protein-protein interaction data and the observed protein modifications. These include the distinct role of the putative carbon monoxide-induced hydrogenase, unique electron transfer routes associated with different oxidoreductases, and the possible role of methylation in regulating sulfate reduction.
Collapse
|
18
|
Terradot L, Noirot-Gros MF. Bacterial protein interaction networks: puzzle stones from solved complex structures add to a clearer picture. Integr Biol (Camb) 2011; 3:645-52. [PMID: 21584322 DOI: 10.1039/c0ib00023j] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
Global scale studies of protein-protein interaction (PPI) networks have considerably expanded our view of how proteins act in the cell. In particular, bacterial "interactome" surveys have revealed that proteins can sometimes interact with a large number of protein partners and connect different cellular processes. More targeted, pathway-orientated PPI studies have also helped to propose functions for unknown proteins based on the "guilty by association" principle. However, given the immense repertoire of PPIs generated and the variability of PPI networks, more studies are required to understand the role(s) of these interactions in the cell. With the availability of bioinformatic analysis tools, transcriptomics and co-expression experiments for a given interaction, interactomes are being deciphered. More recently, functional and structural studies have been derived from these PPI networks. In this review, we will give a number of examples of how combining functional and structural studies into PPI networks has contributed to understanding the functions of some of these interactions. We discuss how interactomes now represent a unique opportunity to determine the structures of bacterial protein complexes on a large scale by the integration of multiple technologies.
Collapse
Affiliation(s)
- Laurent Terradot
- Institut de Biologie et Chimie des Protéines, UMR 5086 CNRS Université de Lyon, IFR128, Biologie Structurale des Complexes Macromoléculaires Bactériens, 7 Passage du Vercors, F-69367, Lyon Cedex 07, France.
| | | |
Collapse
|
19
|
Lindén RO, Eronen VP, Aittokallio T. Quantitative maps of genetic interactions in yeast - comparative evaluation and integrative analysis. BMC SYSTEMS BIOLOGY 2011; 5:45. [PMID: 21435228 PMCID: PMC3079637 DOI: 10.1186/1752-0509-5-45] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/30/2010] [Accepted: 03/24/2011] [Indexed: 01/08/2023]
Abstract
Background High-throughput genetic screening approaches have enabled systematic means to study how interactions among gene mutations contribute to quantitative fitness phenotypes, with the aim of providing insights into the functional wiring diagrams of genetic interaction networks on a global scale. However, it is poorly known how well these quantitative interaction measurements agree across the screening approaches, which hinders their integrated use toward improving the coverage and quality of the genetic interaction maps in yeast and other organisms. Results Using large-scale data matrices from epistatic miniarray profiling (E-MAP), genetic interaction mapping (GIM), and synthetic genetic array (SGA) approaches, we carried out here a systematic comparative evaluation among these quantitative maps of genetic interactions in yeast. The relatively low association between the original interaction measurements or their customized scores could be improved using a matrix-based modelling framework, which enables the use of single- and double-mutant fitness estimates and measurements, respectively, when scoring genetic interactions. Toward an integrative analysis, we show how the detections from the different screening approaches can be combined to suggest novel positive and negative interactions which are complementary to those obtained using any single screening approach alone. The matrix approximation procedure has been made available to support the design and analysis of the future screening studies. Conclusions We have shown here that even if the correlation between the currently available quantitative genetic interaction maps in yeast is relatively low, their comparability can be improved by means of our computational matrix approximation procedure, which will enable integrative analysis and detection of a wider spectrum of genetic interactions using data from the complementary screening approaches.
Collapse
Affiliation(s)
- Rolf O Lindén
- Biomathematics Research Group, Department of Mathematics, University of Turku, Turku, Finland
| | | | | |
Collapse
|
20
|
Lim YH, Charette JM, Baserga SJ. Assembling a protein-protein interaction map of the SSU processome from existing datasets. PLoS One 2011; 6:e17701. [PMID: 21423703 PMCID: PMC3053386 DOI: 10.1371/journal.pone.0017701] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2010] [Accepted: 02/08/2011] [Indexed: 01/12/2023] Open
Abstract
Background The small subunit (SSU) processome is a large ribonucleoprotein complex involved in small ribosomal subunit assembly. It consists of the U3 snoRNA and ∼72 proteins. While most of its components have been identified, the protein-protein interactions (PPIs) among them remain largely unknown, and thus the assembly, architecture and function of the SSU processome remains unclear. Methodology We queried PPI databases for SSU processome proteins to quantify the degree to which the three genome-wide high-throughput yeast two-hybrid (HT-Y2H) studies, the genome-wide protein fragment complementation assay (PCA) and the literature-curated (LC) datasets cover the SSU processome interactome. Conclusions We find that coverage of the SSU processome PPI network is remarkably sparse. Two of the three HT-Y2H studies each account for four and six PPIs between only six of the 72 proteins, while the third study accounts for as little as one PPI and two proteins. The PCA dataset has the highest coverage among the genome-wide studies with 27 PPIs between 25 proteins. The LC dataset was the most extensive, accounting for 34 proteins and 38 PPIs, many of which were validated by independent methods, thereby further increasing their reliability. When the collected data were merged, we found that at least 70% of the predicted PPIs have yet to be determined and 26 proteins (36%) have no known partners. Since the SSU processome is conserved in all Eukaryotes, we also queried HT-Y2H datasets from six additional model organisms, but only four orthologues and three previously known interologous interactions were found. This provides a starting point for further work on SSU processome assembly, and spotlights the need for a more complete genome-wide Y2H analysis.
Collapse
Affiliation(s)
- Young H. Lim
- Department of Molecular Biophysics and Biochemistry, Yale University School of Medicine, New Haven, Connecticut, United States of America
| | - J. Michael Charette
- Department of Molecular Biophysics and Biochemistry, Yale University School of Medicine, New Haven, Connecticut, United States of America
- Department of Therapeutic Radiology, Yale University School of Medicine, New Haven, Connecticut, United States of America
| | - Susan J. Baserga
- Department of Molecular Biophysics and Biochemistry, Yale University School of Medicine, New Haven, Connecticut, United States of America
- Department of Therapeutic Radiology, Yale University School of Medicine, New Haven, Connecticut, United States of America
- Department of Genetics, Yale University School of Medicine, New Haven, Connecticut, United States of America
- * E-mail:
| |
Collapse
|
21
|
Rossin EJ, Lage K, Raychaudhuri S, Xavier RJ, Tatar D, Benita Y, Cotsapas C, Daly MJ. Proteins encoded in genomic regions associated with immune-mediated disease physically interact and suggest underlying biology. PLoS Genet 2011; 7:e1001273. [PMID: 21249183 PMCID: PMC3020935 DOI: 10.1371/journal.pgen.1001273] [Citation(s) in RCA: 407] [Impact Index Per Article: 31.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2010] [Accepted: 12/09/2010] [Indexed: 12/14/2022] Open
Abstract
Genome-wide association studies (GWAS) have defined over 150 genomic regions unequivocally containing variation predisposing to immune-mediated disease. Inferring disease biology from these observations, however, hinges on our ability to discover the molecular processes being perturbed by these risk variants. It has previously been observed that different genes harboring causal mutations for the same Mendelian disease often physically interact. We sought to evaluate the degree to which this is true of genes within strongly associated loci in complex disease. Using sets of loci defined in rheumatoid arthritis (RA) and Crohn's disease (CD) GWAS, we build protein-protein interaction (PPI) networks for genes within associated loci and find abundant physical interactions between protein products of associated genes. We apply multiple permutation approaches to show that these networks are more densely connected than chance expectation. To confirm biological relevance, we show that the components of the networks tend to be expressed in similar tissues relevant to the phenotypes in question, suggesting the network indicates common underlying processes perturbed by risk loci. Furthermore, we show that the RA and CD networks have predictive power by demonstrating that proteins in these networks, not encoded in the confirmed list of disease associated loci, are significantly enriched for association to the phenotypes in question in extended GWAS analysis. Finally, we test our method in 3 non-immune traits to assess its applicability to complex traits in general. We find that genes in loci associated to height and lipid levels assemble into significantly connected networks but did not detect excess connectivity among Type 2 Diabetes (T2D) loci beyond chance. Taken together, our results constitute evidence that, for many of the complex diseases studied here, common genetic associations implicate regions encoding proteins that physically interact in a preferential manner, in line with observations in Mendelian disease.
Collapse
Affiliation(s)
- Elizabeth J. Rossin
- Center for Human Genetics Research and Center for Computational and Integrative Biology, Massachusetts General Hospital, Boston, Massachusetts, United States of America
- Program in Medical and Population Genetics, The Broad Institute, Cambridge, Massachusetts, United States of America
- Department of Medicine, Harvard Medical School, Boston, Massachusetts, United States of America
- Health Science and Technology MD Program, Harvard University and Massachusetts Institute of Technology, Boston, Massachusetts, United States of America
- Harvard Biological and Biomedical Sciences Program, Harvard University, Boston, Massachusetts, United States of America
| | - Kasper Lage
- Program in Medical and Population Genetics, The Broad Institute, Cambridge, Massachusetts, United States of America
- Department of Medicine, Harvard Medical School, Boston, Massachusetts, United States of America
- Pediatric Surgical Research Laboratories, Massachusetts General Hospital, Boston, Massachusetts, United States of America
- Center for Biological Sequence Analysis, Department of Systems Biology, Technical University of Denmark, Lyngby, Denmark
| | - Soumya Raychaudhuri
- Center for Human Genetics Research and Center for Computational and Integrative Biology, Massachusetts General Hospital, Boston, Massachusetts, United States of America
- Program in Medical and Population Genetics, The Broad Institute, Cambridge, Massachusetts, United States of America
- Division of Rheumatology, Immunology, and Allergy, Brigham and Women's Hospital, Boston, Massachusetts, United States of America
| | - Ramnik J. Xavier
- Center for Human Genetics Research and Center for Computational and Integrative Biology, Massachusetts General Hospital, Boston, Massachusetts, United States of America
- Program in Medical and Population Genetics, The Broad Institute, Cambridge, Massachusetts, United States of America
- Department of Medicine, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Diana Tatar
- Pediatric Surgical Research Laboratories, Massachusetts General Hospital, Boston, Massachusetts, United States of America
| | - Yair Benita
- Center for Human Genetics Research and Center for Computational and Integrative Biology, Massachusetts General Hospital, Boston, Massachusetts, United States of America
| | | | - Chris Cotsapas
- Center for Human Genetics Research and Center for Computational and Integrative Biology, Massachusetts General Hospital, Boston, Massachusetts, United States of America
- Program in Medical and Population Genetics, The Broad Institute, Cambridge, Massachusetts, United States of America
| | - Mark J. Daly
- Center for Human Genetics Research and Center for Computational and Integrative Biology, Massachusetts General Hospital, Boston, Massachusetts, United States of America
- Program in Medical and Population Genetics, The Broad Institute, Cambridge, Massachusetts, United States of America
- Department of Medicine, Harvard Medical School, Boston, Massachusetts, United States of America
- Health Science and Technology MD Program, Harvard University and Massachusetts Institute of Technology, Boston, Massachusetts, United States of America
- Harvard Biological and Biomedical Sciences Program, Harvard University, Boston, Massachusetts, United States of America
| |
Collapse
|
22
|
Wodak SJ, Vlasblom J, Pu S. High-throughput analyses and curation of protein interactions in yeast. Methods Mol Biol 2011; 759:381-406. [PMID: 21863499 DOI: 10.1007/978-1-61779-173-4_22] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
The yeast Saccharomyces cerevisiae is the model organism in which protein interactions have been most extensively analyzed. The vast majority of these interactions have been characterized by a variety of sophisticated high-throughput techniques probing different aspects of protein association. This chapter summarizes the major techniques, highlights their complementary nature, discusses the data they produce, and highlights some of the biases from which they suffer. A main focus is the key role played by computational methods for processing, analyzing, and validating the large body of noisy data produced by the experimental procedures. It also describes how computational methods are used to extend the coverage and reliability of protein interaction data by integrating information from heterogeneous sources and reviews the current status of literature-curated data on yeast protein interactions stored in specialized databases.
Collapse
Affiliation(s)
- Shoshana J Wodak
- Molecular Structure and Function Program, Hospital for Sick Children, Toronto, ON, Canada.
| | | | | |
Collapse
|
23
|
Wu M, Li X, Chua HN, Kwoh CK, Ng SK. Integrating diverse biological and computational sources for reliable protein-protein interactions. BMC Bioinformatics 2010; 11 Suppl 7:S8. [PMID: 21106130 PMCID: PMC2957691 DOI: 10.1186/1471-2105-11-s7-s8] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
Background Protein-protein interactions (PPIs) play important roles in various cellular processes. However, the low quality of current PPI data detected from high-throughput screening techniques has diminished the potential usefulness of the data. We need to develop a method to address the high data noise and incompleteness of PPI data, namely, to filter out inaccurate protein interactions (false positives) and predict putative protein interactions (false negatives). Results In this paper, we proposed a novel two-step method to integrate diverse biological and computational sources of supporting evidence for reliable PPIs. The first step, interaction binning or InterBIN, groups PPIs together to more accurately estimate the likelihood (Bin-Confidence score) that the protein pairs interact for each biological or computational evidence source. The second step, interaction classification or InterCLASS, integrates the collected Bin-Confidence scores to build classifiers and identify reliable interactions. Conclusions We performed comprehensive experiments on two benchmark yeast PPI datasets. The experimental results showed that our proposed method can effectively eliminate false positives in detected PPIs and identify false negatives by predicting novel yet reliable PPIs. Our proposed method also performed significantly better than merely using each of individual evidence sources, illustrating the importance of integrating various biological and computational sources of data and evidence.
Collapse
Affiliation(s)
- Min Wu
- School of Computer Engineering, Nanyang Technological University, Singapore.
| | | | | | | | | |
Collapse
|
24
|
Ooi HS, Schneider G, Chan YL, Lim TT, Eisenhaber B, Eisenhaber F. Databases of protein-protein interactions and complexes. Methods Mol Biol 2010; 609:145-59. [PMID: 20221918 DOI: 10.1007/978-1-60327-241-4_9] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/12/2023]
Abstract
In the current understanding, translation of genomic sequences into proteins is the most important path for realization of genome information. In exercising their intended function, proteins work together through various forms of direct (physical) or indirect interaction mechanisms. For a variety of basic functions, many proteins form a large complex representing a molecular machine or a macromolecular super-structural building block. After several high-throughput techniques for detection of protein-protein interactions had matured, protein interaction data became available in a large scale and curated databases for protein-protein interactions (PPIs) are a new necessity for efficient research. Here, their scope, annotation quality, and retrieval tools are reviewed. In addition, attention is paid to portals that provide unified access to a variety of such databases with added annotation value.
Collapse
Affiliation(s)
- Hong Sain Ooi
- Bioinformatics Institute, Agency for science, Technology, and Research, Singapore
| | | | | | | | | | | |
Collapse
|
25
|
Li X, Wu M, Kwoh CK, Ng SK. Computational approaches for detecting protein complexes from protein interaction networks: a survey. BMC Genomics 2010; 11 Suppl 1:S3. [PMID: 20158874 PMCID: PMC2822531 DOI: 10.1186/1471-2164-11-s1-s3] [Citation(s) in RCA: 167] [Impact Index Per Article: 11.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
Background Most proteins form macromolecular complexes to perform their biological functions. However, experimentally determined protein complex data, especially of those involving more than two protein partners, are relatively limited in the current state-of-the-art high-throughput experimental techniques. Nevertheless, many techniques (such as yeast-two-hybrid) have enabled systematic screening of pairwise protein-protein interactions en masse. Thus computational approaches for detecting protein complexes from protein interaction data are useful complements to the limited experimental methods. They can be used together with the experimental methods for mapping the interactions of proteins to understand how different proteins are organized into higher-level substructures to perform various cellular functions. Results Given the abundance of pairwise protein interaction data from high-throughput genome-wide experimental screenings, a protein interaction network can be constructed from protein interaction data by considering individual proteins as the nodes, and the existence of a physical interaction between a pair of proteins as a link. This binary protein interaction graph can then be used for detecting protein complexes using graph clustering techniques. In this paper, we review and evaluate the state-of-the-art techniques for computational detection of protein complexes, and discuss some promising research directions in this field. Conclusions Experimental results with yeast protein interaction data show that the interaction subgraphs discovered by various computational methods matched well with actual protein complexes. In addition, the computational approaches have also improved in performance over the years. Further improvements could be achieved if the quality of the underlying protein interaction data can be considered adequately to minimize the undesirable effects from the irrelevant and noisy sources, and the various biological evidences can be better incorporated into the detection process to maximize the exploitation of the increasing wealth of biological knowledge available.
Collapse
Affiliation(s)
- Xiaoli Li
- Institute for Infocomm Research, 1 Fusionopolis Way, Singapore.
| | | | | | | |
Collapse
|
26
|
Ratmann O, Wiuf C, Pinney JW. From evidence to inference: probing the evolution of protein interaction networks. HFSP JOURNAL 2009; 3:290-306. [PMID: 20357887 DOI: 10.2976/1.3167215] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/16/2009] [Revised: 05/30/2009] [Indexed: 01/06/2023]
Abstract
The evolutionary mechanisms by which protein interaction networks grow and change are beginning to be appreciated as a major factor shaping their present-day structures and properties. Starting with a consideration of the biases and errors inherent in our current views of these networks, we discuss the dangers of constructing evolutionary arguments from naïve analyses of network topology. We argue that progress in understanding the processes of network evolution is only possible when hypotheses are formulated as plausible evolutionary models and compared against the observed data within the framework of probabilistic modeling. The value of such models is expected to be greatly enhanced as they incorporate more of the details of the biophysical properties of interacting proteins, gene phylogeny, and measurement error and as more advanced methodologies emerge for model comparison and the inference of ancestral network states.
Collapse
|
27
|
Tun K, Rao RK, Samavedham L, Tanaka H, Dhar PK. Rich can get poor: conversion of hub to non-hub proteins. SYSTEMS AND SYNTHETIC BIOLOGY 2009; 2:75-82. [PMID: 19399641 PMCID: PMC2735643 DOI: 10.1007/s11693-009-9024-9] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/20/2009] [Revised: 04/06/2009] [Accepted: 04/07/2009] [Indexed: 11/26/2022]
Abstract
Hubs are ubiquitous network elements with high connectivity. One of the common observations about hub proteins is their preferential attachment leading to scale-free network topology. Here we examine the question: does rich protein always get richer, or can it get poor too? To answer this question, we compared similar and well-annotated hub proteins in six organisms, from prokaryotes to eukaryotes. Our findings indicate that hub proteins retain, gain or lose connectivity based on the context. Furthermore, the loss or gain of connectivity appears to correlate with the functional role of the protein in a given system.
Collapse
Affiliation(s)
- Kyaw Tun
- Synthetic Biology Lab, RIKEN Advanced Sciences Institute, 1-7-22, Suehiro-cho, Tsurumi, 230-0045 Yokohama Japan
- Department of Systems Biology, School of Biomedical Sciences, Tokyo Medical and Dental University, Yushima, 1-5-45, Bunkyo, 113-8510 Tokyo Japan
| | - Raghuraj Keshava Rao
- Department of Chemical and Biomolecular Engineering, Engineering Drive, National University of Singapore, Singapore, 117576 Singapore
| | - Lakshminarayanan Samavedham
- Department of Chemical and Biomolecular Engineering, Engineering Drive, National University of Singapore, Singapore, 117576 Singapore
| | - Hiroshi Tanaka
- Department of Systems Biology, School of Biomedical Sciences, Tokyo Medical and Dental University, Yushima, 1-5-45, Bunkyo, 113-8510 Tokyo Japan
| | - Pawan K. Dhar
- Synthetic Biology Lab, RIKEN Advanced Sciences Institute, 1-7-22, Suehiro-cho, Tsurumi, 230-0045 Yokohama Japan
| |
Collapse
|
28
|
Cusick ME, Yu H, Smolyar A, Venkatesan K, Carvunis AR, Simonis N, Rual JF, Borick H, Braun P, Dreze M, Vandenhaute J, Galli M, Yazaki J, Hill DE, Ecker JR, Roth FP, Vidal M. Literature-curated protein interaction datasets. Nat Methods 2009; 6:39-46. [PMID: 19116613 PMCID: PMC2683745 DOI: 10.1038/nmeth.1284] [Citation(s) in RCA: 234] [Impact Index Per Article: 15.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
High quality datasets are needed to understand how global and local properties of protein-protein interaction, or “interactome”, networks relate to biological mechanisms, and to guide research on individual proteins. Evaluations of existing curation of protein interaction experiments reported in the literature find that curation can be error prone and possibly of lower quality than commonly assumed.
Collapse
Affiliation(s)
- Michael E Cusick
- Center for Cancer Systems Biology and Department of Cancer Biology, Dana-Farber Cancer Institute, 44 Binney Street, Boston, Massachusetts 02115, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
29
|
Wodak SJ, Pu S, Vlasblom J, Seéraphin B. Challenges and Rewards of Interaction Proteomics. Mol Cell Proteomics 2009; 8:3-18. [DOI: 10.1074/mcp.r800014-mcp200] [Citation(s) in RCA: 75] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
|
30
|
Kelly W, Stumpf M. Protein-protein interactions: from global to local analyses. Curr Opin Biotechnol 2008; 19:396-403. [PMID: 18644446 DOI: 10.1016/j.copbio.2008.06.010] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2008] [Revised: 06/25/2008] [Accepted: 06/25/2008] [Indexed: 12/26/2022]
Abstract
For the increasing number of species with complete genome sequences, the task of elucidating their complete proteomes and interactomes has attracted much recent interest. Although the proteome describes the complete repertoire of proteins expressed, the interactome comprises the pairwise protein-protein interactions that occur, or could occur, within an organism, and forms a large-scale sparse network. Here we discuss the challenges provided by present data, and outline a route from global analysis to more detailed and focused studies of protein-protein interactions. Carefully using protein-interaction data allows us to explore its potential fully alongside the evaluation of mechanistic hypotheses about biological systems.
Collapse
Affiliation(s)
- Wp Kelly
- Centre for Bioinformatics, Imperial College London, London, United Kingdom.
| | | |
Collapse
|