1
|
Shtetinska MM, González-Sánchez JC, Beyer T, Boldt K, Ueffing M, Russell R. WeSA: a web server for improving analysis of affinity proteomics data. Nucleic Acids Res 2024; 52:W333-W340. [PMID: 38795065 PMCID: PMC11223876 DOI: 10.1093/nar/gkae423] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2024] [Revised: 04/23/2024] [Accepted: 05/14/2024] [Indexed: 05/27/2024] Open
Abstract
Protein-protein interaction experiments still yield many false positive interactions. The socioaffinity metric can distinguish true protein-protein interactions from noise based on available data. Here, we present WeSA (Weighted SocioAffinity), which considers large datasets of interaction proteomics data (IntAct, BioGRID, the BioPlex) to score human protein interactions and, in a statistically robust way, flag those (even from a single experiment) that are likely to be false positives. ROC analysis (using CORUM-PDB positives and Negatome negatives) shows that WeSA improves over other measures of interaction confidence. WeSA shows consistently good results over all datasets (up to: AUC = 0.93 and at best threshold: TPR = 0.84, FPR = 0.11, Precision = 0.98). WeSA is freely available without login (wesa.russelllab.org). Users can submit their own data or look for organized information on human protein interactions using the web server. Users can either retrieve available information for a list of proteins of interest or calculate scores for new experiments. The server outputs either pre-computed or updated WeSA scores for the input enriched with information from databases. The summary is presented as a table and a network-based visualization allowing the user to remove those nodes/edges that the method considers spurious.
Collapse
Affiliation(s)
- Magdalena M Shtetinska
- BioQuant, Heidelberg University, 69120 Heidelberg, Germany
- Biochemistry Center (BZH), Heidelberg University, 69120 Heidelberg, Germany
| | - Juan-Carlos González-Sánchez
- BioQuant, Heidelberg University, 69120 Heidelberg, Germany
- Biochemistry Center (BZH), Heidelberg University, 69120 Heidelberg, Germany
| | - Tina Beyer
- Institute for Ophthalmic Research, Center for Ophthalmology, University of Tübingen, 72076 Tübingen, Germany
| | - Karsten Boldt
- Institute for Ophthalmic Research, Center for Ophthalmology, University of Tübingen, 72076 Tübingen, Germany
| | - Marius Ueffing
- Institute for Ophthalmic Research, Center for Ophthalmology, University of Tübingen, 72076 Tübingen, Germany
| | - Robert B Russell
- BioQuant, Heidelberg University, 69120 Heidelberg, Germany
- Biochemistry Center (BZH), Heidelberg University, 69120 Heidelberg, Germany
| |
Collapse
|
2
|
Khazen G, Gyulkhandanian A, Issa T, Maroun RC. Getting to know each other: PPIMem, a novel approach for predicting transmembrane protein-protein complexes. Comput Struct Biotechnol J 2021; 19:5184-5197. [PMID: 34630938 PMCID: PMC8476896 DOI: 10.1016/j.csbj.2021.09.013] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2021] [Revised: 08/23/2021] [Accepted: 09/12/2021] [Indexed: 02/03/2023] Open
Abstract
Because of their considerable number and diversity, membrane proteins and their macromolecular complexes represent the functional units of cells. Their quaternary structure may be stabilized by interactions between the α-helices of different proteins in the hydrophobic region of the cell membrane. Membrane proteins equally represent potential pharmacological targets par excellence for various diseases. Unfortunately, their experimental 3D structure and that of their complexes with other intramembrane protein partners are scarce due to technical difficulties. To overcome this key problem, we devised PPIMem, a computational approach for the specific prediction of higher-order structures of α-helical transmembrane proteins. The novel approach involves proper identification of the amino acid residues at the interface of molecular complexes with a 3D structure. The identified residues compose then nonlinear interaction motifs that are conveniently expressed as mathematical regular expressions. These are efficiently implemented for motif search in amino acid sequence databases, and for the accurate prediction of intramembrane protein-protein complexes. Our template interface-based approach predicted 21,544 binary complexes between 1,504 eukaryotic plasma membrane proteins across 39 species. We compare our predictions to experimental datasets of protein-protein interactions as a first validation method. The online database that results from the PPIMem algorithm with the annotated predicted interactions are implemented as a web server and can be accessed directly at https://transint.univ-evry.fr.
Collapse
Affiliation(s)
- Georges Khazen
- Computer Science and Mathematics Department, Lebanese American University, Byblos, Lebanon
| | - Aram Gyulkhandanian
- Inserm U1204/Université d'Evry/Université Paris-Saclay, Structure-Activité des Biomolécules Normales et Pathologiques, 91025 Evry, France
| | - Tina Issa
- Computer Science and Mathematics Department, Lebanese American University, Byblos, Lebanon
| | - Rachid C Maroun
- Inserm U1204/Université d'Evry/Université Paris-Saclay, Structure-Activité des Biomolécules Normales et Pathologiques, 91025 Evry, France
| |
Collapse
|
3
|
Poverennaya EV, Kiseleva OI, Ivanov AS, Ponomarenko EA. Methods of Computational Interactomics for Investigating Interactions of Human Proteoforms. BIOCHEMISTRY (MOSCOW) 2020; 85:68-79. [PMID: 32079518 DOI: 10.1134/s000629792001006x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
Abstract
Human genome contains ca. 20,000 protein-coding genes that could be translated into millions of unique protein species (proteoforms). Proteoforms coded by a single gene often have different functions, which implies different protein partners. By interacting with each other, proteoforms create a network reflecting the dynamics of cellular processes in an organism. Perturbations of protein-protein interactions change the network topology, which often triggers pathological processes. Studying proteoforms is a relatively new research area in proteomics, and this is why there are comparatively few experimental studies on the interaction of proteoforms. Bioinformatics tools can facilitate such studies by providing valuable complementary information to the experimental data and, in particular, expanding the possibilities of the studies of proteoform interactions.
Collapse
Affiliation(s)
| | - O I Kiseleva
- Institute of Biomedical Chemistry, Moscow, 119121, Russia
| | - A S Ivanov
- Institute of Biomedical Chemistry, Moscow, 119121, Russia
| | | |
Collapse
|
4
|
Theodosiou T, Papanikolaou N, Savvaki M, Bonetto G, Maxouri S, Fakoureli E, Eliopoulos AG, Tavernarakis N, Amoutzias GD, Pavlopoulos GA, Aivaliotis M, Nikoletopoulou V, Tzamarias D, Karagogeos D, Iliopoulos I. UniProt-Related Documents (UniReD): assisting wet lab biologists in their quest on finding novel counterparts in a protein network. NAR Genom Bioinform 2020; 2:lqaa005. [PMID: 33575553 PMCID: PMC7671407 DOI: 10.1093/nargab/lqaa005] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2019] [Revised: 01/20/2020] [Accepted: 01/31/2020] [Indexed: 02/04/2023] Open
Abstract
The in-depth study of protein–protein interactions (PPIs) is of key importance for understanding how cells operate. Therefore, in the past few years, many experimental as well as computational approaches have been developed for the identification and discovery of such interactions. Here, we present UniReD, a user-friendly, computational prediction tool which analyses biomedical literature in order to extract known protein associations and suggest undocumented ones. As a proof of concept, we demonstrate its usefulness by experimentally validating six predicted interactions and by benchmarking it against public databases of experimentally validated PPIs succeeding a high coverage. We believe that UniReD can become an important and intuitive resource for experimental biologists in their quest for finding novel associations within a protein network and a useful tool to complement experimental approaches (e.g. mass spectrometry) by producing sorted lists of candidate proteins for further experimental validation. UniReD is available at http://bioinformatics.med.uoc.gr/unired/
Collapse
Affiliation(s)
- Theodosios Theodosiou
- University of Crete, School of Medicine, Department of Basic Sciences, Heraklion 71003, Crete, Greece
| | - Nikolaos Papanikolaou
- University of Crete, School of Medicine, Department of Basic Sciences, Heraklion 71003, Crete, Greece
| | - Maria Savvaki
- University of Crete, School of Medicine, Department of Basic Sciences, Heraklion 71003, Crete, Greece.,Institute of Molecular Biology and Biotechnology, Foundation for Research and Technology-Hellas, Nikolaou Plastira 100, 70013 Heraklion, Crete, Greece
| | - Giulia Bonetto
- University of Crete, School of Medicine, Department of Basic Sciences, Heraklion 71003, Crete, Greece
| | - Stella Maxouri
- University of Crete, School of Medicine, Department of Basic Sciences, Heraklion 71003, Crete, Greece.,Medical School of Patras University, Laboratory of General Biology, Asklipiou 1, 26500 Rio Patras, Greece
| | - Eirini Fakoureli
- University of Crete, School of Medicine, Department of Basic Sciences, Heraklion 71003, Crete, Greece
| | - Aristides G Eliopoulos
- Department of Biology, Medical School, National and Kapodistrian University of Athens, Mikras Asias 75, 11527 Athens, Greece
| | - Nektarios Tavernarakis
- University of Crete, School of Medicine, Department of Basic Sciences, Heraklion 71003, Crete, Greece.,Institute of Molecular Biology and Biotechnology, Foundation for Research and Technology-Hellas, Nikolaou Plastira 100, 70013 Heraklion, Crete, Greece
| | - Grigoris D Amoutzias
- Bioinformatics Laboratory, Department of Biochemistry and Biotechnology, University of Thessaly, Larisa 41500, Greece
| | - Georgios A Pavlopoulos
- Institute for Fundamental Biomedical Research, BSRC "Alexander Fleming", 34 Fleming Street, 16672 Vari, Greece
| | - Michalis Aivaliotis
- Institute of Molecular Biology and Biotechnology, Foundation for Research and Technology-Hellas, Nikolaou Plastira 100, 70013 Heraklion, Crete, Greece.,Laboratory of Biological Chemistry, Faculty of Health Sciences, School of Medicine, Aristotle University of Thessaloniki, GR-54124, Thessaloniki, Greece.,Functional Proteomics and Systems Biology (FunPATh), Center for Interdisciplinary Research and Innovation (CIRI-AUTH), Balkan Center, Thessaloniki, 10th km Thessaloniki-Thermi Rd, P.O.Box 8318, GR 57001, Greece
| | - Vasiliki Nikoletopoulou
- Institute of Molecular Biology and Biotechnology, Foundation for Research and Technology-Hellas, Nikolaou Plastira 100, 70013 Heraklion, Crete, Greece
| | - Dimitris Tzamarias
- Institute of Molecular Biology and Biotechnology, Foundation for Research and Technology-Hellas, Nikolaou Plastira 100, 70013 Heraklion, Crete, Greece
| | - Domna Karagogeos
- University of Crete, School of Medicine, Department of Basic Sciences, Heraklion 71003, Crete, Greece.,Institute of Molecular Biology and Biotechnology, Foundation for Research and Technology-Hellas, Nikolaou Plastira 100, 70013 Heraklion, Crete, Greece
| | - Ioannis Iliopoulos
- University of Crete, School of Medicine, Department of Basic Sciences, Heraklion 71003, Crete, Greece
| |
Collapse
|
5
|
Lazar T, Guharoy M, Schad E, Tompa P. Unique Physicochemical Patterns of Residues in Protein–Protein Interfaces. J Chem Inf Model 2018; 58:2164-2173. [DOI: 10.1021/acs.jcim.8b00270] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Affiliation(s)
- Tamas Lazar
- VIB-VUB Center for Structural Biology, Vlaams Instituut voor Biotechnologie, Pleinlaan 2, 1050 Brussels, Belgium
- Structural Biology Brussels, Vrije Universiteit Brussel, Pleinlaan 2, 1050 Brussels, Belgium
| | - Mainak Guharoy
- VIB-VUB Center for Structural Biology, Vlaams Instituut voor Biotechnologie, Pleinlaan 2, 1050 Brussels, Belgium
- Structural Biology Brussels, Vrije Universiteit Brussel, Pleinlaan 2, 1050 Brussels, Belgium
| | - Eva Schad
- Institute of Enzymology, Research Centre for Natural Sciences, Hungarian Academy of Sciences, Magyar tudosok korutja 2, 1117 Budapest, Hungary
| | - Peter Tompa
- VIB-VUB Center for Structural Biology, Vlaams Instituut voor Biotechnologie, Pleinlaan 2, 1050 Brussels, Belgium
- Structural Biology Brussels, Vrije Universiteit Brussel, Pleinlaan 2, 1050 Brussels, Belgium
- Institute of Enzymology, Research Centre for Natural Sciences, Hungarian Academy of Sciences, Magyar tudosok korutja 2, 1117 Budapest, Hungary
| |
Collapse
|
6
|
Computational Resources for Predicting Protein-Protein Interactions. ADVANCES IN PROTEIN CHEMISTRY AND STRUCTURAL BIOLOGY 2017; 110:251-275. [PMID: 29412998 DOI: 10.1016/bs.apcsb.2017.07.006] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Proteins are the essential building blocks and functional components of a cell. They account for the vital functions of an organism. Proteins interact with each other and form protein interaction networks. These protein interactions play a major role in all the biological processes and pathways. The previous methods of predicting protein interactions were experimental which focused on a small set of proteins or a particular protein. However, these experimental approaches are low-throughput as they are time-consuming and require a significant amount of human effort. This led to the development of computational techniques that uses high-throughput experimental data for analyzing protein-protein interactions. The main purpose of this review is to provide an overview on the computational advancements and tools for the prediction of protein interactions. The major databases for the deposition of these interactions are also described. The advantages, as well as the specific limitations of these tools, are highlighted which will shed light on the computational aspects that can help the biologist and researchers in their research.
Collapse
|
7
|
Li M, Lu Y, Wang J, Wu FX, Pan Y. A Topology Potential-Based Method for Identifying Essential Proteins from PPI Networks. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2015; 12:372-383. [PMID: 26357224 DOI: 10.1109/tcbb.2014.2361350] [Citation(s) in RCA: 52] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
Essential proteins are indispensable for cellular life. It is of great significance to identify essential proteins that can help us understand the minimal requirements for cellular life and is also very important for drug design. However, identification of essential proteins based on experimental approaches are typically time-consuming and expensive. With the development of high-throughput technology in the post-genomic era, more and more protein-protein interaction data can be obtained, which make it possible to study essential proteins from the network level. There have been a series of computational approaches proposed for predicting essential proteins based on network topologies. Most of these topology based essential protein discovery methods were to use network centralities. In this paper, we investigate the essential proteins' topological characters from a completely new perspective. To our knowledge it is the first time that topology potential is used to identify essential proteins from a protein-protein interaction (PPI) network. The basic idea is that each protein in the network can be viewed as a material particle which creates a potential field around itself and the interaction of all proteins forms a topological field over the network. By defining and computing the value of each protein's topology potential, we can obtain a more precise ranking which reflects the importance of proteins from the PPI network. The experimental results show that topology potential-based methods TP and TP-NC outperform traditional topology measures: degree centrality (DC), betweenness centrality (BC), closeness centrality (CC), subgraph centrality (SC), eigenvector centrality (EC), information centrality (IC), and network centrality (NC) for predicting essential proteins. In addition, these centrality measures are improved on their performance for identifying essential proteins in biological network when controlled by topology potential.
Collapse
|
8
|
Park JH, Park S, Yang JS, Kwon OS, Kim S, Jang SK. Discovery of cellular proteins required for the early steps of HCV infection using integrative genomics. PLoS One 2013; 8:e60333. [PMID: 23593195 PMCID: PMC3625227 DOI: 10.1371/journal.pone.0060333] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2012] [Accepted: 02/25/2013] [Indexed: 02/06/2023] Open
Abstract
Successful viral infection requires intimate communication between virus and host cell, a process that absolutely requires various host proteins. However, current efforts to discover novel host proteins as therapeutic targets for viral infection are difficult. Here, we developed an integrative-genomics approach to predict human genes involved in the early steps of hepatitis C virus (HCV) infection. By integrating HCV and human protein associations, co-expression data, and tight junction-tetraspanin web specific networks, we identified host proteins required for the early steps in HCV infection. Moreover, we validated the roles of newly identified proteins in HCV infection by knocking down their expression using small interfering RNAs. Specifically, a novel host factor CD63 was shown to directly interact with HCV E2 protein. We further demonstrated that an antibody against CD63 blocked HCV infection, indicating that CD63 may serve as a new therapeutic target for HCV-related diseases. The candidate gene list provides a source for identification of new therapeutic targets.
Collapse
Affiliation(s)
- Ji Hoon Park
- Division of Molecular and Life Science, Pohang University of Science and Technology, Pohang, Korea
| | - Solip Park
- School of Interdisciplinary Bioscience and Bioengineering, Pohang University of Science and Technology, Pohang, Korea
| | - Jae-Seong Yang
- Division of Molecular and Life Science, Pohang University of Science and Technology, Pohang, Korea
| | - Oh Sung Kwon
- Division of Molecular and Life Science, Pohang University of Science and Technology, Pohang, Korea
| | - Sanguk Kim
- Division of Molecular and Life Science, Pohang University of Science and Technology, Pohang, Korea
- School of Interdisciplinary Bioscience and Bioengineering, Pohang University of Science and Technology, Pohang, Korea
- Division of IT Convergence Engineering, Pohang University of Science and Technology, Pohang, Korea
- * E-mail: (SK); (SKJ)
| | - Sung Key Jang
- Division of Molecular and Life Science, Pohang University of Science and Technology, Pohang, Korea
- School of Interdisciplinary Bioscience and Bioengineering, Pohang University of Science and Technology, Pohang, Korea
- Division of Integrative Biosciences and Biotechnology, Pohang University of Science and Technology, Pohang, Korea
- Biotechnology Research Center, Pohang University of Science and Technology, Pohang, Korea
- * E-mail: (SK); (SKJ)
| |
Collapse
|
9
|
Woods NT, Mesquita RD, Sweet M, Carvalho MA, Li X, Liu Y, Nguyen H, Thomas CE, Iversen ES, Marsillac S, Karchin R, Koomen J, Monteiro ANA. Charting the landscape of tandem BRCT domain-mediated protein interactions. Sci Signal 2012; 5:rs6. [PMID: 22990118 DOI: 10.1126/scisignal.2002255] [Citation(s) in RCA: 82] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
Eukaryotic cells have evolved an intricate system to resolve DNA damage to prevent its transmission to daughter cells. This system, collectively known as the DNA damage response (DDR) network, includes many proteins that detect DNA damage, promote repair, and coordinate progression through the cell cycle. Because defects in this network can lead to cancer, this network constitutes a barrier against tumorigenesis. The modular BRCA1 carboxyl-terminal (BRCT) domain is frequently present in proteins involved in the DDR, can exist either as an individual domain or as tandem domains (tBRCT), and can bind phosphorylated peptides. We performed a systematic analysis of protein-protein interactions involving tBRCT in the DDR by combining literature curation, yeast two-hybrid screens, and tandem affinity purification coupled to mass spectrometry. We identified 23 proteins containing conserved BRCT domains and generated a human protein-protein interaction network for seven proteins with tBRCT. This study also revealed previously unknown components in DNA damage signaling, such as COMMD1 and the target of rapamycin complex mTORC2. Additionally, integration of tBRCT domain interactions with DDR phosphoprotein studies and analysis of kinase-substrate interactions revealed signaling subnetworks that may aid in understanding the involvement of tBRCT in disease and DNA repair.
Collapse
Affiliation(s)
- Nicholas T Woods
- Cancer Epidemiology Program, H. Lee Moffitt Cancer Center and Research Institute, Tampa, FL 33612, USA
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
10
|
Ceres N, Lavery R. Coarse-grain Protein Models. INNOVATIONS IN BIOMOLECULAR MODELING AND SIMULATIONS 2012. [DOI: 10.1039/9781849735049-00219] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]
Abstract
Coarse-graining is a powerful approach for modeling biomolecules that, over the last few decades, has been extensively applied to proteins. Coarse-grain models offer access to large systems and to slow processes without becoming computationally unmanageable. In addition, they are very versatile, enabling both the protein representation and the energy function to be adapted to the biological problem in hand. This review concentrates on modeling soluble proteins and their assemblies. It presents an overview of the coarse-grain representations, of the associated interaction potentials, and of the optimization procedures used to define them. It then shows how coarse-grain models have been used to understand processes involving proteins, from their initial folding to their functional properties, their binary interactions, and the assembly of large complexes.
Collapse
Affiliation(s)
- N. Ceres
- Bases Moléculaires et Structurales des Systèmes Infectieux Université Lyon1/CNRS UMR 5086, IBCP, 7 Passage du Vercors, 69367, Lyon France
| | - R. Lavery
- Bases Moléculaires et Structurales des Systèmes Infectieux Université Lyon1/CNRS UMR 5086, IBCP, 7 Passage du Vercors, 69367, Lyon France
| |
Collapse
|
11
|
Pancaldi V, Saraç ÖS, Rallis C, McLean JR, Převorovský M, Gould K, Beyer A, Bähler J. Predicting the fission yeast protein interaction network. G3 (BETHESDA, MD.) 2012; 2:453-67. [PMID: 22540037 PMCID: PMC3337474 DOI: 10.1534/g3.111.001560] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/08/2011] [Accepted: 01/31/2012] [Indexed: 12/03/2022]
Abstract
A systems-level understanding of biological processes and information flow requires the mapping of cellular component interactions, among which protein-protein interactions are particularly important. Fission yeast (Schizosaccharomyces pombe) is a valuable model organism for which no systematic protein-interaction data are available. We exploited gene and protein properties, global genome regulation datasets, and conservation of interactions between budding and fission yeast to predict fission yeast protein interactions in silico. We have extensively tested our method in three ways: first, by predicting with 70-80% accuracy a selected high-confidence test set; second, by recapitulating interactions between members of the well-characterized SAGA co-activator complex; and third, by verifying predicted interactions of the Cbf11 transcription factor using mass spectrometry of TAP-purified protein complexes. Given the importance of the pathway in cell physiology and human disease, we explore the predicted sub-networks centered on the Tor1/2 kinases. Moreover, we predict the histidine kinases Mak1/2/3 to be vital hubs in the fission yeast stress response network, and we suggest interactors of argonaute 1, the principal component of the siRNA-mediated gene silencing pathway, lost in budding yeast but preserved in S. pombe. Of the new high-quality interactions that were discovered after we started this work, 73% were found in our predictions. Even though any predicted interactome is imperfect, the protein network presented here can provide a valuable basis to explore biological processes and to guide wet-lab experiments in fission yeast and beyond. Our predicted protein interactions are freely available through PInt, an online resource on our website (www.bahlerlab.info/PInt).
Collapse
Affiliation(s)
- Vera Pancaldi
- Department of Genetics, Evolution, and Environment and
- UCL Cancer Institute, University College London, London WC1E 6BT, United Kingdom
| | - Ömer S. Saraç
- Cellular Networks and Systems Biology, Biotechnology Center, Dresden University of Technology (TU Dresden), Dresden 01307, Germany, and
| | - Charalampos Rallis
- Department of Genetics, Evolution, and Environment and
- UCL Cancer Institute, University College London, London WC1E 6BT, United Kingdom
| | - Janel R. McLean
- Howard Hughes Medical Institute
- Department of Cell and Developmental Biology, Vanderbilt University School of Medicine, Nashville, Tennessee 37232
| | - Martin Převorovský
- Department of Genetics, Evolution, and Environment and
- UCL Cancer Institute, University College London, London WC1E 6BT, United Kingdom
| | - Kathleen Gould
- Howard Hughes Medical Institute
- Department of Cell and Developmental Biology, Vanderbilt University School of Medicine, Nashville, Tennessee 37232
| | - Andreas Beyer
- Cellular Networks and Systems Biology, Biotechnology Center, Dresden University of Technology (TU Dresden), Dresden 01307, Germany, and
| | - Jürg Bähler
- Department of Genetics, Evolution, and Environment and
- UCL Cancer Institute, University College London, London WC1E 6BT, United Kingdom
| |
Collapse
|
12
|
Abstract
The time-controlled transcardiac perfusion crosslinking (tcTPC) method differs from conventional perfusion fixation in that the crosslinking reagent is administered throughout the circulatory system for only a relatively short period of time, thereby allowing limited crosslinking to occur. Bait protein complexes are isolated by affinity capture (AC) under stringent conditions and are recovered from the AC matrix by acidic elution. Affinity-purified proteins are reduced, alkylated, and digested with a specific endoproteinase, such as trypsin. Subsequently, peptides are isotopically labeled, separated by reversed-phase chromatography and analyzed by quantitative tandem mass spectrometry (MS/MS). The proteins crosslinked to the bait protein during tcTPC are identified by database searches with conventional protein identification software. The tcTPC strategy offers unique advantages over alternative approaches for studying a subset of protein complexes which require a particular environment for their structural integrity, such as membrane protein complexes that are notorious for their tendency to dissociate upon detergent solubilization. The sensitivity and utility of this method are influenced by the spatial distribution of chemical groups within the bait protein complexes that can engage in productive crosslinks.
Collapse
|
13
|
Clarke D, Bhardwaj N, Gerstein MB. Novel insights through the integration of structural and functional genomics data with protein networks. J Struct Biol 2012; 179:320-6. [PMID: 22343087 DOI: 10.1016/j.jsb.2012.02.001] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2011] [Revised: 02/02/2012] [Accepted: 02/02/2012] [Indexed: 12/13/2022]
Abstract
In recent years, major advances in genomics, proteomics, macromolecular structure determination, and the computational resources capable of processing and disseminating the large volumes of data generated by each have played major roles in advancing a more systems-oriented appreciation of biological organization. One product of systems biology has been the delineation of graph models for describing genome-wide protein-protein interaction networks. The network organization and topology which emerges in such models may be used to address fundamental questions in an array of cellular processes, as well as biological features intrinsic to the constituent proteins (or "nodes") themselves. However, graph models alone constitute an abstraction which neglects the underlying biological and physical reality that the network's nodes and edges are highly heterogeneous entities. Here, we explore some of the advantages of introducing a protein structural dimension to such models, as the marriage of conventional network representations with macromolecular structural data helps to place static node and edge constructs in a biologically more meaningful context. We emphasize that 3D protein structures constitute a valuable conceptual and predictive framework by discussing examples of the insights provided, such as enabling in silico predictions of protein-protein interactions, providing rational and compelling classification schemes for network elements, as well as revealing interesting intrinsic differences between distinct node types, such as disorder and evolutionary features, which may then be rationalized in light of their respective functions within networks.
Collapse
Affiliation(s)
- Declan Clarke
- Department of Chemistry, Yale University, New Haven, CT 06520, USA
| | | | | |
Collapse
|
14
|
Mapping of Chikungunya virus interactions with host proteins identified nsP2 as a highly connected viral component. J Virol 2012; 86:3121-34. [PMID: 22258240 DOI: 10.1128/jvi.06390-11] [Citation(s) in RCA: 93] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023] Open
Abstract
Chikungunya virus (CHIKV) is a mosquito-transmitted alphavirus that has been responsible for an epidemic outbreak of unprecedented magnitude in recent years. Since then, significant efforts have been made to better understand the biology of this virus, but we still have poor knowledge of CHIKV interactions with host cell components at the molecular level. Here we describe the extensive use of high-throughput yeast two-hybrid (HT-Y2H) assays to characterize interactions between CHIKV and human proteins. A total of 22 high-confidence interactions, which essentially involved the viral nonstructural protein nsP2, were identified and further validated in protein complementation assay (PCA). These results were integrated to a larger network obtained by extensive mining of the literature for reports on alphavirus-host interactions. To investigate the role of cellular proteins interacting with nsP2, gene silencing experiments were performed in cells infected by a recombinant CHIKV expressing Renilla luciferase as a reporter. Collected data showed that heterogeneous nuclear ribonucleoprotein K (hnRNP-K) and ubiquilin 4 (UBQLN4) participate in CHIKV replication in vitro. In addition, we showed that CHIKV nsP2 induces a cellular shutoff, as previously reported for other Old World alphaviruses, and determined that among binding partners identified by yeast two-hybrid methods, the tetratricopeptide repeat protein 7B (TTC7B) plays a significant role in this activity. Altogether, this report provides the first interaction map between CHIKV and human proteins and describes new host cell proteins involved in the replication cycle of this virus.
Collapse
|
15
|
Systematic control of protein interactions for systems biology. Proc Natl Acad Sci U S A 2011; 108:20279-80. [PMID: 22160691 DOI: 10.1073/pnas.1118084109] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
|
16
|
Zhou P, Tian F, Ren Y, Shang Z. Systematic classification and analysis of themes in protein-DNA recognition. J Chem Inf Model 2010; 50:1476-88. [PMID: 20726602 DOI: 10.1021/ci100145d] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
Protein-DNA recognition plays a central role in the regulation of gene expression. With the rapidly increasing number of protein-DNA complex structures available at atomic resolution in recent years, a systematic, complete, and intuitive framework to clarify the intrinsic relationship between the global binding modes of these complexes is needed. In this work, we modified, extended, and applied previously defined RNA-recognition themes to describe protein-DNA recognition and used a protocol that incorporates automatic methods into manual inspection to plant a comprehensive classification tree for currently available high-quality protein-DNA structures. Further, a nonredundant (representative) data set consisting of 200 thematically diverse complexes was extracted from the leaves of the classification tree by using a locally sensitive interface comparison algorithm. On the basis of the representative data set, various physical and chemical properties associated with protein-DNA interactions were analyzed using empirical or semiempirical methods. We also examined the individual energetic components involved in protein-DNA interactions and highlighted the importance of conformational entropy, which has been almost completely ignored in previous studies of protein-DNA binding energy.
Collapse
Affiliation(s)
- Peng Zhou
- Department of Chemistry, Zhejiang University, Hangzhou 310027, China, College of Bioengineering, Chongqing University, Chongqing 400044, China
| | | | | | | |
Collapse
|
17
|
Cornell M, Paton NW, Oliver SG. A critical and integrated view of the yeast interactome. Comp Funct Genomics 2010; 5:382-402. [PMID: 18629175 PMCID: PMC2447467 DOI: 10.1002/cfg.412] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2003] [Revised: 04/23/2004] [Accepted: 05/14/2004] [Indexed: 11/08/2022] Open
Abstract
Global studies of protein–protein interactions are crucial to both elucidating gene
function and producing an integrated view of the workings of living cells. High-throughput
studies of the yeast interactome have been performed using both genetic
and biochemical screens. Despite their size, the overlap between these experimental
datasets is very limited. This could be due to each approach sampling only a small
fraction of the total interactome. Alternatively, a large proportion of the data from
these screens may represent false-positive interactions. We have used the Genome
Information Management System (GIMS) to integrate interactome datasets with
transcriptome and protein annotation data and have found significant evidence that
the proportion of false-positive results is high. Not all high-throughput datasets are
similarly contaminated, and the tandem affinity purification (TAP) approach appears
to yield a high proportion of reliable interactions for which corroborating evidence
is available. From our integrative analyses, we have generated a set of verified
interactome data for yeast.
Collapse
Affiliation(s)
- Michael Cornell
- Department of Computer Science, University of Manchester, Kilburn Building, Oxford Road, Manchester M13 9PL, UK.
| | | | | |
Collapse
|
18
|
Aloy P, Russell RB. Understanding and predicting protein assemblies with 3D structures. Comp Funct Genomics 2010; 4:410-5. [PMID: 18629088 PMCID: PMC2447374 DOI: 10.1002/cfg.310] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2003] [Revised: 06/03/2003] [Accepted: 06/03/2003] [Indexed: 01/08/2023] Open
Abstract
Protein interactions are central to most biological processes, and are currently the subject of great interest. Yet despite the many recently developed methods for
interaction discovery, little attention has been paid to one of the best sources of
data: complexes of known three-dimensional (3D) structure. Here we discuss how
such complexes can be used to study and predict protein interactions and complexes,
and to interrogate interaction networks proposed by methods such as two-hybrid
screens or affinity purifications.
Collapse
Affiliation(s)
- Patrick Aloy
- EMBL, Meyerhofstrasse 1, Heidelberg D69117, Germany
| | | |
Collapse
|
19
|
Tuncbag N, Kar G, Gursoy A, Keskin O, Nussinov R. Towards inferring time dimensionality in protein-protein interaction networks by integrating structures: the p53 example. MOLECULAR BIOSYSTEMS 2010; 5:1770-8. [PMID: 19585003 PMCID: PMC2898629 DOI: 10.1039/b905661k] [Citation(s) in RCA: 61] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
Structural data, efficient structural comparison algorithms and appropriate datasets and filters can assist in getting an insight into time dimensionality in interaction networks; in predicting which interactions can and cannot co-exist; and in obtaining concrete predictions consistent with experiment.
Inspection of protein–protein interaction maps illustrates that a hub protein can interact with a very large number of proteins, reaching tens and even hundreds. Since a single protein cannot interact with such a large number of partners at the same time, this presents a challenge: can we figure out which interactions can occur simultaneously and which are mutually excluded? Addressing this question adds a fourth dimension into interaction maps: that of time. Including the time dimension in structural networks is an immense asset; time dimensionality transforms network node-and-edge maps into cellular processes, assisting in the comprehension of cellular pathways and their regulation. While the time dimensionality can be further enhanced by linking protein complexes to time series of mRNA expression data, current robust, network experimental data are lacking. Here we outline how, using structural data, efficient structural comparison algorithms and appropriate datasets and filters can assist in getting an insight into time dimensionality in interaction networks; in predicting which interactions can and cannot co-exist; and in obtaining concrete predictions consistent with experiment. As an example, we present p53-linked processes.
Collapse
Affiliation(s)
- Nurcan Tuncbag
- Koc University, Center for Computational Biology and Bioinformatics, College of Engineering, Rumelifeneri Yolu, 34450 Sariyer Istanbul, Turkey
| | | | | | | | | |
Collapse
|
20
|
Mendez-Rios J, Uetz P. Global approaches to study protein-protein interactions among viruses and hosts. Future Microbiol 2010; 5:289-301. [PMID: 20143950 DOI: 10.2217/fmb.10.7] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023] Open
Abstract
While high-throughput protein-protein interaction screens were first published approximately 10 years ago, systematic attempts to map interactions among viruses and hosts started only a few years ago. HIV-human interactions dominate host-pathogen interaction databases (with approximately 2000 interactions) despite the fact that probably none of these interactions have been identified in systematic interaction screens. Recently, combinations of protein interaction data with RNAi and other functional genomics data allowed researchers to model more complex interaction networks. The rapid progress in this area promises a flood of new data in the near future, with clinical applications as soon as structural and functional genomics catches up with next-generation sequencing of human variation and structure-based drug design.
Collapse
Affiliation(s)
- Jorge Mendez-Rios
- J Craig Venter Institute, 9704 Medical Center Drive, Rockville, MD 20850, USA.
| | | |
Collapse
|
21
|
Shi T, Weerasekera R, Yan C, Reginold W, Ball H, Kislinger T, Schmitt-Ulms G. Method for the Affinity Purification of Covalently Linked Peptides Following Cyanogen Bromide Cleavage of Proteins. Anal Chem 2009; 81:9885-95. [DOI: 10.1021/ac901373q] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Tujin Shi
- Centre for Research in Neurodegenerative Diseases, University of Toronto, Toronto, Ontario, Canada, Department of Laboratory Medicine and Pathobiology, University of Toronto, Toronto, Ontario, Canada, Department of Biochemistry, University of Texas Southwestern Medical School, Dallas, Texas, and Division of Cancer Genomics and Proteomics, Ontario Cancer Institute, Toronto, Ontario, Canada
| | - Rasanjala Weerasekera
- Centre for Research in Neurodegenerative Diseases, University of Toronto, Toronto, Ontario, Canada, Department of Laboratory Medicine and Pathobiology, University of Toronto, Toronto, Ontario, Canada, Department of Biochemistry, University of Texas Southwestern Medical School, Dallas, Texas, and Division of Cancer Genomics and Proteomics, Ontario Cancer Institute, Toronto, Ontario, Canada
| | - Chen Yan
- Centre for Research in Neurodegenerative Diseases, University of Toronto, Toronto, Ontario, Canada, Department of Laboratory Medicine and Pathobiology, University of Toronto, Toronto, Ontario, Canada, Department of Biochemistry, University of Texas Southwestern Medical School, Dallas, Texas, and Division of Cancer Genomics and Proteomics, Ontario Cancer Institute, Toronto, Ontario, Canada
| | - William Reginold
- Centre for Research in Neurodegenerative Diseases, University of Toronto, Toronto, Ontario, Canada, Department of Laboratory Medicine and Pathobiology, University of Toronto, Toronto, Ontario, Canada, Department of Biochemistry, University of Texas Southwestern Medical School, Dallas, Texas, and Division of Cancer Genomics and Proteomics, Ontario Cancer Institute, Toronto, Ontario, Canada
| | - Haydn Ball
- Centre for Research in Neurodegenerative Diseases, University of Toronto, Toronto, Ontario, Canada, Department of Laboratory Medicine and Pathobiology, University of Toronto, Toronto, Ontario, Canada, Department of Biochemistry, University of Texas Southwestern Medical School, Dallas, Texas, and Division of Cancer Genomics and Proteomics, Ontario Cancer Institute, Toronto, Ontario, Canada
| | - Thomas Kislinger
- Centre for Research in Neurodegenerative Diseases, University of Toronto, Toronto, Ontario, Canada, Department of Laboratory Medicine and Pathobiology, University of Toronto, Toronto, Ontario, Canada, Department of Biochemistry, University of Texas Southwestern Medical School, Dallas, Texas, and Division of Cancer Genomics and Proteomics, Ontario Cancer Institute, Toronto, Ontario, Canada
| | - Gerold Schmitt-Ulms
- Centre for Research in Neurodegenerative Diseases, University of Toronto, Toronto, Ontario, Canada, Department of Laboratory Medicine and Pathobiology, University of Toronto, Toronto, Ontario, Canada, Department of Biochemistry, University of Texas Southwestern Medical School, Dallas, Texas, and Division of Cancer Genomics and Proteomics, Ontario Cancer Institute, Toronto, Ontario, Canada
| |
Collapse
|
22
|
Abu-Farha M, Elisma F, Zhou H, Tian R, Zhou H, Asmer MS, Figeys D. Proteomics: From Technology Developments to Biological Applications. Anal Chem 2009; 81:4585-99. [PMID: 19371061 DOI: 10.1021/ac900735j] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Affiliation(s)
- Mohamed Abu-Farha
- Ottawa Institute of Systems Biology (OISB), University of Ottawa, Ottawa, Ontario, Canada, and Department of Biochemistry, Microbiology and Immunology, University of Ottawa, Ottawa, Ontario, Canada
| | - Fred Elisma
- Ottawa Institute of Systems Biology (OISB), University of Ottawa, Ottawa, Ontario, Canada, and Department of Biochemistry, Microbiology and Immunology, University of Ottawa, Ottawa, Ontario, Canada
| | - Houjiang Zhou
- Ottawa Institute of Systems Biology (OISB), University of Ottawa, Ottawa, Ontario, Canada, and Department of Biochemistry, Microbiology and Immunology, University of Ottawa, Ottawa, Ontario, Canada
| | - Ruijun Tian
- Ottawa Institute of Systems Biology (OISB), University of Ottawa, Ottawa, Ontario, Canada, and Department of Biochemistry, Microbiology and Immunology, University of Ottawa, Ottawa, Ontario, Canada
| | - Hu Zhou
- Ottawa Institute of Systems Biology (OISB), University of Ottawa, Ottawa, Ontario, Canada, and Department of Biochemistry, Microbiology and Immunology, University of Ottawa, Ottawa, Ontario, Canada
| | - Mehmet Selim Asmer
- Ottawa Institute of Systems Biology (OISB), University of Ottawa, Ottawa, Ontario, Canada, and Department of Biochemistry, Microbiology and Immunology, University of Ottawa, Ottawa, Ontario, Canada
| | - Daniel Figeys
- Ottawa Institute of Systems Biology (OISB), University of Ottawa, Ottawa, Ontario, Canada, and Department of Biochemistry, Microbiology and Immunology, University of Ottawa, Ottawa, Ontario, Canada
| |
Collapse
|
23
|
Fornes O, Aragues R, Espadaler J, Marti-Renom MA, Sali A, Oliva B. ModLink+: improving fold recognition by using protein-protein interactions. ACTA ACUST UNITED AC 2009; 25:1506-12. [PMID: 19357100 DOI: 10.1093/bioinformatics/btp238] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023]
Abstract
MOTIVATION Several strategies have been developed to predict the fold of a target protein sequence, most of which are based on aligning the target sequence to other sequences of known structure. Previously, we demonstrated that the consideration of protein-protein interactions significantly increases the accuracy of fold assignment compared with PSI-BLAST sequence comparisons. A drawback of our method was the low number of proteins to which a fold could be assigned. Here, we present an improved version of the method that addresses this limitation. We also compare our method to other state-of-the-art fold assignment methodologies. RESULTS Our approach (ModLink+) has been tested on 3716 proteins with domain folds classified in the Structural Classification Of Proteins (SCOP) as well as known interacting partners in the Database of Interacting Proteins (DIP). For this test set, the ratio of success [positive predictive value (PPV)] on fold assignment increases from 75% for PSI-BLAST, 83% for HHSearch and 81% for PRC to >90% for ModLink+at the e-value cutoff of 10(-3). Under this e-value, ModLink+can assign a fold to 30-45% of the proteins in the test set, while our previous method could cover <25%. When applied to 6384 proteins with unknown fold in the yeast proteome, ModLink+combined with PSI-BLAST assigns a fold for domains in 3738 proteins, while PSI-BLAST alone covers only 2122 proteins, HHSearch 2969 and PRC 2826 proteins, using a threshold e-value that would represent a PPV >82% for each method in the test set. AVAILABILITY The ModLink+server is freely accessible in the World Wide Web at http://sbi.imim.es/modlink/. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Oriol Fornes
- Structural Bioinformatics Lab (GRIB-IMIM), Universitat Pompeu Fabra, Parc de Recerca Biomèdica de Barcelona (PRBB), Barcelona, Catalonia, Spain.
| | | | | | | | | | | |
Collapse
|
24
|
Shin CJ, Wong S, Davis MJ, Ragan MA. Protein-protein interaction as a predictor of subcellular location. BMC SYSTEMS BIOLOGY 2009; 3:28. [PMID: 19243629 PMCID: PMC2663780 DOI: 10.1186/1752-0509-3-28] [Citation(s) in RCA: 51] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/31/2008] [Accepted: 02/25/2009] [Indexed: 11/10/2022]
Abstract
Background Many biological processes are mediated by dynamic interactions between and among proteins. In order to interact, two proteins must co-occur spatially and temporally. As protein-protein interactions (PPIs) and subcellular location (SCL) are discovered via separate empirical approaches, PPI and SCL annotations are independent and might complement each other in helping us to understand the role of individual proteins in cellular networks. We expect reliable PPI annotations to show that proteins interacting in vivo are co-located in the same cellular compartment. Our goal here is to evaluate the potential of using PPI annotation in determining SCL of proteins in human, mouse, fly and yeast, and to identify and quantify the factors that contribute to this complementarity. Results Using publicly available data, we evaluate the hypothesis that interacting proteins must be co-located within the same subcellular compartment. Based on a large, manually curated PPI dataset, we demonstrate that a substantial proportion of interacting proteins are in fact co-located. We develop an approach to predict the SCL of a protein based on the SCL of its interaction partners, given sufficient confidence in the interaction itself. The frequency of false positive PPIs can be reduced by use of six lines of supporting evidence, three based on type of recorded evidence (empirical approach, multiplicity of databases, and multiplicity of literature citations) and three based on type of biological evidence (inferred biological process, domain-domain interactions, and orthology relationships), with biological evidence more-effective than recorded evidence. Our approach performs better than four existing prediction methods in identifying the SCL of membrane proteins, and as well as or better for soluble proteins. Conclusion Understanding cellular systems requires knowledge of the SCL of interacting proteins. We show how PPI data can be used more effectively to yield reliable SCL predictions for both soluble and membrane proteins. Scope exists for further improvement in our understanding of cellular function through consideration of the biological context of molecular interactions.
Collapse
Affiliation(s)
- Chang Jin Shin
- The University of Queensland, Institute for Molecular Bioscience, and ARC Centre of Excellence in Bioinformatics, QLD, Australia.
| | | | | | | |
Collapse
|
25
|
Topological properties of protein interaction networks from a structural perspective. Biochem Soc Trans 2009; 36:1398-403. [PMID: 19021563 DOI: 10.1042/bst0361398] [Citation(s) in RCA: 101] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Protein-protein interactions are usually shown as interaction networks (graphs), where the proteins are represented as nodes and the connections between the interacting proteins are shown as edges. The graph abstraction of protein interactions is crucial for understanding the global behaviour of the network. In this mini review, we summarize basic graph topological properties, such as node degree and betweenness, and their relation to essentiality and modularity of protein interactions. The classification of hub proteins into date and party hubs with distinct properties has significant implications for relating topological properties to the behaviour of the network. We emphasize that the integration of protein interface structure into interaction graph models provides a better explanation of hub proteins, and strengthens the relationship between the role of the hubs in the cell and their topological properties.
Collapse
|
26
|
Gerber D, Maerkl SJ, Quake SR. An in vitro microfluidic approach to generating protein-interaction networks. Nat Methods 2009; 6:71-4. [PMID: 19098921 PMCID: PMC4117197 DOI: 10.1038/nmeth.1289] [Citation(s) in RCA: 129] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2008] [Accepted: 11/11/2008] [Indexed: 01/26/2023]
Abstract
We developed an in vitro protein expression and interaction analysis platform based on a highly parallel and sensitive microfluidic affinity assay, and used it for 14,792 on-chip experiments, which exhaustively measured the protein-protein interactions of 43 Streptococcus pneumoniae proteins in quadruplicate. The resulting network of 157 interactions was denser than expected based on known networks. Analysis of the network revealed previously undescribed physical interactions among members of some biochemical pathways.
Collapse
Affiliation(s)
- Doron Gerber
- Department of Bioengineering, Stanford University and Howard Hughes Medical Institute, 318 Campus Drive, Stanford, CA 94305, USA
| | | | | |
Collapse
|
27
|
Yoon HK, Sohn KC, Lee JS, Kim YJ, Bhak J, Yang JM, You KH, Kim CD, Lee JH. Prediction and evaluation of protein–protein interaction in keratinocyte differentiation. Biochem Biophys Res Commun 2008; 377:662-667. [DOI: 10.1016/j.bbrc.2008.10.051] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2008] [Accepted: 10/10/2008] [Indexed: 11/29/2022]
|
28
|
Krycer JR, Pang CNI, Wilkins MR. High throughput protein-protein interaction data: clues for the architecture of protein complexes. Proteome Sci 2008; 6:32. [PMID: 19032795 PMCID: PMC2621150 DOI: 10.1186/1477-5956-6-32] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2008] [Accepted: 11/26/2008] [Indexed: 11/23/2022] Open
Abstract
Background High-throughput techniques are becoming widely used to study protein-protein interactions and protein complexes on a proteome-wide scale. Here we have explored the potential of these techniques to accurately determine the constituent proteins of complexes and their architecture within the complex. Results Two-dimensional representations of the 19S and 20S proteasome, mediator, and SAGA complexes were generated and overlaid with high quality pairwise interaction data, core-module-attachment classifications from affinity purifications of complexes and predicted domain-domain interactions. Pairwise interaction data could accurately determine the members of each complex, but was unexpectedly poor at deciphering the topology of proteins in complexes. Core and module data from affinity purification studies were less useful for accurately defining the member proteins of these complexes. However, these data gave strong information on the spatial proximity of many proteins. Predicted domain-domain interactions provided some insight into the topology of proteins within complexes, but was affected by a lack of available structural data for the co-activator complexes and the presence of shared domains in paralogous proteins. Conclusion The constituent proteins of complexes are likely to be determined with accuracy by combining data from high-throughput techniques. The topology of some proteins in the complexes will be able to be clearly inferred. We finally suggest strategies that can be employed to use high throughput interaction data to define the membership and understand the architecture of proteins in novel complexes.
Collapse
|
29
|
Cheng TMK, Blundell TL, Fernandez-Recio J. Structural assembly of two-domain proteins by rigid-body docking. BMC Bioinformatics 2008; 9:441. [PMID: 18925951 PMCID: PMC2579442 DOI: 10.1186/1471-2105-9-441] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2008] [Accepted: 10/16/2008] [Indexed: 11/25/2022] Open
Abstract
Background Modelling proteins with multiple domains is one of the central challenges in Structural Biology. Although homology modelling has successfully been applied for prediction of protein structures, very often domain-domain interactions cannot be inferred from the structures of homologues and their prediction requires ab initio methods. Here we present a new structural prediction approach for modelling two-domain proteins based on rigid-body domain-domain docking. Results Here we focus on interacting domain pairs that are part of the same peptide chain and thus have an inter-domain peptide region (so called linker). We have developed a method called pyDockTET (tethered-docking), which uses rigid-body docking to generate domain-domain poses that are further scored by binding energy and a pseudo-energy term based on restraints derived from linker end-to-end distances. The method has been benchmarked on a set of 77 non-redundant pairs of domains with available X-ray structure. We have evaluated the docking method ZDOCK, which is able to generate acceptable domain-domain orientations in 51 out of the 77 cases. Among them, our method pyDockTET finds the correct assembly within the top 10 solutions in over 60% of the cases. As a further test, on a subset of 20 pairs where domains were built by homology modelling, ZDOCK generates acceptable orientations in 13 out of the 20 cases, among which the correct assembly is ranked lower than 10 in around 70% of the cases by our pyDockTET method. Conclusion Our results show that rigid-body docking approach plus energy scoring and linker-based restraints are useful for modelling domain-domain interactions. These positive results will encourage development of new methods for structural prediction of macromolecules with multiple (more than two) domains.
Collapse
Affiliation(s)
- Tammy M K Cheng
- Department of Biochemistry, University of Cambridge, 80 Tennis Court Road, Cambridge CB2 1GA, UK.
| | | | | |
Collapse
|
30
|
Pitre S, North C, Alamgir M, Jessulat M, Chan A, Luo X, Green JR, Dumontier M, Dehne F, Golshani A. Global investigation of protein-protein interactions in yeast Saccharomyces cerevisiae using re-occurring short polypeptide sequences. Nucleic Acids Res 2008; 36:4286-94. [PMID: 18586826 PMCID: PMC2490765 DOI: 10.1093/nar/gkn390] [Citation(s) in RCA: 43] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022] Open
Abstract
Protein–protein interaction (PPI) maps provide insight into cellular biology and have received considerable attention in the post-genomic era. While large-scale experimental approaches have generated large collections of experimentally determined PPIs, technical limitations preclude certain PPIs from detection. Recently, we demonstrated that yeast PPIs can be computationally predicted using re-occurring short polypeptide sequences between known interacting protein pairs. However, the computational requirements and low specificity made this method unsuitable for large-scale investigations. Here, we report an improved approach, which exhibits a specificity of ∼99.95% and executes 16 000 times faster. Importantly, we report the first all-to-all sequence-based computational screen of PPIs in yeast, Saccharomyces cerevisiae in which we identify 29 589 high confidence interactions of ∼2 × 107 possible pairs. Of these, 14 438 PPIs have not been previously reported and may represent novel interactions. In particular, these results reveal a richer set of membrane protein interactions, not readily amenable to experimental investigations. From the novel PPIs, a novel putative protein complex comprised largely of membrane proteins was revealed. In addition, two novel gene functions were predicted and experimentally confirmed to affect the efficiency of non-homologous end-joining, providing further support for the usefulness of the identified PPIs in biological investigations.
Collapse
Affiliation(s)
- S Pitre
- School of Computer Science, Carleton University, Ottawa, Canada
| | | | | | | | | | | | | | | | | | | |
Collapse
|
31
|
Signaling networks during development: the case of asymmetric cell division in the Drosophila nervous system. Dev Biol 2008; 321:1-17. [PMID: 18586022 DOI: 10.1016/j.ydbio.2008.06.010] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2008] [Revised: 06/03/2008] [Accepted: 06/04/2008] [Indexed: 11/22/2022]
Abstract
Remarkable progress in genetics and molecular biology has made possible the sequencing of the genomes from numerous species. In the post-genomic era, technical developments in the fields of proteomics and bioinformatics are poised to further catapult our understanding of protein structure, function and organization into complex signaling networks. One of the greatest challenges in the field now is to unravel the functional signaling networks and their spatio-temporal regulation in living cells. Here, the need for such in vivo system-wide level approach is illustrated in relation to the mechanisms that underlie the biological process of asymmetric cell division. Genomic, post-genomic and live imaging techniques are reviewed in light of the huge impact they are having on this field for the discovering of new proteins and for the in vivo analysis of asymmetric cell division. The proteins, signals and the emerging networking of functional connections that is arising between them during this process in the Drosophila nervous system will be also discussed.
Collapse
|
32
|
Abstract
Interactive proteomics addresses the physical associations among proteins and establishes global, disease-, and pathway-specific protein interaction networks. The inherent chemical and structural diversity of proteins, their different expression levels, and their distinct subcellular localizations pose unique challenges for the exploration of these networks, necessitating the use of a variety of innovative and ingenious approaches. Consequently, recent years have seen exciting developments in protein interaction mapping and the establishment of very large interaction networks, especially in model organisms. In the near future, attention will shift to the establishment of interaction networks in humans and their application in drug discovery and understanding of diseases. In this review, we present an impressive toolbox of different technologies that we expect to be crucial for interactive proteomics in the coming years.
Collapse
|
33
|
Harrington ED, Jensen LJ, Bork P. Predicting biological networks from genomic data. FEBS Lett 2008; 582:1251-8. [PMID: 18294967 DOI: 10.1016/j.febslet.2008.02.033] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2008] [Accepted: 02/13/2008] [Indexed: 12/27/2022]
Abstract
Continuing improvements in DNA sequencing technologies are providing us with vast amounts of genomic data from an ever-widening range of organisms. The resulting challenge for bioinformatics is to interpret this deluge of data and place it back into its biological context. Biological networks provide a conceptual framework with which we can describe part of this context, namely the different interactions that occur between the molecular components of a cell. Here, we review the computational methods available to predict biological networks from genomic sequence data and discuss how they relate to high-throughput experimental methods.
Collapse
Affiliation(s)
- Eoghan D Harrington
- Structural and Computational Biology Unit, European Molecular Biology Laboratory, Meyerhofstrasse 1, D-69117 Heidelberg, Germany
| | | | | |
Collapse
|
34
|
Pitre S, Alamgir M, Green JR, Dumontier M, Dehne F, Golshani A. Computational methods for predicting protein-protein interactions. ADVANCES IN BIOCHEMICAL ENGINEERING/BIOTECHNOLOGY 2008; 110:247-67. [PMID: 18202838 DOI: 10.1007/10_2007_089] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Abstract
Protein-protein interactions (PPIs) play a critical role in many cellular functions. A number of experimental techniques have been applied to discover PPIs; however, these techniques are expensive in terms of time, money, and expertise. There are also large discrepancies between the PPI data collected by the same or different techniques in the same organism. We therefore turn to computational techniques for the prediction of PPIs. Computational techniques have been applied to the collection, indexing, validation, analysis, and extrapolation of PPI data. This chapter will focus on computational prediction of PPI, reviewing a number of techniques including PIPE, developed in our own laboratory. For comparison, the conventional large-scale approaches to predict PPIs are also briefly discussed. The chapter concludes with a discussion of the limitations of both experimental and computational methods of determining PPIs.
Collapse
Affiliation(s)
- Sylvain Pitre
- School of Computer Science, Carleton University, 5304 Herzberg Building, 1125 Colonel By Drive, K1S 5B6, Ottawa, Ontario, Canada
| | | | | | | | | | | |
Collapse
|
35
|
Weerasekera R, She YM, Markham KA, Bai Y, Opalka N, Orlicky S, Sicheri F, Kislinger T, Schmitt-Ulms G. Interactome and interface protocol (2IP): A novel strategy for high sensitivity topology mapping of protein complexes. Proteomics 2007; 7:3835-52. [DOI: 10.1002/pmic.200700688] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
|
36
|
Heuser P, Schomburg D. Combination of scoring schemes for protein docking. BMC Bioinformatics 2007; 8:279. [PMID: 17678526 PMCID: PMC1978211 DOI: 10.1186/1471-2105-8-279] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2007] [Accepted: 08/01/2007] [Indexed: 12/02/2022] Open
Abstract
Background Docking algorithms are developed to predict in which orientation two proteins are likely to bind under natural conditions. The currently used methods usually consist of a sampling step followed by a scoring step. We developed a weighted geometric correlation based on optimised atom specific weighting factors and combined them with our previously published amino acid specific scoring and with a comprehensive SVM-based scoring function. Results The scoring with the atom specific weighting factors yields better results than the amino acid specific scoring. In combination with SVM-based scoring functions the percentage of complexes for which a near native structure can be predicted within the top 100 ranks increased from 14% with the geometric scoring to 54% with the combination of all scoring functions. Especially for the enzyme-inhibitor complexes the results of the ranking are excellent. For half of these complexes a near-native structure can be predicted within the first 10 proposed structures and for more than 86% of all enzyme-inhibitor complexes within the first 50 predicted structures. Conclusion We were able to develop a combination of different scoring schemes which considers a series of previously described and some new scoring criteria yielding a remarkable improvement of prediction quality.
Collapse
Affiliation(s)
- Philipp Heuser
- Cologne University Bioinformatics Center (CUBIC), University of Cologne, Zuelpicher Str. 47, 50674 Koeln, Germany
| | - Dietmar Schomburg
- Cologne University Bioinformatics Center (CUBIC), University of Cologne, Zuelpicher Str. 47, 50674 Koeln, Germany
| |
Collapse
|
37
|
Linding R, Jensen LJ, Ostheimer GJ, van Vugt MA, Jørgensen C, Miron IM, Diella F, Colwill K, Taylor L, Elder K, Metalnikov P, Nguyen V, Pasculescu A, Jin J, Park JG, Samson LD, Woodgett JR, Russell RB, Bork P, Yaffe MB, Pawson T. Systematic discovery of in vivo phosphorylation networks. Cell 2007; 129:1415-26. [PMID: 17570479 PMCID: PMC2692296 DOI: 10.1016/j.cell.2007.05.052] [Citation(s) in RCA: 588] [Impact Index Per Article: 32.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2006] [Revised: 04/24/2007] [Accepted: 05/30/2007] [Indexed: 01/23/2023]
Abstract
Protein kinases control cellular decision processes by phosphorylating specific substrates. Thousands of in vivo phosphorylation sites have been identified, mostly by proteome-wide mapping. However, systematically matching these sites to specific kinases is presently infeasible, due to limited specificity of consensus motifs, and the influence of contextual factors, such as protein scaffolds, localization, and expression, on cellular substrate specificity. We have developed an approach (NetworKIN) that augments motif-based predictions with the network context of kinases and phosphoproteins. The latter provides 60%-80% of the computational capability to assign in vivo substrate specificity. NetworKIN pinpoints kinases responsible for specific phosphorylations and yields a 2.5-fold improvement in the accuracy with which phosphorylation networks can be constructed. Applying this approach to DNA damage signaling, we show that 53BP1 and Rad50 are phosphorylated by CDK1 and ATM, respectively. We describe a scalable strategy to evaluate predictions, which suggests that BCLAF1 is a GSK-3 substrate.
Collapse
Affiliation(s)
- Rune Linding
- Samuel Lunenfeld Research Institute, Mount Sinai Hospital, Toronto, Canada
- Center for Cancer Research, Massachusetts Institute of Technology, Cambridge, USA
| | | | - Gerard J. Ostheimer
- Center for Cancer Research, Massachusetts Institute of Technology, Cambridge, USA
- Center for Environmental Health Sciences, Massachusetts Institute of Technology, Cambridge, USA
| | - Marcel A.T.M. van Vugt
- Center for Cancer Research, Massachusetts Institute of Technology, Cambridge, USA
- Department of Cell Biology and Genetics, Erasmus University, Rotterdam, The Netherlands
| | - Claus Jørgensen
- Samuel Lunenfeld Research Institute, Mount Sinai Hospital, Toronto, Canada
| | - Ioana M. Miron
- Samuel Lunenfeld Research Institute, Mount Sinai Hospital, Toronto, Canada
| | | | - Karen Colwill
- Samuel Lunenfeld Research Institute, Mount Sinai Hospital, Toronto, Canada
| | - Lorne Taylor
- Samuel Lunenfeld Research Institute, Mount Sinai Hospital, Toronto, Canada
| | - Kelly Elder
- Samuel Lunenfeld Research Institute, Mount Sinai Hospital, Toronto, Canada
| | - Pavel Metalnikov
- Samuel Lunenfeld Research Institute, Mount Sinai Hospital, Toronto, Canada
| | - Vivian Nguyen
- Samuel Lunenfeld Research Institute, Mount Sinai Hospital, Toronto, Canada
| | - Adrian Pasculescu
- Samuel Lunenfeld Research Institute, Mount Sinai Hospital, Toronto, Canada
| | - Jing Jin
- Samuel Lunenfeld Research Institute, Mount Sinai Hospital, Toronto, Canada
| | - Jin Gyoon Park
- Samuel Lunenfeld Research Institute, Mount Sinai Hospital, Toronto, Canada
| | - Leona D. Samson
- Center for Environmental Health Sciences, Massachusetts Institute of Technology, Cambridge, USA
| | - James R. Woodgett
- Samuel Lunenfeld Research Institute, Mount Sinai Hospital, Toronto, Canada
| | | | - Peer Bork
- European Molecular Biology Laboratory, Heidelberg, Germany
- Max-Delbrück-Centre for Molecular Medicine, Berlin, Germany
| | - Michael B. Yaffe
- Samuel Lunenfeld Research Institute, Mount Sinai Hospital, Toronto, Canada
| | - Tony Pawson
- Samuel Lunenfeld Research Institute, Mount Sinai Hospital, Toronto, Canada
| |
Collapse
|
38
|
Markham K, Bai Y, Schmitt-Ulms G. Co-immunoprecipitations revisited: an update on experimental concepts and their implementation for sensitive interactome investigations of endogenous proteins. Anal Bioanal Chem 2007; 389:461-73. [PMID: 17583802 DOI: 10.1007/s00216-007-1385-x] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2007] [Revised: 05/14/2007] [Accepted: 05/22/2007] [Indexed: 10/23/2022]
Abstract
The study of protein-protein interactions involving endogenous proteins frequently relies on the immunoaffinity capture of a protein of interest followed by mass spectrometry-based identification of co-purifying interactors. A notorious problem with this approach is the difficulty of distinguishing physiological interactors from unspecific binders. Additional challenges pose the need to employ a strategy that is compatible with downstream mass spectrometry and minimizes sample losses during handling steps. Finally, the complexity of data sets demands solutions for data filtering. Here we present an update on co-immunoprecipitation procedures for sensitive interactome mapping applications. We define the relevant terminology, review methodological advances that reduce sample losses, and discuss experimental strategies that facilitate recognition of candidate interactors through a combination of informative controls and data filtering. Finally, we provide starting points for initial validation experiments and propose conventions for manuscripts which report on co-immunoprecipitation work.
Collapse
Affiliation(s)
- Kelly Markham
- Centre for Research in Neurodegenerative Diseases, University of Toronto, Tanz Neuroscience Building, 6 Queen's Park Crescent West, Toronto, ON M5S 3H2, Canada
| | | | | |
Collapse
|
39
|
Devos D, Russell RB. A more complete, complexed and structured interactome. Curr Opin Struct Biol 2007; 17:370-7. [PMID: 17574831 DOI: 10.1016/j.sbi.2007.05.011] [Citation(s) in RCA: 42] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2007] [Revised: 04/18/2007] [Accepted: 05/31/2007] [Indexed: 11/16/2022]
Abstract
Multiprotein complexes are key players in virtually all important cellular processes. The past year has seen the publication of several papers that have illuminated what we know about the number and composition of these molecular machines, using high-throughput purification methods. Other studies have illuminated structural and functional aspects of protein interactions, networks and molecular assemblies. As a result, we have a more complete view of how many complexes are in living systems, what they look like and the roles they play in the cell.
Collapse
Affiliation(s)
- Damien Devos
- EMBL, Meyerhofstrasse 1, 69117 Heidelberg, Germany
| | | |
Collapse
|
40
|
Abstract
Many essential cellular processes such as signal transduction, transport, cellular motion and most regulatory mechanisms are mediated by protein-protein interactions. In recent years, new experimental techniques have been developed to discover the protein-protein interaction networks of several organisms. However, the accuracy and coverage of these techniques have proven to be limited, and computational approaches remain essential both to assist in the design and validation of experimental studies and for the prediction of interaction partners and detailed structures of protein complexes. Here, we provide a critical overview of existing structure-independent and structure-based computational methods. Although these techniques have significantly advanced in the past few years, we find that most of them are still in their infancy. We also provide an overview of experimental techniques for the detection of protein-protein interactions. Although the developments are promising, false positive and false negative results are common, and reliable detection is possible only by taking a consensus of different experimental approaches. The shortcomings of experimental techniques affect both the further development and the fair evaluation of computational prediction methods. For an adequate comparative evaluation of prediction and high-throughput experimental methods, an appropriately large benchmark set of biophysically characterized protein complexes would be needed, but is sorely lacking.
Collapse
Affiliation(s)
- András Szilágyi
- Center of Excellence in Bioinformatics, University at Buffalo, State University of New York, 901 Washington St, Buffalo, NY 14203, USA
| | | | | | | |
Collapse
|
41
|
Abstract
Two comprehensive studies of the total complement of protein complexes in yeast come up with surprisingly different answers. Simple eukaryotic cells such as yeast could contain around 800 protein complexes, as two new comprehensive studies show. But slightly different approaches resulted in surprising differences between the two datasets, showing that more work is required to get a complete picture of the yeast interactome.
Collapse
Affiliation(s)
- Johannes Goll
- Institut für Genetik, Forschungszentrum Karlsruhe, Box 3640, 76021 Karlsruhe, Germany
| | - Peter Uetz
- Institut für Genetik, Forschungszentrum Karlsruhe, Box 3640, 76021 Karlsruhe, Germany
- The Institute of Genomic Research, 9712 Medical Center Drive, Rockville, MD 20850, USA
| |
Collapse
|
42
|
Potluri S, Yan AK, Chou JJ, Donald BR, Bailey-Kellogg C. Structure determination of symmetric homo-oligomers by a complete search of symmetry configuration space, using NMR restraints and van der Waals packing. Proteins 2006; 65:203-19. [PMID: 16897780 DOI: 10.1002/prot.21091] [Citation(s) in RCA: 37] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
Structural studies of symmetric homo-oligomers provide mechanistic insights into their roles in essential biological processes, including cell signaling and cellular regulation. This paper presents a novel algorithm for homo-oligomeric structure determination, given the subunit structure, that is both complete, in that it evaluates all possible conformations, and data-driven, in that it evaluates conformations separately for consistency with experimental data and for quality of packing. Completeness ensures that the algorithm does not miss the native conformation, and being data-driven enables it to assess the structural precision possible from data alone. Our algorithm performs a branch-and-bound search in the symmetry configuration space, the space of symmetry axis parameters (positions and orientations) defining all possible C(n) homo-oligomeric complexes for a given subunit structure. It eliminates those symmetry axes inconsistent with intersubunit nuclear Overhauser effect (NOE) distance restraints and then identifies conformations representing any consistent, well-packed structure to within a user-defined similarity level. For the human phospholamban pentamer in dodecylphosphocholine micelles, using the structure of one subunit determined from a subset of the experimental NMR data, our algorithm identifies a diverse set of complex structures consistent with the nine intersubunit NOE restraints. The distribution of determined structures provides an objective characterization of structural uncertainty: backbone RMSD to the previously determined structure ranges from 1.07 to 8.85 A, and variance in backbone atomic coordinates is an average of 12.32 A(2). Incorporating vdW packing reduces structural diversity to a maximum backbone RMSD of 6.24 A and an average backbone variance of 6.80 A(2). By comparing data consistency and packing quality under different assumptions of oligomeric number, our algorithm identifies the pentamer as the most likely oligomeric state of phospholamban, demonstrating that it is possible to determine the oligomeric number directly from NMR data. Additional tests on a number of homo-oligomers, from dimer to heptamer, similarly demonstrate the power of our method to provide unbiased determination and evaluation of homo-oligomeric complex structures.
Collapse
Affiliation(s)
- Shobha Potluri
- Department of Computer Science, Dartmouth College, Hanover, New Hampshire 03755, USA
| | | | | | | | | |
Collapse
|
43
|
Heuser P, Schomburg D. Optimised amino acid specific weighting factors for unbound protein docking. BMC Bioinformatics 2006; 7:344. [PMID: 16842615 PMCID: PMC1534072 DOI: 10.1186/1471-2105-7-344] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2006] [Accepted: 07/14/2006] [Indexed: 11/10/2022] Open
Abstract
Background One of the most challenging aspects of protein-protein docking is the inclusion of flexibility into the docking procedure. We developed a postfilter where the grid-representation of proteins for docking is extended by an optimised weighting factor for each amino acid. Results For up to 86% of the evaluated complexes a near-native structure was within the top 5% of the ranked prediction output. The weighting factors obtained by the optimisation procedure correlate to a certain extent with the flexibility of the amino acids, their hydrophobicity and with their propensity to be in the interface. Conclusion Use of the optimised amino acid specific parameters yields a strong increase of near-native structures on the first ranks of the prediction.
Collapse
Affiliation(s)
- Philipp Heuser
- Cologne University Bioinformatics Center (CUBIC), University of Cologne, Zuelpicher Str. 47, 50674 Koeln, Germany
| | - Dietmar Schomburg
- Cologne University Bioinformatics Center (CUBIC), University of Cologne, Zuelpicher Str. 47, 50674 Koeln, Germany
| |
Collapse
|
44
|
Bonet J, Caltabiano G, Khan AK, Johnston MA, Corbí C, Gómez A, Rovira X, Teyra J, Villà-Freixa J. The role of residue stability in transient protein-protein interactions involved in enzymatic phosphate hydrolysis. A computational study. Proteins 2006; 63:65-77. [PMID: 16374872 DOI: 10.1002/prot.20791] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
Finding why protein-protein interactions (PPIs) are so specific can provide a valuable tool in a variety of fields. Statistical surveys of so-called transient complexes (like those relevant for signal transduction mechanisms) have shown a tendency of polar residues to participate in the interaction region. Following this scheme, residues in the unbound partners have to compete between interacting with water or interacting with other residues of the protein. On the other hand, several works have shown that the notion of active site electrostatic preorganization can be used to interpret the high efficiency in enzyme reactions. This preorganization can be related to the instability of the residues important for catalysis. In some enzymes, in addition, conformational changes upon binding to other proteins lead to an increase in the activity of the enzymatic partner. In this article the linear response approximation version of the semimacroscopic protein dipoles Langevin dipoles (PDLD/S-LRA) model is used to evaluate the stability of several residues in two phosphate hydrolysis enzymes upon complexation with their activating partners. In particular, the residues relevant for PPI and for phosphate hydrolysis in the CDK2/Cyclin A and Ras/GAP complexes are analyzed. We find that the evaluation of the stability of residues in these systems can be used to identify not only active site regions but it can also be used as a guide to locate "hot spots" for PPIs. We also show that conformational changes play a major role in positioning interfacing residues in a proper "energetic" orientation, ready to interact with the residues in the partner protein surface. Thus, we extend the preorganization theory to PPIs, extrapolating the results we obtained from the above-mentioned complexes to a more general case. We conclude that the correlation between stability of a residue in the surface and the likelihood that it participates in the interaction can be a general fact for transient PPIs.
Collapse
Affiliation(s)
- Jaume Bonet
- Computational Biochemistry and Biophysics Laboratory, Research Group on Biomedical Informatics (GRIB), IMIM/UPF, Barcelona, Spain
| | | | | | | | | | | | | | | | | |
Collapse
|
45
|
Bravo J, Aloy P. Target selection for complex structural genomics. Curr Opin Struct Biol 2006; 16:385-92. [PMID: 16713251 DOI: 10.1016/j.sbi.2006.05.003] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2006] [Revised: 04/25/2006] [Accepted: 05/04/2006] [Indexed: 01/05/2023]
Abstract
Most cellular processes are carried out by macromolecular assemblies and regulated through a complex network of transient protein-protein interactions. Genome-wide interaction discovery experiments are already delivering the first drafts of whole organism interactomes and, thus, depicting the limits of the interaction space. However, a complete understanding of molecular interactions can only come from high-resolution three-dimensional structures, as they provide key atomic details about the binding interfaces. The launch of structural genomics initiatives focused on protein interactions and complexes could quickly fill up the interaction space with structural details, offering a new perspective on how cell networks operate at atomic level. Clear target selection strategies that rationally identify the key interactions and complexes that should be first tackled are fundamental to maximize the return, minimize the costs and prevent experimental difficulties.
Collapse
Affiliation(s)
- Jerónimo Bravo
- Centro Nacional de Investigaciones Oncológicas, C/Melchor Fernández Almagro 3, 28029 Madrid, Spain
| | | |
Collapse
|
46
|
Valente AXCN, Cusick ME. Yeast Protein Interactome topology provides framework for coordinated-functionality. Nucleic Acids Res 2006; 34:2812-9. [PMID: 16717286 PMCID: PMC1464412 DOI: 10.1093/nar/gkl325] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023] Open
Abstract
The architecture of the network of protein–protein physical interactions in Saccharomyces cerevisiae is exposed through the combination of two complementary theoretical network measures, betweenness centrality and ‘Q-modularity’. The yeast interactome is characterized by well-defined topological modules connected via a small number of inter-module protein interactions. Should such topological inter-module connections turn out to constitute a form of functional coordination between the modules, we speculate that this coordination is occurring typically in a pairwise fashion, rather than by way of high-degree hub proteins responsible for coordinating multiple modules. The unique non-hub-centric hierarchical organization of the interactome is not reproduced by gene duplication-and-divergence stochastic growth models that disregard global selective pressures.
Collapse
Affiliation(s)
- André X C N Valente
- Biometry Research Group, Department of Health and Human Services, National Cancer Institute, National Institutes of Health, Bethesda, MD 20892, USA.
| | | |
Collapse
|
47
|
Mika S, Rost B. Protein-protein interactions more conserved within species than across species. PLoS Comput Biol 2006; 2:e79. [PMID: 16854211 PMCID: PMC1513270 DOI: 10.1371/journal.pcbi.0020079] [Citation(s) in RCA: 84] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2005] [Indexed: 11/21/2022] Open
Abstract
Experimental high-throughput studies of protein–protein interactions are beginning to provide enough data for comprehensive computational studies. Today, about ten large data sets, each with thousands of interacting pairs, coarsely sample the interactions in fly, human, worm, and yeast. Another about 55,000 pairs of interacting proteins have been identified by more careful, detailed biochemical experiments. Most interactions are experimentally observed in prokaryotes and simple eukaryotes; very few interactions are observed in higher eukaryotes such as mammals. It is commonly assumed that pathways in mammals can be inferred through homology to model organisms, e.g. the experimental observation that two yeast proteins interact is transferred to infer that the two corresponding proteins in human also interact. Two pairs for which the interaction is conserved are often described as interologs. The goal of this investigation was a large-scale comprehensive analysis of such inferences, i.e. of the evolutionary conservation of interologs. Here, we introduced a novel score for measuring the overlap between protein–protein interaction data sets. This measure appeared to reflect the overall quality of the data and was the basis for our two surprising results from our large-scale analysis. Firstly, homology-based inferences of physical protein–protein interactions appeared far less successful than expected. In fact, such inferences were accurate only for extremely high levels of sequence similarity. Secondly, and most surprisingly, the identification of interacting partners through sequence similarity was significantly more reliable for protein pairs within the same organism than for pairs between species. Our analysis underlined that the discrepancies between different datasets are large, even when using the same type of experiment on the same organism. This reality considerably constrains the power of homology-based transfer of interactions. In particular, the experimental probing of interactions in distant model organisms has to be undertaken with some caution. More comprehensive images of protein–protein networks will require the combination of many high-throughput methods, including in silico inferences and predictions. http://www.rostlab.org/results/2006/ppi_homology/ The IntAct database contains about ten large-scale data sets of protein–protein interactions. Each set contains thousands of experimentally observed pair interactions. Most pairs were observed in yeast (Saccharomyces cerevisiae), fly (Drosophila melanogaster), and worm (Caenorhabditis elegans). These interactions are often perceived as model organisms in the sense that one can infer that two mouse proteins interact if one experimentally observes the two corresponding proteins in worm to interact. Here, the authors analyzed in detail how the sequence signals of physical protein–protein interactions are conserved. It is a common assumption that protein–protein interactions can easily be inferred through homology transfer from one model organism to another organism of interest. Here, the authors demonstrated that such homology transfers are only accurate at unexpectedly high levels of sequence identity. Even more surprisingly, homology transfers of protein–protein interactions are significantly more reliable for protein pairs from the same species than for two protein pairs from different organisms. The observation that interactions were much more conserved within than across species was valid for all levels of sequence similarity, i.e. for very similar as well as for more diverged interologs.
Collapse
Affiliation(s)
- Sven Mika
- Department of Biochemistry and Molecular Biophysics, Columbia University, New York, New York, USA.
| | | |
Collapse
|
48
|
Bauch A, Superti-Furga G. Charting protein complexes, signaling pathways, and networks in the immune system. Immunol Rev 2006; 210:187-207. [PMID: 16623772 DOI: 10.1111/j.0105-2896.2006.00369.x] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]
Abstract
Systematic deciphering of protein-protein interactions has the potential to generate comprehensive and instructive signaling networks and to fuel new therapeutic and diagnostic strategies. Here, we describe how recent advances in high-throughput proteomic technologies, involving biochemical purification methods and mass spectrometry analysis, can be applied systematically to the characterization of protein complexes and the computation of molecular networks. The networks obtained form the basis for further functional analyses, such as knockdown by RNA interference, ultimately leading to the identification of nodes that represent candidate targets for pharmacological exploitation. No individual experimental approach can accurately elucidate all critical modulatory components and biological aspects of a signaling network. Such functionally annotated protein-protein interaction networks, however, represent an ideal platform for the integration of additional datasets. By providing links between molecules, they also provide links to all previous observations associated with these molecules, be they of genetic, pharmacological, or other origin. As exemplified here by the analysis of the tumor necrosis factor (TNF)-alpha/nuclear factor-kappaB (NF-kappaB) signaling pathway, the approach is applicable to any mammalian cellular signaling pathway in the immune system.
Collapse
Affiliation(s)
- Angela Bauch
- CeMM, Research Center for Molecular Medicine of the Austrian Academy of Sciences, Vienna, Austria.
| | | |
Collapse
|
49
|
Heuser P, Baù D, Benkert P, Schomburg D. Refinement of unbound protein docking studies using biological knowledge. Proteins 2006; 61:1059-67. [PMID: 16208723 DOI: 10.1002/prot.20634] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
In this work we present two methods for the reranking of protein-protein docking studies. One scoring method searches the InterDom database for domains that are available in the proteins to be docked and evaluates the interaction of these domains in other complexes of known structure. The second one analyzes the interface of each proposed conformation with regard to the conservation of Phe, Met, and Trp and their polar neighbor residues. The special relevance of these residues is based on a publication by Ma et al. (Proc Natl Acad Sci USA 2003;100:5772-5777), who compared the conservation of all residues in the interface region to the conservation on the rest of the protein's surface. The scoring functions were tested on 30 unbound docking test cases. The evaluation of the methods is based on the ability to rerank the output of a Fast Fourier Transformation (FFT) docking. Both were able to improve the ranking of the docking output. The best improvement was achieved for enzyme-inhibitor examples. Especially the domain-based scoring function was successful and able to place a near-native solution on one of the first six ranks for 13 of 17 (76%) enzyme-inhibitor complexes [in 53% (nine complexes) even on the first rank]. The method evaluating residue conservation allowed us to increase the number of good solutions within the first 100 ranks out of approximately 9000 in 82% of the 17 enzyme-inhibitor test cases, and for seven (41%) out of 17 enzyme-inhibitor complexes, a near native solution was placed within the first seven ranks.
Collapse
Affiliation(s)
- Philipp Heuser
- CUBIC-Cologne University BioInformatics Center, Cologne, Germany
| | | | | | | |
Collapse
|
50
|
Abstract
Much of systems biology aims to predict the behaviour of biological systems on the basis of the set of molecules involved. Understanding the interactions between these molecules is therefore crucial to such efforts. Although many thousands of interactions are known, precise molecular details are available for only a tiny fraction of them. The difficulties that are involved in experimentally determining atomic structures for interacting proteins make predictive methods essential for progress. Structural details can ultimately turn abstract system representations into models that more accurately reflect biological reality.
Collapse
Affiliation(s)
- Patrick Aloy
- Institució Catalana de Recerca i Estudis Avançats, Barcelona, Spain
| | | |
Collapse
|