1
|
Tumilovich A, Yablokov E, Mezentsev Y, Ershov P, Basina V, Gnedenko O, Kaluzhskiy L, Tsybruk T, Grabovec I, Kisel M, Shabunya P, Soloveva N, Vavilov N, Gilep A, Ivanov A. The Multienzyme Complex Nature of Dehydroepiandrosterone Sulfate Biosynthesis. Int J Mol Sci 2024; 25:2072. [PMID: 38396748 PMCID: PMC10889563 DOI: 10.3390/ijms25042072] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2023] [Revised: 01/16/2024] [Accepted: 01/26/2024] [Indexed: 02/25/2024] Open
Abstract
Dehydroepiandrosterone (DHEA), a precursor of steroid sex hormones, is synthesized by steroid 17-alpha-hydroxylase/17,20-lyase (CYP17A1) with the participation of microsomal cytochrome b5 (CYB5A) and cytochrome P450 reductase (CPR), followed by sulfation by two cytosolic sulfotransferases, SULT1E1 and SULT2A1, for storage and transport to tissues in which its synthesis is not available. The involvement of CYP17A1 and SULTs in these successive reactions led us to consider the possible interaction of SULTs with DHEA-producing CYP17A1 and its redox partners. Text mining analysis, protein-protein network analysis, and gene co-expression analysis were performed to determine the relationships between SULTs and microsomal CYP isoforms. For the first time, using surface plasmon resonance, we detected interactions between CYP17A1 and SULT2A1 or SULT1E1. SULTs also interacted with CYB5A and CPR. The interaction parameters of SULT2A1/CYP17A1 and SULT2A1/CYB5A complexes seemed to be modulated by 3'-phosphoadenosine-5'-phosphosulfate (PAPS). Affinity purification, combined with mass spectrometry (AP-MS), allowed us to identify a spectrum of SULT1E1 potential protein partners, including CYB5A. We showed that the enzymatic activity of SULTs increased in the presence of only CYP17A1 or CYP17A1 and CYB5A mixture. The structures of CYP17A1/SULT1E1 and CYB5A/SULT1E1 complexes were predicted. Our data provide novel fundamental information about the organization of microsomal CYP-dependent macromolecular complexes.
Collapse
Affiliation(s)
- Anastasiya Tumilovich
- Institute of Bioorganic Chemistry NASB, 5 Building 2, V.F. Kuprevich Street, 220141 Minsk, Belarus; (A.T.); (T.T.); (I.G.); (M.K.); (P.S.); (A.G.)
| | - Evgeniy Yablokov
- Institute of Biomedical Chemistry, 10 Building 8, Pogodinskaya Street, 119121 Moscow, Russia; (E.Y.); (P.E.); (O.G.); (L.K.); (N.S.); (N.V.); (A.I.)
| | - Yuri Mezentsev
- Institute of Biomedical Chemistry, 10 Building 8, Pogodinskaya Street, 119121 Moscow, Russia; (E.Y.); (P.E.); (O.G.); (L.K.); (N.S.); (N.V.); (A.I.)
| | - Pavel Ershov
- Institute of Biomedical Chemistry, 10 Building 8, Pogodinskaya Street, 119121 Moscow, Russia; (E.Y.); (P.E.); (O.G.); (L.K.); (N.S.); (N.V.); (A.I.)
| | - Viktoriia Basina
- Research Centre for Medical Genetics, 1 Moskvorechye Street, 115522 Moscow, Russia;
| | - Oksana Gnedenko
- Institute of Biomedical Chemistry, 10 Building 8, Pogodinskaya Street, 119121 Moscow, Russia; (E.Y.); (P.E.); (O.G.); (L.K.); (N.S.); (N.V.); (A.I.)
| | - Leonid Kaluzhskiy
- Institute of Biomedical Chemistry, 10 Building 8, Pogodinskaya Street, 119121 Moscow, Russia; (E.Y.); (P.E.); (O.G.); (L.K.); (N.S.); (N.V.); (A.I.)
| | - Tatsiana Tsybruk
- Institute of Bioorganic Chemistry NASB, 5 Building 2, V.F. Kuprevich Street, 220141 Minsk, Belarus; (A.T.); (T.T.); (I.G.); (M.K.); (P.S.); (A.G.)
| | - Irina Grabovec
- Institute of Bioorganic Chemistry NASB, 5 Building 2, V.F. Kuprevich Street, 220141 Minsk, Belarus; (A.T.); (T.T.); (I.G.); (M.K.); (P.S.); (A.G.)
| | - Maryia Kisel
- Institute of Bioorganic Chemistry NASB, 5 Building 2, V.F. Kuprevich Street, 220141 Minsk, Belarus; (A.T.); (T.T.); (I.G.); (M.K.); (P.S.); (A.G.)
| | - Polina Shabunya
- Institute of Bioorganic Chemistry NASB, 5 Building 2, V.F. Kuprevich Street, 220141 Minsk, Belarus; (A.T.); (T.T.); (I.G.); (M.K.); (P.S.); (A.G.)
| | - Natalia Soloveva
- Institute of Biomedical Chemistry, 10 Building 8, Pogodinskaya Street, 119121 Moscow, Russia; (E.Y.); (P.E.); (O.G.); (L.K.); (N.S.); (N.V.); (A.I.)
| | - Nikita Vavilov
- Institute of Biomedical Chemistry, 10 Building 8, Pogodinskaya Street, 119121 Moscow, Russia; (E.Y.); (P.E.); (O.G.); (L.K.); (N.S.); (N.V.); (A.I.)
| | - Andrei Gilep
- Institute of Bioorganic Chemistry NASB, 5 Building 2, V.F. Kuprevich Street, 220141 Minsk, Belarus; (A.T.); (T.T.); (I.G.); (M.K.); (P.S.); (A.G.)
- Institute of Biomedical Chemistry, 10 Building 8, Pogodinskaya Street, 119121 Moscow, Russia; (E.Y.); (P.E.); (O.G.); (L.K.); (N.S.); (N.V.); (A.I.)
| | - Alexis Ivanov
- Institute of Biomedical Chemistry, 10 Building 8, Pogodinskaya Street, 119121 Moscow, Russia; (E.Y.); (P.E.); (O.G.); (L.K.); (N.S.); (N.V.); (A.I.)
| |
Collapse
|
2
|
Angles R, Arenas-Salinas M, García R, Ingram B. An optimized relational database for querying structural patterns in proteins. Database (Oxford) 2024; 2024:baad093. [PMID: 38236197 PMCID: PMC10939390 DOI: 10.1093/database/baad093] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2023] [Revised: 11/16/2023] [Accepted: 12/19/2023] [Indexed: 01/19/2024]
Abstract
A database is an essential component in almost any software system, and its creation involves more than just data modeling and schema design. It also includes query optimization and tuning. This paper focuses on a web system called GSP4PDB, which is used for searching structural patterns in proteins. The system utilizes a normalized relational database, which has proven to be inefficient even for simple queries. This article discusses the optimization of the GSP4PDB database by implementing two techniques: denormalization and indexing. The empirical evaluation described in the article shows that combining these techniques enhances the efficiency of the database when querying both real and artificial graph-based structural patterns.
Collapse
Affiliation(s)
- Renzo Angles
- Department of Computer Science, Faculty of Engineering, Universidad de Talca, Camino a Los Niches Km. 1, Curicó, Región del Maule 3340000, Chile
- Millennium Institute for Foundational Research on Data (IMFD), Vicuña Mackenna 4860, Macul, Santiago, Región Metropolitana 7810000, Chile
| | - Mauricio Arenas-Salinas
- Centro de Bioinformática y Simulación Molecular (CBSM), Faculty of Engineering, Universidad de Talca, Av. Lircay s/n, Talca Región del Maule 34600000, Chile
| | - Roberto García
- Millennium Institute for Foundational Research on Data (IMFD), Vicuña Mackenna 4860, Macul, Santiago, Región Metropolitana 7810000, Chile
- Engineering Systems Doctoral Program, Faculty of Engineering, Universidad de Talca, Camino a Los Niches Km 1, Curicó, Región del Maule 3340000, Chile
| | - Ben Ingram
- School of Water, Energy and Environment, Cranfield University, College Road, Cranfield, Bedfordshire MK43 0AL, England
| |
Collapse
|
3
|
Meng Q, Guo F, Wang E, Tang J. ComDock: A novel approach for protein-protein docking with an efficient fusing strategy. Comput Biol Med 2023; 167:107660. [PMID: 37944303 DOI: 10.1016/j.compbiomed.2023.107660] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2023] [Revised: 10/08/2023] [Accepted: 10/31/2023] [Indexed: 11/12/2023]
Abstract
Protein-protein interaction plays an important role in studying the mechanism of protein functions from the structural perspective. Molecular docking is a powerful approach to detect protein-protein complexes using computational tools, due to the high cost and time-consuming of the traditional experimental methods. Among existing technologies, the template-based method utilizes the structural information of known homologous 3D complexes as available and reliable templates to achieve high accuracy and low computational complexity. However, the performance of the template-based method depends on the quality and quantity of templates. When insufficient or even no templates, the ab initio docking method is necessary and largely enriches the docking conformations. Therefore, it's a feasible strategy to fuse the effectivity of the template-based model and the universality of ab initio model to improve the docking performance. In this study, we construct a new, diverse, comprehensive template library derived from PDB, containing 77,685 complexes. We propose a template-based method (named TemDock), which retrieves the evolutionary relationship between the target sequence and samples in the template library and transfers similar structural information. Then, the target structure is built by superposing on the homologous template complex with TM-align. Moreover, we develop a consensus-based method (named ComDock) to integrate our TemDock and an existing ab initio method (ZDOCK). On 105 targets with templates from Benchmark 5.0, the TemDock and ComDock achieve a success rate of 68.57 % and 71.43 % in the top 10 conformations, respectively. Compared with the HDOCK, ComDock obtains better I-RMSD of hit configurations on 9 targets and more hit models in the top 100 conformations. As an efficient method for protein-protein docking, the ComDock is expected to study protein-protein recognition and reveal the various biological passways that are critical for developing drug discovery. The final results are stored at https://github.com/guofei-tju/mqz_ComDock_docking.
Collapse
Affiliation(s)
- Qiaozhen Meng
- College of Intelligence and Computing, Tianjin University, Tianjin, China
| | - Fei Guo
- School of Computer Science and Engineering, Central South University, Changsha, China.
| | - Ercheng Wang
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, Zhejiang, China; Zhejiang Laboratory, Hangzhou, Zhejiang, China.
| | - Jijun Tang
- Shenzhen Institute of Advanced Technology of Chinese Academy of Sciences, Shenzhen, China.
| |
Collapse
|
4
|
Lin Z, Schaefer K, Lui I, Yao Z, Fossati A, Swaney DL, Palar A, Sali A, Wells JA. Multi-scale photocatalytic proximity labeling reveals cell surface neighbors on and between cells. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.10.28.564055. [PMID: 37961561 PMCID: PMC10634877 DOI: 10.1101/2023.10.28.564055] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/15/2023]
Abstract
The cell membrane proteome is the primary biohub for cell communication, yet we are only beginning to understand the dynamic protein neighborhoods that form on the cell surface and between cells. Proximity labeling proteomics (PLP) strategies using chemically reactive probes are powerful approaches to yield snapshots of protein neighborhoods but are currently limited to one single resolution based on the probe labeling radius. Here, we describe a multi-scale PLP method with tunable resolution using a commercially available histological dye, Eosin Y, which upon visible light illumination, activates three different photo-probes with labeling radii ranging from ∼100 to 3000 Å. We applied this platform to profile neighborhoods of the oncogenic epidermal growth factor receptor (EGFR) and orthogonally validated >20 neighbors using immuno-assays and AlphaFold-Multimer prediction that generated plausible binary interaction models. We further profiled the protein neighborhoods of cell-cell synapses induced by bi-specific T-cell engagers (BiTEs) and chimeric antigen receptor (CAR)T cells at longer length scales. This integrated multi-scale PLP platform maps local and distal protein networks on cell surfaces and between cells. We believe this information will aid in the systematic construction of the cell surface interactome and reveal new opportunities for immunotherapeutics.
Collapse
|
5
|
Collins KW, Copeland MM, Kotthoff I, Singh A, Kundrotas PJ, Vakser IA. Dockground resource for protein recognition studies. Protein Sci 2022; 31:e4481. [PMID: 36281025 PMCID: PMC9667896 DOI: 10.1002/pro.4481] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2022] [Revised: 10/19/2022] [Accepted: 10/20/2022] [Indexed: 12/13/2022]
Abstract
Structural information of protein-protein interactions is essential for characterization of life processes at the molecular level. While a small fraction of known protein interactions has experimentally determined structures, computational modeling of protein complexes (protein docking) has to fill the gap. The Dockground resource (http://dockground.compbio.ku.edu) provides a collection of datasets for the development and testing of protein docking techniques. Currently, Dockground contains datasets for the bound and the unbound (experimentally determined and simulated) protein structures, model-model complexes, docking decoys of experimentally determined and modeled proteins, and templates for comparative docking. The Dockground bound proteins dataset is a core set, from which other Dockground datasets are generated. It is devised as a relational PostgreSQL database containing information on experimentally determined protein-protein complexes. This report on the Dockground resource describes current status of the datasets, new automated update procedures and further development of the core datasets. We also present a new Dockground interactive web interface, which allows search by various parameters, such as release date, multimeric state, complex type, structure resolution, and so on, visualization of the search results with a number of customizable parameters, as well as downloadable datasets with predefined levels of sequence and structure redundancy.
Collapse
Affiliation(s)
| | | | - Ian Kotthoff
- Computational Biology ProgramThe University of KansasKansasUSA
| | - Amar Singh
- Computational Biology ProgramThe University of KansasKansasUSA
| | | | - Ilya A. Vakser
- Computational Biology ProgramThe University of KansasKansasUSA
- Department of Molecular BiosciencesThe University of KansasKansasUSA
| |
Collapse
|
6
|
Sen N, Madhusudhan MS. A structural database of chain–chain and domain–domain interfaces of proteins. Protein Sci 2022. [DOI: 10.1002/pro.4406] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Affiliation(s)
- Neeladri Sen
- Indian Institute of Science Education and Research Pune India
- Institute of Structural and Molecular Biology University College London London UK
| | | |
Collapse
|
7
|
Dynamic stability of salt stable cowpea chlorotic mottle virus capsid protein dimers and pentamers of dimers. Sci Rep 2022; 12:14251. [PMID: 35995818 PMCID: PMC9395436 DOI: 10.1038/s41598-022-18019-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2022] [Accepted: 08/03/2022] [Indexed: 12/03/2022] Open
Abstract
Intermediates of the self-assembly process of the salt stable cowpea chlorotic mottle virus (ss-CCMV) capsid can be modelled atomistically on realistic computational timescales either by studying oligomers in equilibrium or by focusing on their dissociation instead of their association. Our previous studies showed that among the three possible dimer interfaces in the icosahedral capsid, two are thermodynamically relevant for capsid formation. The aim of the current study is to evaluate the relative structural stabilities of the three different ss-CCMV dimers and to find and understand the conditions that lead to their dissociation. Long timescale molecular dynamics simulations at 300 K of the various dimers and of the pentamer of dimers underscore the importance of large contact surfaces on stabilizing the capsid subunits within an oligomer. Simulations in implicit solvent show that at higher temperature (350 K), the N-terminal tails of the protein units act as tethers, delaying dissociation for all but the most stable interface. The pentamer of dimers is also found to be stable on long timescales at 300 K, with an inherent flexibility of the outer protein chains.
Collapse
|
8
|
Ghadie M, Xia Y. Mutation Edgotype Drives Fitness Effect in Human. FRONTIERS IN BIOINFORMATICS 2021; 1:690769. [PMID: 36303776 PMCID: PMC9581054 DOI: 10.3389/fbinf.2021.690769] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2021] [Accepted: 08/18/2021] [Indexed: 11/24/2022] Open
Abstract
Missense mutations are known to perturb protein-protein interaction networks (known as interactome networks) in different ways. However, it remains unknown how different interactome perturbation patterns (“edgotypes”) impact organismal fitness. Here, we estimate the fitness effect of missense mutations with different interactome perturbation patterns in human, by calculating the fractions of neutral and deleterious mutations that do not disrupt PPIs (“quasi-wild-type”), or disrupt PPIs either by disrupting the binding interface (“edgetic”) or by disrupting overall protein stability (“quasi-null”). We first map pathogenic mutations and common non-pathogenic mutations onto homology-based three-dimensional structural models of proteins and protein-protein interactions in human. Next, we perform structure-based calculations to classify each mutation as either quasi-wild-type, edgetic, or quasi-null. Using our predicted as well as experimentally determined interactome perturbation patterns, we estimate that >∼40% of quasi-wild-type mutations are effectively neutral and the remaining are mostly mildly deleterious, that >∼75% of edgetic mutations are only mildly deleterious, and that up to ∼75% of quasi-null mutations may be strongly detrimental. These estimates are the first such estimates of fitness effect for different network perturbation patterns in any interactome. Our results suggest that while mutations that do not disrupt the interactome tend to be effectively neutral, the majority of human PPIs are under strong purifying selection and the stability of most human proteins is essential to human life.
Collapse
|
9
|
Zheng C, Liu Y, Sun F, Zhao L, Zhang L. Predicting Protein-Protein Interactions Between Rice and Blast Fungus Using Structure-Based Approaches. FRONTIERS IN PLANT SCIENCE 2021; 12:690124. [PMID: 34367213 PMCID: PMC8343130 DOI: 10.3389/fpls.2021.690124] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/12/2021] [Accepted: 06/21/2021] [Indexed: 05/18/2023]
Abstract
Rice blast, caused by the fungus Magnaporthe oryzae, is the most devastating disease affecting rice production. Identification of protein-protein interactions (PPIs) is a critical step toward understanding the molecular mechanisms underlying resistance to blast fungus in rice. In this study, we presented a computational framework for predicting plant-pathogen PPIs based on structural information. Compared with the sequence-based methods, the structure-based approach showed to be more powerful in discovering new PPIs between plants and pathogens. Using the structure-based method, we generated a global PPI network consisted of 2,018 interacting protein pairs involving 1,344 rice proteins and 418 blast fungus proteins. The network analysis showed that blast resistance genes were enriched in the PPI network. The network-based prediction enabled systematic discovery of new blast resistance genes in rice. The network provided a global map to help accelerate the identification of blast resistance genes and advance our understanding of plant-pathogen interactions.
Collapse
|
10
|
Upadhyayula RS. Computational Investigation of Structural Interfaces of Protein Complexes with Short Linear Motifs. J Proteome Res 2020; 19:3254-3263. [DOI: 10.1021/acs.jproteome.0c00212] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Affiliation(s)
- Raghavender Surya Upadhyayula
- Department of Biotechnology and Bioinformatics, Birla Institute of Scientific Research (BISR), Jaipur, Rajasthan 302001, India
| |
Collapse
|
11
|
Kundrotas PJ, Kotthoff I, Choi SW, Copeland MM, Vakser IA. Dockground Tool for Development and Benchmarking of Protein Docking Procedures. Methods Mol Biol 2020; 2165:289-300. [PMID: 32621232 DOI: 10.1007/978-1-0716-0708-4_17] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Databases of protein-protein complexes are essential for the development of protein modeling/docking techniques. Such databases provide a knowledge base for docking algorithms, intermolecular potentials, search procedures, scoring functions, and refinement protocols. Development of docking techniques requires systematic validation of the modeling protocols on carefully curated benchmark sets of complexes. We present a description and a guide to the DOCKGROUND resource ( http://dockground.compbio.ku.edu ) for structural modeling of protein interactions. The resource integrates various datasets of protein complexes and other data for the development and testing of protein docking techniques. The sets include bound complexes, experimentally determined unbound, simulated unbound, model-model complexes, and docking decoys. The datasets are available to the user community through a Web interface.
Collapse
Affiliation(s)
- Petras J Kundrotas
- Computational Biology Program and Department of Molecular Biosciences, The University of Kansas, Lawrence, KS, USA.
| | - Ian Kotthoff
- Computational Biology Program and Department of Molecular Biosciences, The University of Kansas, Lawrence, KS, USA
| | - Sherman W Choi
- Computational Biology Program and Department of Molecular Biosciences, The University of Kansas, Lawrence, KS, USA
| | - Matthew M Copeland
- Computational Biology Program and Department of Molecular Biosciences, The University of Kansas, Lawrence, KS, USA
| | - Ilya A Vakser
- Computational Biology Program and Department of Molecular Biosciences, The University of Kansas, Lawrence, KS, USA.
| |
Collapse
|
12
|
Gemovic B, Sumonja N, Davidovic R, Perovic V, Veljkovic N. Mapping of Protein-Protein Interactions: Web-Based Resources for Revealing Interactomes. Curr Med Chem 2019; 26:3890-3910. [PMID: 29446725 DOI: 10.2174/0929867325666180214113704] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2017] [Revised: 09/14/2017] [Accepted: 01/29/2018] [Indexed: 01/04/2023]
Abstract
BACKGROUND The significant number of protein-protein interactions (PPIs) discovered by harnessing concomitant advances in the fields of sequencing, crystallography, spectrometry and two-hybrid screening suggests astonishing prospects for remodelling drug discovery. The PPI space which includes up to 650 000 entities is a remarkable reservoir of potential therapeutic targets for every human disease. In order to allow modern drug discovery programs to leverage this, we should be able to discern complete PPI maps associated with a specific disorder and corresponding normal physiology. OBJECTIVE Here, we will review community available computational programs for predicting PPIs and web-based resources for storing experimentally annotated interactions. METHODS We compared the capacities of prediction tools: iLoops, Struck2Net, HOMCOS, COTH, PrePPI, InterPreTS and PRISM to predict recently discovered protein interactions. RESULTS We described sequence-based and structure-based PPI prediction tools and addressed their peculiarities. Additionally, since the usefulness of prediction algorithms critically depends on the quality and quantity of the experimental data they are built on; we extensively discussed community resources for protein interactions. We focused on the active and recently updated primary and secondary PPI databases, repositories specialized to the subject or species, as well as databases that include both experimental and predicted PPIs. CONCLUSION PPI complexes are the basis of important physiological processes and therefore, possible targets for cell-penetrating ligands. Reliable computational PPI predictions can speed up new target discoveries through prioritization of therapeutically relevant protein-protein complexes for experimental studies.
Collapse
Affiliation(s)
- Branislava Gemovic
- Center for Multidisciplinary Research, Institute of Nuclear Sciences Vinca, University of Belgrade, Belgrade, Serbia
| | - Neven Sumonja
- Center for Multidisciplinary Research, Institute of Nuclear Sciences Vinca, University of Belgrade, Belgrade, Serbia
| | - Radoslav Davidovic
- Center for Multidisciplinary Research, Institute of Nuclear Sciences Vinca, University of Belgrade, Belgrade, Serbia
| | - Vladimir Perovic
- Center for Multidisciplinary Research, Institute of Nuclear Sciences Vinca, University of Belgrade, Belgrade, Serbia
| | - Nevena Veljkovic
- Center for Multidisciplinary Research, Institute of Nuclear Sciences Vinca, University of Belgrade, Belgrade, Serbia
| |
Collapse
|
13
|
Siebenmorgen T, Zacharias M. Computational prediction of protein–protein binding affinities. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE 2019. [DOI: 10.1002/wcms.1448] [Citation(s) in RCA: 50] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]
Affiliation(s)
- Till Siebenmorgen
- Physics Department T38 Technical University of Munich Garching Germany
| | - Martin Zacharias
- Physics Department T38 Technical University of Munich Garching Germany
| |
Collapse
|
14
|
Mirabello C, Wallner B. Topology independent structural matching discovers novel templates for protein interfaces. Bioinformatics 2019; 34:i787-i794. [PMID: 30423106 DOI: 10.1093/bioinformatics/bty587] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
Motivation Protein-protein interactions (PPI) are essential for the function of the cellular machinery. The rapid growth of protein-protein complexes with known 3D structures offers a unique opportunity to study PPI to gain crucial insights into protein function and the causes of many diseases. In particular, it would be extremely useful to compare interaction surfaces of monomers, as this would enable the pinpointing of potential interaction surfaces based solely on the monomer structure, without the need to predict the complete complex structure. While there are many structural alignment algorithms for individual proteins, very few have been developed for protein interfaces, and none that can align only the interface residues to other interfaces or surfaces of interacting monomer subunits in a topology independent (non-sequential) manner. Results We present InterComp, a method for topology and sequence-order independent structural comparisons. The method is general and can be applied to various structural comparison applications. By representing residues as independent points in space rather than as a sequence of residues, InterComp can be applied to a wide range of problems including interface-surface comparisons and interface-interface comparisons. We demonstrate a use-case by applying InterComp to find similar protein interfaces on the surface of proteins. We show that InterComp pinpoints the correct interface for almost half of the targets (283 of 586) when considering the top 10 hits, and for 24% of the top 1, even when no templates can be found with regular sequence-order dependent structural alignment methods. Availability and implementation The source code and the datasets are available at: http://wallnerlab.org/InterComp. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Claudio Mirabello
- Division of Bioinformatics, Department of Physics, Chemistry and Biology, Linköping University, Linköping SE, Sweden
| | - Björn Wallner
- Division of Bioinformatics, Department of Physics, Chemistry and Biology, Linköping University, Linköping SE, Sweden
| |
Collapse
|
15
|
Zhao J, Lei Y, Hong J, Zheng C, Zhang L. AraPPINet: An Updated Interactome for the Analysis of Hormone Signaling Crosstalk in Arabidopsis thaliana. FRONTIERS IN PLANT SCIENCE 2019; 10:870. [PMID: 31333706 PMCID: PMC6625390 DOI: 10.3389/fpls.2019.00870] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/08/2019] [Accepted: 06/18/2019] [Indexed: 05/29/2023]
Abstract
Protein-protein interactions (PPIs) play fundamental roles in various cellular processes. Here, we present a new version of computational interactome that contains more than 345,000 predicted PPIs involving about 51.2% of the Arabidopsis proteins. Compared to the earlier version, the updated AraPPINet displays a higher accuracy in predicting protein interactions through performance evaluation with independent datasets. In addition to the experimental verifications of the previous version, the new version has been subjected to further validation test that demonstrates its ability to discover novel PPIs involved in hormone signaling pathways. Moreover, network analysis shows that many overlapping proteins are significantly involved in the interactions which mediated the crosstalk among plant hormones. The new version of AraPPINet provides a more reliable interactome which would facilitate the understanding of crosstalk among hormone signaling pathways in plants.
Collapse
|
16
|
Chasapis CT. Building Bridges Between Structural and Network-Based Systems Biology. Mol Biotechnol 2019; 61:221-229. [DOI: 10.1007/s12033-018-0146-8] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
17
|
Navigating Among Known Structures in Protein Space. Methods Mol Biol 2018. [PMID: 30298400 DOI: 10.1007/978-1-4939-8736-8_12] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register]
Abstract
Present-day protein space is the result of 3.7 billion years of evolution, constrained by the underlying physicochemical qualities of the proteins. It is difficult to differentiate between evolutionary traces and effects of physicochemical constraints. Nonetheless, as a rule of thumb, instances of structural reuse, or focusing on structural similarity, are likely attributable to physicochemical constraints, whereas sequence reuse, or focusing on sequence similarity, may be more indicative of evolutionary relationships. Both types of relationships have been studied and can provide meaningful insights to protein biophysics and evolution, which in turn can lead to better algorithms for protein search, annotation, and maybe even design.In broad strokes, studies of protein space vary in the entities they represent, the similarity measure comparing these entities, and the representation used. The entities can be, for example, protein chains, domains, supra-domains, or smaller protein sub-parts denoted themes. The measures of similarity between the entities can be based on sequence, structure, function, or any combination of these. The representation can be global, encompassing the whole space, or local, focusing on a particular region surrounding protein(s) of interest. Global representations include lists of grouped proteins, protein networks, and maps. Networks are the abstraction that is derived most directly from the similarity data: each node is the protein entity (e.g., a domain), and edges connect similar domains. Selecting the entities, the similarity measure, and the abstraction are three intertwined decisions: the similarity measures allow us to identify the entities, and the selection of entities influences what is a meaningful similarity measure. Similarly, we seek entities that are related to each other in a way, for which a simple representation describes their relationships succinctly and accurately. This chapter will cover studies that rely on different entities, similarity measures, and a range of representations to better understand protein structure space. Scholars may use publicly available navigators offering a global representation, and in particular the hierarchical classifications SCOP, CATH, and ECOD, or a local representation, which encompass structural alignment algorithms. Alternatively, scholars can configure their own navigator using existing tools. To demonstrate this DIY (do it yourself) approach for navigating in protein space, we investigate substrate-binding proteins. By presenting sequence similarities among this large and diverse protein family as a network, we can infer that one member (pdb ID 4ntl; of yet unknown function) may bind methionine and suggest a putative binding mechanism.
Collapse
|
18
|
Computational and Experimental Approaches to Predict Host-Parasite Protein-Protein Interactions. Methods Mol Biol 2018; 1819:153-173. [PMID: 30421403 DOI: 10.1007/978-1-4939-8618-7_7] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
In host-parasite systems, protein-protein interactions are key to allow the pathogen to enter the host and persist within the host. The study of host-parasite molecular communication improves the understanding the mechanisms of infection, evasion of the host immune system and tropism across different tissues. Current trends in parasitology focus on unraveling host-parasite protein-protein interactions to aid the development of new strategies to combat pathogenic parasites with better treatments and prevention mechanisms. Due to the complexity of capturing experimentally these interactions, computational approaches integrating data from different sources (mainly "omics" data) become key to complement or support experimental approaches. Here, we focus on the application of experimental and computational methods in the prediction of host-parasite interactions and highlight the potential of each of these methods in specific contexts.
Collapse
|
19
|
Ur Rehman H, Bari I, Ali A, Mahmood H. A Bayesian approach for estimating protein-protein interactions by integrating structural and non-structural biological data. MOLECULAR BIOSYSTEMS 2017; 13:2592-2602. [PMID: 29028065 DOI: 10.1039/c7mb00484b] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Accurate elucidation of genome wide protein-protein interactions is crucial for understanding the regulatory processes of the cell. High-throughput techniques, such as the yeast-2-hybrid (Y2H) assay, co-immunoprecipitation (co-IP), mass spectrometric (MS) protein complex identification, affinity purification (AP) etc., are generally relied upon to determine protein interactions. Unfortunately, each type of method is inherently subject to different types of noise and results in false positive interactions. On the other hand, precise understanding of proteins, especially knowledge of their functional associations is necessary for understanding how complex molecular machines function. To solve this problem, computational techniques are generally relied upon to precisely predict protein interactions. In this work, we present a novel method that combines structural and non-structural biological data to precisely predict protein interactions. The conceptual novelty of our approach lies in identifying and precisely associating biological information that provides substantial interaction clues. Our model combines structural and non-structural information using Bayesian statistics to calculate the likelihood of each interaction. The proposed model is tested on Saccharomyces cerevisiae's interactions extracted from the DIP and IntAct databases and provides substantial improvements in terms of accuracy, precision, recall and F1 score, as compared with the most widely used related state-of-the-art techniques.
Collapse
Affiliation(s)
- Hafeez Ur Rehman
- Department of Computer Science, FAST National University of Computer & Emerging Sciences, Peshawar, Pakistan.
| | | | | | | |
Collapse
|
20
|
Kundrotas PJ, Anishchenko I, Dauzhenka T, Kotthoff I, Mnevets D, Copeland MM, Vakser IA. Dockground: A comprehensive data resource for modeling of protein complexes. Protein Sci 2017; 27:172-181. [PMID: 28891124 DOI: 10.1002/pro.3295] [Citation(s) in RCA: 47] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2017] [Revised: 09/06/2017] [Accepted: 09/07/2017] [Indexed: 12/28/2022]
Abstract
Characterization of life processes at the molecular level requires structural details of protein interactions. The number of experimentally determined structures of protein-protein complexes accounts only for a fraction of known protein interactions. This gap in structural description of the interactome has to be bridged by modeling. An essential part of the development of structural modeling/docking techniques for protein interactions is databases of protein-protein complexes. They are necessary for studying protein interfaces, providing a knowledge base for docking algorithms, and developing intermolecular potentials, search procedures, and scoring functions. Development of protein-protein docking techniques requires thorough benchmarking of different parts of the docking protocols on carefully curated sets of protein-protein complexes. We present a comprehensive description of the Dockground resource (http://dockground.compbio.ku.edu) for structural modeling of protein interactions, including previously unpublished unbound docking benchmark set 4, and the X-ray docking decoy set 2. The resource offers a variety of interconnected datasets of protein-protein complexes and other data for the development and testing of different aspects of protein docking methodologies. Based on protein-protein complexes extracted from the PDB biounit files, Dockground offers sets of X-ray unbound, simulated unbound, model, and docking decoy structures. All datasets are freely available for download, as a whole or selecting specific structures, through a user-friendly interface on one integrated website.
Collapse
Affiliation(s)
- Petras J Kundrotas
- Center for Computational Biology, The University of Kansas, Lawrence, Kansas, 66045
| | - Ivan Anishchenko
- Center for Computational Biology, The University of Kansas, Lawrence, Kansas, 66045
| | - Taras Dauzhenka
- Center for Computational Biology, The University of Kansas, Lawrence, Kansas, 66045
| | - Ian Kotthoff
- Center for Computational Biology, The University of Kansas, Lawrence, Kansas, 66045
| | - Daniil Mnevets
- Center for Computational Biology, The University of Kansas, Lawrence, Kansas, 66045
| | - Matthew M Copeland
- Center for Computational Biology, The University of Kansas, Lawrence, Kansas, 66045
| | - Ilya A Vakser
- Center for Computational Biology, The University of Kansas, Lawrence, Kansas, 66045.,Department of Molecular Biosciences, The University of Kansas, Lawrence, Kansas, 66045
| |
Collapse
|
21
|
Computational modeling of protein assemblies. Curr Opin Struct Biol 2017; 44:179-189. [DOI: 10.1016/j.sbi.2017.04.006] [Citation(s) in RCA: 39] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2016] [Revised: 04/07/2017] [Accepted: 04/11/2017] [Indexed: 01/18/2023]
|
22
|
Liu S, Liu Y, Zhao J, Cai S, Qian H, Zuo K, Zhao L, Zhang L. A computational interactome for prioritizing genes associated with complex agronomic traits in rice (Oryza sativa). THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2017; 90:177-188. [PMID: 28074633 DOI: 10.1111/tpj.13475] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/15/2016] [Revised: 12/20/2016] [Accepted: 12/22/2016] [Indexed: 05/18/2023]
Abstract
Rice (Oryza sativa) is one of the most important staple foods for more than half of the global population. Many rice traits are quantitative, complex and controlled by multiple interacting genes. Thus, a full understanding of genetic relationships will be critical to systematically identify genes controlling agronomic traits. We developed a genome-wide rice protein-protein interaction network (RicePPINet, http://netbio.sjtu.edu.cn/riceppinet) using machine learning with structural relationship and functional information. RicePPINet contained 708 819 predicted interactions for 16 895 non-transposable element related proteins. The power of the network for discovering novel protein interactions was demonstrated through comparison with other publicly available protein-protein interaction (PPI) prediction methods, and by experimentally determined PPI data sets. Furthermore, global analysis of domain-mediated interactions revealed RicePPINet accurately reflects PPIs at the domain level. Our studies showed the efficiency of the RicePPINet-based method in prioritizing candidate genes involved in complex agronomic traits, such as disease resistance and drought tolerance, was approximately 2-11 times better than random prediction. RicePPINet provides an expanded landscape of computational interactome for the genetic dissection of agronomically important traits in rice.
Collapse
Affiliation(s)
- Shiwei Liu
- Department of Plant Science, School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Yihui Liu
- Department of Plant Science, School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Jiawei Zhao
- Department of Plant Science, School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Shitao Cai
- Department of Plant Science, School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Hongmei Qian
- Department of Plant Science, School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Kaijing Zuo
- Department of Plant Science, School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Lingxia Zhao
- Department of Plant Science, School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Lida Zhang
- Department of Plant Science, School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai, 200240, China
- Key Laboratory of Urban Agriculture (South) Ministry of Agriculture, School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai, 200240, China
| |
Collapse
|
23
|
Zhang L, Borthakur S, Buck M. Dissociation of a Dynamic Protein Complex Studied by All-Atom Molecular Simulations. Biophys J 2016; 110:877-86. [PMID: 26910424 PMCID: PMC4776036 DOI: 10.1016/j.bpj.2015.12.036] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2015] [Revised: 11/28/2015] [Accepted: 12/01/2015] [Indexed: 12/11/2022] Open
Abstract
The process of protein complex dissociation remains to be understood at the atomic level of detail. Computers now allow microsecond timescale molecular-dynamics simulations, which make the visualization of such processes possible. Here, we investigated the dissociation process of the EphA2-SHIP2 SAM-SAM domain heterodimer complex using unrestrained all-atom molecular-dynamics simulations. Previous studies on this system have shown that alternate configurations are sampled, that their interconversion can be fast, and that the complex is dynamic by nature. Starting from different NMR-derived structures, mutants were designed to stabilize a subset of configurations by swapping ion pairs across the protein-protein interface. We focused on two mutants, K956D/D1235K and R957D/D1223R, with attenuated binding affinity compared with the wild-type proteins. In contrast to calculations on the wild-type complexes, the majority of simulations of these mutants showed protein dissociation within 2.4 μs. During the separation process, we observed domain rotation and pivoting as well as a translation and simultaneous rolling, typically to alternate and weaker binding interfaces. Several unsuccessful recapturing attempts occurred once the domains were moderately separated. An analysis of protein solvation suggests that the dissociation process correlates with a progressive loss of protein-protein contacts. Furthermore, an evaluation of internal protein dynamics using quasi-harmonic and order parameter analyses indicates that changes in protein internal motions are expected to contribute significantly to the thermodynamics of protein dissociation. Considering protein association as the reverse of the separation process, the initial role of charged/polar interactions is emphasized, followed by changes in protein and solvent dynamics. The trajectories show that protein separation does not follow a single distinct pathway, but suggest that the mechanism of dissociation is common in that it initially involves transitions to surfaces with fewer, less favorable contacts compared with those seen in the fully formed complex.
Collapse
Affiliation(s)
- Liqun Zhang
- Department of Physiology and Biophysics, School of Medicine, Case Western Reserve University, Cleveland, Ohio
| | - Susmita Borthakur
- Department of Physiology and Biophysics, School of Medicine, Case Western Reserve University, Cleveland, Ohio
| | - Matthias Buck
- Department of Physiology and Biophysics, School of Medicine, Case Western Reserve University, Cleveland, Ohio; Department of Neurosciences, School of Medicine, Case Western Reserve University, Cleveland, Ohio; Department of Pharmacology, School of Medicine, Case Western Reserve University, Cleveland, Ohio; Comprehensive Cancer Center, School of Medicine, Case Western Reserve University, Cleveland, Ohio; Center for Proteomics and Bioinformatics, School of Medicine, Case Western Reserve University, Cleveland, Ohio.
| |
Collapse
|
24
|
Kuang X, Dhroso A, Han JG, Shyu CR, Korkin D. DOMMINO 2.0: integrating structurally resolved protein-, RNA-, and DNA-mediated macromolecular interactions. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2016; 2016:bav114. [PMID: 26827237 PMCID: PMC4733329 DOI: 10.1093/database/bav114] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/01/2015] [Accepted: 11/16/2015] [Indexed: 11/14/2022]
Abstract
Macromolecular interactions are formed between proteins, DNA and RNA molecules. Being a principle building block in macromolecular assemblies and pathways, the interactions underlie most of cellular functions. Malfunctioning of macromolecular interactions is also linked to a number of diseases. Structural knowledge of the macromolecular interaction allows one to understand the interaction's mechanism, determine its functional implications and characterize the effects of genetic variations, such as single nucleotide polymorphisms, on the interaction. Unfortunately, until now the interactions mediated by different types of macromolecules, e.g. protein-protein interactions or protein-DNA interactions, are collected into individual and unrelated structural databases. This presents a significant obstacle in the analysis of macromolecular interactions. For instance, the homogeneous structural interaction databases prevent scientists from studying structural interactions of different types but occurring in the same macromolecular complex. Here, we introduce DOMMINO 2.0, a structural Database Of Macro-Molecular INteractiOns. Compared to DOMMINO 1.0, a comprehensive database on protein-protein interactions, DOMMINO 2.0 includes the interactions between all three basic types of macromolecules extracted from PDB files. DOMMINO 2.0 is automatically updated on a weekly basis. It currently includes ∼1,040,000 interactions between two polypeptide subunits (e.g. domains, peptides, termini and interdomain linkers), ∼43,000 RNA-mediated interactions, and ∼12,000 DNA-mediated interactions. All protein structures in the database are annotated using SCOP and SUPERFAMILY family annotation. As a result, protein-mediated interactions involving protein domains, interdomain linkers, C- and N- termini, and peptides are identified. Our database provides an intuitive web interface, allowing one to investigate interactions at three different resolution levels: whole subunit network, binary interaction and interaction interface. Database URL: http://dommino.org.
Collapse
Affiliation(s)
- Xingyan Kuang
- Informatics Institute, University of Missouri, Columbia, MO, USA
| | - Andi Dhroso
- Department of Computer Science and Bioinformatics and Computational Biology Program, Worcester Polytechnic Institute, Worcester, MA, USA
| | - Jing Ginger Han
- Informatics Institute, University of Missouri, Columbia, MO, USA
| | - Chi-Ren Shyu
- Informatics Institute, University of Missouri, Columbia, MO, USA, Department of Electrical and Computer Engineering, Department of Computer Science, University of Missouri, Columbia, MO, USA
| | - Dmitry Korkin
- Department of Computer Science and Bioinformatics and Computational Biology Program, Worcester Polytechnic Institute, Worcester, MA, USA,
| |
Collapse
|
25
|
Ghoorah AW, Devignes MD, Smaïl-Tabbone M, Ritchie DW. Protein docking using case-based reasoning. Proteins 2015; 81:2150-8. [PMID: 24123156 DOI: 10.1002/prot.24433] [Citation(s) in RCA: 86] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
Protein docking algorithms aim to calculate the three-dimensional (3D) structure of a protein complex starting from its unbound components. Although ab initio docking algorithms are improving, there is a growing need to use homology modeling techniques to exploit the rapidly increasing volumes of structural information that now exist. However, most current homology modeling approaches involve finding a pair of complete single-chain structures in a homologous protein complex to use as a 3D template, despite the fact that protein complexes are often formed from one or more domain-domain interactions (DDIs). To model 3D protein complexes by domain-domain homology, we have developed a case-based reasoning approach called KBDOCK which systematically identifies and reuses domain family binding sites from our database of nonredundant DDIs. When tested on 54 protein complexes from the Protein Docking Benchmark, our approach provides a near-perfect way to model single-domain protein complexes when full-homology templates are available, and it extends our ability to model more difficult cases when only partial or incomplete templates exist. These promising early results highlight the need for a new and diverse docking benchmark set, specifically designed to assess homology docking approaches.
Collapse
Affiliation(s)
- Anisah W Ghoorah
- Department of Obstetrics and Gynaecology, University of Tuebingen, Tuebingen, Germany
| | | | | | | |
Collapse
|
26
|
Composition of Overlapping Protein-Protein and Protein-Ligand Interfaces. PLoS One 2015; 10:e0140965. [PMID: 26517868 PMCID: PMC4627654 DOI: 10.1371/journal.pone.0140965] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2015] [Accepted: 10/03/2015] [Indexed: 11/19/2022] Open
Abstract
Protein-protein interactions (PPIs) play a major role in many biological processes and they represent an important class of targets for therapeutic intervention. However, targeting PPIs is challenging because often no convenient natural substrates are available as starting point for small-molecule design. Here, we explored the characteristics of protein interfaces in five non-redundant datasets of 174 protein-protein (PP) complexes, and 161 protein-ligand (PL) complexes from the ABC database, 436 PP complexes, and 196 PL complexes from the PIBASE database and a dataset of 89 PL complexes from the Timbal database. In all cases, the small molecule ligands must bind at the respective PP interface. We observed similar amino acid frequencies in all three datasets. Remarkably, also the characteristics of PP contacts and overlapping PL contacts are highly similar.
Collapse
|
27
|
Surfing the Protein-Protein Interaction Surface Using Docking Methods: Application to the Design of PPI Inhibitors. Molecules 2015; 20:11569-603. [PMID: 26111183 PMCID: PMC6272567 DOI: 10.3390/molecules200611569] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2015] [Revised: 06/02/2015] [Accepted: 06/15/2015] [Indexed: 02/06/2023] Open
Abstract
Blocking protein-protein interactions (PPI) using small molecules or peptides modulates biochemical pathways and has therapeutic significance. PPI inhibition for designing drug-like molecules is a new area that has been explored extensively during the last decade. Considering the number of available PPI inhibitor databases and the limited number of 3D structures available for proteins, docking and scoring methods play a major role in designing PPI inhibitors as well as stabilizers. Docking methods are used in the design of PPI inhibitors at several stages of finding a lead compound, including modeling the protein complex, screening for hot spots on the protein-protein interaction interface and screening small molecules or peptides that bind to the PPI interface. There are three major challenges to the use of docking on the relatively flat surfaces of PPI. In this review we will provide some examples of the use of docking in PPI inhibitor design as well as its limitations. The combination of experimental and docking methods with improved scoring function has thus far resulted in few success stories of PPI inhibitors for therapeutic purposes. Docking algorithms used for PPI are in the early stages, however, and as more data are available docking will become a highly promising area in the design of PPI inhibitors or stabilizers.
Collapse
|
28
|
Mulder NJ, Akinola RO, Mazandu GK, Rapanoel H. Using biological networks to improve our understanding of infectious diseases. Comput Struct Biotechnol J 2014; 11:1-10. [PMID: 25379138 PMCID: PMC4212278 DOI: 10.1016/j.csbj.2014.08.006] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
Infectious diseases are the leading cause of death, particularly in developing countries. Although many drugs are available for treating the most common infectious diseases, in many cases the mechanism of action of these drugs or even their targets in the pathogen remain unknown. In addition, the key factors or processes in pathogens that facilitate infection and disease progression are often not well understood. Since proteins do not work in isolation, understanding biological systems requires a better understanding of the interconnectivity between proteins in different pathways and processes, which includes both physical and other functional interactions. Such biological networks can be generated within organisms or between organisms sharing a common environment using experimental data and computational predictions. Though different data sources provide different levels of accuracy, confidence in interactions can be measured using interaction scores. Connections between interacting proteins in biological networks can be represented as graphs and edges, and thus studied using existing algorithms and tools from graph theory. There are many different applications of biological networks, and here we discuss three such applications, specifically applied to the infectious disease tuberculosis, with its causative agent Mycobacterium tuberculosis and host, Homo sapiens. The applications include the use of the networks for function prediction, comparison of networks for evolutionary studies, and the generation and use of host–pathogen interaction networks.
Collapse
Affiliation(s)
- Nicola J Mulder
- Computational Biology Group, Department of Clinical Laboratory Sciences, IDM, University of Cape Town Faculty of Health Sciences, Anzio Road, Observatory, Cape Town, South Africa
| | - Richard O Akinola
- Computational Biology Group, Department of Clinical Laboratory Sciences, IDM, University of Cape Town Faculty of Health Sciences, Anzio Road, Observatory, Cape Town, South Africa
| | - Gaston K Mazandu
- Computational Biology Group, Department of Clinical Laboratory Sciences, IDM, University of Cape Town Faculty of Health Sciences, Anzio Road, Observatory, Cape Town, South Africa
| | - Holifidy Rapanoel
- Computational Biology Group, Department of Clinical Laboratory Sciences, IDM, University of Cape Town Faculty of Health Sciences, Anzio Road, Observatory, Cape Town, South Africa
| |
Collapse
|
29
|
Huang SY. Search strategies and evaluation in protein–protein docking: principles, advances and challenges. Drug Discov Today 2014; 19:1081-96. [DOI: 10.1016/j.drudis.2014.02.005] [Citation(s) in RCA: 87] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2013] [Revised: 01/04/2014] [Accepted: 02/24/2014] [Indexed: 01/10/2023]
|
30
|
Andreani J, Guerois R. Evolution of protein interactions: From interactomes to interfaces. Arch Biochem Biophys 2014; 554:65-75. [DOI: 10.1016/j.abb.2014.05.010] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2014] [Revised: 04/28/2014] [Accepted: 05/12/2014] [Indexed: 12/16/2022]
|
31
|
Lu HC, Fornili A, Fraternali F. Protein-protein interaction networks studies and importance of 3D structure knowledge. Expert Rev Proteomics 2014; 10:511-20. [PMID: 24206225 DOI: 10.1586/14789450.2013.856764] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
Abstract
Protein-protein interaction networks (PPINs) are a powerful tool to study biological processes in living cells. In this review, we present the progress of PPIN studies from abstract to more detailed representations. We will focus on 3D interactome networks, which offer detailed information at the atomic level. This information can be exploited in understanding not only the underlying cellular mechanisms, but also how human variants and disease-causing mutations affect protein functions and complexes' stability. Recent studies have used structural information on PPINs to also understand the molecular mechanisms of binding partner selection. We will address the challenges in generating 3D PPINs due to the restricted number of solved protein structures. Finally, some of the current use of 3D PPINs will be discussed, highlighting their contribution to the studies in genotype-phenotype relationships and in the optimization of targeted studies to design novel chemical compounds for medical treatments.
Collapse
Affiliation(s)
- Hui-Chun Lu
- Randall Division of Cell and Molecular Biophysics, King's College London, New Hunt's House, London SE1 1UL, UK
| | | | | |
Collapse
|
32
|
Goncearenco A, Shoemaker BA, Zhang D, Sarychev A, Panchenko AR. Coverage of protein domain families with structural protein-protein interactions: current progress and future trends. PROGRESS IN BIOPHYSICS AND MOLECULAR BIOLOGY 2014; 116:187-93. [PMID: 24931138 DOI: 10.1016/j.pbiomolbio.2014.05.005] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/17/2013] [Revised: 04/14/2014] [Accepted: 05/17/2014] [Indexed: 11/16/2022]
Abstract
Protein interactions have evolved into highly precise and regulated networks adding an immense layer of complexity to cellular systems. The most accurate atomistic description of protein binding sites can be obtained directly from structures of protein complexes. The availability of structurally characterized protein interfaces significantly improves our understanding of interactomes, and the progress in structural characterization of protein-protein interactions (PPIs) can be measured by calculating the structural coverage of protein domain families. We analyze the coverage of protein domain families (defined according to CDD and Pfam databases) by structures, structural protein-protein complexes and unique protein binding sites. Structural PPI coverage of currently available protein families is about 30% without any signs of saturation in coverage growth dynamics. Given the current growth rates of domain databases and structural PPI deposition, complete domain coverage with PPIs is not expected in the near future. As a result of this study we identify families without any protein-protein interaction evidence (listed on a supporting website http://www.ncbi.nlm.nih.gov/Structure/ibis/coverage/) and propose them as potential targets for structural studies with a focus on protein interactions.
Collapse
Affiliation(s)
- Alexander Goncearenco
- Computational Biology Branch of the National Center for Biotechnology Information in Bethesda, Maryland, United States
| | - Benjamin A Shoemaker
- Computational Biology Branch of the National Center for Biotechnology Information in Bethesda, Maryland, United States
| | - Dachuan Zhang
- Computational Biology Branch of the National Center for Biotechnology Information in Bethesda, Maryland, United States
| | - Alexey Sarychev
- Computational Biology Branch of the National Center for Biotechnology Information in Bethesda, Maryland, United States
| | - Anna R Panchenko
- Computational Biology Branch of the National Center for Biotechnology Information in Bethesda, Maryland, United States.
| |
Collapse
|
33
|
Webb B, Eswar N, Fan H, Khuri N, Pieper U, Dong G, Sali A. Comparative Modeling of Drug Target Proteins☆. REFERENCE MODULE IN CHEMISTRY, MOLECULAR SCIENCES AND CHEMICAL ENGINEERING 2014. [PMCID: PMC7157477 DOI: 10.1016/b978-0-12-409547-2.11133-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
In this perspective, we begin by describing the comparative protein structure modeling technique and the accuracy of the corresponding models. We then discuss the significant role that comparative prediction plays in drug discovery. We focus on virtual ligand screening against comparative models and illustrate the state-of-the-art by a number of specific examples.
Collapse
|
34
|
Finn RD, Miller BL, Clements J, Bateman A. iPfam: a database of protein family and domain interactions found in the Protein Data Bank. Nucleic Acids Res 2013; 42:D364-73. [PMID: 24297255 PMCID: PMC3965099 DOI: 10.1093/nar/gkt1210] [Citation(s) in RCA: 117] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Abstract
The database iPfam, available at http://ipfam.org, catalogues Pfam domain interactions based on known 3D structures that are found in the Protein Data Bank, providing interaction data at the molecular level. Previously, the iPfam domain–domain interaction data was integrated within the Pfam database and website, but it has now been migrated to a separate database. This allows for independent development, improving data access and giving clearer separation between the protein family and interactions datasets. In addition to domain–domain interactions, iPfam has been expanded to include interaction data for domain bound small molecule ligands. Functional annotations are provided from source databases, supplemented by the incorporation of Wikipedia articles where available. iPfam (version 1.0) contains >9500 domain–domain and 15 500 domain–ligand interactions. The new website provides access to this data in a variety of ways, including interactive visualizations of the interaction data.
Collapse
Affiliation(s)
- Robert D Finn
- HHMI Janelia Farm Research Campus, 19700 Helix Drive, Ashburn VA 20147 USA and European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | | | | | | |
Collapse
|
35
|
Pieper U, Webb BM, Dong GQ, Schneidman-Duhovny D, Fan H, Kim SJ, Khuri N, Spill YG, Weinkam P, Hammel M, Tainer JA, Nilges M, Sali A. ModBase, a database of annotated comparative protein structure models and associated resources. Nucleic Acids Res 2013; 42:D336-46. [PMID: 24271400 PMCID: PMC3965011 DOI: 10.1093/nar/gkt1144] [Citation(s) in RCA: 216] [Impact Index Per Article: 19.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
ModBase (http://salilab.org/modbase) is a database of annotated comparative protein structure models. The models are calculated by ModPipe, an automated modeling pipeline that relies primarily on Modeller for fold assignment, sequence-structure alignment, model building and model assessment (http://salilab.org/modeller/). ModBase currently contains almost 30 million reliable models for domains in 4.7 million unique protein sequences. ModBase allows users to compute or update comparative models on demand, through an interface to the ModWeb modeling server (http://salilab.org/modweb). ModBase models are also available through the Protein Model Portal (http://www.proteinmodelportal.org/). Recently developed associated resources include the AllosMod server for modeling ligand-induced protein dynamics (http://salilab.org/allosmod), the AllosMod-FoXS server for predicting a structural ensemble that fits an SAXS profile (http://salilab.org/allosmod-foxs), the FoXSDock server for protein–protein docking filtered by an SAXS profile (http://salilab.org/foxsdock), the SAXS Merge server for automatic merging of SAXS profiles (http://salilab.org/saxsmerge) and the Pose & Rank server for scoring protein–ligand complexes (http://salilab.org/poseandrank). In this update, we also highlight two applications of ModBase: a PSI:Biology initiative to maximize the structural coverage of the human alpha-helical transmembrane proteome and a determination of structural determinants of human immunodeficiency virus-1 protease specificity.
Collapse
Affiliation(s)
- Ursula Pieper
- Department of Bioengineering and Therapeutic Sciences, California Institute for Quantitative Biosciences, Byers Hall at Mission Bay, Office 503B, University of California at San Francisco, 1700 4th Street, San Francisco, CA 94158, USA, Department of Pharmaceutical Chemistry, California Institute for Quantitative Biosciences, Byers Hall at Mission Bay, Office 503B, University of California at San Francisco, 1700 4th Street, San Francisco, CA 94158, USA, Graduate Group in Biophysics, University of California at San Francisco, CA 94158, USA, Structural Bioinformatics Unit, Structural Biology and Chemistry department, Institut Pasteur, 25 rue du Docteur Roux, 75015 Paris, France, Université Paris Diderot-Paris 7, école doctorale iViv, Paris Rive Gauche, 5 rue Thomas Mann, 75013 Paris, France, Physical Biosciences Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA, Department of Molecular Biology, Skaggs Institute of Chemical Biology, The Scripps Research Institute, La Jolla, CA 92037, USA, Life Sciences Division, Department of Molecular Biology, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
36
|
Van Voorst JR, Finzel BC. Searching for likeness in a database of macromolecular complexes. J Chem Inf Model 2013; 53:2634-47. [PMID: 24047445 DOI: 10.1021/ci4002537] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
A software tool and workflow based on distance geometry is presented that can be used to search for local similarity in substructures in a comprehensive database of experimentally derived macromolecular structure. The method does not rely on fold annotation, specific secondary structure assignments, or sequence homology and may be used to locate compound substructures of multiple segments spanning different macromolecules that share a queried backbone geometry. This generalized substructure searching capability is intended to allow users to play an active part in exploring the role specific substructures play in larger protein domains, quaternary assemblies of proteins, and macromolecular complexes of proteins and polynucleotides. The user may select any portion or portions of an existing structure or complex to serve as a template for searching, and other structures that share the same structural features are identified, retrieved and overlaid to emphasize substructural likeness. Matching structures may be compared using a variety of integrated tools including molecular graphics for structure visualization and matching substructure sequence logos. A number of examples are provided that illustrate how generalized substructure searching may be used to understand both the similarity, and individuality of specific macromolecular structures. Web-based access to our substructure searching services is freely available at https://drugsite.msi.umn.edu.
Collapse
Affiliation(s)
- Jeffrey R Van Voorst
- Department of Medicinal Chemistry, University of Minnesota College of Pharmacy , Minneapolis, Minnesota 55455, United States
| | | |
Collapse
|
37
|
Mosca R, Céol A, Stein A, Olivella R, Aloy P. 3did: a catalog of domain-based interactions of known three-dimensional structure. Nucleic Acids Res 2013; 42:D374-9. [PMID: 24081580 PMCID: PMC3965002 DOI: 10.1093/nar/gkt887] [Citation(s) in RCA: 163] [Impact Index Per Article: 14.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The database of 3D interacting domains (3did, available online for browsing and bulk download at http://3did.irbbarcelona.org) is a catalog of protein–protein interactions for which a high-resolution 3D structure is known. 3did collects and classifies all structural templates of domain–domain interactions in the Protein Data Bank, providing molecular details for such interactions. The current version also includes a pipeline for the discovery and annotation of novel domain–motif interactions. For every interaction, 3did identifies and groups different binding modes by clustering similar interfaces into ‘interaction topologies’. By maintaining a constantly updated collection of domain-based structural interaction templates, 3did is a reference source of information for the structural characterization of protein interaction networks. 3did is updated every 6 months.
Collapse
Affiliation(s)
- Roberto Mosca
- Joint IRB-BSC Program in Computational Biology, Institute for Research in Biomedicine (IRB Barcelona), c/ Baldiri Reixac 10-12, 08028 Barcelona, Spain, Center for Genomic Science of IIT@SEMM, Istituto Italiano di Tecnologia (IIT), Via Adamello 16, 20139 Milan, Italy, California Institute for Quantitative Biomedical Research (qb3) and Department of Bioengineering and Therapeutic Sciences, MC 2530, University of California San Francisco (UCSF) CA 94158-2330, USA and Institució Catalana de Recerca i Estudis Avançats (ICREA), Passeig Lluís Companys 23, 08010 Barcelona, Spain
| | | | | | | | | |
Collapse
|
38
|
Beta atomic contacts: identifying critical specific contacts in protein binding interfaces. PLoS One 2013; 8:e59737. [PMID: 23630569 PMCID: PMC3632532 DOI: 10.1371/journal.pone.0059737] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2012] [Accepted: 02/21/2013] [Indexed: 11/19/2022] Open
Abstract
Specific binding between proteins plays a crucial role in molecular functions and biological processes. Protein binding interfaces and their atomic contacts are typically defined by simple criteria, such as distance-based definitions that only use some threshold of spatial distance in previous studies. These definitions neglect the nearby atomic organization of contact atoms, and thus detect predominant contacts which are interrupted by other atoms. It is questionable whether such kinds of interrupted contacts are as important as other contacts in protein binding. To tackle this challenge, we propose a new definition called beta (β) atomic contacts. Our definition, founded on the β-skeletons in computational geometry, requires that there is no other atom in the contact spheres defined by two contact atoms; this sphere is similar to the van der Waals spheres of atoms. The statistical analysis on a large dataset shows that β contacts are only a small fraction of conventional distance-based contacts. To empirically quantify the importance of β contacts, we design βACV, an SVM classifier with β contacts as input, to classify homodimers from crystal packing. We found that our βACV is able to achieve the state-of-the-art classification performance superior to SVM classifiers with distance-based contacts as input. Our βACV also outperforms several existing methods when being evaluated on several datasets in previous works. The promising empirical performance suggests that β contacts can truly identify critical specific contacts in protein binding interfaces. β contacts thus provide a new model for more precise description of atomic organization in protein quaternary structures than distance-based contacts.
Collapse
|
39
|
Kysilka J, Vondrášek J. Towards a better understanding of the specificity of protein-protein interaction. J Mol Recognit 2013; 25:604-15. [PMID: 23108620 DOI: 10.1002/jmr.2219] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
In order to predict interaction interface for proteins, it is crucial to identify their characteristic features controlling the interaction process. We present analysis of 69 crystal structures of dimer protein complexes that provides a basis for reasonable description of the phenomenon. Interaction interfaces of two proteins at amino acids level were localized and described in terms of their chemical composition, binding preferences, and residue interaction energies utilizing Amber empirical force field. The characteristic properties of the interaction interface were compared against set of corresponding intramolecular binding parameters for amino acids in proteins. It has been found that geometrically distinct clusters of large hydrophobic amino acids (leucine, valine, isoleucine, and phenylalanine) as well as polar tyrosines and charged arginines are signatures of the protein-protein interaction interface. At some extent, we can generalize that protein-protein interaction (seen through interaction between amino acids) is very similar to the intramolecular arrangement of amino acids, although intermolecular pairs have generally lower interaction energies with their neighbors. Interfaces, therefore, possess high degree of complementarity suggesting also high selectivity of the process. The utilization of our results can improve interface prediction algorithms and improve our understanding of protein-protein recognition.
Collapse
Affiliation(s)
- Jiří Kysilka
- Institute of Organic Chemistry and Biochemistry, Czech Academy of Sciences, Prague, Czech Republic
| | | |
Collapse
|
40
|
Zhang QC, Petrey D, Garzón JI, Deng L, Honig B. PrePPI: a structure-informed database of protein-protein interactions. Nucleic Acids Res 2013; 41:D828-33. [PMID: 23193263 PMCID: PMC3531098 DOI: 10.1093/nar/gks1231] [Citation(s) in RCA: 182] [Impact Index Per Article: 16.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023] Open
Abstract
PrePPI (http://bhapp.c2b2.columbia.edu/PrePPI) is a database that combines predicted and experimentally determined protein-protein interactions (PPIs) using a Bayesian framework. Predicted interactions are assigned probabilities of being correct, which are derived from calculated likelihood ratios (LRs) by combining structural, functional, evolutionary and expression information, with the most important contribution coming from structure. Experimentally determined interactions are compiled from a set of public databases that manually collect PPIs from the literature and are also assigned LRs. A final probability is then assigned to every interaction by combining the LRs for both predicted and experimentally determined interactions. The current version of PrePPI contains ∼2 million PPIs that have a probability more than ∼0.1 of which ∼60 000 PPIs for yeast and ∼370 000 PPIs for human are considered high confidence (probability > 0.5). The PrePPI database constitutes an integrated resource that enables users to examine aggregate information on PPIs, including both known and potentially novel interactions, and that provides structural models for many of the PPIs.
Collapse
Affiliation(s)
- Qiangfeng Cliff Zhang
- Howard Hughes Medical Institute, Department of Biochemistry and Molecular Biophysics, Center for Computational Biology and Bioinformatics, Columbia Initiative in Systems Biology, Columbia University, New York, NY 10032, USA and School of Software, Central South University, Changsha 410083, China
| | - Donald Petrey
- Howard Hughes Medical Institute, Department of Biochemistry and Molecular Biophysics, Center for Computational Biology and Bioinformatics, Columbia Initiative in Systems Biology, Columbia University, New York, NY 10032, USA and School of Software, Central South University, Changsha 410083, China
| | - José Ignacio Garzón
- Howard Hughes Medical Institute, Department of Biochemistry and Molecular Biophysics, Center for Computational Biology and Bioinformatics, Columbia Initiative in Systems Biology, Columbia University, New York, NY 10032, USA and School of Software, Central South University, Changsha 410083, China
| | - Lei Deng
- Howard Hughes Medical Institute, Department of Biochemistry and Molecular Biophysics, Center for Computational Biology and Bioinformatics, Columbia Initiative in Systems Biology, Columbia University, New York, NY 10032, USA and School of Software, Central South University, Changsha 410083, China
| | - Barry Honig
- Howard Hughes Medical Institute, Department of Biochemistry and Molecular Biophysics, Center for Computational Biology and Bioinformatics, Columbia Initiative in Systems Biology, Columbia University, New York, NY 10032, USA and School of Software, Central South University, Changsha 410083, China
- *To whom correspondence should be addressed. Tel: +1 212 851 4651; Fax: +1 212 851 4650,
| |
Collapse
|
41
|
Mosca R, Céol A, Aloy P. Interactome3D: adding structural details to protein networks. Nat Methods 2013; 10:47-53. [DOI: 10.1038/nmeth.2289] [Citation(s) in RCA: 339] [Impact Index Per Article: 30.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2012] [Accepted: 10/30/2012] [Indexed: 01/13/2023]
|
42
|
Basse MJ, Betzi S, Bourgeas R, Bouzidi S, Chetrit B, Hamon V, Morelli X, Roche P. 2P2Idb: a structural database dedicated to orthosteric modulation of protein-protein interactions. Nucleic Acids Res 2012. [PMID: 23203891 PMCID: PMC3531195 DOI: 10.1093/nar/gks1002] [Citation(s) in RCA: 117] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023] Open
Abstract
Protein–protein interactions are considered as one of the next generation of
therapeutic targets. Specific tools thus need to be developed to tackle this challenging
chemical space. In an effort to derive some common principles from recent successes, we
have built 2P2Idb (freely accessible at http://2p2idb.cnrs-mrs.fr), a
hand-curated structural database dedicated to protein–protein interactions with
known orthosteric modulators. It includes all interactions for which both the
protein–protein and protein–ligand complexes have been structurally
characterized. A web server provides links to related sites of interest, binding affinity
data, pre-calculated structural information about protein–protein interfaces and 3D
interactive views through java applets. Comparison of interfaces in 2P2Idb to those of
representative datasets of heterodimeric complexes has led to the identification of
geometrical parameters and residue properties to assess the druggability of
protein–protein complexes. A tool is proposed to calculate a series of biophysical
and geometrical parameters that characterize protein–protein interfaces. A large
range of descriptors are computed including, buried accessible surface area, gap volume,
non-bonded contacts, hydrogen-bonds, atom and residue composition, number of segments and
secondary structure contribution. All together the 2P2I database represents a structural
source of information for scientists from academic institutions or pharmaceutical
industries.
Collapse
Affiliation(s)
- Marie Jeanne Basse
- Laboratory of Integrative Structural and Chemical Biology (iSCB), Centre de Recherche en Cancérologie de Marseille (CRCM), CNRS UMR 7258, INSERM U 1068, Institut Paoli-Calmettes, Marseille, France
| | | | | | | | | | | | | | | |
Collapse
|
43
|
Zhou H, Jin J, Wong L. Progress in computational studies of host-pathogen interactions. J Bioinform Comput Biol 2012; 11:1230001. [PMID: 23600809 DOI: 10.1142/s0219720012300018] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Host-pathogen interactions are important for understanding infection mechanism and developing better treatment and prevention of infectious diseases. Many computational studies on host-pathogen interactions have been published. Here, we review recent progress and results in this field and provide a systematic summary, comparison and discussion of computational studies on host-pathogen interactions, including prediction and analysis of host-pathogen protein-protein interactions; basic principles revealed from host-pathogen interactions; and database and software tools for host-pathogen interaction data collection, integration and analysis.
Collapse
Affiliation(s)
- Hufeng Zhou
- NUS Graduate School for Integrative Sciences & Engineering, National University of Singapore, Singapore 117456, Singapore.
| | | | | |
Collapse
|
44
|
Biophysical and computational fragment-based approaches to targeting protein-protein interactions: applications in structure-guided drug discovery. Q Rev Biophys 2012; 45:383-426. [PMID: 22971516 DOI: 10.1017/s0033583512000108] [Citation(s) in RCA: 70] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Drug discovery has classically targeted the active sites of enzymes or ligand-binding sites of receptors and ion channels. In an attempt to improve selectivity of drug candidates, modulation of protein-protein interfaces (PPIs) of multiprotein complexes that mediate conformation or colocation of components of cell-regulatory pathways has become a focus of interest. However, PPIs in multiprotein systems continue to pose significant challenges, as they are generally large, flat and poor in distinguishing features, making the design of small molecule antagonists a difficult task. Nevertheless, encouragement has come from the recognition that a few amino acids - so-called hotspots - may contribute the majority of interaction-free energy. The challenges posed by protein-protein interactions have led to a wellspring of creative approaches, including proteomimetics, stapled α-helical peptides and a plethora of antibody inspired molecular designs. Here, we review a more generic approach: fragment-based drug discovery. Fragments allow novel areas of chemical space to be explored more efficiently, but the initial hits have low affinity. This means that they will not normally disrupt PPIs, unless they are tethered, an approach that has been pioneered by Wells and co-workers. An alternative fragment-based approach is to stabilise the uncomplexed components of the multiprotein system in solution and employ conventional fragment-based screening. Here, we describe the current knowledge of the structures and properties of protein-protein interactions and the small molecules that can modulate them. We then describe the use of sensitive biophysical methods - nuclear magnetic resonance, X-ray crystallography, surface plasmon resonance, differential scanning fluorimetry or isothermal calorimetry - to screen and validate fragment binding. Fragment hits can subsequently be evolved into larger molecules with higher affinity and potency. These may provide new leads for drug candidates that target protein-protein interactions and have therapeutic value.
Collapse
|
45
|
The emergence of protein complexes: quaternary structure, dynamics and allostery. Colworth Medal Lecture. Biochem Soc Trans 2012; 40:475-91. [PMID: 22616857 DOI: 10.1042/bst20120056] [Citation(s) in RCA: 64] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
Abstract
All proteins require physical interactions with other proteins in order to perform their functions. Most of them oligomerize into homomers, and a vast majority of these homomers interact with other proteins, at least part of the time, forming transient or obligate heteromers. In the present paper, we review the structural, biophysical and evolutionary aspects of these protein interactions. We discuss how protein function and stability benefit from oligomerization, as well as evolutionary pathways by which oligomers emerge, mostly from the perspective of homomers. Finally, we emphasize the specificities of heteromeric complexes and their structure and evolution. We also discuss two analytical approaches increasingly being used to study protein structures as well as their interactions. First, we review the use of the biological networks and graph theory for analysis of protein interactions and structure. Secondly, we discuss recent advances in techniques for detecting correlated mutations, with the emphasis on their role in identifying pathways of allosteric communication.
Collapse
|
46
|
Winter C, Henschel A, Tuukkanen A, Schroeder M. Protein interactions in 3D: From interface evolution to drug discovery. J Struct Biol 2012; 179:347-58. [DOI: 10.1016/j.jsb.2012.04.009] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2011] [Revised: 03/27/2012] [Accepted: 04/18/2012] [Indexed: 11/25/2022]
|
47
|
Pang B, Kuang X, Zhao N, Korkin D, Shyu CR. PBSword: a web server for searching similar protein-protein binding sites. Nucleic Acids Res 2012; 40:W428-34. [PMID: 22689645 PMCID: PMC3394332 DOI: 10.1093/nar/gks527] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2022] Open
Abstract
PBSword is a web server designed for efficient and accurate comparisons and searches of geometrically similar protein–protein binding sites from a large-scale database. The basic idea of PBSword is that each protein binding site is first represented by a high-dimensional vector of ‘visual words’, which characterizes both the global and local shape features of the binding site. It then uses a scalable indexing technique to search for those binding sites whose visual words representations are similar to that of the query binding site. Our system is able to return ranked results of binding sites in short time from a database of 194 322 domain–domain binding sites. PBSword supports query by protein ID and by new structures uploaded by users. PBSword is a useful tool to investigate functional connections among proteins based on the local structures of binding site and has potential applications to protein–protein docking and drug discovery. The system is hosted at http://pbs.rnet.missouri.edu.
Collapse
Affiliation(s)
- Bin Pang
- Informatics Institute and Department of Computer Science, University of Missouri, Columbia, MO, USA
| | | | | | | | | |
Collapse
|
48
|
Garcia-Garcia J, Bonet J, Guney E, Fornes O, Planas J, Oliva B. Networks of ProteinProtein Interactions: From Uncertainty to Molecular Details. Mol Inform 2012; 31:342-62. [PMID: 27477264 DOI: 10.1002/minf.201200005] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2012] [Accepted: 03/09/2012] [Indexed: 11/08/2022]
Abstract
Proteins are the bricks and mortar of cells. The work of proteins is structural and functional, as they are the principal element of the organization of the cell architecture, but they also play a relevant role in its metabolism and regulation. To perform all these functions, proteins need to interact with each other and with other bio-molecules, either to form complexes or to recognize precise targets of their action. For instance, a particular transcription factor may activate one gene or another depending on its interactions with other proteins and not only with DNA. Hence, the ability of a protein to interact with other bio-molecules, and the partners they have at each particular time and location can be crucial to characterize the role of a protein. Proteins rarely act alone; they rather constitute a mingled network of physical interactions or other types of relationships (such as metabolic and regulatory) or signaling cascades. In this context, understanding the function of a protein implies to recognize the members of its neighborhood and to grasp how they associate, both at the systemic and atomic level. The network of physical interactions between the proteins of a system, cell or organism, is defined as the interactome. The purpose of this review is to deepen the description of interactomes at different levels of detail: from the molecular structure of complexes to the global topology of the network of interactions. The approaches and techniques applied experimentally and computationally to attain each level are depicted. The limits of each technique and its integration into a model network, the challenges and actual problems of completeness of an interactome, and the reliability of the interactions are reviewed and summarized. Finally, the application of the current knowledge of protein-protein interactions on modern network medicine and protein function annotation is also explored.
Collapse
Affiliation(s)
- Javier Garcia-Garcia
- Structural Bioinformatics Group, GRIB-IMIM, Universitat Pompeu Fabra, Barcelona Research Park of Biomedicine (PRBB), Catalonia, Spain
| | - Jaume Bonet
- Structural Bioinformatics Group, GRIB-IMIM, Universitat Pompeu Fabra, Barcelona Research Park of Biomedicine (PRBB), Catalonia, Spain
| | - Emre Guney
- Structural Bioinformatics Group, GRIB-IMIM, Universitat Pompeu Fabra, Barcelona Research Park of Biomedicine (PRBB), Catalonia, Spain
| | - Oriol Fornes
- Structural Bioinformatics Group, GRIB-IMIM, Universitat Pompeu Fabra, Barcelona Research Park of Biomedicine (PRBB), Catalonia, Spain
| | - Joan Planas
- Structural Bioinformatics Group, GRIB-IMIM, Universitat Pompeu Fabra, Barcelona Research Park of Biomedicine (PRBB), Catalonia, Spain
| | - Baldo Oliva
- Structural Bioinformatics Group, GRIB-IMIM, Universitat Pompeu Fabra, Barcelona Research Park of Biomedicine (PRBB), Catalonia, Spain.
| |
Collapse
|
49
|
Feverati G, Achoch M, Zrimi J, Vuillon L, Lesieur C. Beta-strand interfaces of non-dimeric protein oligomers are characterized by scattered charged residue patterns. PLoS One 2012; 7:e32558. [PMID: 22496732 PMCID: PMC3322119 DOI: 10.1371/journal.pone.0032558] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2011] [Accepted: 01/29/2012] [Indexed: 11/19/2022] Open
Abstract
Protein oligomers are formed either permanently, transiently or even by default. The protein chains are associated through intermolecular interactions constituting the protein interface. The protein interfaces of 40 soluble protein oligomers of stœchiometries above two are investigated using a quantitative and qualitative methodology, which analyzes the x-ray structures of the protein oligomers and considers their interfaces as interaction networks. The protein oligomers of the dataset share the same geometry of interface, made by the association of two individual β-strands (β-interfaces), but are otherwise unrelated. The results show that the β-interfaces are made of two interdigitated interaction networks. One of them involves interactions between main chain atoms (backbone network) while the other involves interactions between side chain and backbone atoms or between only side chain atoms (side chain network). Each one has its own characteristics which can be associated to a distinct role. The secondary structure of the β-interfaces is implemented through the backbone networks which are enriched with the hydrophobic amino acids favored in intramolecular β-sheets (MCWIV). The intermolecular specificity is provided by the side chain networks via positioning different types of charged residues at the extremities (arginine) and in the middle (glutamic acid and histidine) of the interface. Such charge distribution helps discriminating between sequences of intermolecular β-strands, of intramolecular β-strands and of β-strands forming β-amyloid fibers. This might open new venues for drug designs and predictive tool developments. Moreover, the β-strands of the cholera toxin B subunit interface, when produced individually as synthetic peptides, are capable of inhibiting the assembly of the toxin into pentamers. Thus, their sequences contain the features necessary for a β-interface formation. Such β-strands could be considered as ‘assemblons’, independent associating units, by homology to the foldons (independent folding unit). Such property would be extremely valuable in term of assembly inhibitory drug development.
Collapse
Affiliation(s)
| | - Mounia Achoch
- Université de Savoie, Annecy le Vieux Cedex, France
- Laboratoire de Chimie Bioorganique et Macromoléculaire (LCBM), Faculté des Sciences et Techniques-Guéliz, Université Cadi Ayyad, Marrakech, Maroc
| | - Jihad Zrimi
- Université de Savoie, Annecy le Vieux Cedex, France
- Laboratoire de Chimie Bioorganique et Macromoléculaire (LCBM), Faculté des Sciences et Techniques-Guéliz, Université Cadi Ayyad, Marrakech, Maroc
| | | | - Claire Lesieur
- Université de Savoie, Annecy le Vieux Cedex, France
- AGIM, Université Joseph Fourier, Archamps, France
- * E-mail:
| |
Collapse
|
50
|
Pang B, Zhao N, Korkin D, Shyu CR. Fast protein binding site comparisons using visual words representation. ACTA ACUST UNITED AC 2012; 28:1345-52. [PMID: 22492639 DOI: 10.1093/bioinformatics/bts138] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
MOTIVATION Finding geometrically similar protein binding sites is crucial for understanding protein functions and can provide valuable information for protein-protein docking and drug discovery. As the number of known protein-protein interaction structures has dramatically increased, a high-throughput and accurate protein binding site comparison method is essential. Traditional alignment-based methods can provide accurate correspondence between the binding sites but are computationally expensive. RESULTS In this article, we present a novel method for the comparisons of protein binding sites using a 'visual words' representation (PBSword). We first extract geometric features of binding site surfaces and build a vocabulary of visual words by clustering a large set of feature descriptors. We then describe a binding site surface with a high-dimensional vector that encodes the frequency of visual words, enhanced by the spatial relationships among them. Finally, we measure the similarity of binding sites by utilizing metric space operations, which provide speedy comparisons between protein binding sites. Our experimental results show that PBSword achieves a comparable classification accuracy to an alignment-based method and improves accuracy of a feature-based method by 36% on a non-redundant dataset. PBSword also exhibits a significant efficiency improvement over an alignment-based method.
Collapse
Affiliation(s)
- Bin Pang
- Informatics Institute, University of Missouri, Columbia, MO 65211, USA
| | | | | | | |
Collapse
|