1
|
Rasul HO, Ghafour DD, Aziz BK, Hassan BA, Rashid TA, Kivrak A. Decoding Drug Discovery: Exploring A-to-Z In Silico Methods for Beginners. Appl Biochem Biotechnol 2025; 197:1453-1503. [PMID: 39630336 DOI: 10.1007/s12010-024-05110-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/19/2024] [Indexed: 03/29/2025]
Abstract
The drug development process is a critical challenge in the pharmaceutical industry due to its time-consuming nature and the need to discover new drug potentials to address various ailments. The initial step in drug development, drug target identification, often consumes considerable time. While valid, traditional methods such as in vivo and in vitro approaches are limited in their ability to analyze vast amounts of data efficiently, leading to wasteful outcomes. To expedite and streamline drug development, an increasing reliance on computer-aided drug design (CADD) approaches has merged. These sophisticated in silico methods offer a promising avenue for efficiently identifying viable drug candidates, thus providing pharmaceutical firms with significant opportunities to uncover new prospective drug targets. The main goal of this work is to review in silico methods used in the drug development process with a focus on identifying therapeutic targets linked to specific diseases at the genetic or protein level. This article thoroughly discusses A-to-Z in silico techniques, which are essential for identifying the targets of bioactive compounds and their potential therapeutic effects. This review intends to improve drug discovery processes by illuminating the state of these cutting-edge approaches, thereby maximizing the effectiveness and duration of clinical trials for novel drug target investigation.
Collapse
Affiliation(s)
- Hezha O Rasul
- Department of Pharmaceutical Chemistry, College of Science, Charmo University, Peshawa Street, Chamchamal, 46023, Sulaimani, Iraq.
| | - Dlzar D Ghafour
- Department of Medical Laboratory Science, College of Science, Komar University of Science and Technology, 46001, Sulaimani, Iraq
- Department of Chemistry, College of Science, University of Sulaimani, 46001, Sulaimani, Iraq
| | - Bakhtyar K Aziz
- Department of Nanoscience and Applied Chemistry, College of Science, Charmo University, Peshawa Street, Chamchamal, 46023, Sulaimani, Iraq
| | - Bryar A Hassan
- Computer Science and Engineering Department, School of Science and Engineering, University of Kurdistan Hewler, KRI, Iraq
- Department of Computer Science, College of Science, Charmo University, Peshawa Street, Chamchamal, 46023, Sulaimani, Iraq
| | - Tarik A Rashid
- Computer Science and Engineering Department, School of Science and Engineering, University of Kurdistan Hewler, KRI, Iraq
| | - Arif Kivrak
- Department of Chemistry, Faculty of Sciences and Arts, Eskisehir Osmangazi University, Eskişehir, 26040, Turkey
| |
Collapse
|
2
|
Ismail CMKH, Abdul Hamid AA, Abdul Rashid NN, Lestari W, Mokhtar KI, Mustafa Alahmad BE, Abd Razak MRM, Ismail A. An ensemble docking-based virtual screening and molecular dynamics simulation of phytochemical compounds from Malaysian Kelulut Honey (KH) against SARS-CoV-2 target enzyme, human angiotensin-converting enzyme 2 (ACE-2). J Biomol Struct Dyn 2024:1-30. [PMID: 38279932 DOI: 10.1080/07391102.2024.2308762] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2023] [Accepted: 01/17/2024] [Indexed: 01/29/2024]
Abstract
The human angiotensin-converting enzyme 2 (ACE-2) receptor is a metalloenzyme that plays an important role in regulating blood pressure by modulating angiotensin II. This receptor facilitates SARS-CoV-2 entry into human cells via receptor-mediated endocytosis, causing the global COVID-19 pandemic and a major health crisis. Kelulut honey (KH), one of Malaysian honey recently gained attention for its distinct flavour and taste while having many nutritional and medicinal properties. Recent study demonstrates the antiviral potential of KH against SARS-CoV-2 by inhibiting ACE-2 in vitro, but the bioactive compound pertaining to the ACE-2 inhibition is yet unknown. An ensemble docking-based virtual screening was employed to screen the phytochemical compounds from KH with high binding affinity against the 10 best representative structures of ACE-2 that mostly formed from MD simulation. From 110 phytochemicals previously identified in KH, 27 compounds passed the ADMET analysis and proceeded to docking. Among the docked compound, SDC and FMN consistently exhibited strong binding to ACE-2's active site (-9.719 and -9.473 kcal/mol) and allosteric site (-7.305 and -7.464 kcal/mol) as compared to potent ACE-2 inhibitor, MLN 4760. Detailed trajectory analysis of MD simulation showed stable binding interaction towards active and allosteric sites of ACE-2. KH's compounds show promise in inhibiting SARS-CoV-2 binding to ACE-2 receptors, indicating potential for preventive use or as a supplement to other COVID-19 treatments. Additional research is needed to confirm KH's antiviral effects and its role in SARS-CoV-2 therapy, including prophylaxis and adjuvant treatment with vaccination.
Collapse
Affiliation(s)
- Che Muhammad Khairul Hisyam Ismail
- Department of Biotechnology, Kulliyyah of Science, International Islamic University Malaysia, Kuantan, Pahang, Malaysia
- Research Unit for Bioinformatics & Computational Biology (RUBIC), Kulliyyah of Science, International Islamic University Malaysia, Kuantan, Pahang, Malaysia
| | - Azzmer Azzar Abdul Hamid
- Department of Biotechnology, Kulliyyah of Science, International Islamic University Malaysia, Kuantan, Pahang, Malaysia
- Research Unit for Bioinformatics & Computational Biology (RUBIC), Kulliyyah of Science, International Islamic University Malaysia, Kuantan, Pahang, Malaysia
| | | | - Widya Lestari
- Department of Fundamental Dental and Medical Sciences, Kulliyyah of Dentistry, International Islamic University Malaysia, Kuantan, Pahang, Malaysia
| | - Khairani Idah Mokhtar
- Department of Fundamental Dental and Medical Sciences, Kulliyyah of Dentistry, International Islamic University Malaysia, Kuantan, Pahang, Malaysia
| | - Basma Ezzat Mustafa Alahmad
- Department of Fundamental Dental and Medical Sciences, Kulliyyah of Dentistry, International Islamic University Malaysia, Kuantan, Pahang, Malaysia
| | - Mohd Ridzuan Mohd Abd Razak
- Herbal Medicine Research Centre, Institute for Medical Research, National Institutes of Health, Shah Alam, Selangor, Malaysia
| | - Azlini Ismail
- Department of Fundamental Dental and Medical Sciences, Kulliyyah of Dentistry, International Islamic University Malaysia, Kuantan, Pahang, Malaysia
| |
Collapse
|
3
|
Hong X, Tong X, Xie J, Liu P, Liu X, Song Q, Liu S, Liu S. An updated dataset and a structure-based prediction model for protein-RNA binding affinity. Proteins 2023; 91:1245-1253. [PMID: 37186412 DOI: 10.1002/prot.26503] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2022] [Revised: 03/08/2023] [Accepted: 04/12/2023] [Indexed: 05/17/2023]
Abstract
Understanding the process of protein-RNA interaction is essential for structural biology. The thermodynamic process is an important part to uncover the protein-RNA interaction mechanism. The regulatory networks between protein and RNA in organisms are dominated by the binding or dissociation in the cells. Therefore, determining the binding affinity for protein-RNA complexes can help us to understand the regulation mechanism of protein-RNA interaction. Since it is time-consuming and labor-intensive to determine the binding affinity for protein-RNA complexes by experimental methods, it is necessary and urgent to develop computational methods to predict that. To develop a binding affinity prediction model, first we update the dataset of protein-RNA binding affinity benchmark (PRBAB), which includes 145 complexes now. Second, we extract the structural features based on complex structure, and then we analyze and select the representative structural features to train the regression model. Third, we random select the subset from the PRBAB2.0 to fit the protein-RNA binding affinity determined by experiment. In the end, we tested our model on the nonredundant PDBbind dataset, and the results showed that Pearson correlation coefficient r = .57 and RMSE = 2.51 kcal/mol. The Pearson correlation coefficient achieves 0.7 while removing 5 complex structures with modified residues/nucleotides and metal ions. While testing on ProNAB, the results showed that 71.60% of the prediction achieves Pearson correlation coefficient r = .61 and RMSE = 1.56 kcal/mol with experiment values.
Collapse
Affiliation(s)
- Xu Hong
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei, China
| | - Xiaoxue Tong
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei, China
| | - Juan Xie
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei, China
| | - Pinyu Liu
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei, China
| | - Xudong Liu
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei, China
| | - Qi Song
- Key Laboratory of Fermentation Engineering (Ministry of Education), Hubei University of Technology, Wuhan, China
| | - Sen Liu
- Key Laboratory of Fermentation Engineering (Ministry of Education), Hubei University of Technology, Wuhan, China
| | - Shiyong Liu
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei, China
| |
Collapse
|
4
|
Yang YX, Huang JY, Wang P, Zhu BT. AREA-AFFINITY: A Web Server for Machine Learning-Based Prediction of Protein-Protein and Antibody-Protein Antigen Binding Affinities. J Chem Inf Model 2023; 63:3230-3237. [PMID: 37235532 PMCID: PMC10268951 DOI: 10.1021/acs.jcim.2c01499] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2022] [Indexed: 05/28/2023]
Abstract
Protein-Protein binding affinity reflects the binding strength between the binding partners. The prediction of protein-protein binding affinity is important for elucidating protein functions and also for designing protein-based therapeutics. The geometric characteristics such as area (both interface and surface areas) in the structure of a protein-protein complex play an important role in determining protein-protein interactions and their binding affinity. Here, we present a free web server for academic use, AREA-AFFINITY, for prediction of protein-protein or antibody-protein antigen binding affinity based on interface and surface areas in the structure of a protein-protein complex. AREA-AFFINITY implements 60 effective area-based protein-protein affinity predictive models and 37 effective area-based models specific for antibody-protein antigen binding affinity prediction developed in our recent studies. These models take into consideration the roles of interface and surface areas in binding affinity by using areas classified according to different amino acid types with different biophysical nature. The models with the best performances integrate machine learning methods such as neural network or random forest. These newly developed models have superior or comparable performance compared to the commonly used existing methods. AREA-AFFINITY is available for free at: https://affinity.cuhk.edu.cn/.
Collapse
Affiliation(s)
- Yong Xiao Yang
- Shenzhen
Key Laboratory of Steroid Drug Discovery and Development, School of
Medicine, The Chinese University of Hong
Kong, Shenzhen, Guangdong 518172, China
| | - Jin Yan Huang
- Shenzhen
Key Laboratory of Steroid Drug Discovery and Development, School of
Medicine, The Chinese University of Hong
Kong, Shenzhen, Guangdong 518172, China
| | - Pan Wang
- Shenzhen
Key Laboratory of Steroid Drug Discovery and Development, School of
Medicine, The Chinese University of Hong
Kong, Shenzhen, Guangdong 518172, China
| | - Bao Ting Zhu
- Shenzhen
Key Laboratory of Steroid Drug Discovery and Development, School of
Medicine, The Chinese University of Hong
Kong, Shenzhen, Guangdong 518172, China
- Shenzhen
Bay Laboratory, Shenzhen, 518055, China
| |
Collapse
|
5
|
Guo Z, Yamaguchi R. Machine learning methods for protein-protein binding affinity prediction in protein design. FRONTIERS IN BIOINFORMATICS 2022; 2:1065703. [PMID: 36591334 PMCID: PMC9800603 DOI: 10.3389/fbinf.2022.1065703] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2022] [Accepted: 12/01/2022] [Indexed: 12/23/2022] Open
Abstract
Protein-protein interactions govern a wide range of biological activity. A proper estimation of the protein-protein binding affinity is vital to design proteins with high specificity and binding affinity toward a target protein, which has a variety of applications including antibody design in immunotherapy, enzyme engineering for reaction optimization, and construction of biosensors. However, experimental and theoretical modelling methods are time-consuming, hinder the exploration of the entire protein space, and deter the identification of optimal proteins that meet the requirements of practical applications. In recent years, the rapid development in machine learning methods for protein-protein binding affinity prediction has revealed the potential of a paradigm shift in protein design. Here, we review the prediction methods and associated datasets and discuss the requirements and construction methods of binding affinity prediction models for protein design.
Collapse
Affiliation(s)
- Zhongliang Guo
- Division of Cancer Systems Biology, Aichi Cancer Center Research Institute, Nagoya, Aichi, Japan
| | - Rui Yamaguchi
- Division of Cancer Systems Biology, Aichi Cancer Center Research Institute, Nagoya, Aichi, Japan,Division of Cancer Informatics, Nagoya University Graduate School of Medicine, Nagoya, Aichi, Japan,*Correspondence: Rui Yamaguchi,
| |
Collapse
|
6
|
Desantis F, Miotto M, Di Rienzo L, Milanetti E, Ruocco G. Spatial organization of hydrophobic and charged residues affects protein thermal stability and binding affinity. Sci Rep 2022; 12:12087. [PMID: 35840609 PMCID: PMC9287411 DOI: 10.1038/s41598-022-16338-5] [Citation(s) in RCA: 26] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2022] [Accepted: 07/08/2022] [Indexed: 11/12/2022] Open
Abstract
What are the molecular determinants of protein–protein binding affinity and whether they are similar to those regulating fold stability are two major questions of molecular biology, whose answers bring important implications both from a theoretical and applicative point of view. Here, we analyze chemical and physical features on a large dataset of protein–protein complexes with reliable experimental binding affinity data and compare them with a set of monomeric proteins for which melting temperature data was available. In particular, we probed the spatial organization of protein (1) intramolecular and intermolecular interaction energies among residues, (2) amino acidic composition, and (3) their hydropathy features. Analyzing the interaction energies, we found that strong Coulombic interactions are preferentially associated with a high protein thermal stability, while strong intermolecular van der Waals energies correlate with stronger protein–protein binding affinity. Statistical analysis of amino acids abundances, exposed to the molecular surface and/or in interaction with the molecular partner, confirmed that hydrophobic residues present on the protein surfaces are preferentially located in the binding regions, while charged residues behave oppositely. Leveraging on the important role of van der Waals interface interactions in binding affinity, we focused on the molecular surfaces in the binding regions and evaluated their shape complementarity, decomposing the molecular patches in the 2D Zernike basis. For the first time, we quantified the correlation between local shape complementarity and binding affinity via the Zernike formalism. In addition, considering the solvent interactions via the residue hydropathy, we found that the hydrophobicity of the binding regions dictates their shape complementary as much as the correlation between van der Waals energy and binding affinity. In turn, these relationships pave the way to the fast and accurate prediction and design of optimal binding regions as the 2D Zernike formalism allows a rapid and superposition-free comparison between possible binding surfaces.
Collapse
Affiliation(s)
- Fausta Desantis
- Center for Life Nano and Neuro Science, Istituto Italiano di Tecnologia (IIT), Viale Regina Elena 291, 00161, Rome, Italy.,The Open University Affiliated Research Centre at Istituto Italiano di Tecnologia, Via Morego, 30, 16163, Genoa, Italy
| | - Mattia Miotto
- Center for Life Nano and Neuro Science, Istituto Italiano di Tecnologia (IIT), Viale Regina Elena 291, 00161, Rome, Italy.
| | - Lorenzo Di Rienzo
- Center for Life Nano and Neuro Science, Istituto Italiano di Tecnologia (IIT), Viale Regina Elena 291, 00161, Rome, Italy
| | - Edoardo Milanetti
- Center for Life Nano and Neuro Science, Istituto Italiano di Tecnologia (IIT), Viale Regina Elena 291, 00161, Rome, Italy.,Department of Physics, Sapienza University of Rome, Piazzale Aldo Moro, 5, 00185, Rome, Italy
| | - Giancarlo Ruocco
- Center for Life Nano and Neuro Science, Istituto Italiano di Tecnologia (IIT), Viale Regina Elena 291, 00161, Rome, Italy.,Department of Physics, Sapienza University of Rome, Piazzale Aldo Moro, 5, 00185, Rome, Italy
| |
Collapse
|
7
|
Grassmann G, Miotto M, Di Rienzo L, Gosti G, Ruocco G, Milanetti E. A novel computational strategy for defining the minimal protein molecular surface representation. PLoS One 2022; 17:e0266004. [PMID: 35421111 PMCID: PMC9009619 DOI: 10.1371/journal.pone.0266004] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2021] [Accepted: 03/12/2022] [Indexed: 11/18/2022] Open
Abstract
Most proteins perform their biological function by interacting with one or more molecular partners. In this respect, characterizing local features of the molecular surface, that can potentially be involved in the interaction with other molecules, represents a step forward in the investigation of the mechanisms of recognition and binding between molecules. Predictive methods often rely on extensive samplings of molecular patches with the aim to identify hot spots on the surface. In this framework, analysis of large proteins and/or many molecular dynamics frames is often unfeasible due to the high computational cost. Thus, finding optimal ways to reduce the number of points to be sampled maintaining the biological information (including the surface shape) carried by the molecular surface is pivotal. In this perspective, we here present a new theoretical and computational algorithm with the aim of defining a set of molecular surfaces composed of points not uniformly distributed in space, in such a way as to maximize the information of the overall shape of the molecule by minimizing the number of total points. We test our procedure’s ability in recognizing hot-spots by describing the local shape properties of portions of molecular surfaces through a recently developed method based on the formalism of 2D Zernike polynomials. The results of this work show the ability of the proposed algorithm to preserve the key information of the molecular surface using a reduced number of points compared to the complete surface, where all points of the surface are used for the description. In fact, the methodology shows a significant gain of the information stored in the sampling procedure compared to uniform random sampling.
Collapse
Affiliation(s)
| | - Mattia Miotto
- Center for Life Nano & Neuroscience, Italian Institute of Technology, Rome, Italy
| | - Lorenzo Di Rienzo
- Center for Life Nano & Neuroscience, Italian Institute of Technology, Rome, Italy
| | - Giorgio Gosti
- Center for Life Nano & Neuroscience, Italian Institute of Technology, Rome, Italy
| | - Giancarlo Ruocco
- Center for Life Nano & Neuroscience, Italian Institute of Technology, Rome, Italy
- Department of Physics, Sapienza University, Rome, Italy
| | - Edoardo Milanetti
- Center for Life Nano & Neuroscience, Italian Institute of Technology, Rome, Italy
- Department of Physics, Sapienza University, Rome, Italy
- * E-mail:
| |
Collapse
|
8
|
Yang YX, Wang P, Zhu BT. Relative importance of interface and surface areas in protein-protein binding affinity prediction: A machine learning analysis based on linear regression and artificial neural network. Biophys Chem 2022; 283:106762. [DOI: 10.1016/j.bpc.2022.106762] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2021] [Revised: 01/11/2022] [Accepted: 01/14/2022] [Indexed: 11/02/2022]
|
9
|
Dhusia K, Madrid C, Su Z, Wu Y. EXCESP: A Structure-Based Online Database for Extracellular Interactome of Cell Surface Proteins in Humans. J Proteome Res 2022; 21:349-359. [PMID: 34978816 DOI: 10.1021/acs.jproteome.1c00612] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
The interactions between ectodomains of cell surface proteins are vital players in many important cellular processes, such as regulating immune responses, coordinating cell differentiation, and shaping neural plasticity. However, while the construction of a large-scale protein interactome has been greatly facilitated by the development of high-throughput experimental techniques, little progress has been made to support the discovery of extracellular interactome for cell surface proteins. Harnessed by the recent advances in computational modeling of protein-protein interactions, here we present a structure-based online database for the extracellular interactome of cell surface proteins in humans, called EXCESP. The database contains both experimentally determined and computationally predicted interactions among all type-I transmembrane proteins in humans. All structural models for these interactions and their binding affinities were further computationally modeled. Moreover, information such as expression levels of each protein in different cell types and its relation to various signaling pathways from other online resources has also been integrated into the database. In summary, the database serves as a valuable addition to the existing online resources for the study of cell surface proteins. It can contribute to the understanding of the functions of cell surface proteins in the era of systems biology.
Collapse
Affiliation(s)
- Kalyani Dhusia
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, 1300 Morris Park Avenue, Bronx, New York 10461, United States
| | - Carlos Madrid
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, 1300 Morris Park Avenue, Bronx, New York 10461, United States.,Laboratory for Macromolecular Analysis and Proteomics, Albert Einstein College of Medicine, 1300 Morris Park Avenue, Bronx, New York 10461, United States
| | - Zhaoqian Su
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, 1300 Morris Park Avenue, Bronx, New York 10461, United States
| | - Yinghao Wu
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, 1300 Morris Park Avenue, Bronx, New York 10461, United States
| |
Collapse
|
10
|
Wu H, Ling H, Gao L, Fu Q, Lu W, Ding Y, Jiang M, Li H. Empirical Potential Energy Function Toward ab Initio Folding G Protein-Coupled Receptors. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2021; 18:1752-1762. [PMID: 32750885 DOI: 10.1109/tcbb.2020.3008014] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Approximately 40-50 percent of all drugs targets are G protein-coupled receptors (GPCRs). Three-dimensional structure of GPCRs is important to probe their biophysical and biochemical functions and their pharmaceutical applications. Lacking reliable and high quality free function is one of the ugent problems of computational predicting the three-dimensional structure in this community. We proposed a GPCR-specified energy function composed of four novel empirical potential energy terms: a two-dimensional contact energy force field, knowledge-based helix pair connection distance energy term, knowledge-based helix pair angle restraint energy term and a disulfide bond energy term. To validate the energy function, we employed an ab initio GPCR three-dimensional structure predictor to test if the energy function improved the accuracy of prediction. We evaluated 28 solved GPCRs and found that 21(75 percent) targets were correctly folded (TM-score>0.5). Also, the average TM-score using the energy function was 0.54, which was improved 134 percent than the TM-score 0.23 for MODELLER energy function and 170 percent than the TM-score 0.20 for Rosetta membrane energy function. The results confirmed that our empirical potential energy function toward ab initio folding is competitive to state-of-the-art solutions for structural prediction of GPCRs.
Collapse
|
11
|
Wang B, Su Z, Wu Y. Computational Assessment of Protein-Protein Binding Affinity by Reverse Engineering the Energetics in Protein Complexes. GENOMICS PROTEOMICS & BIOINFORMATICS 2021; 19:1012-1022. [PMID: 33838354 PMCID: PMC9403033 DOI: 10.1016/j.gpb.2021.03.004] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/10/2018] [Revised: 03/07/2019] [Accepted: 05/17/2019] [Indexed: 11/29/2022]
Abstract
The cellular functions of proteins are maintained by forming diverse complexes. The stability of these complexes is quantified by the measurement of binding affinity, and mutations that alter the binding affinity can cause various diseases such as cancer and diabetes. As a result, accurate estimation of the binding stability and the effects of mutations on changes of binding affinity is a crucial step to understanding the biological functions of proteins and their dysfunctional consequences. It has been hypothesized that the stability of a protein complex is dependent not only on the residues at its binding interface by pairwise interactions but also on all other remaining residues that do not appear at the binding interface. Here, we computationally reconstruct the binding affinity by decomposing it into the contributions of interfacial residues and other non-interfacial residues in a protein complex. We further assume that the contributions of both interfacial and non-interfacial residues to the binding affinity depend on their local structural environments such as solvent-accessible surfaces and secondary structural types. The weights of all corresponding parameters are optimized by Monte-Carlo simulations. After cross-validation against a large-scale dataset, we show that the model not only shows a strong correlation between the absolute values of the experimental and calculated binding affinities, but can also be an effective approach to predict the relative changes of binding affinity from mutations. Moreover, we have found that the optimized weights of many parameters can capture the first-principle chemical and physical features of molecular recognition, therefore reversely engineering the energetics of protein complexes. These results suggest that our method can serve as a useful addition to current computational approaches for predicting binding affinity and understanding the molecular mechanism of protein–protein interactions.
Collapse
Affiliation(s)
- Bo Wang
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, 1300 Morris Park Avenue, Bronx, NY 10461, USA
| | - Zhaoqian Su
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, 1300 Morris Park Avenue, Bronx, NY 10461, USA
| | - Yinghao Wu
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, 1300 Morris Park Avenue, Bronx, NY 10461, USA.
| |
Collapse
|
12
|
Abbasi WA, Yaseen A, Hassan FU, Andleeb S, Minhas FUAA. ISLAND: in-silico proteins binding affinity prediction using sequence information. BioData Min 2020; 13:20. [PMID: 33292419 PMCID: PMC7688004 DOI: 10.1186/s13040-020-00231-w] [Citation(s) in RCA: 35] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2020] [Accepted: 11/15/2020] [Indexed: 12/30/2022] Open
Abstract
BACKGROUND Determining binding affinity in protein-protein interactions is important in the discovery and design of novel therapeutics and mutagenesis studies. Determination of binding affinity of proteins in the formation of protein complexes requires sophisticated, expensive and time-consuming experimentation which can be replaced with computational methods. Most computational prediction techniques require protein structures that limit their applicability to protein complexes with known structures. In this work, we explore sequence-based protein binding affinity prediction using machine learning. METHOD We have used protein sequence information instead of protein structures along with machine learning techniques to accurately predict the protein binding affinity. RESULTS We present our findings that the true generalization performance of even the state-of-the-art sequence-only predictor is far from satisfactory and that the development of machine learning methods for binding affinity prediction with improved generalization performance is still an open problem. We have also proposed a sequence-based novel protein binding affinity predictor called ISLAND which gives better accuracy than existing methods over the same validation set as well as on external independent test dataset. A cloud-based webserver implementation of ISLAND and its python code are available at https://sites.google.com/view/wajidarshad/software . CONCLUSION This paper highlights the fact that the true generalization performance of even the state-of-the-art sequence-only predictor of binding affinity is far from satisfactory and that the development of effective and practical methods in this domain is still an open problem.
Collapse
Affiliation(s)
- Wajid Arshad Abbasi
- Computational Biology and Data Analysis Laboratory, Department of Computer Science and Information Technology, King Abdullah Campus, University of Azad Jammu & Kashmir, Muzaffarabad, Pakistan. .,Biomedical Informatics Research Laboratory, Department of Computer and Information Sciences, Pakistan Institute of Engineering and Applied Sciences (PIEAS), Nilore, Islamabad, Pakistan.
| | - Adiba Yaseen
- Biomedical Informatics Research Laboratory, Department of Computer and Information Sciences, Pakistan Institute of Engineering and Applied Sciences (PIEAS), Nilore, Islamabad, Pakistan
| | - Fahad Ul Hassan
- Biomedical Informatics Research Laboratory, Department of Computer and Information Sciences, Pakistan Institute of Engineering and Applied Sciences (PIEAS), Nilore, Islamabad, Pakistan
| | - Saiqa Andleeb
- Biotechnology Laboratory, Department of Zoology, King Abdullah Campus, University of Azad Jammu & Kashmir, Muzaffarabad, Pakistan
| | | |
Collapse
|
13
|
PreDBA: A heterogeneous ensemble approach for predicting protein-DNA binding affinity. Sci Rep 2020; 10:1278. [PMID: 31992738 PMCID: PMC6987227 DOI: 10.1038/s41598-020-57778-1] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2019] [Accepted: 01/06/2020] [Indexed: 11/17/2022] Open
Abstract
The interaction between protein and DNA plays an essential function in various critical natural processes, like DNA replication, transcription, splicing, and repair. Studying the binding affinity of proteins to DNA helps to understand the recognition mechanism of protein-DNA complexes. Since there are still many limitations on the protein-DNA binding affinity data measured by experiments, accurate and reliable calculation methods are necessarily required. So we put forward a computational approach in this paper, called PreDBA, that can forecast protein-DNA binding affinity effectively by using heterogeneous ensemble models. One hundred protein-DNA complexes are manually collected from the related literature as a data set for protein-DNA binding affinity. Then, 52 sequence and structural features are obtained. Based on this, the correlation between these 52 characteristics and protein-DNA binding affinity is calculated. Furthermore, we found that the protein-DNA binding affinity is affected by the DNA molecule structure of the compound. We classify all protein-DNA compounds into five classifications based on the DNA structure related to the proteins that make up the protein-DNA complexes. In each group, a stacked heterogeneous ensemble model is constructed based on the obtained features. In the end, based on the binding affinity data set, we used the leave-one-out cross-validation to evaluate the proposed method comprehensively. In the five categories, the Pearson correlation coefficient values of our recommended method range from 0.735 to 0.926. We have demonstrated the advantages of the proposed method compared to other machine learning methods and currently existing protein-DNA binding affinity prediction approach.
Collapse
|
14
|
Diller DJ, Swanson J, Bayden AS, Brown CJ, Thean D, Lane DP, Partridge AW, Sawyer TK, Audie J. Rigorous Computational and Experimental Investigations on MDM2/MDMX-Targeted Linear and Macrocyclic Peptides. Molecules 2019; 24:E4586. [PMID: 31847417 PMCID: PMC6943714 DOI: 10.3390/molecules24244586] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2019] [Revised: 12/11/2019] [Accepted: 12/12/2019] [Indexed: 12/25/2022] Open
Abstract
There is interest in peptide drug design, especially for targeting intracellular protein-protein interactions. Therefore, the experimental validation of a computational platform for enabling peptide drug design is of interest. Here, we describe our peptide drug design platform (CMDInventus) and demonstrate its use in modeling and predicting the structural and binding aspects of diverse peptides that interact with oncology targets MDM2/MDMX in comparison to both retrospective (pre-prediction) and prospective (post-prediction) data. In the retrospective study, CMDInventus modules (CMDpeptide, CMDboltzmann, CMDescore and CMDyscore) were used to accurately reproduce structural and binding data across multiple MDM2/MDMX data sets. In the prospective study, CMDescore, CMDyscore and CMDboltzmann were used to accurately predict binding affinities for an Ala-scan of the stapled α-helical peptide ATSP-7041. Remarkably, CMDboltzmann was used to accurately predict the results of a novel D-amino acid scan of ATSP-7041. Our investigations rigorously validate CMDInventus and support its utility for enabling peptide drug design.
Collapse
Affiliation(s)
- David J. Diller
- CMDBioscience, 5 Park Avenue, New Haven, CT 06511, USA; (D.J.D.); (J.S.); (A.S.B.)
- Venenum BioDesign, LLC, 8 Black Forest Road, Hamilton, NJ 08691, USA
| | - Jon Swanson
- CMDBioscience, 5 Park Avenue, New Haven, CT 06511, USA; (D.J.D.); (J.S.); (A.S.B.)
- ChemModeling, LLC, Suite 101, 500 Huber Park Ct, Weldon Spring, MO 63304, USA
| | - Alexander S. Bayden
- CMDBioscience, 5 Park Avenue, New Haven, CT 06511, USA; (D.J.D.); (J.S.); (A.S.B.)
- Kleo Pharmaceuticals, 25 Science Park, Ste 235, New Haven, CT 06511, USA
| | - Chris J. Brown
- A*STAR, p53 Laboratory, Singapore 138648, Singapore; (C.J.B.); (D.T.); (D.P.L.)
| | - Dawn Thean
- A*STAR, p53 Laboratory, Singapore 138648, Singapore; (C.J.B.); (D.T.); (D.P.L.)
| | - David P. Lane
- A*STAR, p53 Laboratory, Singapore 138648, Singapore; (C.J.B.); (D.T.); (D.P.L.)
| | - Anthony W. Partridge
- MSD International GmbH, Singapore 138665, Singapore;
- Merck Research Laboratories, 33 Avenue Louis Pasteur, Boston, MA 02115, USA
| | - Tomi K. Sawyer
- Merck Research Laboratories, 33 Avenue Louis Pasteur, Boston, MA 02115, USA
| | - Joseph Audie
- CMDBioscience, 5 Park Avenue, New Haven, CT 06511, USA; (D.J.D.); (J.S.); (A.S.B.)
- College of Arts and Sciences, Department of Chemistry, Sacred Heart University, 5151 Park Avenue, Fairfield, CT 06825, USA
| |
Collapse
|
15
|
Nithin C, Mukherjee S, Bahadur RP. A structure-based model for the prediction of protein-RNA binding affinity. RNA (NEW YORK, N.Y.) 2019; 25:1628-1645. [PMID: 31395671 PMCID: PMC6859855 DOI: 10.1261/rna.071779.119] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/04/2019] [Accepted: 08/05/2019] [Indexed: 05/28/2023]
Abstract
Protein-RNA recognition is highly affinity-driven and regulates a wide array of cellular functions. In this study, we have curated a binding affinity data set of 40 protein-RNA complexes, for which at least one unbound partner is available in the docking benchmark. The data set covers a wide affinity range of eight orders of magnitude as well as four different structural classes. On average, we find the complexes with single-stranded RNA have the highest affinity, whereas the complexes with the duplex RNA have the lowest. Nevertheless, free energy gain upon binding is the highest for the complexes with ribosomal proteins and the lowest for the complexes with tRNA with an average of -5.7 cal/mol/Å2 in the entire data set. We train regression models to predict the binding affinity from the structural and physicochemical parameters of protein-RNA interfaces. The best fit model with the lowest maximum error is provided with three interface parameters: relative hydrophobicity, conformational change upon binding and relative hydration pattern. This model has been used for predicting the binding affinity on a test data set, generated using mutated structures of yeast aspartyl-tRNA synthetase, for which experimentally determined ΔG values of 40 mutations are available. The predicted ΔGempirical values highly correlate with the experimental observations. The data set provided in this study should be useful for further development of the binding affinity prediction methods. Moreover, the model developed in this study enhances our understanding on the structural basis of protein-RNA binding affinity and provides a platform to engineer protein-RNA interfaces with desired affinity.
Collapse
Affiliation(s)
- Chandran Nithin
- Computational Structural Biology Lab, Department of Biotechnology, Indian Institute of Technology Kharagpur, Kharagpur 721302, India
| | - Sunandan Mukherjee
- Computational Structural Biology Lab, Department of Biotechnology, Indian Institute of Technology Kharagpur, Kharagpur 721302, India
| | - Ranjit Prasad Bahadur
- Computational Structural Biology Lab, Department of Biotechnology, Indian Institute of Technology Kharagpur, Kharagpur 721302, India
| |
Collapse
|
16
|
Su Z, Wu Y. Multiscale simulation unravel the kinetic mechanisms of inflammasome assembly. BIOCHIMICA ET BIOPHYSICA ACTA-MOLECULAR CELL RESEARCH 2019; 1867:118612. [PMID: 31758956 DOI: 10.1016/j.bbamcr.2019.118612] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/13/2019] [Revised: 11/11/2019] [Accepted: 11/18/2019] [Indexed: 01/16/2023]
Abstract
In the innate immune system, the host defense from the invasion of external pathogens triggers the inflammatory responses. Proteins involved in the inflammatory pathways were often found to aggregate into supramolecular oligomers, called 'inflammasome', mostly through the homotypic interaction between their domains that belong to the death domain superfamily. Although much has been known about the formation of these helical molecular machineries, the detailed correlation between the dynamics of their assembly and the structure of each domain is still not well understood. Using the filament formed by the PYD domains of adaptor molecule ASC as a test system, we constructed a new multiscale simulation framework to study the kinetics of inflammasome assembly. We found that the filament assembly is a multi-step, but highly cooperative process. Moreover, there are three types of binding interfaces between domain subunits in the ASCPYD filament. The multiscale simulation results suggest that dynamics of domain assembly are rooted in the primary protein sequence which defines the energetics of molecular recognition through three binding interfaces. Interface I plays a more regulatory role than the other two in mediating both the kinetics and the thermodynamics of assembly. Finally, the efficiency of our computational framework allows us to design mutants on a systematic scale and predict their impacts on filament assembly. In summary, this is, to the best of our knowledge, the first simulation method to model the spatial-temporal process of inflammasome assembly. Our work is a useful addition to a suite of existing experimental techniques to study the functions of inflammasome in innate immune system.
Collapse
Affiliation(s)
- Zhaoqian Su
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, 1300 Morris Park Avenue, Bronx, NY 10461, United States of America
| | - Yinghao Wu
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, 1300 Morris Park Avenue, Bronx, NY 10461, United States of America.
| |
Collapse
|
17
|
|
18
|
Marín-López MA, Planas-Iglesias J, Aguirre-Plans J, Bonet J, Garcia-Garcia J, Fernandez-Fuentes N, Oliva B. On the mechanisms of protein interactions: predicting their affinity from unbound tertiary structures. Bioinformatics 2018; 34:592-598. [PMID: 29028891 PMCID: PMC5860604 DOI: 10.1093/bioinformatics/btx616] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2016] [Accepted: 09/26/2017] [Indexed: 12/12/2022] Open
Abstract
Motivation The characterization of the protein–protein association mechanisms is crucial to understanding how biological processes occur. It has been previously shown that the early formation of non-specific encounters enhances the realization of the stereospecific (i.e. native) complex by reducing the dimensionality of the search process. The association rate for the formation of such complex plays a crucial role in the cell biology and depends on how the partners diffuse to be close to each other. Predicting the binding free energy of proteins provides new opportunities to modulate and control protein–protein interactions. However, existing methods require the 3D structure of the complex to predict its affinity, severely limiting their application to interactions with known structures. Results We present a new approach that relies on the unbound protein structures and protein docking to predict protein–protein binding affinities. Through the study of the docking space (i.e. decoys), the method predicts the binding affinity of the query proteins when the actual structure of the complex itself is unknown. We tested our approach on a set of globular and soluble proteins of the newest affinity benchmark, obtaining accuracy values comparable to other state-of-art methods: a 0.4 correlation coefficient between the experimental and predicted values of ΔG and an error < 3 Kcal/mol. Availability and implementation The binding affinity predictor is implemented and available at http://sbi.upf.edu/BADock and https://github.com/badocksbi/BADock. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Manuel Alejandro Marín-López
- Structural Bioinformatics Lab, Department of Experimental and Health Science, Universitat Pompeu Fabra, Barcelona 08003, Spain
| | - Joan Planas-Iglesias
- Division of Metabolic and Vascular Health, University of Warwick, Coventry CV4?7AL, UK
| | - Joaquim Aguirre-Plans
- Structural Bioinformatics Lab, Department of Experimental and Health Science, Universitat Pompeu Fabra, Barcelona 08003, Spain
| | - Jaume Bonet
- Laboratory of Protein Design and Immunoenginneering, School of Engineering, Ecole Polytechnique Federale de Lausanne, Lausanne 1015, Switzerland
| | - Javier Garcia-Garcia
- Structural Bioinformatics Lab, Department of Experimental and Health Science, Universitat Pompeu Fabra, Barcelona 08003, Spain
| | - Narcis Fernandez-Fuentes
- Institute of Biological, Environmental and Rural Sciences, Aberystwyth University, Aberystwyth SY23?3DA, UK
| | - Baldo Oliva
- Structural Bioinformatics Lab, Department of Experimental and Health Science, Universitat Pompeu Fabra, Barcelona 08003, Spain
| |
Collapse
|
19
|
Zhao J, Nussinov R, Wu WJ, Ma B. In Silico Methods in Antibody Design. Antibodies (Basel) 2018; 7:E22. [PMID: 31544874 PMCID: PMC6640671 DOI: 10.3390/antib7030022] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2018] [Revised: 06/28/2018] [Accepted: 06/28/2018] [Indexed: 01/10/2023] Open
Abstract
Antibody therapies with high efficiency and low toxicity are becoming one of the major approaches in antibody therapeutics. Based on high-throughput sequencing and increasing experimental structures of antibodies/antibody-antigen complexes, computational approaches can predict antibody/antigen structures, engineering the function of antibodies and design antibody-antigen complexes with improved properties. This review summarizes recent progress in the field of in silico design of antibodies, including antibody structure modeling, antibody-antigen complex prediction, antibody stability evaluation, and allosteric effects in antibodies and functions. We listed the cases in which these methods have helped experimental studies to improve the affinities and physicochemical properties of antibodies. We emphasized how the molecular dynamics unveiled the allosteric effects during antibody-antigen recognition and antibody-effector recognition.
Collapse
Affiliation(s)
- Jun Zhao
- Division of Biotechnology Review and Research I, Office of Biotechnology Products, Office of Pharmaceutical Quality, Center for Drug Evaluation and Research, US Food and Drug Administration, 10903 New Hampshire Avenue, Silver Spring, MD 20993, USA.
- Interagency Oncology Task Force (IOTF) Fellowship: Oncology Product Research/Review Fellow, National Cancer Institute, Bethesda, MD 20892, USA.
- Cancer and Inflammation Program, National Cancer Institute, Frederick, MD 21702, USA.
| | - Ruth Nussinov
- Basic Science Program, Leidos Biomedical Research, Inc. Cancer and Inflammation Program, National Cancer Institute, Frederick, MD 21702, USA.
- Sackler Inst. of Molecular Medicine, Department of Human Genetics and Molecular Medicine, Sackler School of Medicine, Tel Aviv University, Tel Aviv 69978, Israel.
| | - Wen-Jin Wu
- Division of Biotechnology Review and Research I, Office of Biotechnology Products, Office of Pharmaceutical Quality, Center for Drug Evaluation and Research, US Food and Drug Administration, 10903 New Hampshire Avenue, Silver Spring, MD 20993, USA.
| | - Buyong Ma
- Basic Science Program, Leidos Biomedical Research, Inc. Cancer and Inflammation Program, National Cancer Institute, Frederick, MD 21702, USA.
| |
Collapse
|
20
|
Raucci R, Laine E, Carbone A. Local Interaction Signal Analysis Predicts Protein-Protein Binding Affinity. Structure 2018; 26:905-915.e4. [PMID: 29779789 DOI: 10.1016/j.str.2018.04.006] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2017] [Revised: 02/06/2018] [Accepted: 04/10/2018] [Indexed: 12/27/2022]
Abstract
Several models estimating the strength of the interaction between proteins in a complex have been proposed. By exploring the geometry of contact distribution at protein-protein interfaces, we provide an improved model of binding energy. Local interaction signal analysis (LISA) is a radial function based on terms describing favorable and non-favorable contacts obtained by density functional theory, the support-core-rim interface residue distribution, non-interacting charged residues and secondary structures contribution. The three-dimensional organization of the contacts and their contribution on localized hot-sites over the entire interaction surface were numerically evaluated. LISA achieves a correlation of 0.81 (and a root-mean-square error of 2.35 ± 0.38 kcal/mol) when tested on 125 complexes for which experimental measurements were realized. LISA's performance is stable for subsets defined by functional composition and extent of conformational changes upon complex formation. A large-scale comparison with 17 other functions demonstrated the power of the geometrical model in the understanding of complex binding.
Collapse
Affiliation(s)
- Raffaele Raucci
- Sorbonne Université, CNRS, IBPS, Laboratoire de Biologie Computationnelle et Quantitative (LCQB), 4 Place Jussieu, 75005 Paris, France; Sorbonne Université, Institut des Sciences du Calcul et des Données (ISCD), 75005 Paris, France
| | - Elodie Laine
- Sorbonne Université, CNRS, IBPS, Laboratoire de Biologie Computationnelle et Quantitative (LCQB), 4 Place Jussieu, 75005 Paris, France
| | - Alessandra Carbone
- Sorbonne Université, CNRS, IBPS, Laboratoire de Biologie Computationnelle et Quantitative (LCQB), 4 Place Jussieu, 75005 Paris, France; Institut Universitaire de France, 75005 Paris, France.
| |
Collapse
|
21
|
Škrbić T, Zamuner S, Hong R, Seno F, Laio A, Trovato A. Vibrational entropy estimation can improve binding affinity prediction for non-obligatory protein complexes. Proteins 2018; 86:393-404. [DOI: 10.1002/prot.25454] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2017] [Revised: 12/22/2017] [Accepted: 01/05/2018] [Indexed: 01/10/2023]
Affiliation(s)
- Tatjana Škrbić
- Faculty of Physics; International School for Advanced Studies (SISSA/ISAS); Trieste Italy
- Department of Physics and Astronomy “Galileo Galilei”; University of Padova; Padova Italy
| | - Stefano Zamuner
- Department of Physics and Astronomy “Galileo Galilei”; University of Padova; Padova Italy
| | - Rolando Hong
- Faculty of Physics; International School for Advanced Studies (SISSA/ISAS); Trieste Italy
| | - Flavio Seno
- Department of Physics and Astronomy “Galileo Galilei”; University of Padova; Padova Italy
- Padova Section, National Institute of Nuclear Physics (INFN); Padova Italy
| | - Alessandro Laio
- Faculty of Physics; International School for Advanced Studies (SISSA/ISAS); Trieste Italy
| | - Antonio Trovato
- Department of Physics and Astronomy “Galileo Galilei”; University of Padova; Padova Italy
- Padova Section, National Institute of Nuclear Physics (INFN); Padova Italy
| |
Collapse
|
22
|
Integrating computational methods and experimental data for understanding the recognition mechanism and binding affinity of protein-protein complexes. PROGRESS IN BIOPHYSICS AND MOLECULAR BIOLOGY 2017; 128:33-38. [PMID: 28069340 DOI: 10.1016/j.pbiomolbio.2017.01.001] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/30/2016] [Revised: 01/04/2017] [Accepted: 01/05/2017] [Indexed: 01/09/2023]
Abstract
Protein-protein interactions perform several functions inside the cell. Understanding the recognition mechanism and binding affinity of protein-protein complexes is a challenging problem in experimental and computational biology. In this review, we focus on two aspects (i) understanding the recognition mechanism and (ii) predicting the binding affinity. The first part deals with computational techniques for identifying the binding site residues and the contribution of important interactions for understanding the recognition mechanism of protein-protein complexes in comparison with experimental observations. The second part is devoted to the methods developed for discriminating high and low affinity complexes, and predicting the binding affinity of protein-protein complexes using three-dimensional structural information and just from the amino acid sequence. The overall view enhances our understanding of the integration of experimental data and computational methods, recognition mechanism of protein-protein complexes and the binding affinity.
Collapse
|
23
|
Computational Approaches for Predicting Binding Partners, Interface Residues, and Binding Affinity of Protein-Protein Complexes. Methods Mol Biol 2017; 1484:237-253. [PMID: 27787830 DOI: 10.1007/978-1-4939-6406-2_16] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
Studying protein-protein interactions leads to a better understanding of the underlying principles of several biological pathways. Cost and labor-intensive experimental techniques suggest the need for computational methods to complement them. Several such state-of-the-art methods have been reported for analyzing diverse aspects such as predicting binding partners, interface residues, and binding affinity for protein-protein complexes with reliable performance. However, there are specific drawbacks for different methods that indicate the need for their improvement. This review highlights various available computational algorithms for analyzing diverse aspects of protein-protein interactions and endorses the necessity for developing new robust methods for gaining deep insights about protein-protein interactions.
Collapse
|
24
|
Swanson J, Audie J. An unexpected way forward: towards a more accurate and rigorous protein-protein binding affinity scoring function by eliminating terms from an already simple scoring function. J Biomol Struct Dyn 2016; 36:83-97. [PMID: 27989231 DOI: 10.1080/07391102.2016.1268974] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
Abstract
A fundamental and unsolved problem in biophysical chemistry is the development of a computationally simple, physically intuitive, and generally applicable method for accurately predicting and physically explaining protein-protein binding affinities from protein-protein interaction (PPI) complex coordinates. Here, we propose that the simplification of a previously described six-term PPI scoring function to a four term function results in a simple expression of all physically and statistically meaningful terms that can be used to accurately predict and explain binding affinities for a well-defined subset of PPIs that are characterized by (1) crystallographic coordinates, (2) rigid-body association, (3) normal interface size, and hydrophobicity and hydrophilicity, and (4) high quality experimental binding affinity measurements. We further propose that the four-term scoring function could be regarded as a core expression for future development into a more general PPI scoring function. Our work has clear implications for PPI modeling and structure-based drug design.
Collapse
Affiliation(s)
- Jon Swanson
- a ChemModeling LLC , Suite 101, 500 Huber Park Ct, Weldon Spring , MO 63304 , USA
| | - Joseph Audie
- b CMD Bioscience , 5 Science Park , New Haven , CT 06511 , USA.,c Department of Chemistry , Sacred Heart University , 5151 Park Ave, Fairfield , CT 06825 , USA
| |
Collapse
|
25
|
Vangone A, Bonvin AM. Contacts-based prediction of binding affinity in protein-protein complexes. eLife 2015. [PMID: 26193119 DOI: 10.7554/elife07454] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/12/2023] Open
Abstract
Almost all critical functions in cells rely on specific protein-protein interactions. Understanding these is therefore crucial in the investigation of biological systems. Despite all past efforts, we still lack a thorough understanding of the energetics of association of proteins. Here, we introduce a new and simple approach to predict binding affinity based on functional and structural features of the biological system, namely the network of interfacial contacts. We assess its performance against a protein-protein binding affinity benchmark and show that both experimental methods used for affinity measurements and conformational changes have a strong impact on prediction accuracy. Using a subset of complexes with reliable experimental binding affinities and combining our contacts and contact-types-based model with recent observations on the role of the non-interacting surface in protein-protein interactions, we reach a high prediction accuracy for such a diverse dataset outperforming all other tested methods.
Collapse
Affiliation(s)
- Anna Vangone
- Computational Structural Biology Group, Bijvoet Center for Biomolecular Research, Faculty of Science-Chemistry, Utrecht University, Utrecht, Netherlands
| | - Alexandre Mjj Bonvin
- Computational Structural Biology Group, Bijvoet Center for Biomolecular Research, Faculty of Science-Chemistry, Utrecht University, Utrecht, Netherlands
| |
Collapse
|
26
|
Vangone A, Bonvin AMJJ. Contacts-based prediction of binding affinity in protein-protein complexes. eLife 2015; 4:e07454. [PMID: 26193119 PMCID: PMC4523921 DOI: 10.7554/elife.07454] [Citation(s) in RCA: 367] [Impact Index Per Article: 36.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2015] [Accepted: 07/08/2015] [Indexed: 12/13/2022] Open
Abstract
Almost all critical functions in cells rely on specific protein-protein interactions. Understanding these is therefore crucial in the investigation of biological systems. Despite all past efforts, we still lack a thorough understanding of the energetics of association of proteins. Here, we introduce a new and simple approach to predict binding affinity based on functional and structural features of the biological system, namely the network of interfacial contacts. We assess its performance against a protein-protein binding affinity benchmark and show that both experimental methods used for affinity measurements and conformational changes have a strong impact on prediction accuracy. Using a subset of complexes with reliable experimental binding affinities and combining our contacts and contact-types-based model with recent observations on the role of the non-interacting surface in protein-protein interactions, we reach a high prediction accuracy for such a diverse dataset outperforming all other tested methods.
Collapse
Affiliation(s)
- Anna Vangone
- Computational Structural Biology Group, Bijvoet Center for Biomolecular Research, Faculty of Science—Chemistry, Utrecht University, Utrecht, Netherlands
| | - Alexandre MJJ Bonvin
- Computational Structural Biology Group, Bijvoet Center for Biomolecular Research, Faculty of Science—Chemistry, Utrecht University, Utrecht, Netherlands
| |
Collapse
|
27
|
Moal IH, Dapkūnas J, Fernández-Recio J. Inferring the microscopic surface energy of protein-protein interfaces from mutation data. Proteins 2015; 83:640-50. [PMID: 25586563 DOI: 10.1002/prot.24761] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2014] [Revised: 12/04/2014] [Accepted: 12/21/2014] [Indexed: 11/11/2022]
Abstract
Mutations at protein-protein recognition sites alter binding strength by altering the chemical nature of the interacting surfaces. We present a simple surface energy model, parameterized with empirical ΔΔG values, yielding mean energies of -48 cal mol(-1) Å(-2) for interactions between hydrophobic surfaces, -51 to -80 cal mol(-1) Å(-2) for surfaces of complementary charge, and 66-83 cal mol(-1) Å(-2) for electrostatically repelling surfaces, relative to the aqueous phase. This places the mean energy of hydrophobic surface burial at -24 cal mol(-1) Å(-2) . Despite neglecting configurational entropy and intramolecular changes, the model correlates with empirical binding free energies of a functionally diverse set of rigid-body interactions (r = 0.66). When used to rerank docking poses, it can place near-native solutions in the top 10 for 37% of the complexes evaluated, and 82% in the top 100. The method shows that hydrophobic burial is the driving force for protein association, accounting for 50-95% of the cohesive energy. The model is available open-source from http://life.bsc.es/pid/web/surface_energy/ and via the CCharpPPI web server http://life.bsc.es/pid/ccharppi/.
Collapse
Affiliation(s)
- Iain H Moal
- Joint BSC-IRB Research Program in Computational Biology, Life Science Department, Barcelona Supercomputing Center, Barcelona, 08034, Spain
| | | | | |
Collapse
|
28
|
Erijman A, Rosenthal E, Shifman JM. How structure defines affinity in protein-protein interactions. PLoS One 2014; 9:e110085. [PMID: 25329579 PMCID: PMC4199723 DOI: 10.1371/journal.pone.0110085] [Citation(s) in RCA: 70] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2014] [Accepted: 09/14/2014] [Indexed: 01/29/2023] Open
Abstract
Protein-protein interactions (PPI) in nature are conveyed by a multitude of binding modes involving various surfaces, secondary structure elements and intermolecular interactions. This diversity results in PPI binding affinities that span more than nine orders of magnitude. Several early studies attempted to correlate PPI binding affinities to various structure-derived features with limited success. The growing number of high-resolution structures, the appearance of more precise methods for measuring binding affinities and the development of new computational algorithms enable more thorough investigations in this direction. Here, we use a large dataset of PPI structures with the documented binding affinities to calculate a number of structure-based features that could potentially define binding energetics. We explore how well each calculated biophysical feature alone correlates with binding affinity and determine the features that could be used to distinguish between high-, medium- and low- affinity PPIs. Furthermore, we test how various combinations of features could be applied to predict binding affinity and observe a slow improvement in correlation as more features are incorporated into the equation. In addition, we observe a considerable improvement in predictions if we exclude from our analysis low-resolution and NMR structures, revealing the importance of capturing exact intermolecular interactions in our calculations. Our analysis should facilitate prediction of new interactions on the genome scale, better characterization of signaling networks and design of novel binding partners for various target proteins.
Collapse
Affiliation(s)
- Ariel Erijman
- Department of Biological Chemistry, The Alexander Silberman Institute of Life Sciences, The Hebrew University of Jerusalem, Jerusalem, Israel
| | - Eran Rosenthal
- Department of Biological Chemistry, The Alexander Silberman Institute of Life Sciences, The Hebrew University of Jerusalem, Jerusalem, Israel
| | - Julia M. Shifman
- Department of Biological Chemistry, The Alexander Silberman Institute of Life Sciences, The Hebrew University of Jerusalem, Jerusalem, Israel
- * E-mail:
| |
Collapse
|
29
|
Yugandhar K, Gromiha MM. Protein–protein binding affinity prediction from amino acid sequence. Bioinformatics 2014; 30:3583-9. [DOI: 10.1093/bioinformatics/btu580] [Citation(s) in RCA: 68] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023] Open
|
30
|
Proteins Feel More Than They See: Fine-Tuning of Binding Affinity by Properties of the Non-Interacting Surface. J Mol Biol 2014; 426:2632-52. [DOI: 10.1016/j.jmb.2014.04.017] [Citation(s) in RCA: 79] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2013] [Revised: 03/11/2014] [Accepted: 04/17/2014] [Indexed: 11/21/2022]
|
31
|
Yugandhar K, Gromiha MM. Feature selection and classification of protein-protein complexes based on their binding affinities using machine learning approaches. Proteins 2014; 82:2088-96. [PMID: 24648146 DOI: 10.1002/prot.24564] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2014] [Accepted: 03/14/2014] [Indexed: 12/16/2022]
Abstract
Protein-protein interactions are intrinsic to virtually every cellular process. Predicting the binding affinity of protein-protein complexes is one of the challenging problems in computational and molecular biology. In this work, we related sequence features of protein-protein complexes with their binding affinities using machine learning approaches. We set up a database of 185 protein-protein complexes for which the interacting pairs are heterodimers and their experimental binding affinities are available. On the other hand, we have developed a set of 610 features from the sequences of protein complexes and utilized Ranker search method, which is the combination of Attribute evaluator and Ranker method for selecting specific features. We have analyzed several machine learning algorithms to discriminate protein-protein complexes into high and low affinity groups based on their Kd values. Our results showed a 10-fold cross-validation accuracy of 76.1% with the combination of nine features using support vector machines. Further, we observed accuracy of 83.3% on an independent test set of 30 complexes. We suggest that our method would serve as an effective tool for identifying the interacting partners in protein-protein interaction networks and human-pathogen interactions based on the strength of interactions.
Collapse
Affiliation(s)
- K Yugandhar
- Department of Biotechnology, Indian Institute of Technology Madras, Chennai, 600036, Tamil Nadu, India
| | | |
Collapse
|
32
|
Wang L, Gu Q, Zheng X, Ye J, Liu Z, Li J, Hu X, Hagler A, Xu J. Discovery of new selective human aldose reductase inhibitors through virtual screening multiple binding pocket conformations. J Chem Inf Model 2013; 53:2409-22. [PMID: 23901876 DOI: 10.1021/ci400322j] [Citation(s) in RCA: 43] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023]
Abstract
Aldose reductase reduces glucose to sorbitol. It plays a key role in many of the complications arising from diabetes. Thus, aldose reductase inhibitors (ARI) have been identified as promising therapeutic agents for treating such complications of diabetes, as neuropathy, nephropathy, retinopathy, and cataracts. In this paper, a virtual screening protocol applied to a library of compounds in house has been utilized to discover novel ARIs. IC50's were determined for 15 hits that inhibited ALR2 to greater than 50% at 50 μM, and ten of these have an IC50 of 10 μM or less, corresponding to a rather substantial hit rate of 14% at this level. The specificity of these compounds relative to their cross-reactivity with human ALR1 was also assessed by inhibition assays. This resulted in identification of novel inhibitors with IC50's comparable to the commercially available drug, epalrestat, and greater than an order of magnitude better selectivity.
Collapse
Affiliation(s)
- Ling Wang
- Research Center for Drug Discovery & Institute of Human Virology, School of Pharmaceutical Sciences, Sun Yat-Sen University , Guangzhou 510006, China
| | | | | | | | | | | | | | | | | |
Collapse
|
33
|
Audie J, Swanson J. Advances in the Prediction of Protein-Peptide Binding Affinities: Implications for Peptide-Based Drug Discovery. Chem Biol Drug Des 2012; 81:50-60. [DOI: 10.1111/cbdd.12076] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
|
34
|
Kastritis PL, Bonvin AMJJ. On the binding affinity of macromolecular interactions: daring to ask why proteins interact. J R Soc Interface 2012; 10:20120835. [PMID: 23235262 PMCID: PMC3565702 DOI: 10.1098/rsif.2012.0835] [Citation(s) in RCA: 306] [Impact Index Per Article: 23.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023] Open
Abstract
Interactions between proteins are orchestrated in a precise and time-dependent manner, underlying cellular function. The binding affinity, defined as the strength of these interactions, is translated into physico-chemical terms in the dissociation constant (Kd), the latter being an experimental measure that determines whether an interaction will be formed in solution or not. Predicting binding affinity from structural models has been a matter of active research for more than 40 years because of its fundamental role in drug development. However, all available approaches are incapable of predicting the binding affinity of protein–protein complexes from coordinates alone. Here, we examine both theoretical and experimental limitations that complicate the derivation of structure–affinity relationships. Most work so far has concentrated on binary interactions. Systems of increased complexity are far from being understood. The main physico-chemical measure that relates to binding affinity is the buried surface area, but it does not hold for flexible complexes. For the latter, there must be a significant entropic contribution that will have to be approximated in the future. We foresee that any theoretical modelling of these interactions will have to follow an integrative approach considering the biology, chemistry and physics that underlie protein–protein recognition.
Collapse
Affiliation(s)
- Panagiotis L Kastritis
- Bijvoet Center for Biomolecular Research, Faculty of Science, Chemistry, Utrecht University, , Padualaan 8, Utrecht, The Netherlands
| | | |
Collapse
|
35
|
Recent work in the development and application of protein–peptide docking. Future Med Chem 2012; 4:1619-44. [DOI: 10.4155/fmc.12.99] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
Interest in the development of novel peptide-based drugs is growing. There is, thus, a pressing need for the development of effective methods to enable novel peptide-based drug discovery. A cogent case can be made for the development and application of computational or in silico methods to assist with peptide discovery. In particular, there is a need for the development of effective protein–peptide docking methods. Here, recent work in the area of protein–peptide docking method development is reviewed and several drug-discovery projects that benefited from protein–peptide docking are discussed. In the present review, special attention is given to the search and scoring problems, the use of peptide docking to enable hit identification, and the use of peptide docking to help rationalize experimental results, and generate and test structure-based hypotheses. Finally, some recommendations are made for improving the future development and application of protein–peptide docking.
Collapse
|
36
|
Vreven T, Hwang H, Pierce BG, Weng Z. Prediction of protein-protein binding free energies. Protein Sci 2012; 21:396-404. [PMID: 22238219 DOI: 10.1002/pro.2027] [Citation(s) in RCA: 65] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2011] [Revised: 12/23/2011] [Accepted: 01/04/2012] [Indexed: 11/09/2022]
Abstract
We present an energy function for predicting binding free energies of protein-protein complexes, using the three-dimensional structures of the complex and unbound proteins as input. Our function is a linear combination of nine terms and achieves a correlation coefficient of 0.63 with experimental measurements when tested on a benchmark of 144 complexes using leave-one-out cross validation. Although we systematically tested both atomic and residue-based scoring functions, the selected function is dominated by residue-based terms. Our function is stable for subsets of the benchmark stratified by experimental pH and extent of conformational change upon complex formation, with correlation coefficients ranging from 0.61 to 0.66.
Collapse
Affiliation(s)
- Thom Vreven
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Medical School, Worcester, Massachusetts 01605, USA
| | | | | | | |
Collapse
|
37
|
Tian F, Lv Y, Yang L. Structure-based prediction of protein–protein binding affinity with consideration of allosteric effect. Amino Acids 2011; 43:531-43. [DOI: 10.1007/s00726-011-1101-1] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2011] [Accepted: 09/21/2011] [Indexed: 11/28/2022]
|
38
|
Moal IH, Agius R, Bates PA. Protein-protein binding affinity prediction on a diverse set of structures. Bioinformatics 2011; 27:3002-9. [PMID: 21903632 DOI: 10.1093/bioinformatics/btr513] [Citation(s) in RCA: 89] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/11/2024] Open
Abstract
MOTIVATION Accurate binding free energy functions for protein-protein interactions are imperative for a wide range of purposes. Their construction is predicated upon ascertaining the factors that influence binding and their relative importance. A recent benchmark of binding affinities has allowed, for the first time, the evaluation and construction of binding free energy models using a diverse set of complexes, and a systematic assessment of our ability to model the energetics of conformational changes. RESULTS We construct a large set of molecular descriptors using commonly available tools, introducing the use of energetic factors associated with conformational changes and disorder to order transitions, as well as features calculated on structural ensembles. The descriptors are used to train and test a binding free energy model using a consensus of four machine learning algorithms, whose performance constitutes a significant improvement over the other state of the art empirical free energy functions tested. The internal workings of the learners show how the descriptors are used, illuminating the determinants of protein-protein binding. AVAILABILITY The molecular descriptor set and descriptor values for all complexes are available in the Supplementary Material. A web server for the learners and coordinates for the bound and unbound structures can be accessed from the website: http://bmm.cancerresearchuk.org/~Affinity. CONTACT paul.bates@cancer.org.uk. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Iain H Moal
- Biomolecular Modelling Laboratory, Cancer Research UK London Research Institute, London WC2A 3LY, UK
| | | | | |
Collapse
|
39
|
Melquiond AS, Karaca E, Kastritis PL, Bonvin AM. Next challenges in protein-protein docking: from proteome to interactome and beyond. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE 2011. [DOI: 10.1002/wcms.91] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
|
40
|
Kastritis PL, Moal IH, Hwang H, Weng Z, Bates PA, Bonvin AMJJ, Janin J. A structure-based benchmark for protein-protein binding affinity. Protein Sci 2011; 20:482-91. [PMID: 21213247 PMCID: PMC3064828 DOI: 10.1002/pro.580] [Citation(s) in RCA: 227] [Impact Index Per Article: 16.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2010] [Revised: 12/15/2010] [Accepted: 12/16/2010] [Indexed: 11/06/2022]
Abstract
We have assembled a nonredundant set of 144 protein-protein complexes that have high-resolution structures available for both the complexes and their unbound components, and for which dissociation constants have been measured by biophysical methods. The set is diverse in terms of the biological functions it represents, with complexes that involve G-proteins and receptor extracellular domains, as well as antigen/antibody, enzyme/inhibitor, and enzyme/substrate complexes. It is also diverse in terms of the partners' affinity for each other, with K(d) ranging between 10(-5) and 10(-14) M. Nine pairs of entries represent closely related complexes that have a similar structure, but a very different affinity, each pair comprising a cognate and a noncognate assembly. The unbound structures of the component proteins being available, conformation changes can be assessed. They are significant in most of the complexes, and large movements or disorder-to-order transitions are frequently observed. The set may be used to benchmark biophysical models aiming to relate affinity to structure in protein-protein interactions, taking into account the reactants and the conformation changes that accompany the association reaction, instead of just the final product.
Collapse
Affiliation(s)
- Panagiotis L Kastritis
- Bijvoet Center for Biomolecular Research, Faculty of Science, Utrecht University3584CH Utrecht, The Netherlands
| | - Iain H Moal
- Biomolecular Modelling Laboratory, Cancer Research UK London Research Institute, Lincoln's Inn Fields LaboratoriesLondon WC2A 3LY, United Kingdom
| | - Howook Hwang
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Medical SchoolWorcester, Massachusetts 01605
| | - Zhiping Weng
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Medical SchoolWorcester, Massachusetts 01605
| | - Paul A Bates
- Biomolecular Modelling Laboratory, Cancer Research UK London Research Institute, Lincoln's Inn Fields LaboratoriesLondon WC2A 3LY, United Kingdom
| | - Alexandre M J J Bonvin
- Bijvoet Center for Biomolecular Research, Faculty of Science, Utrecht University3584CH Utrecht, The Netherlands
| | - Joël Janin
- Yeast Structural Genomics, IBBMC UMR 8619, Université Paris-Sud91405 Orsay, France
| |
Collapse
|
41
|
Ponomarev SY, Audie J. Computational prediction and analysis of the DR6-NAPP interaction. Proteins 2011; 79:1376-95. [DOI: 10.1002/prot.22962] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2010] [Revised: 11/02/2010] [Accepted: 11/24/2010] [Indexed: 01/14/2023]
|
42
|
Cohen M, Potapov V, Schreiber G. Four distances between pairs of amino acids provide a precise description of their interaction. PLoS Comput Biol 2009; 5:e1000470. [PMID: 19680437 PMCID: PMC2715887 DOI: 10.1371/journal.pcbi.1000470] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2009] [Accepted: 07/15/2009] [Indexed: 11/18/2022] Open
Abstract
The three-dimensional structures of proteins are stabilized by the interactions between amino acid residues. Here we report a method where four distances are calculated between any two side chains to provide an exact spatial definition of their bonds. The data were binned into a four-dimensional grid and compared to a random model, from which the preference for specific four-distances was calculated. A clear relation between the quality of the experimental data and the tightness of the distance distribution was observed, with crystal structure data providing far tighter distance distributions than NMR data. Since the four-distance data have higher information content than classical bond descriptions, we were able to identify many unique inter-residue features not found previously in proteins. For example, we found that the side chains of Arg, Glu, Val and Leu are not symmetrical in respect to the interactions of their head groups. The described method may be developed into a function, which computationally models accurately protein structures.
Collapse
Affiliation(s)
- Mati Cohen
- Department of Biological Chemistry, Weizmann Institute of Science, Rehovot, Israel
| | - Vladimir Potapov
- Department of Biological Chemistry, Weizmann Institute of Science, Rehovot, Israel
| | - Gideon Schreiber
- Department of Biological Chemistry, Weizmann Institute of Science, Rehovot, Israel
| |
Collapse
|
43
|
Continued development of an empirical function for predicting and rationalizing protein–protein binding affinities. Biophys Chem 2009; 143:139-44. [DOI: 10.1016/j.bpc.2009.05.003] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2009] [Revised: 05/08/2009] [Accepted: 05/11/2009] [Indexed: 11/30/2022]
|
44
|
Durham E, Dorr B, Woetzel N, Staritzbichler R, Meiler J. Solvent accessible surface area approximations for rapid and accurate protein structure prediction. J Mol Model 2009; 15:1093-108. [PMID: 19234730 PMCID: PMC2712621 DOI: 10.1007/s00894-009-0454-9] [Citation(s) in RCA: 254] [Impact Index Per Article: 15.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2008] [Accepted: 01/02/2009] [Indexed: 12/01/2022]
Abstract
The burial of hydrophobic amino acids in the protein core is a driving force in protein folding. The extent to which an amino acid interacts with the solvent and the protein core is naturally proportional to the surface area exposed to these environments. However, an accurate calculation of the solvent-accessible surface area (SASA), a geometric measure of this exposure, is numerically demanding as it is not pair-wise decomposable. Furthermore, it depends on a full-atom representation of the molecule. This manuscript introduces a series of four SASA approximations of increasing computational complexity and accuracy as well as knowledge-based environment free energy potentials based on these SASA approximations. Their ability to distinguish correctly from incorrectly folded protein models is assessed to balance speed and accuracy for protein structure prediction. We find the newly developed “Neighbor Vector” algorithm provides the most optimal balance of accurate yet rapid exposure measures.
Collapse
Affiliation(s)
- Elizabeth Durham
- Department of Chemistry, Center for Structural Biology, Vanderbilt University, 465 21st Ave South, Nashville, TN 37232-8725, USA
| | | | | | | | | |
Collapse
|
45
|
Dell'Orco D. Fast predictions of thermodynamics and kinetics of protein-protein recognition from structures: from molecular design to systems biology. MOLECULAR BIOSYSTEMS 2009; 5:323-34. [PMID: 19396368 DOI: 10.1039/b821580d] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Abstract
The increasing call for an overall picture of the interactions between the components of a biological system that give rise to the observed function is often summarized by the expression systems biology. Both the interpretative and predictive capabilities of holistic models of biochemical systems, however, depend to a large extent on the level of physico-chemical knowledge of the individual molecular interactions making up the network. This review is focused on the structure-based quantitative characterization of protein-protein interactions, ubiquitous in any biochemical pathway. Recently developed, fast and effective computational methods are reviewed, which allow the assessment of kinetic and thermodynamic features of the association-dissociation processes of protein complexes, both in water soluble and membrane environments. The performance and the accuracy of fast and semi-empirical structure-based methods have reached comparable levels with respect to the classical and more elegant molecular simulations. Nevertheless, the broad accessibility and lower computational cost provide the former methods with the advantageous possibility to perform systems-level analyses including extensive in silico mutagenesis screenings and large-scale structural predictions of multiprotein complexes.
Collapse
Affiliation(s)
- Daniele Dell'Orco
- Department of Chemistry, University of Modena and Reggio Emilia, Via Campi 183, 41100, Modena, Italy.
| |
Collapse
|
46
|
Abstract
We present version 3.0 of our publicly available protein-protein docking benchmark. This update includes 40 new test cases, representing a 48% increase from Benchmark 2.0. For all of the new cases, the crystal structures of both binding partners are available. As with Benchmark 2.0, Structural Classification of Proteins (Murzin et al., J Mol Biol 1995;247:536-540) was used to remove redundant test cases. The 124 unbound-unbound test cases in Benchmark 3.0 are classified into 88 rigid-body cases, 19 medium-difficulty cases, and 17 difficult cases, based on the degree of conformational change at the interface upon complex formation. In addition to providing the community with more test cases for evaluating docking methods, the expansion of Benchmark 3.0 will facilitate the development of new algorithms that require a large number of training examples. Benchmark 3.0 is available to the public at http://zlab.bu.edu/benchmark.
Collapse
Affiliation(s)
- Howook Hwang
- Boston University Bioinformatics Program, Boston, USA
| | - Brian Pierce
- Boston University Bioinformatics Program, Boston, USA
| | | | - Joël Janin
- Yeast Structural Genomics, IBBMC Université Paris-Sud, CNRS UMR 8619, 91405-Orsay, France
| | - Zhiping Weng
- Boston University Bioinformatics Program, Boston, USA
- Boston University Biomedical Engineering Department, Boston, USA
| |
Collapse
|
47
|
Audie J. Development and validation of an empirical free energy function for calculating protein-protein binding free energy surfaces. Biophys Chem 2008; 139:84-91. [PMID: 19041170 DOI: 10.1016/j.bpc.2008.10.007] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2008] [Revised: 10/18/2008] [Accepted: 10/22/2008] [Indexed: 10/21/2022]
Abstract
In a previous paper, we described a novel empirical free energy function that was used to accurately predict experimental binding free energies for a diverse test set of 31 protein-protein complexes to within approximately 1.0 kcal. Here, we extend that work and show that an updated version of the function can be used to (1) accurately predict native binding free energies and (2) rank crystallographic, native-like and non-native binding modes in a physically realistic manner. The modified function includes terms designed to capture some of the unfavorable interactions that characterize non-native interfaces. The function was used to calculate one-dimensional binding free energy surfaces for 21 protein complexes. In roughly 90% of the cases tested, the function was used to place native-like and crystallographic binding modes in global free energy minima. Our analysis further suggests that buried hydrogen bonds might provide the key to distinguishing native from non-native interactions. To the best of our knowledge our function is the only one of its kind, a single expression that can be used to accurately calculate native and non-native binding free energies for a large number of proteins. Given the encouraging results presented in this paper, future work will focus on improving the function and applying it to the protein-protein docking problem.
Collapse
Affiliation(s)
- Joseph Audie
- Department of Chemistry, Sacred Heart University, Fairfield, CT 06825, USA.
| |
Collapse
|
48
|
Volume-based solvation models out-perform area-based models in combined studies of wild-type and mutated protein-protein interfaces. BMC Bioinformatics 2008; 9:448. [PMID: 18939984 PMCID: PMC2596146 DOI: 10.1186/1471-2105-9-448] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2008] [Accepted: 10/21/2008] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Empirical binding models have previously been investigated for the energetics of protein complexation (DeltaG models) and for the influence of mutations on complexation (i.e. differences between wild-type and mutant complexes, DeltaDeltaG models). We construct binding models to directly compare these processes, which have generally been studied separately. RESULTS Although reasonable fit models were found for both DeltaG and DeltaDeltaG cases, they differ substantially. In a dataset curated for the absence of mainchain rearrangement upon binding, non-polar area burial is a major determinant of DeltaG models. However this DeltaG model does not fit well to the data for binding differences upon mutation. Burial of non-polar area is weighted down in fitting of DeltaDeltaG models. These calculations were made with no repacking of sidechains upon complexation, and only minimal packing upon mutation. We investigated the consequences of more extensive packing changes with a modified mean-field packing scheme. Rather than emphasising solvent exposure with relatively extended sidechains, rotamers are selected that exhibit maximal packing with protein. This provides solvent accessible areas for proteins that are much closer to those of experimental structures than the more extended sidechain regime. The new packing scheme increases changes in non-polar burial for mutants compared to wild-type proteins, but does not substantially improve agreement between DeltaG and DeltaDeltaG binding models. CONCLUSION We conclude that solvent accessible area, based on modelled mutant structures, is a poor correlate for DeltaDeltaG upon mutation. A simple volume-based, rather than solvent accessibility-based, model is constructed for DeltaG and DeltaDeltaG systems. This shows a more consistent behaviour. We discuss the efficacy of volume, as opposed to area, approaches to describe the energetic consequences of mutations at interfaces. This knowledge can be used to develop simple computational screens for binding in comparative modelled interfaces.
Collapse
|
49
|
Zhou HX, Qin S, Tjong H. Modeling Protein–Protein and Protein–Nucleic Acid Interactions: Structure, Thermodynamics, and Kinetics. ANNUAL REPORTS IN COMPUTATIONAL CHEMISTRY 2008. [DOI: 10.1016/s1574-1400(08)00004-2] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
|