1
|
Shuvo MH, Bhattacharya D. EquiRank: Improved protein-protein interface quality estimation using protein language-model-informed equivariant graph neural networks. Comput Struct Biotechnol J 2024; 27:160-170. [PMID: 39850657 PMCID: PMC11755013 DOI: 10.1016/j.csbj.2024.12.015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2024] [Revised: 12/18/2024] [Accepted: 12/20/2024] [Indexed: 01/25/2025] Open
Abstract
Quality estimation of the predicted interaction interface of protein complex structural models is not only important for complex model evaluation and selection but also useful for protein-protein docking. Despite recent progress fueled by symmetry-aware deep learning architectures and pretrained protein language models (pLMs), existing methods for estimating protein complex quality have yet to fully exploit the collective potentials of these advances for accurate estimation of protein-protein interface. Here we present EquiRank, an improved protein-protein interface quality estimation method by leveraging the strength of a symmetry-aware E(3) equivariant deep graph neural network (EGNN) and integrating pLM embeddings from the pretrained ESM-2 model. Our method estimates the quality of the protein-protein interface through an effective graph-based representation of interacting residue pairs, incorporating a diverse set of features, including ESM-2 embeddings, and then by learning the representation using symmetry-aware EGNNs. Our experimental results demonstrate improved ranking performance on diverse datasets over existing latest protein complex quality estimation methods including the top-performing CASP15 protein complex quality estimation method VoroIF_GNN and the self-assessment module of AlphaFold-Multimer repurposed for protein complex scoring and across different performance evaluation metrics. Additionally, our ablation studies demonstrate the contributions of both pLMs and the equivariant nature of EGNN for improved protein-protein interface quality estimation performance. EquiRank is freely available at https://github.com/mhshuvo1/EquiRank.
Collapse
Affiliation(s)
- Md Hossain Shuvo
- Department of Computer Science, Prairie View A&M University, Prairie View, 77446, TX, USA
| | | |
Collapse
|
2
|
Zeng C, Jian Y, Zhuo C, Li A, Zeng C, Zhao Y. Evaluation of DNA-protein complex structures using the deep learning method. Phys Chem Chem Phys 2023; 26:130-143. [PMID: 38063012 DOI: 10.1039/d3cp04980a] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2023]
Abstract
Biological processes such as transcription, repair, and regulation require interactions between DNA and proteins. To unravel their functions, it is imperative to determine the high-resolution structures of DNA-protein complexes. However, experimental methods for this purpose are costly and technically demanding. Consequently, there is an urgent need for computational techniques to identify the structures of DNA-protein complexes. Despite technological advancements, accurately identifying DNA-protein complexes through computational methods still poses a challenge. Our team has developed a cutting-edge deep-learning approach called DDPScore that assesses DNA-protein complex structures. DDPScore utilizes a 4D convolutional neural network to overcome limited training data. This approach effectively captures local and global features while comprehensively considering the conformational changes arising from the flexibility during the DNA-protein docking process. DDPScore consistently outperformed the available methods in comprehensive DNA-protein complex docking evaluations, even for the flexible docking challenges. DDPScore has a wide range of applications in predicting and designing structures of DNA-protein complexes.
Collapse
Affiliation(s)
- Chengwei Zeng
- Institute of Biophysics and Department of Physics, Central China Normal University, Wuhan, 430079, China.
| | - Yiren Jian
- Department of Computer Science, Dartmouth College, Hanover, NH 03755, USA
| | - Chen Zhuo
- Institute of Biophysics and Department of Physics, Central China Normal University, Wuhan, 430079, China.
| | - Anbang Li
- Institute of Biophysics and Department of Physics, Central China Normal University, Wuhan, 430079, China.
| | - Chen Zeng
- Department of Physics, The George Washington University, Washington, DC 20052, USA
| | - Yunjie Zhao
- Institute of Biophysics and Department of Physics, Central China Normal University, Wuhan, 430079, China.
| |
Collapse
|
3
|
Shuvo MH, Karim M, Roche R, Bhattacharya D. PIQLE: protein-protein interface quality estimation by deep graph learning of multimeric interaction geometries. BIOINFORMATICS ADVANCES 2023; 3:vbad070. [PMID: 37351310 PMCID: PMC10281963 DOI: 10.1093/bioadv/vbad070] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/22/2023] [Revised: 05/17/2023] [Accepted: 06/01/2023] [Indexed: 06/24/2023]
Abstract
Motivation Accurate modeling of protein-protein interaction interface is essential for high-quality protein complex structure prediction. Existing approaches for estimating the quality of a predicted protein complex structural model utilize only the physicochemical properties or energetic contributions of the interacting atoms, ignoring evolutionarily information or inter-atomic multimeric geometries, including interaction distance and orientations. Results Here, we present PIQLE, a deep graph learning method for protein-protein interface quality estimation. PIQLE leverages multimeric interaction geometries and evolutionarily information along with sequence- and structure-derived features to estimate the quality of individual interactions between the interfacial residues using a multi-head graph attention network and then probabilistically combines the estimated quality for scoring the overall interface. Experimental results show that PIQLE consistently outperforms existing state-of-the-art methods including DProQA, TRScore, GNN-DOVE and DOVE on multiple independent test datasets across a wide range of evaluation metrics. Our ablation study and comparison with the self-assessment module of AlphaFold-Multimer repurposed for protein complex scoring reveal that the performance gains are connected to the effectiveness of the multi-head graph attention network in leveraging multimeric interaction geometries and evolutionary information along with other sequence- and structure-derived features adopted in PIQLE. Availability and implementation An open-source software implementation of PIQLE is freely available at https://github.com/Bhattacharya-Lab/PIQLE. Supplementary information Supplementary data are available at Bioinformatics Advances online.
Collapse
Affiliation(s)
- Md Hossain Shuvo
- Department of Computer Science, Virginia Tech, Blacksburg, VA 24061, USA
| | - Mohimenul Karim
- Department of Computer Science, Virginia Tech, Blacksburg, VA 24061, USA
| | - Rahmatullah Roche
- Department of Computer Science, Virginia Tech, Blacksburg, VA 24061, USA
| | | |
Collapse
|
4
|
Shuvo MH, Karim M, Roche R, Bhattacharya D. PIQLE: protein-protein interface quality estimation by deep graph learning of multimeric interaction geometries. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.02.14.528528. [PMID: 36824789 PMCID: PMC9949034 DOI: 10.1101/2023.02.14.528528] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/17/2023]
Abstract
Accurate modeling of protein-protein interaction interface is essential for high-quality protein complex structure prediction. Existing approaches for estimating the quality of a predicted protein complex structural model utilize only the physicochemical properties or energetic contributions of the interacting atoms, ignoring evolutionarily information or inter-atomic multimeric geometries, including interaction distance and orientations. Here we present PIQLE, a deep graph learning method for protein-protein interface quality estimation. PIQLE leverages multimeric interaction geometries and evolutionarily information along with sequence- and structure-derived features to estimate the quality of the individual interactions between the interfacial residues using a multihead graph attention network and then probabilistically combines the estimated quality of the interfacial residues for scoring the overall interface. Experimental results show that PIQLE consistently outperforms existing state-of-the-art methods on multiple independent test datasets across a wide range of evaluation metrics. Our ablation study reveals that the performance gains are connected to the effectiveness of the multihead graph attention network in leveraging multimeric interaction geometries and evolutionary information along with other sequence- and structure-derived features adopted in PIQLE. An open-source software implementation of PIQLE, licensed under the GNU General Public License v3, is freely available at https://github.com/Bhattacharya-Lab/PIQLE .
Collapse
Affiliation(s)
- Md Hossain Shuvo
- Department of Computer Science, Virginia Tech, Blacksburg, VA 24061, United States of America
| | - Mohimenul Karim
- Department of Computer Science, Virginia Tech, Blacksburg, VA 24061, United States of America
| | - Rahmatullah Roche
- Department of Computer Science, Virginia Tech, Blacksburg, VA 24061, United States of America
| | - Debswapna Bhattacharya
- Department of Computer Science, Virginia Tech, Blacksburg, VA 24061, United States of America
| |
Collapse
|
5
|
Alam R, Samad A, Ahammad F, Nur SM, Alsaiari AA, Imon RR, Talukder MEK, Nain Z, Rahman MM, Mohammad F, Karpiński TM. In silico formulation of a next-generation multiepitope vaccine for use as a prophylactic candidate against Crimean-Congo hemorrhagic fever. BMC Med 2023; 21:36. [PMID: 36726141 PMCID: PMC9891764 DOI: 10.1186/s12916-023-02750-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/04/2022] [Accepted: 01/24/2023] [Indexed: 02/03/2023] Open
Abstract
BACKGROUND Crimean-Congo hemorrhagic fever (CCHF) is a widespread disease transmitted to humans and livestock animals through the bite of infected ticks or close contact with infected persons' blood, organs, or other bodily fluids. The virus is responsible for severe viral hemorrhagic fever outbreaks, with a case fatality rate of up to 40%. Despite having the highest fatality rate of the virus, a suitable treatment option or vaccination has not been developed yet. Therefore, this study aimed to formulate a multiepitope vaccine against CCHF through computational vaccine design approaches. METHODS The glycoprotein, nucleoprotein, and RNA-dependent RNA polymerase of CCHF were utilized to determine immunodominant T- and B-cell epitopes. Subsequently, an integrative computational vaccinology approach was used to formulate a multi-epitopes vaccine candidate against the virus. RESULTS After rigorous assessment, a multiepitope vaccine was constructed, which was antigenic, immunogenic, and non-allergenic with desired physicochemical properties. Molecular dynamics (MD) simulations of the vaccine-receptor complex show strong stability of the vaccine candidates to the targeted immune receptor. Additionally, the immune simulation of the vaccine candidates found that the vaccine could trigger real-life-like immune responses upon administration to humans. CONCLUSIONS Finally, we concluded that the formulated multiepitope vaccine candidates would provide excellent prophylactic properties against CCHF.
Collapse
Affiliation(s)
- Rahat Alam
- Department of Genetic Engineering and Biotechnology, Jashore University of Science and Technology, Jashore, 7408, Bangladesh
- Laboratory of Computational Biology, Biological Solution Centre (BioSol Centre), Jashore, 7408, Bangladesh
| | - Abdus Samad
- Department of Genetic Engineering and Biotechnology, Jashore University of Science and Technology, Jashore, 7408, Bangladesh
- Laboratory of Computational Biology, Biological Solution Centre (BioSol Centre), Jashore, 7408, Bangladesh
| | - Foysal Ahammad
- Laboratory of Computational Biology, Biological Solution Centre (BioSol Centre), Jashore, 7408, Bangladesh
- Division of Biological and Biomedical Sciences (BBS), College of Health and Life Sciences (CHLS), Hamad Bin Khalifa University (HBKU), 34110, Doha, Qatar
| | - Suza Mohammad Nur
- Department of Biochemistry, School of Medicine Case, Western Reserve University, Cleveland, OH, 44106, USA
| | - Ahad Amer Alsaiari
- College of Applied Medical Science, Clinical Laboratories Science Department, Taif University, Taif, 21944, Saudi Arabia
| | - Raihan Rahman Imon
- Department of Genetic Engineering and Biotechnology, Jashore University of Science and Technology, Jashore, 7408, Bangladesh
- Laboratory of Computational Biology, Biological Solution Centre (BioSol Centre), Jashore, 7408, Bangladesh
| | - Md Enamul Kabir Talukder
- Department of Genetic Engineering and Biotechnology, Jashore University of Science and Technology, Jashore, 7408, Bangladesh
- Laboratory of Computational Biology, Biological Solution Centre (BioSol Centre), Jashore, 7408, Bangladesh
| | - Zulkar Nain
- School of Biomedical Sciences, Baylor College of Medicine, Houston, TX, 77030, USA
| | - Md Mashiar Rahman
- Department of Genetic Engineering and Biotechnology, Jashore University of Science and Technology, Jashore, 7408, Bangladesh
| | - Farhan Mohammad
- Division of Biological and Biomedical Sciences (BBS), College of Health and Life Sciences (CHLS), Hamad Bin Khalifa University (HBKU), 34110, Doha, Qatar.
| | - Tomasz M Karpiński
- Chair and Department of Medical Microbiology, Poznań University of Medical Sciences, Rokietnicka 10, 60-806, Poznań, Poland.
| |
Collapse
|
6
|
Ozkan A, Sitharam M, Flores-Canales JC, Prabhu R, Kurnikova M. Baseline Comparisons of Complementary Sampling Methods for Assembly Driven by Short-Ranged Pair Potentials toward Fast and Flexible Hybridization. J Chem Theory Comput 2021; 17:1967-1987. [PMID: 33576635 DOI: 10.1021/acs.jctc.0c00945] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
This work measures baseline sampling characteristics that highlight fundamental differences between sampling methods for assembly driven by short-ranged pair potentials. Such granular comparison is essential for fast, flexible, and accurate hybridization of complementary methods. Besides sampling speed, efficiency, and accuracy of uniform grid coverage, other sampling characteristics measured are (i) accuracy of covering narrow low energy regions that have low effective dimension (ii) ability to localize sampling to specific basins, and (iii) flexibility in sampling distributions. As a proof of concept, we compare a recently developed geometric methodology EASAL (Efficient Atlasing and Search of Assembly Landscapes) and the traditional Monte Carlo (MC) method for sampling the energy landscape of two assembling trans-membrane helices, driven by short-range pair potentials. By measuring the above-mentioned sampling characteristics, we demonstrate that EASAL provides localized and accurate coverage of crucial regions of the energy landscape of low effective dimension, under flexible sampling distributions, with much fewer samples and computational resources than MC sampling. EASAL's empirically validated theoretical guarantees permit credible extrapolation of these measurements and comparisons to arbitrary number and size of assembling units. Promising avenues for hybridizing the complementary advantages of the two methods are discussed.
Collapse
Affiliation(s)
- Aysegul Ozkan
- CISE Department, University of Florida, Gainesville, Florida 32611-6120, United States
| | - Meera Sitharam
- CISE Department, University of Florida, Gainesville, Florida 32611-6120, United States
| | | | - Rahul Prabhu
- CISE Department, University of Florida, Gainesville, Florida 32611-6120, United States
| | - Maria Kurnikova
- Chemistry Department, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213, United States
| |
Collapse
|
7
|
Tanemura KA, Pei J, Merz KM. Refinement of pairwise potentials via logistic regression to score protein-protein interactions. Proteins 2020; 88:1559-1568. [PMID: 32729132 DOI: 10.1002/prot.25973] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2020] [Revised: 05/17/2020] [Accepted: 06/14/2020] [Indexed: 12/20/2022]
Abstract
Protein-protein interactions (PPIs) are ubiquitous and functionally of great importance in biological systems. Hence, the accurate prediction of PPIs by protein-protein docking and scoring tools is highly desirable in order to characterize their structure and biological function. Ab initio docking protocols are divided into the sampling of docking poses to produce at least one near-native structure, and then to evaluate the vast candidate structures by scoring. Concurrent development in both sampling and scoring is crucial for the deployment of protein-protein docking software. In the present work, we apply a machine learning model on pairwise potentials to refine the task of protein quaternary structure native structure detection among decoys. A decoy set was featurized using the Knowledge and Empirical Combined Scoring Algorithm 2 (KECSA2) pairwise potential. The highly unbalanced decoy set was then balanced using a comparison concept between native and decoy structures. The resultant comparison descriptors were used to train a logistic regression (LR) classifier. The LR model yielded the optimal performance for native detection among decoys compared with conventional scoring functions, while exhibiting lesser performance for the detection of low root mean square deviation decoy structures. Its deployment on an independent benchmark set confirms that the scoring function performs competitively relative to other scoring functions. The scripts used are available at https://github.com/TanemuraKiyoto/PPI-native-detection-via-LR.
Collapse
Affiliation(s)
- Kiyoto A Tanemura
- Department of Chemistry, Michigan State University, East Lansing, Michigan, USA
| | - Jun Pei
- Department of Chemistry, Michigan State University, East Lansing, Michigan, USA
| | - Kenneth M Merz
- Department of Chemistry, Michigan State University, East Lansing, Michigan, USA
| |
Collapse
|
8
|
Yan Y, Tao H, He J, Huang SY. The HDOCK server for integrated protein–protein docking. Nat Protoc 2020; 15:1829-1852. [DOI: 10.1038/s41596-020-0312-x] [Citation(s) in RCA: 288] [Impact Index Per Article: 57.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2019] [Accepted: 02/03/2020] [Indexed: 12/27/2022]
|
9
|
Singh A, Dauzhenka T, Kundrotas PJ, Sternberg MJE, Vakser IA. Application of docking methodologies to modeled proteins. Proteins 2020; 88:1180-1188. [PMID: 32170770 DOI: 10.1002/prot.25889] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2019] [Revised: 02/15/2020] [Accepted: 03/07/2020] [Indexed: 12/12/2022]
Abstract
Protein docking is essential for structural characterization of protein interactions. Besides providing the structure of protein complexes, modeling of proteins and their complexes is important for understanding the fundamental principles and specific aspects of protein interactions. The accuracy of protein modeling, in general, is still less than that of the experimental approaches. Thus, it is important to investigate the applicability of docking techniques to modeled proteins. We present new comprehensive benchmark sets of protein models for the development and validation of protein docking, as well as a systematic assessment of free and template-based docking techniques on these sets. As opposed to previous studies, the benchmark sets reflect the real case modeling/docking scenario where the accuracy of the models is assessed by the modeling procedure, without reference to the native structure (which would be unknown in practical applications). We also expanded the analysis to include docking of protein pairs where proteins have different structural accuracy. The results show that, in general, the template-based docking is less sensitive to the structural inaccuracies of the models than the free docking. The near-native docking poses generated by the template-based approach, typically, also have higher ranks than those produces by the free docking (although the free docking is indispensable in modeling the multiplicity of protein interactions in a crowded cellular environment). The results show that docking techniques are applicable to protein models in a broad range of modeling accuracy. The study provides clear guidelines for practical applications of docking to protein models.
Collapse
Affiliation(s)
- Amar Singh
- Computational Biology Program, The University of Kansas, Lawrence, Kansas, USA
| | - Taras Dauzhenka
- Computational Biology Program, The University of Kansas, Lawrence, Kansas, USA
| | - Petras J Kundrotas
- Computational Biology Program, The University of Kansas, Lawrence, Kansas, USA
| | - Michael J E Sternberg
- Centre for Integrative Systems Biology and Bioinformatics, Department of Life Sciences, Imperial College London, South Kensington, London, UK
| | - Ilya A Vakser
- Computational Biology Program, The University of Kansas, Lawrence, Kansas, USA.,Department of Molecular Biosciences, The University of Kansas, Lawrence, Kansas, USA
| |
Collapse
|
10
|
Yan Y, He J, Feng Y, Lin P, Tao H, Huang SY. Challenges and opportunities of automated protein-protein docking: HDOCK server vs human predictions in CAPRI Rounds 38-46. Proteins 2020; 88:1055-1069. [PMID: 31994779 DOI: 10.1002/prot.25874] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2019] [Revised: 01/02/2020] [Accepted: 01/22/2020] [Indexed: 12/12/2022]
Abstract
Protein-protein docking plays an important role in the computational prediction of the complex structure between two proteins. For years, a variety of docking algorithms have been developed, as witnessed by the critical assessment of prediction interactions (CAPRI) experiments. However, despite their successes, many docking algorithms often require a series of manual operations like modeling structures from sequences, incorporating biological information, and selecting final models. The difficulties in these manual steps have significantly limited the applications of protein-protein docking, as most of the users in the community are nonexperts in docking. Therefore, automated docking like a web server, which can give a comparable performance to human docking protocol, is pressingly needed. As such, we have participated in the blind CAPRI experiments for Rounds 38-45 and CASP13-CAPRI challenge for Round 46 with both our HDOCK automated docking web server and human docking protocol. It was shown that our HDOCK server achieved an "acceptable" or higher CAPRI-rated model in the top 10 submitted predictions for 65.5% and 59.1% of the targets in the docking experiments of CAPRI and CASP13-CAPRI, respectively, which are comparable to 66.7% and 54.5% for human docking protocol. Similar trends can also be observed in the scoring experiments. These results validated our HDOCK server as an efficient automated docking protocol for nonexpert users. Challenges and opportunities of automated docking are also discussed.
Collapse
Affiliation(s)
- Yumeng Yan
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei, People's Republic of China
| | - Jiahua He
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei, People's Republic of China
| | - Yuyu Feng
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei, People's Republic of China
| | - Peicong Lin
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei, People's Republic of China
| | - Huanyu Tao
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei, People's Republic of China
| | - Sheng-You Huang
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei, People's Republic of China
| |
Collapse
|
11
|
Abstract
Many of the biological functions of the cell are driven by protein-protein interactions. However, determining which proteins interact and exactly how they do so to enable their functions, remain major research questions. Functional interactions are dependent on a number of complicated factors; therefore, modeling the three-dimensional structure of protein-protein complexes is still considered a complex endeavor. Nevertheless, the rewards for modeling protein interactions to atomic level detail are substantial, and there are numerous examples of how models can provide useful information for drug design, protein engineering, systems biology, and understanding of the immune system. Here, we provide practical guidelines for docking proteins using the web-server, SwarmDock, a flexible protein-protein docking method. Moreover, we provide an overview of the factors that need to be considered when deciding whether docking is likely to be successful.
Collapse
Affiliation(s)
- Iain H Moal
- European Bioinformatics Institute, Hinxton, UK
| | | | | | - Paul A Bates
- Biomolecular Modelling Laboratory, The Francis Crick Institute, London, UK.
| |
Collapse
|
12
|
Abstract
Background Protein-protein docking is a valuable computational approach for investigating protein-protein interactions. Shape complementarity is the most basic component of a scoring function and plays an important role in protein-protein docking. Despite significant progresses, shape representation remains an open question in the development of protein-protein docking algorithms, especially for grid-based docking approaches. Results We have proposed a new pairwise shape-based scoring function (LSC) for protein-protein docking which adopts an exponential form to take into account long-range interactions between protein atoms. The LSC scoring function was incorporated into our FFT-based docking program and evaluated for both bound and unbound docking on the protein docking benchmark 4.0. It was shown that our LSC achieved a significantly better performance than four other similar docking methods, ZDOCK 2.1, MolFit/G, GRAMM, and FTDock/G, in both success rate and number of hits. When considering the top 10 predictions, LSC obtained a success rate of 51.71% and 6.82% for bound and unbound docking, respectively, compared to 42.61% and 4.55% for the second-best program ZDOCK 2.1. LSC also yielded an average of 8.38 and 3.94 hits per complex in the top 1000 predictions for bound and unbound docking, respectively, followed by 6.38 and 2.96 hits for the second-best ZDOCK 2.1. Conclusions The present LSC method will not only provide an initial-stage docking approach for post-docking processes but also have a general implementation for accurate representation of other energy terms on grids in protein-protein docking. The software has been implemented in our HDOCK web server at http://hdock.phys.hust.edu.cn/.
Collapse
|
13
|
Perthold JW, Oostenbrink C. GroScore: Accurate Scoring of Protein–Protein Binding Poses Using Explicit-Solvent Free-Energy Calculations. J Chem Inf Model 2019; 59:5074-5085. [DOI: 10.1021/acs.jcim.9b00687] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Jan Walther Perthold
- Institute of Molecular Modeling and Simulation, University of Natural Resources and Life Sciences, Muthgasse 18, 1190 Vienna, Austria
| | - Chris Oostenbrink
- Institute of Molecular Modeling and Simulation, University of Natural Resources and Life Sciences, Muthgasse 18, 1190 Vienna, Austria
| |
Collapse
|
14
|
Kong R, Wang F, Zhang J, Wang F, Chang S. CoDockPP: A Multistage Approach for Global and Site-Specific Protein–Protein Docking. J Chem Inf Model 2019; 59:3556-3564. [DOI: 10.1021/acs.jcim.9b00445] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
Affiliation(s)
- Ren Kong
- Institute of Bioinformatics and Medical Engineering, School of Electrical and Information Engineering, Jiangsu University of Technology, Changzhou 213001, China
| | - Feng Wang
- School of Information Science & Engineering, Changzhou University, Changzhou 213164, China
| | - Jian Zhang
- Department of Pathophysiology, Key Laboratory of Cell Differentiation and Apoptosis of National Ministry of Education, Shanghai Jiao Tong University, School of Medicine, Shanghai 200025, China
| | - Fengfei Wang
- Institute of Bioinformatics and Medical Engineering, School of Electrical and Information Engineering, Jiangsu University of Technology, Changzhou 213001, China
| | - Shan Chang
- Institute of Bioinformatics and Medical Engineering, School of Electrical and Information Engineering, Jiangsu University of Technology, Changzhou 213001, China
| |
Collapse
|
15
|
Computational Modeling of Designed Ankyrin Repeat Protein Complexes with Their Targets. J Mol Biol 2019; 431:2852-2868. [DOI: 10.1016/j.jmb.2019.05.005] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2019] [Revised: 05/03/2019] [Accepted: 05/03/2019] [Indexed: 01/24/2023]
|
16
|
Yan Y, Zhang D, Zhou P, Li B, Huang SY. HDOCK: a web server for protein-protein and protein-DNA/RNA docking based on a hybrid strategy. Nucleic Acids Res 2019; 45:W365-W373. [PMID: 28521030 PMCID: PMC5793843 DOI: 10.1093/nar/gkx407] [Citation(s) in RCA: 768] [Impact Index Per Article: 128.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2017] [Accepted: 04/29/2017] [Indexed: 12/16/2022] Open
Abstract
Protein–protein and protein–DNA/RNA interactions play a fundamental role in a variety of biological processes. Determining the complex structures of these interactions is valuable, in which molecular docking has played an important role. To automatically make use of the binding information from the PDB in docking, here we have presented HDOCK, a novel web server of our hybrid docking algorithm of template-based modeling and free docking, in which cases with misleading templates can be rescued by the free docking protocol. The server supports protein–protein and protein–DNA/RNA docking and accepts both sequence and structure inputs for proteins. The docking process is fast and consumes about 10–20 min for a docking run. Tested on the cases with weakly homologous complexes of <30% sequence identity from five docking benchmarks, the HDOCK pipeline tied with template-based modeling on the protein–protein and protein–DNA benchmarks and performed better than template-based modeling on the three protein–RNA benchmarks when the top 10 predictions were considered. The performance of HDOCK became better when more predictions were considered. Combining the results of HDOCK and template-based modeling by ranking first of the template-based model further improved the predictive power of the server. The HDOCK web server is available at http://hdock.phys.hust.edu.cn/.
Collapse
Affiliation(s)
- Yumeng Yan
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, P. R. China
| | - Di Zhang
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, P. R. China
| | - Pei Zhou
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, P. R. China
| | - Botong Li
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, P. R. China
| | - Sheng-You Huang
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, P. R. China
| |
Collapse
|
17
|
Porter KA, Desta I, Kozakov D, Vajda S. What method to use for protein-protein docking? Curr Opin Struct Biol 2019; 55:1-7. [PMID: 30711743 PMCID: PMC6669123 DOI: 10.1016/j.sbi.2018.12.010] [Citation(s) in RCA: 70] [Impact Index Per Article: 11.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2018] [Accepted: 12/22/2018] [Indexed: 10/27/2022]
Abstract
A number of well-established servers perform 'free' docking of proteins of known structures. In contrast, template-based docking can start from sequences if structures are available for complexes that are homologous to the target. On the basis of the results of the CAPRI-CASP structure prediction experiments, template-based methods yield more accurate predictions if good templates can be found, but generally fail without such templates. However, free global docking, or focused docking around even poor quality template-based models, can still generate acceptable docked structures in these cases. In accordance with the analysis of a benchmark set, free docking of heterodimers yields acceptable or better predictions in the top 10 models for around 40% of structures. However, it is likely that a combination of template-based and free docking methods can perform better for targets that have template structures available. Another way of improving the reliability of predictions is adding experimental information as restraints, an option built into several docking servers.
Collapse
Affiliation(s)
- Kathryn A Porter
- Department of Biomedical Engineering, Boston University, Boston, MA 02215, USA
| | - Israel Desta
- Department of Biomedical Engineering, Boston University, Boston, MA 02215, USA
| | - Dima Kozakov
- Department of Applied Mathematics and Statistics, Stony Brook University, NY, USA; Laufer Center for Physical and Quantitative Biology, Stony Brook University, NY, USA.
| | - Sandor Vajda
- Department of Biomedical Engineering, Boston University, Boston, MA 02215, USA; Department of Chemistry, Boston University, Boston, MA 02215, USA.
| |
Collapse
|
18
|
Galeazzi R, Laudadio E, Falconi E, Massaccesi L, Ercolani L, Mobbili G, Minnelli C, Scirè A, Cianfruglia L, Armeni T. Protein-protein interactions of human glyoxalase II: findings of a reliable docking protocol. Org Biomol Chem 2019; 16:5167-5177. [PMID: 29971290 DOI: 10.1039/c8ob01194j] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
Glyoxalase II (GlxII) is an antioxidant glutathione-dependent enzyme, which catalyzes the hydrolysis of S-d-lactoylglutathione to form d-lactic acid and glutathione (GSH). The last product is the most important thiol reducing agent present in all eukaryotic cells that have mitochondria and chloroplasts. It is generally known that GSH plays a crucial role not only in the cellular redox state but also in various cellular processes. One of them is protein S-glutathionylation, a process that can occur through an oxidation reaction of proteins' thiol groups by GSH. Changes in protein S-glutathionylation have been associated with a range of human diseases such as diabetes, cardiovascular and pulmonary diseases, neurodegenerative diseases and cancer. Within a major project aimed at elucidating the role of GlxII in the mechanism of S-glutathionylation, a reliable computational protocol consisting of a protein-protein docking approach followed by atomistic Molecular Dynamics (MD) simulations was developed and it was applied to the prediction of molecular associations between human GlxII (in the presence and absence of GSH) and some proteins that are known to be S-glutathionylated in vitro, such as actin, malate dehydrogenase (MDH) and glyceraldehyde-3-phosphate dehydrogenase (GAPDH). The computational results show a high propensity of GlxII to interact with actin and MDH through its active site and a high stability of the GlxII-protein systems when GSH is present. Moreover, close proximities of GSH with actin and MDH cysteine residues have been found, suggesting that GlxII could be able to perform protein S-glutathionylation by using the GSH molecule present in its catalytic site.
Collapse
Affiliation(s)
- Roberta Galeazzi
- Department of Life and Environmental Sciences, Università Politecnica delle Marche, Ancona, Italy.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
19
|
Samsonov SA, Zacharias M, Chauvot de Beauchene I. Modeling large protein-glycosaminoglycan complexes using a fragment-based approach. J Comput Chem 2019; 40:1429-1439. [PMID: 30768805 DOI: 10.1002/jcc.25797] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2018] [Revised: 01/10/2019] [Accepted: 01/16/2019] [Indexed: 11/07/2022]
Abstract
Glycosaminoglycans (GAGs), a major constituent of the extracellular matrix, participate in cell-signaling by binding specific proteins. Structural data on protein-GAG interactions are crucial to understand and modulate these signaling processes, with potential applications in regenerative medicine. However, experimental and theoretical approaches used to study GAG-protein systems are challenged by GAGs high flexibility limiting the conformational sampling above a certain size, and by the scarcity of GAG-specific docking tools compared to protein-protein or protein-drug docking approaches. We present for the first time an automated fragment-based method for docking GAGs on a protein binding site. In this approach, trimeric GAG fragments are flexibly docked to the protein, assembled based on their spacial overlap, and refined by molecular dynamics. The method appeared more successful than the classical full-ligand approach for most of 13 tested complexes with known structure. The approach is particularly promising for docking of long GAG chains, which represents a bottleneck for classical docking approaches applied to these systems. © 2019 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Sergey A Samsonov
- Faculty of Chemistry, University of Gdańsk, ul. Wita Stwosza 63, 80-308, Gdańsk, Poland
| | - Martin Zacharias
- Physics Department, Technical University of Munich, James-Franck Strasse 1, 85748, Garching, Germany
| | - Isaure Chauvot de Beauchene
- CNRS, LORIA (CNRS, Inria NGE, Université de Lorraine), Campus Scientifique, 615 rue du Jardin Botanique, Vandœuvre-lès-Nancy, F-54506, France
| |
Collapse
|
20
|
Siebenmorgen T, Zacharias M. Evaluation of Predicted Protein-Protein Complexes by Binding Free Energy Simulations. J Chem Theory Comput 2019; 15:2071-2086. [PMID: 30698954 DOI: 10.1021/acs.jctc.8b01022] [Citation(s) in RCA: 42] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
The accurate prediction of protein-protein complex geometries is of major importance to ultimately model the complete interactome of interacting proteins in a cell. A major bottleneck is the realistic free energy evaluation of predicted docked structures. Typically, simple scoring functions applied to single-complex structures are employed that neglect conformational entropy and often solvent effects completely. The binding free energy of a predicted protein-protein complex can, however, be calculated using umbrella sampling (US) along a predefined dissociation/association coordinate of a complex. We employed atomistic US-molecular dynamics simulations including appropriate conformational and axial restraints and an implicit generalized Born solvent model to calculate binding free energies of a large set of docked decoys for 20 different complexes. Free energies associated with the restraints were calculated separately. In principle, the approach includes all energetic and entropic contributions to the binding process. The evaluation of docked complexes based on binding free energy calculation was in better agreement with experiment compared to a simple scoring based on energy minimization or MD refinement using exactly the same force field description. Even calculated absolute binding free energies of structures close to the native binding geometry showed a reasonable correlation to experiment. However, still for a number of complexes docked decoys of lower free energy than near-native geometries were found indicating inaccuracies in the force field or the implicit solvent model. Although time consuming the approach may open up a new route for realistic ranking of predicted geometries based on calculated free energy of binding.
Collapse
Affiliation(s)
- Till Siebenmorgen
- Physik-Department T38 , Technische Universität München , James-Franck-Strasse 1 , 85748 Garching , Germany
| | - Martin Zacharias
- Physik-Department T38 , Technische Universität München , James-Franck-Strasse 1 , 85748 Garching , Germany
| |
Collapse
|
21
|
Nithin C, Ghosh P, Bujnicki JM. Bioinformatics Tools and Benchmarks for Computational Docking and 3D Structure Prediction of RNA-Protein Complexes. Genes (Basel) 2018; 9:genes9090432. [PMID: 30149645 PMCID: PMC6162694 DOI: 10.3390/genes9090432] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2018] [Revised: 07/26/2018] [Accepted: 08/21/2018] [Indexed: 12/29/2022] Open
Abstract
RNA-protein (RNP) interactions play essential roles in many biological processes, such as regulation of co-transcriptional and post-transcriptional gene expression, RNA splicing, transport, storage and stabilization, as well as protein synthesis. An increasing number of RNP structures would aid in a better understanding of these processes. However, due to the technical difficulties associated with experimental determination of macromolecular structures by high-resolution methods, studies on RNP recognition and complex formation present significant challenges. As an alternative, computational prediction of RNP interactions can be carried out. Structural models obtained by theoretical predictive methods are, in general, less reliable compared to models based on experimental measurements but they can be sufficiently accurate to be used as a basis for to formulating functional hypotheses. In this article, we present an overview of computational methods for 3D structure prediction of RNP complexes. We discuss currently available methods for macromolecular docking and for scoring 3D structural models of RNP complexes in particular. Additionally, we also review benchmarks that have been developed to assess the accuracy of these methods.
Collapse
Affiliation(s)
- Chandran Nithin
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, ul. Ks. Trojdena 4, PL-02-109 Warsaw, Poland.
| | - Pritha Ghosh
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, ul. Ks. Trojdena 4, PL-02-109 Warsaw, Poland.
| | - Janusz M Bujnicki
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, ul. Ks. Trojdena 4, PL-02-109 Warsaw, Poland.
- Bioinformatics Laboratory, Institute of Molecular Biology and Biotechnology, Faculty of Biology, Adam Mickiewicz University, ul. Umultowska 89, PL-61-614 Poznan, Poland.
| |
Collapse
|
22
|
Simões ICM, Coimbra JTS, Neves RPP, Costa IPD, Ramos MJ, Fernandes PA. Properties that rank protein:protein docking poses with high accuracy. Phys Chem Chem Phys 2018; 20:20927-20942. [DOI: 10.1039/c8cp03888k] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
The development of docking algorithms to predict near-native structures of protein:protein complexes from the structure of the isolated monomers is of paramount importance for molecular biology and drug discovery.
Collapse
Affiliation(s)
- Inês C. M. Simões
- UCIBIO
- REQUIMTE
- Departamento de Química e Bioquímica
- Faculdade de Ciências
- Universidade do Porto
| | - João T. S. Coimbra
- UCIBIO
- REQUIMTE
- Departamento de Química e Bioquímica
- Faculdade de Ciências
- Universidade do Porto
| | - Rui P. P. Neves
- UCIBIO
- REQUIMTE
- Departamento de Química e Bioquímica
- Faculdade de Ciências
- Universidade do Porto
| | - Inês P. D. Costa
- UCIBIO
- REQUIMTE
- Departamento de Química e Bioquímica
- Faculdade de Ciências
- Universidade do Porto
| | - Maria J. Ramos
- UCIBIO
- REQUIMTE
- Departamento de Química e Bioquímica
- Faculdade de Ciências
- Universidade do Porto
| | - Pedro A. Fernandes
- UCIBIO
- REQUIMTE
- Departamento de Química e Bioquímica
- Faculdade de Ciências
- Universidade do Porto
| |
Collapse
|
23
|
Abstract
The immune systems protect our bodies from foreign molecules or antigens, where antibodies play important roles. Antibodies evolve over time upon antigen encounter by somatically mutating their genome sequences. The end result is a series of antibodies that display higher affinities and specificities to specific antigens. This process is called affinity maturation. Recent improvements in computer hardware and modeling algorithms now enable the rational design of protein structures and functions, and several works on computer-aided antibody design have been published. In this chapter, we briefly describe computational methods for antibody affinity maturation, focusing on methods for sampling antibody conformations and for scoring designed antibody variants. We also discuss lessons learned from the successful computer-aided design of antibodies.
Collapse
Affiliation(s)
- Daisuke Kuroda
- Department of Bioengineering, School of Engineering, The University of Tokyo, Tokyo, Japan
| | - Kouhei Tsumoto
- Department of Bioengineering, School of Engineering, The University of Tokyo, Tokyo, Japan.
- Medical Proteomics Laboratory, The Institute of Medical Science, The University of Tokyo, Tokyo, Japan.
| |
Collapse
|
24
|
Abstract
The atomic structures of protein complexes can provide useful information for drug design, protein engineering, systems biology, and understanding pathology. Obtaining this information experimentally can be challenging. However, if the structures of the subunits are known, then it is often possible to model the complex computationally. This chapter provide practical guidelines for docking proteins using the SwarmDock flexible protein-protein docking method, providing an overview of the factors that need to be considered when deciding whether docking is likely to be successful, the preparation of structural input, generation of docked poses, analysis and ranking of docked poses, and the validation of models using external data.
Collapse
Affiliation(s)
- Iain H Moal
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, UK.
| | | | - Paul A Bates
- Biomolecular Modelling Laboratory, The Francis Crick Institute, London, UK
| |
Collapse
|
25
|
Vistoli G, Pedretti A, Mazzolari A, Testa B. Approaching Pharmacological Space: Events and Components. Methods Mol Biol 2018; 1800:245-274. [PMID: 29934897 DOI: 10.1007/978-1-4939-7899-1_12] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
With a view to introducing the concept of pharmacological space and its potential applications in investigating and predicting the toxic mechanisms of xenobiotics, this opening chapter describes the logical relations between conformational behavior, physicochemical properties and binding spaces, which are seen as the three key elements composing the pharmacological space. While the concept of conformational space is routinely used to encode molecular flexibility, the concepts of property spaces and, particularly, of binding spaces are more innovative. Indeed, their descriptors can find fruitful applications (a) in describing the dynamic adaptability a given ligand experiences when inserted into a specific environment, and (b) in parameterizing the flexibility a ligand retains when bound to a biological target. Overall, these descriptors can conveniently account for the often disregarded entropic factors and as such they prove successful when inserted in ligand- or structure-based predictive models. Notably, and although binding space parameters can clearly be derived from MD simulations, the chapter will illustrate how docking calculations, despite their static nature, are able to evaluate ligand's flexibility by analyzing several poses for each ligand. Such an approach, which represents the founding core of the binding space concept, can find various applications in which the related descriptors show an impressive enhancing effect on the statistical performances of the resulting predictive models.
Collapse
Affiliation(s)
- Giulio Vistoli
- Dipartimento di Scienze Farmaceutiche Università degli Studi di Milano, Milan, Italy.
| | - Alessandro Pedretti
- Dipartimento di Scienze Farmaceutiche Università degli Studi di Milano, Milan, Italy
| | - Angelica Mazzolari
- Dipartimento di Scienze Farmaceutiche Università degli Studi di Milano, Milan, Italy
| | | |
Collapse
|
26
|
de Vries SJ, Zacharias M. Fast and accurate grid representations for atom-based docking with partner flexibility. J Comput Chem 2017; 38:1538-1546. [DOI: 10.1002/jcc.24795] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2016] [Revised: 01/18/2017] [Accepted: 01/19/2017] [Indexed: 12/12/2022]
Affiliation(s)
- Sjoerd J. de Vries
- MTi, UMR-S 973, Physics Department T38; Technische Universität München; James-Franck-Strasse 1 85748 Garching Germany
| | - Martin Zacharias
- MTi, UMR-S 973, Physics Department T38; Technische Universität München; James-Franck-Strasse 1 85748 Garching Germany
| |
Collapse
|
27
|
Pfeiffenberger E, Chaleil RA, Moal IH, Bates PA. A machine learning approach for ranking clusters of docked protein-protein complexes by pairwise cluster comparison. Proteins 2017; 85:528-543. [PMID: 27935158 PMCID: PMC5396268 DOI: 10.1002/prot.25218] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2016] [Revised: 11/14/2016] [Accepted: 11/21/2016] [Indexed: 01/28/2023]
Abstract
Reliable identification of near-native poses of docked protein-protein complexes is still an unsolved problem. The intrinsic heterogeneity of protein-protein interactions is challenging for traditional biophysical or knowledge based potentials and the identification of many false positive binding sites is not unusual. Often, ranking protocols are based on initial clustering of docked poses followed by the application of an energy function to rank each cluster according to its lowest energy member. Here, we present an approach of cluster ranking based not only on one molecular descriptor (e.g., an energy function) but also employing a large number of descriptors that are integrated in a machine learning model, whereby, an extremely randomized tree classifier based on 109 molecular descriptors is trained. The protocol is based on first locally enriching clusters with additional poses, the clusters are then characterized using features describing the distribution of molecular descriptors within the cluster, which are combined into a pairwise cluster comparison model to discriminate near-native from incorrect clusters. The results show that our approach is able to identify clusters containing near-native protein-protein complexes. In addition, we present an analysis of the descriptors with respect to their power to discriminate near native from incorrect clusters and how data transformations and recursive feature elimination can improve the ranking performance. Proteins 2017; 85:528-543. © 2016 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
| | | | - Iain H. Moal
- European Molecular Biology LaboratoryEuropean Bioinformatics Institute, Wellcome Trust Genome Campus, HinxtonCambridgeCB10 1SDUK
| | - Paul A. Bates
- Biomolecular Modelling LaboratoryThe Francis Crick InstituteLondonNW1 1ATUK
| |
Collapse
|
28
|
Molecular Simulations of Disulfide-Rich Venom Peptides with Ion Channels and Membranes. Molecules 2017; 22:molecules22030362. [PMID: 28264446 PMCID: PMC6155311 DOI: 10.3390/molecules22030362] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2017] [Revised: 02/23/2017] [Accepted: 02/24/2017] [Indexed: 12/12/2022] Open
Abstract
Disulfide-rich peptides isolated from the venom of arthropods and marine animals are a rich source of potent and selective modulators of ion channels. This makes these peptides valuable lead molecules for the development of new drugs to treat neurological disorders. Consequently, much effort goes into understanding their mechanism of action. This paper presents an overview of how molecular simulations have been used to study the interactions of disulfide-rich venom peptides with ion channels and membranes. The review is focused on the use of docking, molecular dynamics simulations, and free energy calculations to (i) predict the structure of peptide-channel complexes; (ii) calculate binding free energies including the effect of peptide modifications; and (iii) study the membrane-binding properties of disulfide-rich venom peptides. The review concludes with a summary and outlook.
Collapse
|
29
|
Yan Y, Wen Z, Wang X, Huang SY. Addressing recent docking challenges: A hybrid strategy to integrate template-based and free protein-protein docking. Proteins 2017; 85:497-512. [PMID: 28026062 DOI: 10.1002/prot.25234] [Citation(s) in RCA: 101] [Impact Index Per Article: 12.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2016] [Revised: 12/15/2016] [Accepted: 12/16/2016] [Indexed: 12/23/2022]
Abstract
Protein-protein docking is an important computational tool for predicting protein-protein interactions. With the rapid development of proteomics projects, more and more experimental binding information ranging from mutagenesis data to three-dimensional structures of protein complexes are becoming available. Therefore, how to appropriately incorporate the biological information into traditional ab initio docking has been an important issue and challenge in the field of protein-protein docking. To address these challenges, we have developed a Hybrid DOCKing protocol of template-based and template-free approaches, referred to as HDOCK. The basic procedure of HDOCK is to model the structures of individual components based on the template complex by a template-based method if a template is available; otherwise, the component structures will be modeled based on monomer proteins by regular homology modeling. Then, the complex structure of the component models is predicted by traditional protein-protein docking. With the HDOCK protocol, we have participated in the CPARI experiment for rounds 28-35. Out of the 25 CASP-CAPRI targets for oligomer modeling, our HDOCK protocol predicted correct models for 16 targets, ranking one of the top algorithms in this challenge. Our docking method also made correct predictions on other CAPRI challenges such as protein-peptide binding for 6 out of 8 targets and water predictions for 2 out of 2 targets. The advantage of our hybrid docking approach over pure template-based docking was further confirmed by a comparative evaluation on 20 CASP-CAPRI targets. Proteins 2017; 85:497-512. © 2016 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Yumeng Yan
- School of Physics, Huazhong University of Science and Technology, Wuhan Hubei, 430074, People's Republic of China
| | - Zeyu Wen
- School of Physics, Huazhong University of Science and Technology, Wuhan Hubei, 430074, People's Republic of China
| | - Xinxiang Wang
- School of Physics, Huazhong University of Science and Technology, Wuhan Hubei, 430074, People's Republic of China
| | - Sheng-You Huang
- School of Physics, Huazhong University of Science and Technology, Wuhan Hubei, 430074, People's Republic of China
| |
Collapse
|
30
|
Sasse A, de Vries SJ, Schindler CEM, de Beauchêne IC, Zacharias M. Rapid Design of Knowledge-Based Scoring Potentials for Enrichment of Near-Native Geometries in Protein-Protein Docking. PLoS One 2017; 12:e0170625. [PMID: 28118389 PMCID: PMC5261736 DOI: 10.1371/journal.pone.0170625] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2016] [Accepted: 01/07/2017] [Indexed: 01/15/2023] Open
Abstract
Protein-protein docking protocols aim to predict the structures of protein-protein complexes based on the structure of individual partners. Docking protocols usually include several steps of sampling, clustering, refinement and re-scoring. The scoring step is one of the bottlenecks in the performance of many state-of-the-art protocols. The performance of scoring functions depends on the quality of the generated structures and its coupling to the sampling algorithm. A tool kit, GRADSCOPT (GRid Accelerated Directly SCoring OPTimizing), was designed to allow rapid development and optimization of different knowledge-based scoring potentials for specific objectives in protein-protein docking. Different atomistic and coarse-grained potentials can be created by a grid-accelerated directly scoring dependent Monte-Carlo annealing or by a linear regression optimization. We demonstrate that the scoring functions generated by our approach are similar to or even outperform state-of-the-art scoring functions for predicting near-native solutions. Of additional importance, we find that potentials specifically trained to identify the native bound complex perform rather poorly on identifying acceptable or medium quality (near-native) solutions. In contrast, atomistic long-range contact potentials can increase the average fraction of near-native poses by up to a factor 2.5 in the best scored 1% decoys (compared to existing scoring), emphasizing the need of specific docking potentials for different steps in the docking protocol.
Collapse
Affiliation(s)
- Alexander Sasse
- Physik Department T38, Technische Universität München, James-Franck-Straße, Garching, Germany
| | - Sjoerd J. de Vries
- Physik Department T38, Technische Universität München, James-Franck-Straße, Garching, Germany
| | | | | | - Martin Zacharias
- Physik Department T38, Technische Universität München, James-Franck-Straße, Garching, Germany
- * E-mail:
| |
Collapse
|
31
|
Affiliation(s)
- Ruth Nussinov
- Cancer and Inflammation Program, Leidos Biomedical Research, Inc., Frederick National Laboratory for Cancer Research, National Cancer Institute, Frederick, Maryland, United States of America
- Sackler Institute of Molecular Medicine, Department of Human Genetics and Molecular Medicine, Sackler School of Medicine, Tel Aviv University, Tel Aviv, Israel
- * E-mail:
| | - Jason A. Papin
- University of Virginia, Charlottesville, Virginia, United States of America
| | - Ilya Vakser
- Center for Computational Biology, The University of Kansas, Lawrence, Kansas, United States of America
| |
Collapse
|
32
|
Vajda S, Yueh C, Beglov D, Bohnuud T, Mottarella SE, Xia B, Hall DR, Kozakov D. New additions to the ClusPro server motivated by CAPRI. Proteins 2017; 85:435-444. [PMID: 27936493 DOI: 10.1002/prot.25219] [Citation(s) in RCA: 418] [Impact Index Per Article: 52.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2016] [Revised: 11/28/2016] [Accepted: 11/29/2016] [Indexed: 12/12/2022]
Abstract
The heavily used protein-protein docking server ClusPro performs three computational steps as follows: (1) rigid body docking, (2) RMSD based clustering of the 1000 lowest energy structures, and (3) the removal of steric clashes by energy minimization. In response to challenges encountered in recent CAPRI targets, we added three new options to ClusPro. These are (1) accounting for small angle X-ray scattering data in docking; (2) considering pairwise interaction data as restraints; and (3) enabling discrimination between biological and crystallographic dimers. In addition, we have developed an extremely fast docking algorithm based on 5D rotational manifold FFT, and an algorithm for docking flexible peptides that include known sequence motifs. We feel that these developments will further improve the utility of ClusPro. However, CAPRI emphasized several shortcomings of the current server, including the problem of selecting the right energy parameters among the five options provided, and the problem of selecting the best models among the 10 generated for each parameter set. In addition, results convinced us that further development is needed for docking homology models. Finally, we discuss the difficulties we have encountered when attempting to develop a refinement algorithm that would be computationally efficient enough for inclusion in a heavily used server. Proteins 2017; 85:435-444. © 2016 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Sandor Vajda
- Department of Biomedical Engineering, Boston University, Boston, Massachusetts, 02215.,Department of Chemistry, Boston University, Boston, Massachusetts, 02215
| | - Christine Yueh
- Department of Biomedical Engineering, Boston University, Boston, Massachusetts, 02215
| | - Dmitri Beglov
- Department of Biomedical Engineering, Boston University, Boston, Massachusetts, 02215
| | - Tanggis Bohnuud
- Department of Biomedical Engineering, Boston University, Boston, Massachusetts, 02215.,Program in Bioinformatics, Boston University, Boston, Massachusetts, 02215
| | - Scott E Mottarella
- Department of Biomedical Engineering, Boston University, Boston, Massachusetts, 02215.,Program in Bioinformatics, Boston University, Boston, Massachusetts, 02215
| | - Bing Xia
- Department of Biomedical Engineering, Boston University, Boston, Massachusetts, 02215
| | | | - Dima Kozakov
- Department of Applied Mathematics and Statistics, Stony Brook University, New York.,Laufer Center for Physical and Quantitative Biology, Stony Brook University, New York
| |
Collapse
|
33
|
Hamzeh-Mivehroud M, Sokouti B, Dastmalchi S. Molecular Docking at a Glance. Oncology 2017. [DOI: 10.4018/978-1-5225-0549-5.ch030] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The current chapter introduces different aspects of molecular docking technique in order to give an overview to the readers about the topics which will be dealt with throughout this volume. Like many other fields of science, molecular docking studies has experienced a lagging period of slow and steady increase in terms of acquiring attention of scientific community as well as its frequency of application, followed by a pronounced era of exponential expansion in theory, methodology, areas of application and performance due to developments in related technologies such as computational resources and theoretical as well as experimental biophysical methods. In the following sections the evolution of molecular docking will be reviewed and its different components including methods, search algorithms, scoring functions, validation of the methods, and area of applications plus few case studies will be touched briefly.
Collapse
Affiliation(s)
| | | | - Siavoush Dastmalchi
- Biotechnology Research Center, Tabriz University of Medical Sciences, Iran & School of Pharmacy, Tabriz University of Medical Sciences, Iran
| |
Collapse
|
34
|
Spyrakis F, Cozzini P, Eugene Kellogg G. Applying Computational Scoring Functions to Assess Biomolecular Interactions in Food Science: Applications to the Estrogen Receptors. NUCLEAR RECEPTOR RESEARCH 2016. [DOI: 10.11131/2016/101202] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Affiliation(s)
- Francesca Spyrakis
- University of Parma, Department of Food Science, Molecular Modelling Laboratory, Parma, Italy
| | - Pietro Cozzini
- University of Parma, Department of Food Science, Molecular Modelling Laboratory, Parma, Italy
| | - Glen Eugene Kellogg
- Virginia Commonwealth University, Department of Medicinal Chemistry & Institute for Structural Biology, Drug Discovery and Development Richmond, Virginia, USA
| |
Collapse
|
35
|
Schindler C, de Vries S, Sasse A, Zacharias M. SAXS Data Alone can Generate High-Quality Models of Protein-Protein Complexes. Structure 2016; 24:1387-1397. [DOI: 10.1016/j.str.2016.06.007] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2016] [Revised: 06/08/2016] [Accepted: 06/08/2016] [Indexed: 11/29/2022]
|
36
|
Im W, Liang J, Olson A, Zhou HX, Vajda S, Vakser IA. Challenges in structural approaches to cell modeling. J Mol Biol 2016; 428:2943-64. [PMID: 27255863 PMCID: PMC4976022 DOI: 10.1016/j.jmb.2016.05.024] [Citation(s) in RCA: 37] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2016] [Revised: 05/19/2016] [Accepted: 05/24/2016] [Indexed: 11/17/2022]
Abstract
Computational modeling is essential for structural characterization of biomolecular mechanisms across the broad spectrum of scales. Adequate understanding of biomolecular mechanisms inherently involves our ability to model them. Structural modeling of individual biomolecules and their interactions has been rapidly progressing. However, in terms of the broader picture, the focus is shifting toward larger systems, up to the level of a cell. Such modeling involves a more dynamic and realistic representation of the interactomes in vivo, in a crowded cellular environment, as well as membranes and membrane proteins, and other cellular components. Structural modeling of a cell complements computational approaches to cellular mechanisms based on differential equations, graph models, and other techniques to model biological networks, imaging data, etc. Structural modeling along with other computational and experimental approaches will provide a fundamental understanding of life at the molecular level and lead to important applications to biology and medicine. A cross section of diverse approaches presented in this review illustrates the developing shift from the structural modeling of individual molecules to that of cell biology. Studies in several related areas are covered: biological networks; automated construction of three-dimensional cell models using experimental data; modeling of protein complexes; prediction of non-specific and transient protein interactions; thermodynamic and kinetic effects of crowding; cellular membrane modeling; and modeling of chromosomes. The review presents an expert opinion on the current state-of-the-art in these various aspects of structural modeling in cellular biology, and the prospects of future developments in this emerging field.
Collapse
Affiliation(s)
- Wonpil Im
- Center for Computational Biology and Department of Molecular Biosciences, The University of Kansas, Lawrence, KS 66047, United States.
| | - Jie Liang
- Department of Bioengineering, University of Illinois at Chicago, Chicago, IL 60607, United States.
| | - Arthur Olson
- Department of Integrative Structural and Computational Biology, The Scripps Research Institute, La Jolla, CA 92037, United States.
| | - Huan-Xiang Zhou
- Department of Physics and Institute of Molecular Biophysics, Florida State University, Tallahassee, FL 32306, United States.
| | - Sandor Vajda
- Department of Biomedical Engineering, Boston University, Boston, MA 02215, United States.
| | - Ilya A Vakser
- Center for Computational Biology and Department of Molecular Biosciences, The University of Kansas, Lawrence, KS 66047, United States.
| |
Collapse
|
37
|
Mamonov AB, Moghadasi M, Mirzaei H, Zarbafian S, Grove LE, Bohnuud T, Vakili P, Paschalidis IC, Vajda S, Kozakov D. Focused grid-based resampling for protein docking and mapping. J Comput Chem 2016; 37:961-70. [PMID: 26837000 PMCID: PMC4814242 DOI: 10.1002/jcc.24273] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2015] [Revised: 08/31/2015] [Accepted: 09/26/2015] [Indexed: 12/27/2022]
Abstract
The fast Fourier transform (FFT) sampling algorithm has been used with success in application to protein-protein docking and for protein mapping, the latter docking a variety of small organic molecules for the identification of binding hot spots on the target protein. Here we explore the local rather than global usage of the FFT sampling approach in docking applications. If the global FFT based search yields a near-native cluster of docked structures for a protein complex, then focused resampling of the cluster generally leads to a substantial increase in the number of conformations close to the native structure. In protein mapping, focused resampling of the selected hot spot regions generally reveals further hot spots that, while not as strong as the primary hot spots, also contribute to ligand binding. The detection of additional ligand binding regions is shown by the improved overlap between hot spots and bound ligands.
Collapse
Affiliation(s)
- Artem B. Mamonov
- Department of Biomedical Engineering, Boston University, Boston MA 02215
| | - Mohammad Moghadasi
- Center for Information and Systems Engineering, Boston University, Boston, MA 02215
| | - Hanieh Mirzaei
- Center for Information and Systems Engineering, Boston University, Boston, MA 02215
| | - Shahrooz Zarbafian
- Department of Mechanical Engineering, Boston University, Boston MA 02215
| | - Laurie E. Grove
- Department of Sciences, Wentworth Institute of Technology, Boston, MA 02115, USA
| | - Tanggis Bohnuud
- Department of Biomedical Engineering, Boston University, Boston MA 02215
| | - Pirooz Vakili
- Center for Information and Systems Engineering, Boston University, Boston, MA 02215
- Department of Mechanical Engineering, Boston University, Boston MA 02215
| | - Ioannis Ch. Paschalidis
- Center for Information and Systems Engineering, Boston University, Boston, MA 02215
- Department of Electrical and Computer Engineering, Boston University, Boston MA 02215
| | - Sandor Vajda
- Department of Biomedical Engineering, Boston University, Boston MA 02215
- Center for Information and Systems Engineering, Boston University, Boston, MA 02215
- Department of Chemistry, Boston University, Boston MA 02215
| | - Dima Kozakov
- Department of Biomedical Engineering, Boston University, Boston MA 02215
- Departemnt of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, NY, 11790
| |
Collapse
|
38
|
Rigid-Docking Approaches to Explore Protein-Protein Interaction Space. ADVANCES IN BIOCHEMICAL ENGINEERING/BIOTECHNOLOGY 2016; 160:33-55. [PMID: 27830312 DOI: 10.1007/10_2016_41] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]
Abstract
Protein-protein interactions play core roles in living cells, especially in the regulatory systems. As information on proteins has rapidly accumulated on publicly available databases, much effort has been made to obtain a better picture of protein-protein interaction networks using protein tertiary structure data. Predicting relevant interacting partners from their tertiary structure is a challenging task and computer science methods have the potential to assist with this. Protein-protein rigid docking has been utilized by several projects, docking-based approaches having the advantages that they can suggest binding poses of predicted binding partners which would help in understanding the interaction mechanisms and that comparing docking results of both non-binders and binders can lead to understanding the specificity of protein-protein interactions from structural viewpoints. In this review we focus on explaining current computational prediction methods to predict pairwise direct protein-protein interactions that form protein complexes.
Collapse
|
39
|
Dawson WK, Bujnicki JM. Computational modeling of RNA 3D structures and interactions. Curr Opin Struct Biol 2015; 37:22-8. [PMID: 26689764 DOI: 10.1016/j.sbi.2015.11.007] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2015] [Revised: 11/11/2015] [Accepted: 11/12/2015] [Indexed: 11/25/2022]
Abstract
RNA molecules have key functions in cellular processes beyond being carriers of protein-coding information. These functions are often dependent on the ability to form complex three-dimensional (3D) structures. However, experimental determination of RNA 3D structures is difficult, which has prompted the development of computational methods for structure prediction from sequence. Recent progress in 3D structure modeling of RNA and emerging approaches for predicting RNA interactions with ions, ligands and proteins have been stimulated by successes in protein 3D structure modeling.
Collapse
Affiliation(s)
- Wayne K Dawson
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, ul. Ks. Trojdena 4, 02-109 Warsaw, Poland
| | - Janusz M Bujnicki
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, ul. Ks. Trojdena 4, 02-109 Warsaw, Poland; Bioinformatics Laboratory, Institute of Molecular Biology and Biotechnology, Faculty of Biology, Adam Mickiewicz University, ul. Umultowska 89, 61-614 Poznan, Poland.
| |
Collapse
|
40
|
Deng H, Jia Y, Zhang Y. 3DRobot: automated generation of diverse and well-packed protein structure decoys. Bioinformatics 2015; 32:378-87. [PMID: 26471454 DOI: 10.1093/bioinformatics/btv601] [Citation(s) in RCA: 56] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2015] [Accepted: 10/10/2015] [Indexed: 11/12/2022] Open
Abstract
MOTIVATION Computationally generated non-native protein structure conformations (or decoys) are often used for designing protein folding simulation methods and force fields. However, almost all the decoy sets currently used in literature suffer from uneven root mean square deviation (RMSD) distribution with bias to non-protein like hydrogen-bonding and compactness patterns. Meanwhile, most protein decoy sets are pre-calculated and there is a lack of methods for automated generation of high-quality decoys for any target proteins. RESULTS We developed a new algorithm, 3DRobot, to create protein structure decoys by free fragment assembly with enhanced hydrogen-bonding and compactness interactions. The method was benchmarked with three widely used decoy sets from ab initio folding and comparative modeling simulations. The decoys generated by 3DRobot are shown to have significantly enhanced diversity and evenness with a continuous distribution in the RMSD space. The new energy terms introduced in 3DRobot improve the hydrogen-bonding network and compactness of decoys, which eliminates the possibility of native structure recognition by trivial potentials. Algorithms that can automatically create such diverse and well-packed non-native conformations from any protein structure should have a broad impact on the development of advanced protein force field and folding simulation methods. AVAILIABLITY AND IMPLEMENTATION: http://zhanglab.ccmb.med.umich.edu/3DRobot/ CONTACT jiay@phy.ccnu.edu.cn; zhng@umich.edu SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Haiyou Deng
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 45108, USA, Department of Physics and Institute of Biophysics, Central China Normal University, Wuhan 430079, China and
| | - Ya Jia
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 45108, USA
| | - Yang Zhang
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 45108, USA, Department of Biological Chemistry, University of Michigan, Ann Arbor, MI 45108, USA
| |
Collapse
|
41
|
Exploring the potential of global protein–protein docking: an overview and critical assessment of current programs for automatic ab initio docking. Drug Discov Today 2015; 20:969-77. [DOI: 10.1016/j.drudis.2015.03.007] [Citation(s) in RCA: 75] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2014] [Revised: 02/24/2015] [Accepted: 03/13/2015] [Indexed: 12/24/2022]
|
42
|
Kirys T, Ruvinsky AM, Singla D, Tuzikov AV, Kundrotas PJ, Vakser IA. Simulated unbound structures for benchmarking of protein docking in the DOCKGROUND resource. BMC Bioinformatics 2015; 16:243. [PMID: 26227548 PMCID: PMC4521349 DOI: 10.1186/s12859-015-0672-3] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2015] [Accepted: 07/10/2015] [Indexed: 11/10/2022] Open
Abstract
Background Proteins play an important role in biological processes in living organisms. Many protein functions are based on interaction with other proteins. The structural information is important for adequate description of these interactions. Sets of protein structures determined in both bound and unbound states are essential for benchmarking of the docking procedures. However, the number of such proteins in PDB is relatively small. A radical expansion of such sets is possible if the unbound structures are computationally simulated. Results The Dockground public resource provides data to improve our understanding of protein–protein interactions and to assist in the development of better tools for structural modeling of protein complexes, such as docking algorithms and scoring functions. A large set of simulated unbound protein structures was generated from the bound structures. The modeling protocol was based on 1 ns Langevin dynamics simulation. The simulated structures were validated on the ensemble of experimentally determined unbound and bound structures. The set is intended for large scale benchmarking of docking algorithms and scoring functions. Conclusions A radical expansion of the unbound protein docking benchmark set was achieved by simulating the unbound structures. The simulated unbound structures were selected according to criteria from systematic comparison of experimentally determined bound and unbound structures. The set is publicly available at http://dockground.compbio.ku.edu.
Collapse
Affiliation(s)
- Tatsiana Kirys
- Center for Computational Biology, The University of Kansas, Lawrence, KS, 66047, USA. .,United Institute of Informatics Problems, National Academy of Sciences, 220012, Minsk, Belarus.
| | - Anatoly M Ruvinsky
- Center for Computational Biology, The University of Kansas, Lawrence, KS, 66047, USA. .,Schrödinger, Inc., Cambridge, MA, 02142, USA.
| | - Deepak Singla
- Center for Computational Biology, The University of Kansas, Lawrence, KS, 66047, USA.
| | - Alexander V Tuzikov
- United Institute of Informatics Problems, National Academy of Sciences, 220012, Minsk, Belarus.
| | - Petras J Kundrotas
- Center for Computational Biology, The University of Kansas, Lawrence, KS, 66047, USA.
| | - Ilya A Vakser
- Center for Computational Biology, The University of Kansas, Lawrence, KS, 66047, USA. .,Department of Molecular Biosciences, The University of Kansas, Lawrence, KS, 66045, USA.
| |
Collapse
|
43
|
Vakser IA. Protein-protein docking: from interaction to interactome. Biophys J 2015; 107:1785-1793. [PMID: 25418159 DOI: 10.1016/j.bpj.2014.08.033] [Citation(s) in RCA: 204] [Impact Index Per Article: 20.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2014] [Revised: 08/17/2014] [Accepted: 08/27/2014] [Indexed: 12/29/2022] Open
Abstract
The protein-protein docking problem is one of the focal points of activity in computational biophysics and structural biology. The three-dimensional structure of a protein-protein complex, generally, is more difficult to determine experimentally than the structure of an individual protein. Adequate computational techniques to model protein interactions are important because of the growing number of known protein structures, particularly in the context of structural genomics. Docking offers tools for fundamental studies of protein interactions and provides a structural basis for drug design. Protein-protein docking is the prediction of the structure of the complex, given the structures of the individual proteins. In the heart of the docking methodology is the notion of steric and physicochemical complementarity at the protein-protein interface. Originally, mostly high-resolution, experimentally determined (primarily by x-ray crystallography) protein structures were considered for docking. However, more recently, the focus has been shifting toward lower-resolution modeled structures. Docking approaches have to deal with the conformational changes between unbound and bound structures, as well as the inaccuracies of the interacting modeled structures, often in a high-throughput mode needed for modeling of large networks of protein interactions. The growing number of docking developers is engaged in the community-wide assessments of predictive methodologies. The development of more powerful and adequate docking approaches is facilitated by rapidly expanding information and data resources, growing computational capabilities, and a deeper understanding of the fundamental principles of protein interactions.
Collapse
Affiliation(s)
- Ilya A Vakser
- Center for Bioinformatics and Department of Molecular Biosciences, The University of Kansas, Lawrence, Kansas.
| |
Collapse
|
44
|
Krull F, Korff G, Elghobashi-Meinhardt N, Knapp EW. ProPairs: A Data Set for Protein–Protein Docking. J Chem Inf Model 2015; 55:1495-507. [DOI: 10.1021/acs.jcim.5b00082] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Affiliation(s)
- Florian Krull
- Institute of Chemistry and
Biochemistry, Freie Universität Berlin, Fabeckstrasse 36a, 14195 Berlin, Germany
| | - Gerrit Korff
- Institute of Chemistry and
Biochemistry, Freie Universität Berlin, Fabeckstrasse 36a, 14195 Berlin, Germany
| | - Nadia Elghobashi-Meinhardt
- Institute of Chemistry and
Biochemistry, Freie Universität Berlin, Fabeckstrasse 36a, 14195 Berlin, Germany
| | - Ernst-Walter Knapp
- Institute of Chemistry and
Biochemistry, Freie Universität Berlin, Fabeckstrasse 36a, 14195 Berlin, Germany
| |
Collapse
|
45
|
van Zundert GCP, Bonvin AMJJ. DisVis: quantifying and visualizing accessible interaction space of distance-restrained biomolecular complexes. Bioinformatics 2015; 31:3222-4. [PMID: 26026169 PMCID: PMC4576694 DOI: 10.1093/bioinformatics/btv333] [Citation(s) in RCA: 42] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2015] [Accepted: 05/23/2015] [Indexed: 11/15/2022] Open
Abstract
Summary: We present DisVis, a Python package and command line tool to calculate the reduced accessible interaction space of distance-restrained binary protein complexes, allowing for direct visualization and quantification of the information content of the distance restraints. The approach is general and can also be used as a knowledge-based distance energy term in FFT-based docking directly during the sampling stage. Availability and implementation: The source code with documentation is freely available from https://github.com/haddocking/disvis. Contact:a.m.j.j.bonvin@uu.nl Supplementary information:Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- G C P van Zundert
- Bijvoet Center for Biomolecular Research, Faculty of Science - Chemistry, Utrecht University, Utrecht 3584CH, The Netherlands
| | - A M J J Bonvin
- Bijvoet Center for Biomolecular Research, Faculty of Science - Chemistry, Utrecht University, Utrecht 3584CH, The Netherlands
| |
Collapse
|
46
|
Yuriev E, Holien J, Ramsland PA. Improvements, trends, and new ideas in molecular docking: 2012-2013 in review. J Mol Recognit 2015; 28:581-604. [PMID: 25808539 DOI: 10.1002/jmr.2471] [Citation(s) in RCA: 168] [Impact Index Per Article: 16.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2014] [Revised: 01/16/2015] [Accepted: 02/05/2015] [Indexed: 12/11/2022]
Abstract
Molecular docking is a computational method for predicting the placement of ligands in the binding sites of their receptor(s). In this review, we discuss the methodological developments that occurred in the docking field in 2012 and 2013, with a particular focus on the more difficult aspects of this computational discipline. The main challenges and therefore focal points for developments in docking, covered in this review, are receptor flexibility, solvation, scoring, and virtual screening. We specifically deal with such aspects of molecular docking and its applications as selection criteria for constructing receptor ensembles, target dependence of scoring functions, integration of higher-level theory into scoring, implicit and explicit handling of solvation in the binding process, and comparison and evaluation of docking and scoring methods.
Collapse
Affiliation(s)
- Elizabeth Yuriev
- Medicinal Chemistry, Monash Institute of Pharmaceutical Sciences, Monash University, Parkville, Victoria, 3052, Australia
| | - Jessica Holien
- ACRF Rational Drug Discovery Centre and Structural Biology Laboratory, St. Vincent's Institute of Medical Research, Fitzroy, Victoria, 3065, Australia
| | - Paul A Ramsland
- Centre for Biomedical Research, Burnet Institute, Melbourne, Victoria, 3004, Australia.,Department of Surgery Austin Health, University of Melbourne, Melbourne, Victoria, 3084, Australia.,Department of Immunology, Monash University, Alfred Medical Research and Education Precinct, Melbourne, Victoria, 3004, Australia.,School of Biomedical Sciences, CHIRI Biosciences, Curtin University, Perth, Western Australia, 6845, Australia
| |
Collapse
|
47
|
Lensink MF, Wodak SJ. Score_set: A CAPRI benchmark for scoring protein complexes. Proteins 2014; 82:3163-9. [DOI: 10.1002/prot.24678] [Citation(s) in RCA: 49] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2014] [Revised: 08/05/2014] [Accepted: 08/22/2014] [Indexed: 12/26/2022]
Affiliation(s)
- Marc F. Lensink
- CNRS USR3078; University Lille North of France, Parc de la Haute Borne; F-59658 Villeneuve d'Ascq France
| | - Shoshana J. Wodak
- Structural Biology Program; Hospital for Sick Children; Toronto Ontario M5G 1X8 Canada
- Department of Biochemistry; University of Toronto; Ontario Canada
- Department of Molecular Genetics; University of Toronto; Ontario Canada
| |
Collapse
|
48
|
Huang SY. Search strategies and evaluation in protein–protein docking: principles, advances and challenges. Drug Discov Today 2014; 19:1081-96. [DOI: 10.1016/j.drudis.2014.02.005] [Citation(s) in RCA: 87] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2013] [Revised: 01/04/2014] [Accepted: 02/24/2014] [Indexed: 01/10/2023]
|
49
|
Grinter SZ, Zou X. Challenges, applications, and recent advances of protein-ligand docking in structure-based drug design. Molecules 2014; 19:10150-76. [PMID: 25019558 PMCID: PMC6270832 DOI: 10.3390/molecules190710150] [Citation(s) in RCA: 138] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2014] [Revised: 06/13/2014] [Accepted: 07/02/2014] [Indexed: 11/16/2022] Open
Abstract
The docking methods used in structure-based virtual database screening offer the ability to quickly and cheaply estimate the affinity and binding mode of a ligand for the protein receptor of interest, such as a drug target. These methods can be used to enrich a database of compounds, so that more compounds that are subsequently experimentally tested are found to be pharmaceutically interesting. In addition, like all virtual screening methods used for drug design, structure-based virtual screening can focus on curated libraries of synthesizable compounds, helping to reduce the expense of subsequent experimental verification. In this review, we introduce the protein-ligand docking methods used for structure-based drug design and other biological applications. We discuss the fundamental challenges facing these methods and some of the current methodological topics of interest. We also discuss the main approaches for applying protein-ligand docking methods. We end with a discussion of the challenging aspects of evaluating or benchmarking the accuracy of docking methods for their improvement, and discuss future directions.
Collapse
Affiliation(s)
- Sam Z Grinter
- Informatics Institute, University of Missouri, Columbia, MO 65211, USA.
| | - Xiaoqin Zou
- Informatics Institute, University of Missouri, Columbia, MO 65211, USA.
| |
Collapse
|
50
|
Bogorad AM, Xia B, Sandor DG, Mamonov AB, Cafarella TR, Jehle S, Vajda S, Kozakov D, Marintchev A. Insights into the architecture of the eIF2Bα/β/δ regulatory subcomplex. Biochemistry 2014; 53:3432-45. [PMID: 24811713 PMCID: PMC4045321 DOI: 10.1021/bi500346u] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
Eukaryotic translation initiation factor 2B (eIF2B), the guanine nucleotide exchange factor for the G-protein eIF2, is one of the main targets for the regulation of protein synthesis. The eIF2B activity is inhibited in response to a wide range of stress factors and diseases, including viral infections, hypoxia, nutrient starvation, and heme deficiency, collectively known as the integrated stress response. eIF2B has five subunits (α-ε). The α, β, and δ subunits are homologous to each other and form the eIF2B regulatory subcomplex, which is believed to be a trimer consisting of monomeric α, β, and δ subunits. Here we use a combination of biophysical methods, site-directed mutagenesis, and bioinformatics to show that the human eIF2Bα subunit is in fact a homodimer, at odds with the current trimeric model for the eIF2Bα/β/δ regulatory complex. eIF2Bα dimerizes using the same interface that is found in the homodimeric archaeal eIF2Bα/β/δ homolog aIF2B and related metabolic enzymes. We also present evidence that the eIF2Bβ/δ binding interface is similar to that in the eIF2Bα2 homodimer. Mutations at the predicted eIF2Bβ/δ dimer interface cause genetic neurological disorders in humans. We propose that the eIF2B regulatory subcomplex is an α2β2δ2 hexamer, composed of one α2 homodimer and two βδ heterodimers. Our results offer novel insights into the architecture of eIF2B and its interactions with the G-protein eIF2.
Collapse
Affiliation(s)
- Andrew M Bogorad
- Department of Physiology and Biophysics, Boston University School of Medicine , Boston, Massachusetts 02118, United States
| | | | | | | | | | | | | | | | | |
Collapse
|