1
|
Statistical potentials from the Gaussian scaling behaviour of chain fragments buried within protein globules. PLoS One 2022; 17:e0254969. [PMID: 35085247 PMCID: PMC8794220 DOI: 10.1371/journal.pone.0254969] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2021] [Accepted: 10/28/2021] [Indexed: 11/19/2022] Open
Abstract
Knowledge-based approaches use the statistics collected from protein data-bank structures to estimate effective interaction potentials between amino acid pairs. Empirical relations are typically employed that are based on the crucial choice of a reference state associated to the null interaction case. Despite their significant effectiveness, the physical interpretation of knowledge-based potentials has been repeatedly questioned, with no consensus on the choice of the reference state. Here we use the fact that the Flory theorem, originally derived for chains in a dense polymer melt, holds also for chain fragments within the core of globular proteins, if the average over buried fragments collected from different non-redundant native structures is considered. After verifying that the ensuing Gaussian statistics, a hallmark of effectively non-interacting polymer chains, holds for a wide range of fragment lengths, although with significant deviations at short spatial scales, we use it to define a ‘bona fide’ reference state. Notably, despite the latter does depend on fragment length, deviations from it do not. This allows to estimate an effective interaction potential which is not biased by the presence of correlations due to the connectivity of the protein chain. We show how different sequence-independent effective statistical potentials can be derived using this approach by coarse-graining the protein representation at varying levels. The possibility of defining sequence-dependent potentials is explored.
Collapse
|
2
|
Ochoa R, Soler MA, Gladich I, Battisti A, Minovski N, Rodriguez A, Fortuna S, Cossio P, Laio A. Computational Evolution Protocol for Peptide Design. Methods Mol Biol 2022; 2405:335-359. [PMID: 35298821 DOI: 10.1007/978-1-0716-1855-4_16] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Computational peptide design is useful for therapeutics, diagnostics, and vaccine development. To select the most promising peptide candidates, the key is describing accurately the peptide-target interactions at the molecular level. We here review a computational peptide design protocol whose key feature is the use of all-atom explicit solvent molecular dynamics for describing the different peptide-target complexes explored during the optimization. We describe the milestones behind the development of this protocol, which is now implemented in an open-source code called PARCE. We provide a basic tutorial to run the code for an antibody fragment design example. Finally, we describe three additional applications of the method to design peptides for different targets, illustrating the broad scope of the proposed approach.
Collapse
Affiliation(s)
- Rodrigo Ochoa
- Biophysics of Tropical Diseases, Max Planck Tandem Group, University of Antioquia, Medellin, Colombia
| | | | - Ivan Gladich
- Qatar Environment and Energy Research Institute, Hamad Bin Khalifa University, Doha, Qatar
- SISSA, Trieste, Italy
| | | | - Nikola Minovski
- Department of Chemical and Pharmaceutical Sciences, University of Trieste, Trieste, Italy
- Theory Department, Laboratory for Cheminformatics, National Institute of Chemistry, Ljubljana, Slovenia
| | - Alex Rodriguez
- The Abdus Salam International Centre for Theoretical Physics, Trieste, Italy
| | - Sara Fortuna
- Italian Institute of Technology (IIT), Genova, Italy
- Department of Chemical and Pharmaceutical Sciences, University of Trieste, Trieste, Italy
| | - Pilar Cossio
- Biophysics of Tropical Diseases, Max Planck Tandem Group, University of Antioquia, Medellin, Colombia
- Department of Theoretical Biophysics, Max Planck Institute of Biophysics, Frankfurt am Main, Germany
| | - Alessandro Laio
- The Abdus Salam International Centre for Theoretical Physics, Trieste, Italy
- SISSA, Trieste, Italy
| |
Collapse
|
3
|
Wang W, Wang J, Li Z, Xu D, Shang Y. MUfoldQA_G: High-accuracy protein model QA via retraining and transformation. Comput Struct Biotechnol J 2021; 19:6282-6290. [PMID: 34900138 PMCID: PMC8636996 DOI: 10.1016/j.csbj.2021.11.021] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2021] [Revised: 11/10/2021] [Accepted: 11/14/2021] [Indexed: 11/21/2022] Open
Abstract
Protein tertiary structure prediction is an active research area and has attracted significant attention recently due to the success of AlphaFold from DeepMind. Methods capable of accurately evaluating the quality of predicted models are of great importance. In the past, although many model quality assessment (QA) methods have been developed, their accuracies are not consistently high across different QA performance metrics for diverse target proteins. In this paper, we propose MUfoldQA_G, a new multi-model QA method that aims at simultaneously optimizing Pearson correlation and average GDT-TS difference, two commonly used QA performance metrics. This method is based on two new algorithms MUfoldQA_Gp and MUfoldQA_Gr. MUfoldQA_Gp uses a new technique to combine information from protein templates and reference protein models to maximize the Pearson correlation QA metric. MUfoldQA_Gr employs a new machine learning technique that resamples training data and retrains adaptively to learn a consensus model that is better than naïve consensus while minimizing average GDT-TS difference. MUfoldQA_G uses a new method to combine the results of MUfoldQA_Gr and MUfoldQA_Gp so that the final QA prediction results achieve low average GDT-TS difference that is close to the results from MUfoldQA_Gr, while maintaining high Pearson correlation that is the same as the results from MUfoldQA_Gp. In CASP14 QA categories, MUfoldQA_G ranked No. 1 in Pearson correlation and No. 2 in average GDT-TS difference.
Collapse
Affiliation(s)
- Wenbo Wang
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO 65211, USA
| | - Junlin Wang
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO 65211, USA
| | - Zhaoyu Li
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO 65211, USA
| | - Dong Xu
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO 65211, USA
- Christopher S. Bond Life Sciences Center, University of Missouri, Columbia, MO 65211, USA
| | - Yi Shang
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO 65211, USA
| |
Collapse
|
4
|
Ochoa R, Laskowski RA, Thornton JM, Cossio P. Impact of Structural Observables From Simulations to Predict the Effect of Single-Point Mutations in MHC Class II Peptide Binders. Front Mol Biosci 2021; 8:636562. [PMID: 34222328 PMCID: PMC8253603 DOI: 10.3389/fmolb.2021.636562] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2020] [Accepted: 02/15/2021] [Indexed: 11/23/2022] Open
Abstract
The prediction of peptide binders to Major Histocompatibility Complex (MHC) class II receptors is of great interest to study autoimmune diseases and for vaccine development. Most approaches predict the affinities using sequence-based models trained on experimental data and multiple alignments from known peptide substrates. However, detecting activity differences caused by single-point mutations is a challenging task. In this work, we used interactions calculated from simulations to build scoring matrices for quickly estimating binding differences by single-point mutations. We modelled a set of 837 peptides bound to an MHC class II allele, and optimized the sampling of the conformations using the Rosetta backrub method by comparing the results to molecular dynamics simulations. From the dynamic trajectories of each complex, we averaged and compared structural observables for each amino acid at each position of the 9°mer peptide core region. With this information, we generated the scoring-matrices to predict the sign of the binding differences. We then compared the performance of the best scoring-matrix to different computational methodologies that range in computational costs. Overall, the prediction of the activity differences caused by single mutated peptides was lower than 60% for all the methods. However, the developed scoring-matrix in combination with existing methods reports an increase in the performance, up to 86% with a scoring method that uses molecular dynamics.
Collapse
Affiliation(s)
- Rodrigo Ochoa
- Biophysics of Tropical Diseases, Max Planck Tandem Group, University of Antioquia UdeA, Medellin, Colombia.,European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, United Kingdom
| | - Roman A Laskowski
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, United Kingdom
| | - Janet M Thornton
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, United Kingdom
| | - Pilar Cossio
- Biophysics of Tropical Diseases, Max Planck Tandem Group, University of Antioquia UdeA, Medellin, Colombia.,Department of Theoretical Biophysics, Max Planck Institute of Biophysics, Frankfurt am Main, Germany
| |
Collapse
|
5
|
Lans I, Palacio-Rodríguez K, Cavasotto CN, Cossio P. Flexi-pharma: a molecule-ranking strategy for virtual screening using pharmacophores from ligand-free conformational ensembles. J Comput Aided Mol Des 2020; 34:1063-1077. [PMID: 32656619 PMCID: PMC7449997 DOI: 10.1007/s10822-020-00329-7] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2020] [Accepted: 06/27/2020] [Indexed: 01/27/2023]
Abstract
Computer-aided strategies are useful for reducing the costs and increasing the success-rate in drug discovery. Among these strategies, methods based on pharmacophores (an ensemble of electronic and steric features representing the target active site) are efficient to implement over large compound libraries. However, traditional pharmacophore-based methods require knowledge of active compounds or ligand-receptor structures, and only few ones account for target flexibility. Here, we developed a pharmacophore-based virtual screening protocol, Flexi-pharma, that overcomes these limitations. The protocol uses molecular dynamics (MD) simulations to explore receptor flexibility, and performs a pharmacophore-based virtual screening over a set of MD conformations without requiring prior knowledge about known ligands or ligand-receptor structures for building the pharmacophores. The results from the different receptor conformations are combined using a "voting" approach, where a vote is given to each molecule that matches at least one pharmacophore from each MD conformation. Contrarily to other approaches that reduce the pharmacophore ensemble to some representative models and score according to the matching models or molecule conformers, the Flexi-pharma approach takes directly into account the receptor flexibility by scoring in regards to the receptor conformations. We tested the method over twenty systems, finding an enrichment of the dataset for 19 of them. Flexi-pharma is computationally efficient allowing for the screening of thousands of compounds in minutes on a single CPU core. Moreover, the ranking of molecules by vote is a general strategy that can be applied with any pharmacophore-filtering program.
Collapse
Affiliation(s)
- Isaias Lans
- Biophysics of Tropical Diseases Max Planck Tandem Group, University of Antioquia UdeA, Calle 70 No. 52-21, Medellín, Colombia
| | - Karen Palacio-Rodríguez
- Biophysics of Tropical Diseases Max Planck Tandem Group, University of Antioquia UdeA, Calle 70 No. 52-21, Medellín, Colombia
| | - Claudio N Cavasotto
- Computational Drug Design and Biomedical Informatics Laboratory, Translational Medicine Research Institute (IIMT), CONICET-Universidad Austral, Pilar, Buenos Aires, Argentina
- Facultad de Ciencias Biomédicas, and Facultad de Ingeniería, Universidad Austral, Pilar, Buenos Aires, Argentina
- Austral Institute for Applied Artificial Intelligence, Universidad Austral, Pilar, Buenos Aires, Argentina
| | - Pilar Cossio
- Biophysics of Tropical Diseases Max Planck Tandem Group, University of Antioquia UdeA, Calle 70 No. 52-21, Medellín, Colombia.
- Department of Theoretical Biophysics, Max Planck Institute of Biophysics, 60438, Frankfurt am Main, Germany.
| |
Collapse
|
6
|
Lans I, Anoz-Carbonell E, Palacio-Rodríguez K, Aínsa JA, Medina M, Cossio P. In silico discovery and biological validation of ligands of FAD synthase, a promising new antimicrobial target. PLoS Comput Biol 2020; 16:e1007898. [PMID: 32797038 PMCID: PMC7449411 DOI: 10.1371/journal.pcbi.1007898] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2020] [Revised: 08/26/2020] [Accepted: 07/09/2020] [Indexed: 01/06/2023] Open
Abstract
New treatments for diseases caused by antimicrobial-resistant microorganisms can be developed by identifying unexplored therapeutic targets and by designing efficient drug screening protocols. In this study, we have screened a library of compounds to find ligands for the flavin-adenine dinucleotide synthase (FADS) -a potential target for drug design against tuberculosis and pneumonia- by implementing a new and efficient virtual screening protocol. The protocol has been developed for the in silico search of ligands of unexplored therapeutic targets, for which limited information about ligands or ligand-receptor structures is available. It implements an integrative funnel-like strategy with filtering layers that increase in computational accuracy. The protocol starts with a pharmacophore-based virtual screening strategy that uses ligand-free receptor conformations from molecular dynamics (MD) simulations. Then, it performs a molecular docking stage using several docking programs and an exponential consensus ranking strategy. The last filter, samples the conformations of compounds bound to the target using MD simulations. The MD conformations are scored using several traditional scoring functions in combination with a newly-proposed score that takes into account the fluctuations of the molecule with a Morse-based potential. The protocol was optimized and validated using a compound library with known ligands of the Corynebacterium ammoniagenes FADS. Then, it was used to find new FADS ligands from a compound library of 14,000 molecules. A small set of 17 in silico filtered molecules were tested experimentally. We identified five inhibitors of the activity of the flavin adenylyl transferase module of the FADS, and some of them were able to inhibit growth of three bacterial species: C. ammoniagenes, Mycobacterium tuberculosis, and Streptococcus pneumoniae, where the last two are human pathogens. Overall, the results show that the integrative VS protocol is a cost-effective solution for the discovery of ligands of unexplored therapeutic targets.
Collapse
Affiliation(s)
- Isaias Lans
- Biophysics of Tropical Diseases, Max Planck Tandem Group, University of Antioquia UdeA, Medellin, Colombia
| | - Ernesto Anoz-Carbonell
- Departamento de Bioquímica y Biología Molecular y Celular, Facultad de Ciencias, Universidad de Zaragoza, Spain
- Instituto de Biocomputación y Física de Sistemas Complejos (Unidades Asociadas BIFI-IQFR y CBsC-CSIC), Universidad de Zaragoza, Spain
- Grupo de Genética de Micobacterias, Departamento de Microbiología, Pediatría, Radiología y Salud Pública. Facultad de Medicina, Universidad de Zaragoza, Zaragoza, Spain
| | - Karen Palacio-Rodríguez
- Biophysics of Tropical Diseases, Max Planck Tandem Group, University of Antioquia UdeA, Medellin, Colombia
| | - José Antonio Aínsa
- Instituto de Biocomputación y Física de Sistemas Complejos (Unidades Asociadas BIFI-IQFR y CBsC-CSIC), Universidad de Zaragoza, Spain
- Grupo de Genética de Micobacterias, Departamento de Microbiología, Pediatría, Radiología y Salud Pública. Facultad de Medicina, Universidad de Zaragoza, Zaragoza, Spain
- CIBER Enfermedades Respiratorias (CIBERES), Instituto de Salud Carlos III, Spain
| | - Milagros Medina
- Departamento de Bioquímica y Biología Molecular y Celular, Facultad de Ciencias, Universidad de Zaragoza, Spain
- Instituto de Biocomputación y Física de Sistemas Complejos (Unidades Asociadas BIFI-IQFR y CBsC-CSIC), Universidad de Zaragoza, Spain
| | - Pilar Cossio
- Biophysics of Tropical Diseases, Max Planck Tandem Group, University of Antioquia UdeA, Medellin, Colombia
- Department of Theoretical Biophysics, Max Planck Institute of Biophysics, Frankfurt, Germany
- * E-mail:
| |
Collapse
|
7
|
Sanchez-Muñoz R, Perez-Mata E, Almagro L, Cusido RM, Bonfill M, Palazon J, Moyano E. A Novel Hydroxylation Step in the Taxane Biosynthetic Pathway: A New Approach to Paclitaxel Production by Synthetic Biology. Front Bioeng Biotechnol 2020; 8:410. [PMID: 32528936 PMCID: PMC7247824 DOI: 10.3389/fbioe.2020.00410] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2020] [Accepted: 04/14/2020] [Indexed: 11/13/2022] Open
Abstract
Engineered plant cell lines have the potential to achieve enhanced metabolite production rates, providing a high-yielding source of compounds of interest. Improving the production of taxanes, pharmacologically valuable secondary metabolites of Taxus spp., is hindered by an incomplete knowledge of the taxane biosynthetic pathway. Of the five unknown steps, three are thought to involve cytochrome P450-like hydroxylases. In the current work, after an in-depth in silico characterization of four candidate enzymes proposed in a previous cDNA-AFLP assay, TB506 was selected as a candidate for the hydroxylation of the taxane side chain. A docking assay indicated TB506 is active after the attachment of the side chain based on its affinity to the ligand 3'N-dehydroxydebenzoyltaxol. Finally, the involvement of TB506 in the last hydroxylation step of the paclitaxel biosynthetic pathway was confirmed by functional assays. The identification of this hydroxylase will contribute to the development of alternative sustainable paclitaxel production systems using synthetic biology techniques.
Collapse
Affiliation(s)
- Raul Sanchez-Muñoz
- Departament de Ciències Experimentals i de la Salut, Universitat Pompeu Fabra, Barcelona, Spain
| | - Edgar Perez-Mata
- Secció de Fisiologia Vegetal, Facultat de Farmacia, Universitat de Barcelona, Barcelona, Spain
| | - Lorena Almagro
- Departamento de Biología Vegetal, Facultad de Biología, Universidad de Murcia, Murcia, Spain
| | - Rosa M. Cusido
- Secció de Fisiologia Vegetal, Facultat de Farmacia, Universitat de Barcelona, Barcelona, Spain
| | - Mercedes Bonfill
- Secció de Fisiologia Vegetal, Facultat de Farmacia, Universitat de Barcelona, Barcelona, Spain
| | - Javier Palazon
- Secció de Fisiologia Vegetal, Facultat de Farmacia, Universitat de Barcelona, Barcelona, Spain
| | - Elisabeth Moyano
- Departament de Ciències Experimentals i de la Salut, Universitat Pompeu Fabra, Barcelona, Spain
| |
Collapse
|
8
|
Koirala M, Alexov E. Computational chemistry methods to investigate the effects caused by DNA variants linked with disease. JOURNAL OF THEORETICAL & COMPUTATIONAL CHEMISTRY 2019. [DOI: 10.1142/s0219633619300015] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Computational chemistry offers variety of tools to study properties of biological macromolecules. These tools vary in terms of levels of details from quantum mechanical treatment to numerous macroscopic approaches. Here, we provide a review of computational chemistry algorithms and tools for modeling the effects of genetic variations and their association with diseases. Particular emphasis is given on modeling the effects of missense mutations on stability, conformational dynamics, binding, hydrogen bond network, salt bridges, and pH-dependent properties of the corresponding macromolecules. It is outlined that the disease may be caused by alteration of one or several of above-mentioned biophysical characteristics, and a successful prediction of pathogenicity requires detailed analysis of how the alterations affect the function of involved macromolecules. The review provides a short list of most commonly used algorithms to predict the molecular effects of mutations as well.
Collapse
Affiliation(s)
- Mahesh Koirala
- Department of Physics and Astronomy, Clemson University, Clemson, SC 29630, USA
| | - Emil Alexov
- Department of Physics and Astronomy, Clemson University, Clemson, SC 29630, USA
| |
Collapse
|
9
|
Ochoa R, Laio A, Cossio P. Predicting the Affinity of Peptides to Major Histocompatibility Complex Class II by Scoring Molecular Dynamics Simulations. J Chem Inf Model 2019; 59:3464-3473. [PMID: 31290667 DOI: 10.1021/acs.jcim.9b00403] [Citation(s) in RCA: 27] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
Predicting the binding affinity of peptides able to interact with major histocompatibility complex (MHC) molecules is a priority for researchers working in the identification of novel vaccines candidates. Most available approaches are based on the analysis of the sequence of peptides of known experimental affinity. However, for MHC class II receptors, these approaches are not very accurate, due to the intrinsic flexibility of the complex. To overcome these limitations, we propose to estimate the binding affinity of peptides bound to an MHC class II by averaging the score of the configurations from finite-temperature molecular dynamics simulations. The score is estimated for 18 different scoring functions, and we explored the optimal manner for combining them. To test the predictions, we considered eight peptides of known binding affinity. We found that six scoring functions correlate with the experimental ranking of the peptides significantly better than the others. We then assessed a set of techniques for combining the scoring functions by linear regression and logistic regression. We obtained a maximum accuracy of 82% for the predicted sign of the binding affinity using a logistic regression with optimized weights. These results are potentially useful to improve the reliability of in silico protocols to design high-affinity binding peptides for MHC class II receptors.
Collapse
Affiliation(s)
- Rodrigo Ochoa
- Biophysics of Tropical Diseases, Max Planck Tandem Group , University of Antioquia , 050010 Medellin , Colombia
| | - Alessandro Laio
- International School for Advanced Studies (SISSA) , Via Bonomea 265 , 34136 Trieste , Italy.,The Abdus Salam International Centre for Theoretical Physics (ICTP) , Strada Costiera 11 , 34151 Trieste , Italy
| | - Pilar Cossio
- Biophysics of Tropical Diseases, Max Planck Tandem Group , University of Antioquia , 050010 Medellin , Colombia.,Department of Theoretical Biophysics , Max Planck Institute of Biophysics , 60438 Frankfurt am Main , Germany
| |
Collapse
|
10
|
Baiesi M, Orlandini E, Seno F, Trovato A. Sequence and structural patterns detected in entangled proteins reveal the importance of co-translational folding. Sci Rep 2019; 9:8426. [PMID: 31182755 PMCID: PMC6557820 DOI: 10.1038/s41598-019-44928-3] [Citation(s) in RCA: 31] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2019] [Accepted: 05/23/2019] [Indexed: 11/09/2022] Open
Abstract
Proteins must fold quickly to acquire their biologically functional three-dimensional native structures. Hence, these are mainly stabilized by local contacts, while intricate topologies such as knots are rare. Here, we reveal the existence of specific patterns adopted by protein sequences and structures to deal with backbone self-entanglement. A large scale analysis of the Protein Data Bank shows that loops significantly intertwined with another chain portion are typically closed by weakly bound amino acids. Why is this energetic frustration maintained? A possible picture is that entangled loops are formed only toward the end of the folding process to avoid kinetic traps. Consistently, these loops are more frequently found to be wrapped around a portion of the chain on their N-terminal side, the one translated earlier at the ribosome. Finally, these motifs are less abundant in natural native states than in simulated protein-like structures, yet they appear in 32% of proteins, which in some cases display an amazingly complex intertwining.
Collapse
Affiliation(s)
- Marco Baiesi
- Department of Physics and Astronomy, University of Padova, Via Marzolo 8, I-35131, Padova, Italy
- INFN, Sezione di Padova, Via Marzolo 8, I-35131, Padova, Italy
| | - Enzo Orlandini
- Department of Physics and Astronomy, University of Padova, Via Marzolo 8, I-35131, Padova, Italy
- INFN, Sezione di Padova, Via Marzolo 8, I-35131, Padova, Italy
| | - Flavio Seno
- Department of Physics and Astronomy, University of Padova, Via Marzolo 8, I-35131, Padova, Italy.
- INFN, Sezione di Padova, Via Marzolo 8, I-35131, Padova, Italy.
| | - Antonio Trovato
- Department of Physics and Astronomy, University of Padova, Via Marzolo 8, I-35131, Padova, Italy
- INFN, Sezione di Padova, Via Marzolo 8, I-35131, Padova, Italy
| |
Collapse
|
11
|
Methods for the Refinement of Protein Structure 3D Models. Int J Mol Sci 2019; 20:ijms20092301. [PMID: 31075942 PMCID: PMC6539982 DOI: 10.3390/ijms20092301] [Citation(s) in RCA: 40] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2019] [Revised: 04/24/2019] [Accepted: 05/07/2019] [Indexed: 12/25/2022] Open
Abstract
The refinement of predicted 3D protein models is crucial in bringing them closer towards experimental accuracy for further computational studies. Refinement approaches can be divided into two main stages: The sampling and scoring stages. Sampling strategies, such as the popular Molecular Dynamics (MD)-based protocols, aim to generate improved 3D models. However, generating 3D models that are closer to the native structure than the initial model remains challenging, as structural deviations from the native basin can be encountered due to force-field inaccuracies. Therefore, different restraint strategies have been applied in order to avoid deviations away from the native structure. For example, the accurate prediction of local errors and/or contacts in the initial models can be used to guide restraints. MD-based protocols, using physics-based force fields and smart restraints, have made significant progress towards a more consistent refinement of 3D models. The scoring stage, including energy functions and Model Quality Assessment Programs (MQAPs) are also used to discriminate near-native conformations from non-native conformations. Nevertheless, there are often very small differences among generated 3D models in refinement pipelines, which makes model discrimination and selection problematic. For this reason, the identification of the most native-like conformations remains a major challenge.
Collapse
|
12
|
Battisti A, Zamuner S, Sarti E, Laio A. Toward a unified scoring function for native state discrimination and drug-binding pocket recognition. Phys Chem Chem Phys 2019; 20:17148-17155. [PMID: 29900428 DOI: 10.1039/c7cp08170g] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]
Abstract
Protein folding and receptor-ligand recognition are fundamental processes for any living organism. Although folding and ligand recognition are based on the same chemistry, the existing empirical scoring functions target just one problem: predicting the correct fold or the correct binding pose. We here introduce a statistical potential which considers moieties as fundamental units. The scoring function is able to deal with both folding and ligand pocket recognition problems with a performance comparable to the scoring functions specifically tailored for one of the two tasks. We foresee that the capability of the new scoring function to tackle both problems in a unified framework will be a key to deal with the induced fit phenomena, in which a target protein changes significantly its conformation upon binding. Moreover, the new scoring function might be useful in docking protocols towards intrinsically disordered proteins, whose flexibility cannot be handled with the available docking software.
Collapse
Affiliation(s)
- Anna Battisti
- International School for Advanced Studies (SISSA), Via Bonomea 265, I-34136 Trieste, Italy.
| | | | | | | |
Collapse
|
13
|
Ciemny MP, Badaczewska-Dawid AE, Pikuzinska M, Kolinski A, Kmiecik S. Modeling of Disordered Protein Structures Using Monte Carlo Simulations and Knowledge-Based Statistical Force Fields. Int J Mol Sci 2019; 20:E606. [PMID: 30708941 PMCID: PMC6386871 DOI: 10.3390/ijms20030606] [Citation(s) in RCA: 33] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2018] [Revised: 01/23/2019] [Accepted: 01/29/2019] [Indexed: 12/20/2022] Open
Abstract
The description of protein disordered states is important for understanding protein folding mechanisms and their functions. In this short review, we briefly describe a simulation approach to modeling protein interactions, which involve disordered peptide partners or intrinsically disordered protein regions, and unfolded states of globular proteins. It is based on the CABS coarse-grained protein model that uses a Monte Carlo (MC) sampling scheme and a knowledge-based statistical force field. We review several case studies showing that description of protein disordered states resulting from CABS simulations is consistent with experimental data. The case studies comprise investigations of protein⁻peptide binding and protein folding processes. The CABS model has been recently made available as the simulation engine of multiscale modeling tools enabling studies of protein⁻peptide docking and protein flexibility. Those tools offer customization of the modeling process, driving the conformational search using distance restraints, reconstruction of selected models to all-atom resolution, and simulation of large protein systems in a reasonable computational time. Therefore, CABS can be combined in integrative modeling pipelines incorporating experimental data and other modeling tools of various resolution.
Collapse
Affiliation(s)
- Maciej Pawel Ciemny
- Faculty of Chemistry, Biological and Chemical Research Center, University of Warsaw, Pasteura 1, 02-093 Warsaw, Poland.
- Faculty of Physics, University of Warsaw, Pasteura 5, 02-093 Warsaw, Poland.
| | | | - Monika Pikuzinska
- Faculty of Chemistry, Biological and Chemical Research Center, University of Warsaw, Pasteura 1, 02-093 Warsaw, Poland.
| | - Andrzej Kolinski
- Faculty of Chemistry, Biological and Chemical Research Center, University of Warsaw, Pasteura 1, 02-093 Warsaw, Poland.
| | - Sebastian Kmiecik
- Faculty of Chemistry, Biological and Chemical Research Center, University of Warsaw, Pasteura 1, 02-093 Warsaw, Poland.
| |
Collapse
|
14
|
Ochoa R, Soler MA, Laio A, Cossio P. Assessing the capability of in silico mutation protocols for predicting the finite temperature conformation of amino acids. Phys Chem Chem Phys 2018; 20:25901-25909. [PMID: 30289133 DOI: 10.1039/c8cp03826k] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
Mutation protocols are a key tool in computational biophysics for modelling unknown side chain conformations. In particular, these protocols are used to generate the starting structures for molecular dynamics simulations. The accuracy of the initial side chain and backbone placement is crucial to obtain a stable and quickly converging simulation. In this work, we assessed the performance of several mutation protocols in predicting the most probable conformer observed in finite temperature molecular dynamics simulations for a set of protein-peptide crystals differing only by single-point mutations in the peptide sequence. Our results show that several programs which predict well the crystal conformations fail to predict the most probable finite temperature configuration. Methods relying on backbone-dependent rotamer libraries have, in general, a better performance, but even the best protocol fails in predicting approximately 30% of the mutations.
Collapse
Affiliation(s)
- Rodrigo Ochoa
- Biophysics of Tropical Diseases, Max Planck Tandem Group, University of Antioquia, Medellin, Colombia.
| | | | | | | |
Collapse
|
15
|
Anishchenko I, Kundrotas PJ, Vakser IA. Contact Potential for Structure Prediction of Proteins and Protein Complexes from Potts Model. Biophys J 2018; 115:809-821. [PMID: 30122295 DOI: 10.1016/j.bpj.2018.07.035] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2018] [Revised: 07/16/2018] [Accepted: 07/31/2018] [Indexed: 12/18/2022] Open
Abstract
The energy function is the key component of protein modeling methodology. This work presents a semianalytical approach to the development of contact potentials for protein structure modeling. Residue-residue and atom-atom contact energies were derived by maximizing the probability of observing native sequences in a nonredundant set of protein structures. The optimization task was formulated as an inverse statistical mechanics problem applied to the Potts model. Its solution by pseudolikelihood maximization provides consistent estimates of coupling constants at atomic and residue levels. The best performance was achieved when interacting atoms were grouped according to their physicochemical properties. For individual protein structures, the performance of the contact potentials in distinguishing near-native structures from the decoys is similar to the top-performing scoring functions. The potentials also yielded significant improvement in the protein docking success rates. The potentials recapitulated experimentally determined protein stability changes upon point mutations and protein-protein binding affinities. The approach offers a different perspective on knowledge-based potentials and may serve as the basis for their further development.
Collapse
Affiliation(s)
- Ivan Anishchenko
- Computational Biology Program and Department of Molecular Biosciences, The University of Kansas, Lawrence, Kansas
| | - Petras J Kundrotas
- Computational Biology Program and Department of Molecular Biosciences, The University of Kansas, Lawrence, Kansas.
| | - Ilya A Vakser
- Computational Biology Program and Department of Molecular Biosciences, The University of Kansas, Lawrence, Kansas.
| |
Collapse
|
16
|
Škrbić T, Zamuner S, Hong R, Seno F, Laio A, Trovato A. Vibrational entropy estimation can improve binding affinity prediction for non-obligatory protein complexes. Proteins 2018; 86:393-404. [DOI: 10.1002/prot.25454] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2017] [Revised: 12/22/2017] [Accepted: 01/05/2018] [Indexed: 01/10/2023]
Affiliation(s)
- Tatjana Škrbić
- Faculty of Physics; International School for Advanced Studies (SISSA/ISAS); Trieste Italy
- Department of Physics and Astronomy “Galileo Galilei”; University of Padova; Padova Italy
| | - Stefano Zamuner
- Department of Physics and Astronomy “Galileo Galilei”; University of Padova; Padova Italy
| | - Rolando Hong
- Faculty of Physics; International School for Advanced Studies (SISSA/ISAS); Trieste Italy
| | - Flavio Seno
- Department of Physics and Astronomy “Galileo Galilei”; University of Padova; Padova Italy
- Padova Section, National Institute of Nuclear Physics (INFN); Padova Italy
| | - Alessandro Laio
- Faculty of Physics; International School for Advanced Studies (SISSA/ISAS); Trieste Italy
| | - Antonio Trovato
- Department of Physics and Astronomy “Galileo Galilei”; University of Padova; Padova Italy
- Padova Section, National Institute of Nuclear Physics (INFN); Padova Italy
| |
Collapse
|
17
|
Yao Y, Gui R, Liu Q, Yi M, Deng H. Diverse effects of distance cutoff and residue interval on the performance of distance-dependent atom-pair potential in protein structure prediction. BMC Bioinformatics 2017; 18:542. [PMID: 29221443 PMCID: PMC5723101 DOI: 10.1186/s12859-017-1983-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2017] [Accepted: 12/04/2017] [Indexed: 12/27/2022] Open
Abstract
BACKGROUND As one of the most successful knowledge-based energy functions, the distance-dependent atom-pair potential is widely used in all aspects of protein structure prediction, including conformational search, model refinement, and model assessment. During the last two decades, great efforts have been made to improve the reference state of the potential, while other factors that also strongly affect the performance of the potential have been relatively less investigated. RESULTS Based on different distance cutoffs (from 5 to 22 Å) and residue intervals (from 0 to 15) as well as six different reference states, we constructed a series of distance-dependent atom-pair potentials and tested them on several groups of structural decoy sets collected from diverse sources. A comprehensive investigation has been performed to clarify the effects of distance cutoff and residue interval on the potential's performance. Our results provide a new perspective as well as a practical guidance for optimizing distance-dependent statistical potentials. CONCLUSIONS The optimal distance cutoff and residue interval are highly related with the reference state that the potential is based on, the measurements of the potential's performance, and the decoy sets that the potential is applied to. The performance of distance-dependent statistical potential can be significantly improved when the best statistical parameters for the specific application environment are adopted.
Collapse
Affiliation(s)
- Yuangen Yao
- Department of Physics, College of Science, Huazhong Agricultural University, Wuhan, 430070 China
| | - Rong Gui
- Department of Physics, College of Science, Huazhong Agricultural University, Wuhan, 430070 China
| | - Quan Liu
- Department of Physics, College of Science, Huazhong Agricultural University, Wuhan, 430070 China
| | - Ming Yi
- Department of Physics, College of Science, Huazhong Agricultural University, Wuhan, 430070 China
| | - Haiyou Deng
- Department of Physics, College of Science, Huazhong Agricultural University, Wuhan, 430070 China
- Institute of Applied Physics, Huazhong Agricultural University, Wuhan, 430070 China
| |
Collapse
|
18
|
Srinivasan E, Rajasekaran R. Deciphering the loss of metal binding due to mutation D83G of human SOD1 protein causing FALS disease. Int J Biol Macromol 2017; 107:521-529. [PMID: 28899654 DOI: 10.1016/j.ijbiomac.2017.09.019] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2017] [Revised: 09/07/2017] [Accepted: 09/08/2017] [Indexed: 01/23/2023]
Abstract
Mutations in Cu/Zn superoxide dismutase 1 (SOD1) protein are found to be the causative factor, behind the majority of familial amyotrophic later sclerosis (FALS) cases. The mutations particularly on the metal (Zn) binding residues are found to increase the disease onset in the individuals suffering from FALS, while the presence of the metal ion (Zn) is essential for the catalytic activity and retaining the protein stability. Thus in our study, we focused on one such metal binding mutant (D83G) and assessed the impact of the mutation on protein structure and function. The influence of mutation was examined dynamically, using discrete molecular dynamics on both the native and mutant SOD1 protein respectively. Accordingly, the variation in conformational stability, residual flexibility and protein compactness along with the change in conformational free energy were monitored over the entire dynamic period. Moreover, the motion of native and mutant SOD1 was also observed via the essential dynamics. Besides, the disparity in Zn ion binding was inspected through distance analysis and steered molecular dynamics, correspondingly. Therefore, the study provides a better understanding over the profound effect of mutation on SOD1, both structurally and functionally, using computational approaches.
Collapse
Affiliation(s)
- E Srinivasan
- Bioinformatics lab, Department of Biotechnology, School of Biosciences and Technology, VIT University, Vellore - 632014, Tamil Nadu, India
| | - R Rajasekaran
- Bioinformatics lab, Department of Biotechnology, School of Biosciences and Technology, VIT University, Vellore - 632014, Tamil Nadu, India.
| |
Collapse
|
19
|
Srinivasan E, Sethumadhavan R, Rajasekaran R. A theoretical study on Zn binding loop mutants instigating destabilization and metal binding loss in human SOD1 protein. J Mol Model 2017; 23:103. [PMID: 28271284 DOI: 10.1007/s00894-017-3286-z] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2016] [Accepted: 02/20/2017] [Indexed: 01/22/2023]
Abstract
Mutations in Cu/Zn superoxide dismutase 1 (SOD1) protein are a major cause of the devastating neurodegenerative disorder Amyotrophic lateral sclerosis. Evidence suggests that SOD1 functions as a free radical scavenger in humans. However, neither the mechanism nor a cure for this neurodegenerative disease are yet known. In the present study, we explored the effect of mutations on the mechanistic action on the Zn binding loop of SOD1 through discrete molecular dynamics. The results were analyzed in detail using statistical potential (BACH) to find the mutant structures having the least potential energy. Subsequently, we studied the impact of those mutations on metal ions bound in SOD1 using the program Check My Metal. Remarkably, our results recognized certain mutants, viz. His80Arg and Asp83Gly, that were more damaging to the Zn binding loop than all other mutants, leading to a loss of Zn binding with altered coordination of the Zn ion. Furthermore, the conformational stability, compactness, and secondary structural alteration of the His80Arg and Asp83Gly mutants were monitored using distinct parameters. Hence, at low computational expense, our study provides helpful insight into this emergent neurodegenerative disorder affecting mankind.
Collapse
Affiliation(s)
- E Srinivasan
- Bioinformatics Laboratory, Department of Biotechnology, School of Bio Sciences and Technology, VIT University, Vellore, 632014, Tamil Nadu, India
| | - Rao Sethumadhavan
- Bioinformatics Laboratory, Department of Biotechnology, School of Bio Sciences and Technology, VIT University, Vellore, 632014, Tamil Nadu, India
| | - R Rajasekaran
- Bioinformatics Laboratory, Department of Biotechnology, School of Bio Sciences and Technology, VIT University, Vellore, 632014, Tamil Nadu, India.
| |
Collapse
|
20
|
Coucke A, Uguzzoni G, Oteri F, Cocco S, Monasson R, Weigt M. Direct coevolutionary couplings reflect biophysical residue interactions in proteins. J Chem Phys 2016; 145:174102. [DOI: 10.1063/1.4966156] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023] Open
Affiliation(s)
- Alice Coucke
- Laboratoire de Physique Théorique, Ecole Normale Supérieure and CNRS-UMR8549, PSL Research University, Sorbonne Universités UPMC, 24 Rue Lhomond, 75005 Paris, France
- Sorbonne Universités, UPMC, Institut de Biologie Paris-Seine, CNRS, Laboratoire de Biologie Computationnelle et Quantitative UMR 7238, 75005 Paris, France
| | - Guido Uguzzoni
- Sorbonne Universités, UPMC, Institut de Biologie Paris-Seine, CNRS, Laboratoire de Biologie Computationnelle et Quantitative UMR 7238, 75005 Paris, France
| | - Francesco Oteri
- Sorbonne Universités, UPMC, Institut de Biologie Paris-Seine, CNRS, Laboratoire de Biologie Computationnelle et Quantitative UMR 7238, 75005 Paris, France
| | - Simona Cocco
- Laboratoire de Physique Statistique, Ecole Normale Supérieure and CNRS-UMR8550, PSL Research University, Sorbonne Universités UPMC, 24 Rue Lhomond, 75005 Paris, France
| | - Remi Monasson
- Laboratoire de Physique Théorique, Ecole Normale Supérieure and CNRS-UMR8549, PSL Research University, Sorbonne Universités UPMC, 24 Rue Lhomond, 75005 Paris, France
| | - Martin Weigt
- Sorbonne Universités, UPMC, Institut de Biologie Paris-Seine, CNRS, Laboratoire de Biologie Computationnelle et Quantitative UMR 7238, 75005 Paris, France
| |
Collapse
|
21
|
Baiesi M, Orlandini E, Trovato A, Seno F. Linking in domain-swapped protein dimers. Sci Rep 2016; 6:33872. [PMID: 27659606 PMCID: PMC5034241 DOI: 10.1038/srep33872] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2016] [Accepted: 09/05/2016] [Indexed: 11/24/2022] Open
Abstract
The presence of knots has been observed in a small fraction of single-domain proteins and related to their thermodynamic and kinetic properties. The exchanging of identical structural elements, typical of domain-swapped proteins, makes such dimers suitable candidates to validate the possibility that mutual entanglement between chains may play a similar role for protein complexes. We suggest that such entanglement is captured by the linking number. This represents, for two closed curves, the number of times that each curve winds around the other. We show that closing the curves is not necessary, as a novel parameter G', termed Gaussian entanglement, is strongly correlated with the linking number. Based on 110 non redundant domain-swapped dimers, our analysis evidences a high fraction of chains with a significant intertwining, that is with |G'| > 1. We report that Nature promotes configurations with negative mutual entanglement and surprisingly, it seems to suppress intertwining in long protein dimers. Supported by numerical simulations of dimer dissociation, our results provide a novel topology-based classification of protein-swapped dimers together with some preliminary evidence of its impact on their physical and biological properties.
Collapse
Affiliation(s)
- Marco Baiesi
- Department of Physics and Astronomy, University of Padova, Padova, Italy
- INFN-Sezione di Padova, Padova, Italy
| | - Enzo Orlandini
- Department of Physics and Astronomy, University of Padova, Padova, Italy
- INFN-Sezione di Padova, Padova, Italy
| | - Antonio Trovato
- Department of Physics and Astronomy, University of Padova, Padova, Italy
- CNISM-Unità di Padova, Padova, Italy
| | - Flavio Seno
- Department of Physics and Astronomy, University of Padova, Padova, Italy
- CNISM-Unità di Padova, Padova, Italy
| |
Collapse
|
22
|
Srinivasan E, Rajasekaran R. Computational simulation analysis on human SOD1 mutant (H80R) exposes the structural destabilization and the deviation of Zn binding that directs familial amyotrophic lateral sclerosis. J Biomol Struct Dyn 2016; 35:2645-2653. [DOI: 10.1080/07391102.2016.1227723] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
Affiliation(s)
- E. Srinivasan
- Department of Biotechnology, School of Bio Sciences and Technology, VIT University, Vellore 632014, Tamil Nadu, India
| | - R. Rajasekaran
- Department of Biotechnology, School of Bio Sciences and Technology, VIT University, Vellore 632014, Tamil Nadu, India
| |
Collapse
|
23
|
Meshach Paul D, Rajasekaran R. In silico approach to explore the disruption in the molecular mechanism of human hyaluronidase 1 by mutant E268K that directs Natowicz syndrome. EUROPEAN BIOPHYSICS JOURNAL: EBJ 2016; 46:157-169. [PMID: 27424109 DOI: 10.1007/s00249-016-1151-0] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/21/2016] [Revised: 06/02/2016] [Accepted: 07/01/2016] [Indexed: 01/27/2023]
Abstract
Natowicz syndrome (mucopolysaccharidoses type 9) is a lysosomal storage disorder caused by deficient or defective human hyaluronidase 1. The disorder is not well studied at the molecular level. Therefore, a new in silico approach was proposed to study the molecular basis on which one clinically observed mutation, Glu268Lys, results in a defective enzyme. The native and mutant structures were subjected to comparative analyses using a conformational sampling approach for geometrical variables viz, RMSF, RMSD, and Ramachandran plot. In addition, the strength of a Cys207-Cys221 disulfide bond and electrostatic interaction between Arg265 and Asp206 were studied, as they are known to be involved in the catalytic activity of the enzyme. Native and mutant E268K showed statistically significant variations with p < 0.05 in RMSD, Ramachandran plot, strengths of disulfide bond, and electrostatic interactions. Further, single model analysis showed variations between native and mutant structures in terms of intra-protein interactions, hydrogen bond dilution, secondary structure, and dihedral angles. Docking analysis predicted the mutant to have a less favorable substrate binding energy compared to the native protein. Additionally, steered MD analysis indicated that the substrate should have more affinity to the native than mutant enzymes. The observed changes theoretically explain the less favorable binding energy of substrate towards mutant E268K, thereby providing a structural basis for its reduced catalytic activity. Hence, our study provides a basis for understanding the disruption in the molecular mechanism of human hyaluronidase 1 by mutation E268K, which may prove useful for the development of synthetic chaperones as a treatment option for Natowicz syndrome.
Collapse
Affiliation(s)
- D Meshach Paul
- Computational Biology Lab, Department of Biotechnology, School of Bio Sciences and Technology, VIT University, Vellore, 632014, Tamil Nadu, India
| | - R Rajasekaran
- Computational Biology Lab, Department of Biotechnology, School of Bio Sciences and Technology, VIT University, Vellore, 632014, Tamil Nadu, India.
| |
Collapse
|
24
|
Exploration of Structural and Functional Variations Owing to Point Mutations in α-NAGA. Interdiscip Sci 2016; 10:81-92. [PMID: 27138754 DOI: 10.1007/s12539-016-0173-8] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2015] [Revised: 04/07/2016] [Accepted: 04/13/2016] [Indexed: 10/21/2022]
Abstract
Schindler disease is a lysosomal storage disorder caused due to deficiency or defective activity of alpha-N-acetylgalactosaminidase (α-NAGA). Mutations in gene encoding α-NAGA cause wide range of diseases, characterized with mild to severe clinical features. Molecular effects of these mutations are yet to be explored in detail. Therefore, this study was focused on four missense mutations of α-NAGA namely, S160C, E325K, R329Q and R329W. Native and mutant structures of α-NAGA were analysed to determine geometrical deviations such as the contours of root mean square deviation, root mean square fluctuation, percentage of residues in allowed regions of Ramachandran plot and solvent accessible surface area, using conformational sampling technique. Additionally, global energy-minimized structures of native and mutants were further analysed to compute their intra-molecular interactions, hydrogen bond dilution and distribution of secondary structure. In addition, docking studies were also performed to determine variations in binding energies between native and mutants. The deleterious effects of mutants were evident due to variations in their active site residues pertaining to spatial conformation and flexibility, comparatively. Hence, variations exhibited by mutants, namely S160C, E325K, R329Q and R329W to that of native, consequently, lead to the detrimental effects causing Schindler disease. This study computationally explains the underlying reasons for the pathogenesis of the disease, thereby aiding future researchers in drug development and disease management.
Collapse
|
25
|
Geometric Simulation Approach for Grading and Assessing the Thermostability of CALBs. Biochem Res Int 2016; 2016:4101059. [PMID: 27123343 PMCID: PMC4830698 DOI: 10.1155/2016/4101059] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2015] [Accepted: 03/16/2016] [Indexed: 11/17/2022] Open
Abstract
Candida antarctica lipase B (CALB) is a known stable and highly active enzyme used widely in biodiesel synthesis. In this work, the stability of native (4K6G) and mutant (4K5Q) CALB was studied through various structural parameters using conformational sampling approach. The contours of polar surface area and surface area of mutant CALB were 11357.67 Å(2) and 30007.4 Å(2), respectively, showing an enhanced stability compared to native CALB with a statistically significant P value of < 0.0001. Moreover, simulated thermal denaturation of CALB, a process involving dilution of hydrogen bond, significantly shielded against different intervals of energy application in mutant CALB revealing its augmentation of structural rigidity against native CALB. Finally, computational docking analysis showed an increase in the binding affinity of CALB and its substrate (triglyceride) in mutant CALB with Atomic Contact Energy (ACE) of -91.23 kcal/mol compared to native CALB (ACE of -70.3 kcal/mol). The computational observations proposed that the use of mutant CALB (4K5Q) could serve as a best template for production of biodiesel in the future. Additionally, it can also be used as a template to identify efficient thermostable lipases through further mutations.
Collapse
|
26
|
Lopus M, Paul DM, Rajasekaran R. Unraveling the Deleterious Effects of Cancer-Driven STK11 Mutants Through Conformational Sampling Approach. Cancer Inform 2016; 15:35-44. [PMID: 27081308 PMCID: PMC4821432 DOI: 10.4137/cin.s38044] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2015] [Revised: 02/17/2016] [Accepted: 02/17/2016] [Indexed: 01/18/2023] Open
Abstract
Tumor suppressor gene, STK11, encodes for serine-threonine kinase, which has a critical role in regulating cell growth and apoptosis. Mutations of the same lead to the inactivation of STK11, which eventually causes different types of cancer. In this study, we focused on identifying those driver mutations through analyzing structural variations of mutants, viz., D194N, E199K, L160P, and Y49D. Native and the mutants were analyzed to determine their geometrical deviations such as root-mean-square deviation, root-mean-square fluctuation, radius of gyration, potential energy, and solvent-accessible surface area using conformational sampling technique. Additionally, the global minimized structure of native and mutants was further analyzed to compute their intramolecular interactions and distribution of secondary structure. Subsequently, simulated thermal denaturation and docking studies were performed to determine their structural variations, which in turn alter the formation of active complex that comprises STK11, STRAD, and MO25. The deleterious effect of the mutants would result in a comparative loss of enzyme function due to variations in their binding energy pertaining to spatial conformation and flexibility. Hence, the structural variations in binding energy exhibited by the mutants, viz., D194N, E199K, L160P, and Y49D, to that of the native, consequently lead to pathogenesis.
Collapse
Affiliation(s)
- Merlin Lopus
- Department of Biotechnology, School of Bio Sciences and Technology, VIT University, Vellore, Tamil Nadu, India
| | - D Meshach Paul
- Department of Biotechnology, School of Bio Sciences and Technology, VIT University, Vellore, Tamil Nadu, India
| | - R Rajasekaran
- Department of Biotechnology, School of Bio Sciences and Technology, VIT University, Vellore, Tamil Nadu, India
| |
Collapse
|
27
|
Chen SH, Meller J, Elber R. Comprehensive analysis of sequences of a protein switch. Protein Sci 2016; 25:135-46. [PMID: 26073558 PMCID: PMC4815306 DOI: 10.1002/pro.2723] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2015] [Revised: 05/28/2015] [Accepted: 05/28/2015] [Indexed: 11/08/2022]
Abstract
Switches form a special class of proteins that dramatically change their three-dimensional structures upon a small perturbation. One possible perturbation that we explore is that of a single point mutation. Building on the pioneering experimental work of Alexander et al. (Alexander et al. PNAS, 2007; 104,11963-11968) that determines switch sequences between α and α+β folds we conduct a comprehensive sequence sampling by a Markov Chain with multiple fitness criteria to identify new switches given the experimental folds. We screen for switch sequences using a combination of contact potential, secondary structure prediction, and finally molecular dynamics simulations. Statistical properties of switch sequences are discussed and illustrated to be most sensitive to mutation at the N- and C- termini of the switch protein. Based on this analysis, a particularly stable putative switch pair is identified and proposed for further experimental analysis.
Collapse
Affiliation(s)
- Szu-Hua Chen
- Department of Molecular Biosciences, University of Texas at Austin, Austin, Texas
- Institute for Computational Engineering and Sciences, University of Texas at Austin, Austin, Texas
| | - Jaroslaw Meller
- Department of Environmental Health, University of Cincinnati College of Medicine, Cincinnati, Ohio
- Department of Electrical Engineering and Computing Systems, University of Cincinnati College of Medicine, Cincinnati, Ohio
- Department of Biomedical Informatics, University of Cincinnati College of Medicine, Cincinnati, Ohio
- Department of Informatics, Nicholas Copernicus University, Torun, Poland
- Division of Biomedical Informatics, Cincinnati Children's Hospital Medical Center, Cincinnati, Ohio
| | - Ron Elber
- Institute for Computational Engineering and Sciences, University of Texas at Austin, Austin, Texas
- Department of Chemistry, University of Texas at Austin, Austin, Texas
| |
Collapse
|
28
|
Anoosha P, Sakthivel R, Gromiha MM. Prediction of protein disorder on amino acid substitutions. Anal Biochem 2015; 491:18-22. [PMID: 26348538 DOI: 10.1016/j.ab.2015.08.028] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2015] [Revised: 07/27/2015] [Accepted: 08/27/2015] [Indexed: 12/22/2022]
Abstract
Intrinsically disordered regions of proteins are known to have many functional roles in cell signaling and regulatory pathways. The altered expression of these proteins due to mutations is associated with various diseases. Currently, most of the available methods focus on predicting the disordered proteins or the disordered regions in a protein. On the other hand, methods developed for predicting protein disorder on mutation showed a poor performance with a maximum accuracy of 70%. Hence, in this work, we have developed a novel method to classify the disorder-related amino acid substitutions using amino acid properties, substitution matrices, and the effect of neighboring residues that showed an accuracy of 90.0% with a sensitivity and specificity of 94.9 and 80.6%, respectively, in 10-fold cross-validation. The method was evaluated with a test set of 20% data using 10 iterations, which showed an average accuracy of 88.9%. Furthermore, we systematically analyzed the features responsible for the better performance of our method and observed that neighboring residues play an important role in defining the disorder of a given residue in a protein sequence. We have developed a prediction server to identify disorder-related mutations, and it is available at http://www.iitm.ac.in/bioinfo/DIM_Pred/.
Collapse
Affiliation(s)
- P Anoosha
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai 600036, Tamilnadu, India
| | - R Sakthivel
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai 600036, Tamilnadu, India
| | - M Michael Gromiha
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai 600036, Tamilnadu, India.
| |
Collapse
|
29
|
Chae MH, Krull F, Knapp EW. Optimized distance-dependent atom-pair-based potential DOOP for protein structure prediction. Proteins 2015; 83:881-90. [PMID: 25693513 DOI: 10.1002/prot.24782] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2014] [Revised: 02/06/2015] [Accepted: 02/10/2015] [Indexed: 12/20/2022]
Abstract
The DOcking decoy-based Optimized Potential (DOOP) energy function for protein structure prediction is based on empirical distance-dependent atom-pair interactions. To optimize the atom-pair interactions, native protein structures are decomposed into polypeptide chain segments that correspond to structural motives involving complete secondary structure elements. They constitute near native ligand-receptor systems (or just pairs). Thus, a total of 8609 ligand-receptor systems were prepared from 954 selected proteins. For each of these hypothetical ligand-receptor systems, 1000 evenly sampled docking decoys with 0-10 Å interface root-mean-square-deviation (iRMSD) were generated with a method used before for protein-protein docking. A neural network-based optimization method was applied to derive the optimized energy parameters using these decoys so that the energy function mimics the funnel-like energy landscape for the interaction between these hypothetical ligand-receptor systems. Thus, our method hierarchically models the overall funnel-like energy landscape of native protein structures. The resulting energy function was tested on several commonly used decoy sets for native protein structure recognition and compared with other statistical potentials. In combination with a torsion potential term which describes the local conformational preference, the atom-pair-based potential outperforms other reported statistical energy functions in correct ranking of native protein structures for a variety of decoy sets. This is especially the case for the most challenging ROSETTA decoy set, although it does not take into account side chain orientation-dependence explicitly. The DOOP energy function for protein structure prediction, the underlying database of protein structures with hypothetical ligand-receptor systems and their decoys are freely available at http://agknapp.chemie.fu-berlin.de/doop/.
Collapse
Affiliation(s)
- Myong-Ho Chae
- Department of Biology, University of Science, Unjong-District, Pyongyang, DPR Korea
| | | | | |
Collapse
|
30
|
Sarti E, Granata D, Seno F, Trovato A, Laio A. Native fold and docking pose discrimination by the same residue-based scoring function. Proteins 2015; 83:621-30. [DOI: 10.1002/prot.24764] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2014] [Revised: 12/15/2014] [Accepted: 12/31/2014] [Indexed: 11/10/2022]
Affiliation(s)
| | - Daniele Granata
- SISSA, Physics Faculty; Trieste I-34136 Italy
- Institute of Computational and Molecular Science; Temple University; Philadelphia, PA 19122 USA
| | - Flavio Seno
- CNISM, Padova Unit, and Department of Physics and Astronomy; University of Padova; 35131 Padova Italy
| | - Antonio Trovato
- INFN, Padova Section, and Department of Physics and Astronomy; University of Padova; 35131 Padova Italy
| | | |
Collapse
|
31
|
Bottaro S, Di Palma F, Bussi G. The role of nucleobase interactions in RNA structure and dynamics. Nucleic Acids Res 2014; 42:13306-14. [PMID: 25355509 PMCID: PMC4245972 DOI: 10.1093/nar/gku972] [Citation(s) in RCA: 111] [Impact Index Per Article: 10.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
Abstract
The intricate network of interactions observed in RNA three-dimensional structures is often described in terms of a multitude of geometrical properties, including helical parameters, base pairing/stacking, hydrogen bonding and backbone conformation. We show that a simple molecular representation consisting in one oriented bead per nucleotide can account for the fundamental structural properties of RNA. In this framework, canonical Watson-Crick, non-Watson-Crick base-pairing and base-stacking interactions can be unambiguously identified within a well-defined interaction shell. We validate this representation by performing two independent, complementary tests. First, we use it to construct a sequence-independent, knowledge-based scoring function for RNA structural prediction, which compares favorably to fully atomistic, state-of-the-art techniques. Second, we define a metric to measure deviation between RNA structures that directly reports on the differences in the base–base interaction network. The effectiveness of this metric is tested with respect to the ability to discriminate between structurally and kinetically distant RNA conformations, performing better compared to standard techniques. Taken together, our results suggest that this minimalist, nucleobase-centric representation captures the main interactions that are relevant for describing RNA structure and dynamics.
Collapse
Affiliation(s)
- Sandro Bottaro
- Scuola Internazionale Superiore di Studi Avanzati, International School for Advanced Studies, 265, Via Bonomea I-34136 Trieste, Italy
| | - Francesco Di Palma
- Scuola Internazionale Superiore di Studi Avanzati, International School for Advanced Studies, 265, Via Bonomea I-34136 Trieste, Italy
| | - Giovanni Bussi
- Scuola Internazionale Superiore di Studi Avanzati, International School for Advanced Studies, 265, Via Bonomea I-34136 Trieste, Italy
| |
Collapse
|
32
|
Park J, Saitou K. ROTAS: a rotamer-dependent, atomic statistical potential for assessment and prediction of protein structures. BMC Bioinformatics 2014; 15:307. [PMID: 25236673 PMCID: PMC4262145 DOI: 10.1186/1471-2105-15-307] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2014] [Accepted: 09/09/2014] [Indexed: 12/31/2022] Open
Abstract
Background Multibody potentials accounting for cooperative effects of molecular interactions have shown better accuracy than typical pairwise potentials. The main challenge in the development of such potentials is to find relevant structural features that characterize the tightly folded proteins. Also, the side-chains of residues adopt several specific, staggered conformations, known as rotamers within protein structures. Different molecular conformations result in different dipole moments and induce charge reorientations. However, until now modeling of the rotameric state of residues had not been incorporated into the development of multibody potentials for modeling non-bonded interactions in protein structures. Results In this study, we develop a new multibody statistical potential which can account for the influence of rotameric states on the specificity of atomic interactions. In this potential, named “rotamer-dependent atomic statistical potential” (ROTAS), the interaction between two atoms is specified by not only the distance and relative orientation but also by two state parameters concerning the rotameric state of the residues to which the interacting atoms belong. It was clearly found that the rotameric state is correlated to the specificity of atomic interactions. Such rotamer-dependencies are not limited to specific type or certain range of interactions. The performance of ROTAS was tested using 13 sets of decoys and was compared to those of existing atomic-level statistical potentials which incorporate orientation-dependent energy terms. The results show that ROTAS performs better than other competing potentials not only in native structure recognition, but also in best model selection and correlation coefficients between energy and model quality. Conclusions A new multibody statistical potential, ROTAS accounting for the influence of rotameric states on the specificity of atomic interactions was developed and tested on decoy sets. The results show that ROTAS has improved ability to recognize native structure from decoy models compared to other potentials. The effectiveness of ROTAS may provide insightful information for the development of many applications which require accurate side-chain modeling such as protein design, mutation analysis, and docking simulation. Electronic supplementary material The online version of this article (doi:10.1186/1471-2105-15-307) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
| | - Kazuhiro Saitou
- Department of Mechanical Engineering, University of Michigan, Ann Arbor, MI, USA.
| |
Collapse
|
33
|
Walsh I, Seno F, Tosatto SCE, Trovato A. PASTA 2.0: an improved server for protein aggregation prediction. Nucleic Acids Res 2014; 42:W301-7. [PMID: 24848016 PMCID: PMC4086119 DOI: 10.1093/nar/gku399] [Citation(s) in RCA: 346] [Impact Index Per Article: 31.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023] Open
Abstract
The formation of amyloid aggregates upon protein misfolding is related to several devastating degenerative diseases. The propensities of different protein sequences to aggregate into amyloids, how they are enhanced by pathogenic mutations, the presence of aggregation hot spots stabilizing pathological interactions, the establishing of cross-amyloid interactions between co-aggregating proteins, all rely at the molecular level on the stability of the amyloid cross-beta structure. Our redesigned server, PASTA 2.0, provides a versatile platform where all of these different features can be easily predicted on a genomic scale given input sequences. The server provides other pieces of information, such as intrinsic disorder and secondary structure predictions, that complement the aggregation data. The PASTA 2.0 energy function evaluates the stability of putative cross-beta pairings between different sequence stretches. It was re-derived on a larger dataset of globular protein domains. The resulting algorithm was benchmarked on comprehensive peptide and protein test sets, leading to improved, state-of-the-art results with more amyloid forming regions correctly detected at high specificity. The PASTA 2.0 server can be accessed at http://protein.bio.unipd.it/pasta2/.
Collapse
Affiliation(s)
- Ian Walsh
- Department of Biomedical Sciences, University of Padova, Padova I-35131, Italy
| | - Flavio Seno
- INFN, Padova Section, and Department of Physics and Astronomy 'G. Galilei', University of Padova, Padova I-35121, Italy
| | - Silvio C E Tosatto
- Department of Biomedical Sciences, University of Padova, Padova I-35131, Italy
| | - Antonio Trovato
- INFN, Padova Section, and Department of Physics and Astronomy 'G. Galilei', University of Padova, Padova I-35121, Italy
| |
Collapse
|
34
|
Abstract
Protein switches are made of highly similar sequences that fold to dramatically different structures. A structural switching system with 31 sequence variants for α and α+β folds has been illustrated experimentally by He et al., Structure, 2012, 20, 283 and is investigated computationally in the present study. Methods to assign a sequence to one of the two folds are reported and analyzed. A fast and accurate protocol to identify the correct fold of the 31 sequences is based on enriching modeled structures using short molecular dynamics (MD) trajectories and scoring these structures with coarse-grained energy functions. We examine five coarse-grained energy functions and illustrate that the Hinds-Levitt potential works the best for this task. We show that enrichment by MD significantly enhances prediction accuracy. Finally, we find that melting temperature correlates well with the energy difference between the two folds (correlation coefficient ∼-0.7). The correlation reduces dramatically (∼0.4) if the absolute energy of the correct fold is considered. Moreover, prediction of melting temperature is sensitive to the structural templates. We emphasize in our analyses the use of native structures as templates since these folds are more readily available from structural biology experiments.
Collapse
Affiliation(s)
- Szu-Hua Chen
- Department of Molecular Biosciences, Institute for Computational Engineering and Sciences, University of Texas at Austin, Austin, TX 78712, USA
| | | |
Collapse
|
35
|
Zangi R. Side-chain-side-chain interactions and stability of the helical state. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2014; 89:012723. [PMID: 24580273 DOI: 10.1103/physreve.89.012723] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/22/2013] [Indexed: 06/03/2023]
Abstract
Understanding the driving forces that lead to the stability of the secondary motifs found in proteins, namely α-helix and β-sheet, is a major goal in structural biology. The thermodynamic stability of these repetitive units is a result of a delicate balance between many factors, which in addition to the peptide chain involves also the solvent. Despite the fact that the backbones of all amino acids are the same (except of that of proline), there are large differences in the propensity of the different amino acids to promote the helical structure. In this paper, we investigate by explicit-solvent molecular dynamics simulations the role of the side chains (modeled as coarse-grained single sites) in stabilizing α helices in an aqueous solution. Our model systems include four (six-mer-nine-mer) peptide lengths in which the magnitude of the effective attraction between the side chains is systematically increased. We find that these interactions between the side chains can induce (for the nine-mer almost completely) a transition from a coil to a helical state. This transition is found to be characterized by three states in which the intermediate state is a partially folded α-helical conformation. In the absence of any interactions between the side chains the free energy change for helix formation has a small positive value indicating that favorable contributions from the side chains are necessary to stabilize the helical conformation. Thus, the helix-coil transition is controlled by the effective potentials between the side-chain residues and the magnitude of the required attraction per residue, which is on the order of the thermal energy, reduces with the length of the peptide. Surprisingly, the plots of the population of the helical state (or the change in the free energy for helix formation) as a function of the total effective interactions between the side chains in the helical state for all peptide lengths fall on the same curve.
Collapse
Affiliation(s)
- Ronen Zangi
- Department of Organic Chemistry I, University of the Basque Country UPV/EHU, Avenida de Tolosa 72, 20018, San Sebastian, Spain and IKERBASQUE, Basque Foundation for Science, 48011, Bilbao, Spain
| |
Collapse
|
36
|
Dong GQ, Fan H, Schneidman-Duhovny D, Webb B, Sali A. Optimized atomic statistical potentials: assessment of protein interfaces and loops. Bioinformatics 2013; 29:3158-66. [PMID: 24078704 PMCID: PMC3842762 DOI: 10.1093/bioinformatics/btt560] [Citation(s) in RCA: 98] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2013] [Revised: 08/13/2013] [Accepted: 09/22/2013] [Indexed: 01/16/2023] Open
Abstract
MOTIVATION Statistical potentials have been widely used for modeling whole proteins and their parts (e.g. sidechains and loops) as well as interactions between proteins, nucleic acids and small molecules. Here, we formulate the statistical potentials entirely within a statistical framework, avoiding questionable statistical mechanical assumptions and approximations, including a definition of the reference state. RESULTS We derive a general Bayesian framework for inferring statistically optimized atomic potentials (SOAP) in which the reference state is replaced with data-driven 'recovery' functions. Moreover, we restrain the relative orientation between two covalent bonds instead of a simple distance between two atoms, in an effort to capture orientation-dependent interactions such as hydrogen bonds. To demonstrate this general approach, we computed statistical potentials for protein-protein docking (SOAP-PP) and loop modeling (SOAP-Loop). For docking, a near-native model is within the top 10 scoring models in 40% of the PatchDock benchmark cases, compared with 23 and 27% for the state-of-the-art ZDOCK and FireDock scoring functions, respectively. Similarly, for modeling 12-residue loops in the PLOP benchmark, the average main-chain root mean square deviation of the best scored conformations by SOAP-Loop is 1.5 Å, close to the average root mean square deviation of the best sampled conformations (1.2 Å) and significantly better than that selected by Rosetta (2.1 Å), DFIRE (2.3 Å), DOPE (2.5 Å) and PLOP scoring functions (3.0 Å). Our Bayesian framework may also result in more accurate statistical potentials for additional modeling applications, thus affording better leverage of the experimentally determined protein structures. AVAILABILITY AND IMPLEMENTATION SOAP-PP and SOAP-Loop are available as part of MODELLER (http://salilab.org/modeller).
Collapse
Affiliation(s)
- Guang Qiang Dong
- Department of Bioengineering and Therapeutic Sciences, Department of Pharmaceutical Chemistry and California Institute for Quantitative Biosciences (QB3), University of California, San Francisco, CA 94158, USA
| | | | | | | | | |
Collapse
|
37
|
Khoury GA, Tamamis P, Pinnaduwage N, Smadbeck J, Kieslich CA, Floudas CA. Princeton_TIGRESS: protein geometry refinement using simulations and support vector machines. Proteins 2013; 82:794-814. [PMID: 24174311 DOI: 10.1002/prot.24459] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2013] [Revised: 10/18/2013] [Accepted: 10/22/2013] [Indexed: 12/30/2022]
Abstract
Protein structure refinement aims to perform a set of operations given a predicted structure to improve model quality and accuracy with respect to the native in a blind fashion. Despite the numerous computational approaches to the protein refinement problem reported in the previous three CASPs, an overwhelming majority of methods degrade models rather than improve them. We initially developed a method tested using blind predictions during CASP10 which was officially ranked in 5th place among all methods in the refinement category. Here, we present Princeton_TIGRESS, which when benchmarked on all CASP 7,8,9, and 10 refinement targets, simultaneously increased GDT_TS 76% of the time with an average improvement of 0.83 GDT_TS points per structure. The method was additionally benchmarked on models produced by top performing three-dimensional structure prediction servers during CASP10. The robustness of the Princeton_TIGRESS protocol was also tested for different random seeds. We make the Princeton_TIGRESS refinement protocol freely available as a web server at http://atlas.princeton.edu/refinement. Using this protocol, one can consistently refine a prediction to help bridge the gap between a predicted structure and the actual native structure.
Collapse
Affiliation(s)
- George A Khoury
- Department of Chemical and Biological Engineering, Princeton University, Princeton, New Jersey, 08540
| | | | | | | | | | | |
Collapse
|