1
|
Hsieh YC, Delarue M, Orland H, Koehl P. Analyzing the Geometry and Dynamics of Viral Structures: A Review of Computational Approaches Based on Alpha Shape Theory, Normal Mode Analysis, and Poisson-Boltzmann Theories. Viruses 2023; 15:1366. [PMID: 37376665 DOI: 10.3390/v15061366] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2023] [Revised: 06/05/2023] [Accepted: 06/09/2023] [Indexed: 06/29/2023] Open
Abstract
The current SARS-CoV-2 pandemic highlights our fragility when we are exposed to emergent viruses either directly or through zoonotic diseases. Fortunately, our knowledge of the biology of those viruses is improving. In particular, we have more and more structural information on virions, i.e., the infective form of a virus that includes its genomic material and surrounding protective capsid, and on their gene products. It is important to have methods that enable the analyses of structural information on such large macromolecular systems. We review some of those methods in this paper. We focus on understanding the geometry of virions and viral structural proteins, their dynamics, and their energetics, with the ambition that this understanding can help design antiviral agents. We discuss those methods in light of the specificities of those structures, mainly that they are huge. We focus on three of our own methods based on the alpha shape theory for computing geometry, normal mode analyses to study dynamics, and modified Poisson-Boltzmann theories to study the organization of ions and co-solvent and solvent molecules around biomacromolecules. The corresponding software has computing times that are compatible with the use of regular desktop computers. We show examples of their applications on some outer shells and structural proteins of the West Nile Virus.
Collapse
Affiliation(s)
- Yin-Chen Hsieh
- Institute for Arctic and Marine Biology, Department of Biosciences, Fisheries, and Economics, UiT The Arctic University of Norway, 9037 Tromso, Norway
| | - Marc Delarue
- Institut Pasteur, Université Paris-Cité and CNRS, UMR 3528, Unité Architecture et Dynamique des Macromolécules Biologiques, 75015 Paris, France
| | - Henri Orland
- Institut de Physique Théorique, CEA, CNRS, Université Paris-Saclay, 91191 Gif-sur-Yvette, France
| | - Patrice Koehl
- Department of Computer Science, University of California, Davis, CA 95616, USA
| |
Collapse
|
2
|
Mailhot O, Frappier V, Major F, Najmanovich RJ. Sequence-sensitive elastic network captures dynamical features necessary for miR-125a maturation. PLoS Comput Biol 2022; 18:e1010777. [PMID: 36516216 PMCID: PMC9797095 DOI: 10.1371/journal.pcbi.1010777] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2022] [Revised: 12/28/2022] [Accepted: 11/29/2022] [Indexed: 12/15/2022] Open
Abstract
The Elastic Network Contact Model (ENCoM) is a coarse-grained normal mode analysis (NMA) model unique in its all-atom sensitivity to the sequence of the studied macromolecule and thus to the effect of mutations. We adapted ENCoM to simulate the dynamics of ribonucleic acid (RNA) molecules, benchmarked its performance against other popular NMA models and used it to study the 3D structural dynamics of human microRNA miR-125a, leveraging high-throughput experimental maturation efficiency data of over 26 000 sequence variants. We also introduce a novel way of using dynamical information from NMA to train multivariate linear regression models, with the purpose of highlighting the most salient contributions of dynamics to function. ENCoM has a similar performance profile on RNA than on proteins when compared to the Anisotropic Network Model (ANM), the most widely used coarse-grained NMA model; it has the advantage on predicting large-scale motions while ANM performs better on B-factors prediction. A stringent benchmark from the miR-125a maturation dataset, in which the training set contains no sequence information in common with the testing set, reveals that ENCoM is the only tested model able to capture signal beyond the sequence. This ability translates to better predictive power on a second benchmark in which sequence features are shared between the train and test sets. When training the linear regression model using all available data, the dynamical features identified as necessary for miR-125a maturation point to known patterns but also offer new insights into the biogenesis of microRNAs. Our novel approach combining NMA with multivariate linear regression is generalizable to any macromolecule for which relatively high-throughput mutational data is available.
Collapse
Affiliation(s)
- Olivier Mailhot
- Department of Biochemistry and Molecular Medicine, Université de Montréal, Montreal, QC, Canada
- Department of Computer Science and Operations Research, Université de Montréal, Montreal, QC, Canada
- Institute for Research in Immunology and Cancer, Université de Montréal, Montreal, QC, Canada
- Department of Pharmacology and Physiology, Université de Montréal, Montreal, QC, Canada
| | - Vincent Frappier
- Generate Biomedicines, Cambridge, Massachusetts, United States of America
| | - François Major
- Department of Computer Science and Operations Research, Université de Montréal, Montreal, QC, Canada
- Institute for Research in Immunology and Cancer, Université de Montréal, Montreal, QC, Canada
| | - Rafael J. Najmanovich
- Department of Pharmacology and Physiology, Université de Montréal, Montreal, QC, Canada
| |
Collapse
|
3
|
Mailhot O, Najmanovich R. The NRGTEN Python package: an extensible toolkit for coarse-grained normal mode analysis of proteins, nucleic acids, small molecules and their complexes. Bioinformatics 2021; 37:3369-3371. [PMID: 33742655 DOI: 10.1093/bioinformatics/btab189] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2020] [Revised: 02/24/2021] [Accepted: 03/17/2021] [Indexed: 01/11/2023] Open
Abstract
SUMMARY The Najmanovich Research Group Toolkit for Elastic Networks (NRGTEN) is a Python toolkit that implements four different NMA models in addition to popular and novel metrics to benchmark and measure properties from these models. Furthermore, the toolkit is available as a public Python package and is easily extensible for the development or implementation of additional NMA models. The inclusion of the ENCoM model (Elastic Network Contact Model) developed in our group within NRGTEN is noteworthy, owing to its account for the specific chemical nature of atomic interactions. AVAILABILITY AND IMPLEMENTATION https://github.com/gregorpatof/nrgten_package/.
Collapse
Affiliation(s)
- Olivier Mailhot
- Department of Pharmacology and Physiology, Faculty of Medicine, Université de Montréal.,Institute for Research in Immunology and Cancer (IRIC), Faculty of Medicine, Université de Montréal
| | - Rafael Najmanovich
- Department of Pharmacology and Physiology, Faculty of Medicine, Université de Montréal
| |
Collapse
|
4
|
Bose Majumdar A, Kim IJ, Na H. Effect of solvent on protein structure and dynamics. Phys Biol 2020; 17:036006. [DOI: 10.1088/1478-3975/ab74b3] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
|
5
|
Bauer JA, Pavlović J, Bauerová-Hlinková V. Normal Mode Analysis as a Routine Part of a Structural Investigation. Molecules 2019; 24:E3293. [PMID: 31510014 PMCID: PMC6767145 DOI: 10.3390/molecules24183293] [Citation(s) in RCA: 55] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2019] [Revised: 08/30/2019] [Accepted: 08/30/2019] [Indexed: 12/13/2022] Open
Abstract
Normal mode analysis (NMA) is a technique that can be used to describe the flexible states accessible to a protein about an equilibrium position. These states have been shown repeatedly to have functional significance. NMA is probably the least computationally expensive method for studying the dynamics of macromolecules, and advances in computer technology and algorithms for calculating normal modes over the last 20 years have made it nearly trivial for all but the largest systems. Despite this, it is still uncommon for NMA to be used as a component of the analysis of a structural study. In this review, we will describe NMA, outline its advantages and limitations, explain what can and cannot be learned from it, and address some criticisms and concerns that have been voiced about it. We will then review the most commonly used techniques for reducing the computational cost of this method and identify the web services making use of these methods. We will illustrate several of their possible uses with recent examples from the literature. We conclude by recommending that NMA become one of the standard tools employed in any structural study.
Collapse
Affiliation(s)
- Jacob A Bauer
- Institute of Molecular Biology, Slovak Academy of Sciences, Dúbravská cesta 21, 845 51 Bratislava, Slovakia.
| | - Jelena Pavlović
- Institute of Molecular Biology, Slovak Academy of Sciences, Dúbravská cesta 21, 845 51 Bratislava, Slovakia
| | - Vladena Bauerová-Hlinková
- Institute of Molecular Biology, Slovak Academy of Sciences, Dúbravská cesta 21, 845 51 Bratislava, Slovakia
| |
Collapse
|
6
|
Advances in coarse-grained modeling of macromolecular complexes. Curr Opin Struct Biol 2018; 52:119-126. [PMID: 30508766 DOI: 10.1016/j.sbi.2018.11.005] [Citation(s) in RCA: 101] [Impact Index Per Article: 14.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2018] [Revised: 11/05/2018] [Accepted: 11/17/2018] [Indexed: 01/12/2023]
Abstract
Recent progress in coarse-grained (CG) molecular modeling and simulation has facilitated an influx of computational studies on biological macromolecules and their complexes. Given the large separation of length-scales and time-scales that dictate macromolecular biophysics, CG modeling and simulation are well-suited to bridge the microscopic and mesoscopic or macroscopic details observed from all-atom molecular simulations and experiments, respectively. In this review, we first summarize recent innovations in the development of CG models, which broadly include structure-based, knowledge-based, and dynamics-based approaches. We then discuss recent applications of different classes of CG models to explore various macromolecular complexes. Finally, we conclude with an outlook for the future in this ever-growing field of biomolecular modeling.
Collapse
|
7
|
Coarse-grained dynamics of supramolecules: Conformational changes in outer shells of Dengue viruses. PROGRESS IN BIOPHYSICS AND MOLECULAR BIOLOGY 2018; 143:20-37. [PMID: 30273615 DOI: 10.1016/j.pbiomolbio.2018.09.004] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/31/2018] [Revised: 09/22/2018] [Accepted: 09/24/2018] [Indexed: 01/12/2023]
Abstract
While structural data on viruses are more and more common, information on their dynamics is much harder to obtain as those viruses form very large molecular complexes. In this paper, we propose a new method for computing the coarse-grained normal modes of such supra-molecules, NormalGo. A new formalism is developed to represent the Hessian of a quadratic potential using tensor products. This formalism is applied to the Tirion elastic potential, as well as to a Gō like potential. When combined with a fast method for computing a select set of eigenpairs of the Hessian, this new formalism enables the computation of thousands of normal modes of a full viral shell with more than one hundred thousand atoms in less than 2 h on a standard desktop computer. We then compare the two coarse-grained potentials. We show that, despite significant differences in their formulations, the Tirion and the Gō like potentials capture very similar dynamics characteristics of the molecule under study. However, we find that the Gō like potential should be preferred as it leads to less local deformations in the structure of the molecule during normal mode dynamics. Finally, we use NormalGo to characterize the structural transitions that occur when FAB fragments bind to the icosahedral outer shell of serotype 3 of the Dengue virus. We have identified residues at the surface of the outer shell that are important for the transition between the FAB-free and FAB-bound conformations, and therefore potentially useful for the design of antibodies to Dengue viruses.
Collapse
|
8
|
Koehl P. Large Eigenvalue Problems in Coarse-Grained Dynamic Analyses of Supramolecular Systems. J Chem Theory Comput 2018; 14:3903-3919. [DOI: 10.1021/acs.jctc.8b00338] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Patrice Koehl
- Department of Computer Sciences and Genome Center, University of California, Davis, California 95616, United States
| |
Collapse
|
9
|
Putz I, Brock O. Elastic network model of learned maintained contacts to predict protein motion. PLoS One 2017; 12:e0183889. [PMID: 28854238 PMCID: PMC5576689 DOI: 10.1371/journal.pone.0183889] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2016] [Accepted: 08/14/2017] [Indexed: 12/21/2022] Open
Abstract
We present a novel elastic network model, lmcENM, to determine protein motion even for localized functional motions that involve substantial changes in the protein's contact topology. Existing elastic network models assume that the contact topology remains unchanged throughout the motion and are thus most appropriate to simulate highly collective function-related movements. lmcENM uses machine learning to differentiate breaking from maintained contacts. We show that lmcENM accurately captures functional transitions unexplained by the classical ENM and three reference ENM variants, while preserving the simplicity of classical ENM. We demonstrate the effectiveness of our approach on a large set of proteins covering different motion types. Our results suggest that accurately predicting a "deformation-invariant" contact topology offers a promising route to increase the general applicability of ENMs. We also find that to correctly predict this contact topology a combination of several features seems to be relevant which may vary slightly depending on the protein. Additionally, we present case studies of two biologically interesting systems, Ferric Citrate membrane transporter FecA and Arachidonate 15-Lipoxygenase.
Collapse
Affiliation(s)
- Ines Putz
- Robotics and Biology Laboratory, Department of Computer Science and Electrical Engineering, Technische Universität Berlin, Berlin, Berlin, Germany
| | - Oliver Brock
- Robotics and Biology Laboratory, Department of Computer Science and Electrical Engineering, Technische Universität Berlin, Berlin, Berlin, Germany
| |
Collapse
|
10
|
Song G. Symmetry in normal modes and its strong dependence on symmetry in structure. J Mol Graph Model 2017; 75:32-41. [DOI: 10.1016/j.jmgm.2017.04.002] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2017] [Revised: 04/05/2017] [Accepted: 04/06/2017] [Indexed: 10/19/2022]
|
11
|
Frappier V, Chartier M, Najmanovich R. Applications of Normal Mode Analysis Methods in Computational Protein Design. Methods Mol Biol 2017; 1529:203-214. [PMID: 27914052 DOI: 10.1007/978-1-4939-6637-0_9] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
Recent advances in coarse-grained normal mode analysis methods make possible the large-scale prediction of the effect of mutations on protein stability and dynamics as well as the generation of biologically relevant conformational ensembles. Given the interplay between flexibility and enzymatic activity, the combined analysis of stability and dynamics using the Elastic Network Contact Model (ENCoM) method has ample applications in protein engineering in industrial and medical applications such as in computational antibody design. Here, we present a detailed tutorial on how to perform such calculations using ENCoM.
Collapse
Affiliation(s)
- Vincent Frappier
- Department of Biology, Massachusetts Institute of Technology, 77 Massachusetts avenue, Cambridge, MA, 02139, USA
- Faculty of Medicine and Health Sciences, Department of Biochemistry, University of Sherbrooke, 3001, 12 Av., NordSherbrooke, QCJ1H 5N4, Canada
| | - Matthieu Chartier
- Faculty of Medicine and Health Sciences, Department of Biochemistry, University of Sherbrooke, 3001, 12 Av., NordSherbrooke, QCJ1H 5N4, Canada
| | - Rafael Najmanovich
- Department of Pharmacology and Physiology, Faculty of Medicine, Université de Montreal, Montreal, H3C 3J7, QC, Canada.
- Faculty of Medicine and Health Sciences, Department of Biochemistry, University of Sherbrooke, 3001, 12 Av., NordSherbrooke, QCJ1H 5N4, Canada.
| |
Collapse
|
12
|
Peterson L, Jamroz M, Kolinski A, Kihara D. Predicting Real-Valued Protein Residue Fluctuation Using FlexPred. Methods Mol Biol 2017; 1484:175-186. [PMID: 27787827 DOI: 10.1007/978-1-4939-6406-2_13] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
The conventional view of a protein structure as static provides only a limited picture. There is increasing evidence that protein dynamics are often vital to protein function including interaction with partners such as other proteins, nucleic acids, and small molecules. Considering flexibility is also important in applications such as computational protein docking and protein design. While residue flexibility is partially indicated by experimental measures such as the B-factor from X-ray crystallography and ensemble fluctuation from nuclear magnetic resonance (NMR) spectroscopy as well as computational molecular dynamics (MD) simulation, these techniques are resource-intensive. In this chapter, we describe the web server and stand-alone version of FlexPred, which rapidly predicts absolute per-residue fluctuation from a three-dimensional protein structure. On a set of 592 nonredundant structures, comparing the fluctuations predicted by FlexPred to the observed fluctuations in MD simulations showed an average correlation coefficient of 0.669 and an average root mean square error of 1.07 Å. FlexPred is available at http://kiharalab.org/flexPred/ .
Collapse
Affiliation(s)
- Lenna Peterson
- Department of Biological Sciences, College of Science, Purdue University, 915 W. State Street, West Lafayette, IN, 47907-2054, USA
| | - Michal Jamroz
- Laboratory of Theory of Biopolymers, Faculty of Chemistry, University of Warsaw, Pasteura 1, Warszawa, 02-093, Poland
| | - Andrzej Kolinski
- Laboratory of Theory of Biopolymers, Faculty of Chemistry, University of Warsaw, Pasteura 1, Warszawa, 02-093, Poland
| | - Daisuke Kihara
- Department of Biological Sciences, College of Science, Purdue University, 915 W. State Street, West Lafayette, IN, 47907-2054, USA. .,Department of Computer Science, College of Science, Purdue University, 305 N. University Street, West Lafayette, IN, 47907-2107, USA.
| |
Collapse
|
13
|
|
14
|
Na H, Jernigan RL, Song G. Bridging between NMA and Elastic Network Models: Preserving All-Atom Accuracy in Coarse-Grained Models. PLoS Comput Biol 2015; 11:e1004542. [PMID: 26473491 PMCID: PMC4608564 DOI: 10.1371/journal.pcbi.1004542] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2015] [Accepted: 09/08/2015] [Indexed: 11/24/2022] Open
Abstract
Dynamics can provide deep insights into the functional mechanisms of proteins and protein complexes. For large protein complexes such as GroEL/GroES with more than 8,000 residues, obtaining a fine-grained all-atom description of its normal mode motions can be computationally prohibitive and is often unnecessary. For this reason, coarse-grained models have been used successfully. However, most existing coarse-grained models use extremely simple potentials to represent the interactions within the coarse-grained structures and as a result, the dynamics obtained for the coarse-grained structures may not always be fully realistic. There is a gap between the quality of the dynamics of the coarse-grained structures given by all-atom models and that by coarse-grained models. In this work, we resolve an important question in protein dynamics computations—how can we efficiently construct coarse-grained models whose description of the dynamics of the coarse-grained structures remains as accurate as that given by all-atom models? Our method takes advantage of the sparseness of the Hessian matrix and achieves a high efficiency with a novel iterative matrix projection approach. The result is highly significant since it can provide descriptions of normal mode motions at an all-atom level of accuracy even for the largest biomolecular complexes. The application of our method to GroEL/GroES offers new insights into the mechanism of this biologically important chaperonin, such as that the conformational transitions of this protein complex in its functional cycle are even more strongly connected to the first few lowest frequency modes than with other coarse-grained models. Proteins and other biomolecules are not static but are constantly in motion. Moreover, they possess intrinsic collective motion patterns that are tightly linked to their functions. Thus, an accurate and detailed description of their motions can provide deep insights into their functional mechanisms. For large protein complexes with hundreds of thousands of atoms or more, an atomic level description of the motions can be computationally prohibitive, and so coarse-grained models with fewer structural details are often used instead. However, there can be a big gap between the quality of motions derived from atomic models and those from coarse-grained models. In this work, we solve an important problem in protein dynamics studies: how to preserve the atomic-level accuracy in describing molecular motions while using coarse-grained models? We accomplish this by developing a novel iterative matrix projection method that dramatically speeds up the computations. This method is significant since it promises accurate descriptions of protein motions approaching an all-atom level even for the largest biomolecular complexes. Results shown here for a large molecular chaperonin demonstrate how this can provide new insights into its functional process.
Collapse
Affiliation(s)
- Hyuntae Na
- Department of Computer Science, Iowa State University, Ames, Iowa, United States of America
| | - Robert L. Jernigan
- Department of Biochemistry, Biophysics and Molecular Biology, Iowa State University, Ames, Iowa, United States of America
- Program of Bioinformatics and Computational Biology, Iowa State University, Ames, Iowa, United States of America
- L. H. Baker Center for Bioinformatics and Biological Statistics, Iowa State University, Ames, Iowa, United States of America
| | - Guang Song
- Department of Computer Science, Iowa State University, Ames, Iowa, United States of America
- Program of Bioinformatics and Computational Biology, Iowa State University, Ames, Iowa, United States of America
- L. H. Baker Center for Bioinformatics and Biological Statistics, Iowa State University, Ames, Iowa, United States of America
- * E-mail:
| |
Collapse
|
15
|
Frappier V, Chartier M, Najmanovich RJ. ENCoM server: exploring protein conformational space and the effect of mutations on protein function and stability. Nucleic Acids Res 2015; 43:W395-400. [PMID: 25883149 PMCID: PMC4489264 DOI: 10.1093/nar/gkv343] [Citation(s) in RCA: 137] [Impact Index Per Article: 13.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2015] [Revised: 03/27/2015] [Accepted: 04/06/2015] [Indexed: 11/24/2022] Open
Abstract
ENCoM is a coarse-grained normal mode analysis method recently introduced that unlike previous such methods is unique in that it accounts for the nature of amino acids. The inclusion of this layer of information was shown to improve conformational space sampling and apply for the first time a coarse-grained normal mode analysis method to predict the effect of single point mutations on protein dynamics and thermostability resulting from vibrational entropy changes. Here we present a web server that allows non-technical users to have access to ENCoM calculations to predict the effect of mutations on thermostability and dynamics as well as to generate geometrically realistic conformational ensembles. The server is accessible at: http://bcb.med.usherbrooke.ca/encom.
Collapse
Affiliation(s)
- Vincent Frappier
- Department of Biochemistry, Faculty of Medicine and Health Sciences, University of Sherbrooke, Sherbrooke, Quebec, J1H 5N4, Canada
| | - Matthieu Chartier
- Department of Biochemistry, Faculty of Medicine and Health Sciences, University of Sherbrooke, Sherbrooke, Quebec, J1H 5N4, Canada
| | - Rafael J Najmanovich
- Department of Biochemistry, Faculty of Medicine and Health Sciences, University of Sherbrooke, Sherbrooke, Quebec, J1H 5N4, Canada
| |
Collapse
|
16
|
Sim J, Sim J, Park E, Lee J. Method for identification of rigid domains and hinge residues in proteins based on exhaustive enumeration. Proteins 2015; 83:1054-67. [DOI: 10.1002/prot.24799] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2014] [Revised: 02/28/2015] [Accepted: 03/10/2015] [Indexed: 11/07/2022]
Affiliation(s)
- Jaehyun Sim
- Department of Oral Microbiology and Immunology; School of Dentistry, Seoul National University; Seoul 110-749 Korea
| | - Jun Sim
- Department of Bioinformatics and Life Science; Soongsil University; Seoul 156-743 Korea
| | - Eunsung Park
- Administrative Service Division, Apsun Dental Hospital; Seoul 135-590 Korea
| | - Julian Lee
- Department of Oral Microbiology and Immunology; School of Dentistry, Seoul National University; Seoul 110-749 Korea
| |
Collapse
|
17
|
Na H, Song G. Conventional NMA as a better standard for evaluating elastic network models. Proteins 2014; 83:259-67. [DOI: 10.1002/prot.24735] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2014] [Revised: 11/14/2014] [Accepted: 11/26/2014] [Indexed: 11/08/2022]
Affiliation(s)
- Hyuntae Na
- Department of Computer Science; Iowa State University; Ames Iowa 50011
| | - Guang Song
- Department of Computer Science; Iowa State University; Ames Iowa 50011
- Program of Bioinformatics and Computational Biology, Iowa State University; Ames Iowa 50011
- L.H. Baker Center for Bioinformatics and Biological Statistics, Iowa State University; Ames Iowaa 50011
| |
Collapse
|
18
|
Zimmermann MT, Jernigan RL. Elastic network models capture the motions apparent within ensembles of RNA structures. RNA (NEW YORK, N.Y.) 2014; 20:792-804. [PMID: 24759093 PMCID: PMC4024634 DOI: 10.1261/rna.041269.113] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/08/2023]
Abstract
The role of structure and dynamics in mechanisms for RNA becomes increasingly important. Computational approaches using simple dynamics models have been successful at predicting the motions of proteins and are often applied to ribonucleo-protein complexes but have not been thoroughly tested for well-packed nucleic acid structures. In order to characterize a true set of motions, we investigate the apparent motions from 16 ensembles of experimentally determined RNA structures. These indicate a relatively limited set of motions that are captured by a small set of principal components (PCs). These limited motions closely resemble the motions computed from low frequency normal modes from elastic network models (ENMs), either at atomic or coarse-grained resolution. Various ENM model types, parameters, and structure representations are tested here against the experimental RNA structural ensembles, exposing differences between models for proteins and for folded RNAs. Differences in performance are seen, depending on the structure alignment algorithm used to generate PCs, modulating the apparent utility of ENMs but not significantly impacting their ability to generate functional motions. The loss of dynamical information upon coarse-graining is somewhat larger for RNAs than for globular proteins, indicating, perhaps, the lower cooperativity of the less densely packed RNA. However, the RNA structures show less sensitivity to the elastic network model parameters than do proteins. These findings further demonstrate the utility of ENMs and the appropriateness of their application to well-packed RNA-only structures, justifying their use for studying the dynamics of ribonucleo-proteins, such as the ribosome and regulatory RNAs.
Collapse
Affiliation(s)
- Michael T. Zimmermann
- Department of Biochemistry, Biophysics and Molecular Biology, Iowa State University, Ames, Iowa 50011, USA
- Bioinformatics and Computational Biology Interdepartmental Graduate Program, Iowa State University, Ames, Iowa 50011, USA
| | - Robert L. Jernigan
- Department of Biochemistry, Biophysics and Molecular Biology, Iowa State University, Ames, Iowa 50011, USA
- Bioinformatics and Computational Biology Interdepartmental Graduate Program, Iowa State University, Ames, Iowa 50011, USA
- Corresponding authorE-mail
| |
Collapse
|
19
|
Na H, Song G. Bridging between normal mode analysis and elastic network models. Proteins 2014; 82:2157-68. [DOI: 10.1002/prot.24571] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2013] [Revised: 03/06/2014] [Accepted: 03/21/2014] [Indexed: 11/10/2022]
Affiliation(s)
- Hyuntae Na
- Department of Computer Science; Iowa State University; Ames Iowa 50011
| | - Guang Song
- Department of Computer Science; Iowa State University; Ames Iowa 50011
- Program of Bioinformatics and Computational Biology; Iowa State University; Ames Iowa 50011
- L. H. Baker Center for Bioinformatics and Biological Statistics, Iowa State University; Ames Iowa 50011
| |
Collapse
|
20
|
Na H, Song G. A natural unification of GNM and ANM and the role of inter-residue forces. Phys Biol 2014; 11:036002. [DOI: 10.1088/1478-3975/11/3/036002] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
|
21
|
Frappier V, Najmanovich RJ. A coarse-grained elastic network atom contact model and its use in the simulation of protein dynamics and the prediction of the effect of mutations. PLoS Comput Biol 2014; 10:e1003569. [PMID: 24762569 PMCID: PMC3998880 DOI: 10.1371/journal.pcbi.1003569] [Citation(s) in RCA: 72] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2013] [Accepted: 02/25/2014] [Indexed: 11/18/2022] Open
Abstract
Normal mode analysis (NMA) methods are widely used to study dynamic aspects of protein structures. Two critical components of NMA methods are coarse-graining in the level of simplification used to represent protein structures and the choice of potential energy functional form. There is a trade-off between speed and accuracy in different choices. In one extreme one finds accurate but slow molecular-dynamics based methods with all-atom representations and detailed atom potentials. On the other extreme, fast elastic network model (ENM) methods with Cα-only representations and simplified potentials that based on geometry alone, thus oblivious to protein sequence. Here we present ENCoM, an Elastic Network Contact Model that employs a potential energy function that includes a pairwise atom-type non-bonded interaction term and thus makes it possible to consider the effect of the specific nature of amino-acids on dynamics within the context of NMA. ENCoM is as fast as existing ENM methods and outperforms such methods in the generation of conformational ensembles. Here we introduce a new application for NMA methods with the use of ENCoM in the prediction of the effect of mutations on protein stability. While existing methods are based on machine learning or enthalpic considerations, the use of ENCoM, based on vibrational normal modes, is based on entropic considerations. This represents a novel area of application for NMA methods and a novel approach for the prediction of the effect of mutations. We compare ENCoM to a large number of methods in terms of accuracy and self-consistency. We show that the accuracy of ENCoM is comparable to that of the best existing methods. We show that existing methods are biased towards the prediction of destabilizing mutations and that ENCoM is less biased at predicting stabilizing mutations.
Collapse
Affiliation(s)
- Vincent Frappier
- Department of Biochemistry, Faculty of Medicine and Health Sciences, Université de Sherbrooke, Sherbrooke, Quebec, Canada
| | - Rafael J Najmanovich
- Department of Biochemistry, Faculty of Medicine and Health Sciences, Université de Sherbrooke, Sherbrooke, Quebec, Canada
| |
Collapse
|
22
|
Jamroz M, Kolinski A, Kihara D. Structural features that predict real-value fluctuations of globular proteins. Proteins 2012; 80:1425-35. [PMID: 22328193 DOI: 10.1002/prot.24040] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2011] [Revised: 01/03/2012] [Accepted: 01/11/2012] [Indexed: 12/20/2022]
Abstract
It is crucial to consider dynamics for understanding the biological function of proteins. We used a large number of molecular dynamics (MD) trajectories of nonhomologous proteins as references and examined static structural features of proteins that are most relevant to fluctuations. We examined correlation of individual structural features with fluctuations and further investigated effective combinations of features for predicting the real value of residue fluctuations using the support vector regression (SVR). It was found that some structural features have higher correlation than crystallographic B-factors with fluctuations observed in MD trajectories. Moreover, SVR that uses combinations of static structural features showed accurate prediction of fluctuations with an average Pearson's correlation coefficient of 0.669 and a root mean square error of 1.04 Å. This correlation coefficient is higher than the one observed in predictions by the Gaussian network model (GNM). An advantage of the developed method over the GNMs is that the former predicts the real value of fluctuation. The results help improve our understanding of relationships between protein structure and fluctuation. Furthermore, the developed method provides a convienient practial way to predict fluctuations of proteins using easily computed static structural features of proteins.
Collapse
Affiliation(s)
- Michal Jamroz
- Laboratory of Theory of Biopolymers, Faculty of Chemistry, University of Warsaw, Pasteura 1, 02-093 Warszawa, Poland
| | | | | |
Collapse
|
23
|
Zimmermann MT, Kloczkowski A, Jernigan RL. MAVENs: motion analysis and visualization of elastic networks and structural ensembles. BMC Bioinformatics 2011; 12:264. [PMID: 21711533 PMCID: PMC3213244 DOI: 10.1186/1471-2105-12-264] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2011] [Accepted: 06/28/2011] [Indexed: 11/16/2022] Open
Abstract
Background The ability to generate, visualize, and analyze motions of biomolecules has made a significant impact upon modern biology. Molecular Dynamics has gained substantial use, but remains computationally demanding and difficult to setup for many biologists. Elastic network models (ENMs) are an alternative and have been shown to generate the dominant equilibrium motions of biomolecules quickly and efficiently. These dominant motions have been shown to be functionally relevant and also to indicate the likely direction of conformational changes. Most structures have a small number of dominant motions. Comparing computed motions to the structure's conformational ensemble derived from a collection of static structures or frames from an MD trajectory is an important way to understand functional motions as well as evaluate the models. Modes of motion computed from ENMs can be visualized to gain functional and mechanistic understanding and to compute useful quantities such as average positional fluctuations, internal distance changes, collectiveness of motions, and directional correlations within the structure. Results Our new software, MAVEN, aims to bring ENMs and their analysis to a broader audience by integrating methods for their generation and analysis into a user friendly environment that automates many of the steps. Models can be constructed from raw PDB files or density maps, using all available atomic coordinates or by employing various coarse-graining procedures. Visualization can be performed either with our software or exported to molecular viewers. Mixed resolution models allow one to study atomic effects on the system while retaining much of the computational speed of the coarse-grained ENMs. Analysis options are available to further aid the user in understanding the computed motions and their importance for its function. Conclusion MAVEN has been developed to simplify ENM generation, allow for diverse models to be used, and facilitate useful analyses, all on the same platform. This represents an integrated approach that incorporates all four levels of the modeling process - generation, evaluation, analysis, visualization - and also brings to bear multiple ENM types. The intension is to provide a versatile modular suite of programs to a broader audience. MAVEN is available for download at http://maven.sourceforge.net.
Collapse
Affiliation(s)
- Michael T Zimmermann
- LH Baker Center for Bioinformatics and Biological Statistics, Iowa State University, Ames, IA 50011, USA
| | | | | |
Collapse
|