1
|
Ribeiro RP, Goßen J, Rossetti G, Giorgetti A. Structural Systems Biology Toolkit (SSBtoolkit): From Molecular Structure to Subcellular Signaling Pathways. J Chem Inf Model 2025. [PMID: 40248991 DOI: 10.1021/acs.jcim.5c00165] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/19/2025]
Abstract
Here, we introduce the Structural Systems Biology (SSB) toolkit, a Python library that integrates structural macromolecular data with systems biology simulations to model signal-transduction pathways of G-protein-coupled receptors (GPCRs). Our framework streamlines simulation and analysis of the mathematical models of GPCRs cellular pathways, facilitating the exploration of the signal-transduction kinetics induced by ligand-GPCR interactions: the dose-response of the ligand can be modeled, along with the corresponding change in the concentration of other signaling molecular species over time, like for instance [Ca2+] or [cAMP]. SSB toolkit brings to light the possibility of easily investigating the subcellular effects of ligand binding on receptor activation, even in the presence of genetic mutations, thereby enhancing our understanding of the intricate relationship between ligand-target interactions at the molecular level and the higher-level cellular and (patho)physiological response mechanisms.
Collapse
Affiliation(s)
- Rui Pedro Ribeiro
- Department of Biotechnology, University of Verona, Verona 37129,Italy
| | - Jonas Goßen
- IAS-5/INM-9, Forschungszentrum, 52425 Jülich, Germany
- Faculty of Mathematics, Computer Science and Natural Sciences RWTH Aachen University, 52062 Aachen, Germany
| | - Giulia Rossetti
- IAS-5/INM-9, Forschungszentrum, 52425 Jülich, Germany
- Jülich Supercomputing Center, Forschungszentrum, 52425 Jülich, Germany
- Department of Neurology, University Hospital RWTH Aachen, 52074 Aachen, Germany
| | - Alejandro Giorgetti
- Department of Biotechnology, University of Verona, Verona 37129,Italy
- IAS-5/INM-9, Forschungszentrum, 52425 Jülich, Germany
| |
Collapse
|
2
|
Eriksson O, Bhalla US, Blackwell KT, Crook SM, Keller D, Kramer A, Linne ML, Saudargienė A, Wade RC, Hellgren Kotaleski J. Combining hypothesis- and data-driven neuroscience modeling in FAIR workflows. eLife 2022; 11:e69013. [PMID: 35792600 PMCID: PMC9259018 DOI: 10.7554/elife.69013] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2021] [Accepted: 05/13/2022] [Indexed: 12/22/2022] Open
Abstract
Modeling in neuroscience occurs at the intersection of different points of view and approaches. Typically, hypothesis-driven modeling brings a question into focus so that a model is constructed to investigate a specific hypothesis about how the system works or why certain phenomena are observed. Data-driven modeling, on the other hand, follows a more unbiased approach, with model construction informed by the computationally intensive use of data. At the same time, researchers employ models at different biological scales and at different levels of abstraction. Combining these models while validating them against experimental data increases understanding of the multiscale brain. However, a lack of interoperability, transparency, and reusability of both models and the workflows used to construct them creates barriers for the integration of models representing different biological scales and built using different modeling philosophies. We argue that the same imperatives that drive resources and policy for data - such as the FAIR (Findable, Accessible, Interoperable, Reusable) principles - also support the integration of different modeling approaches. The FAIR principles require that data be shared in formats that are Findable, Accessible, Interoperable, and Reusable. Applying these principles to models and modeling workflows, as well as the data used to constrain and validate them, would allow researchers to find, reuse, question, validate, and extend published models, regardless of whether they are implemented phenomenologically or mechanistically, as a few equations or as a multiscale, hierarchical system. To illustrate these ideas, we use a classical synaptic plasticity model, the Bienenstock-Cooper-Munro rule, as an example due to its long history, different levels of abstraction, and implementation at many scales.
Collapse
Affiliation(s)
- Olivia Eriksson
- Science for Life Laboratory, School of Electrical Engineering and Computer Science, KTH Royal Institute of TechnologyStockholmSweden
| | - Upinder Singh Bhalla
- National Center for Biological Sciences, Tata Institute of Fundamental ResearchBangaloreIndia
| | - Kim T Blackwell
- Department of Bioengineering, Volgenau School of Engineering, George Mason UniversityFairfaxUnited States
| | - Sharon M Crook
- School of Mathematical and Statistical Sciences, Arizona State UniversityTempeUnited States
| | - Daniel Keller
- Blue Brain Project, École Polytechnique Fédérale de LausanneLausanneSwitzerland
| | - Andrei Kramer
- Science for Life Laboratory, School of Electrical Engineering and Computer Science, KTH Royal Institute of TechnologyStockholmSweden
- Department of Neuroscience, Karolinska InstituteStockholmSweden
| | - Marja-Leena Linne
- Faculty of Medicine and Health Technology, Tampere UniversityTampereFinland
| | - Ausra Saudargienė
- Neuroscience Institute, Lithuanian University of Health SciencesKaunasLithuania
- Department of Informatics, Vytautas Magnus UniversityKaunasLithuania
| | - Rebecca C Wade
- Molecular and Cellular Modeling Group, Heidelberg Institute for Theoretical Studies (HITS)HeidelbergGermany
- Center for Molecular Biology (ZMBH), ZMBH-DKFZ Alliance, University of HeidelbergHeidelbergGermany
- Interdisciplinary Center for Scientific Computing (IWR), Heidelberg UniversityHeidelbergGermany
| | - Jeanette Hellgren Kotaleski
- Science for Life Laboratory, School of Electrical Engineering and Computer Science, KTH Royal Institute of TechnologyStockholmSweden
- Department of Neuroscience, Karolinska InstituteStockholmSweden
| |
Collapse
|
3
|
van Keulen SC, Martin J, Colizzi F, Frezza E, Trpevski D, Diaz NC, Vidossich P, Rothlisberger U, Hellgren Kotaleski J, Wade RC, Carloni P. Multiscale molecular simulations to investigate adenylyl cyclase‐based signaling in the brain. WIRES COMPUTATIONAL MOLECULAR SCIENCE 2022. [DOI: 10.1002/wcms.1623] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Affiliation(s)
- Siri C. van Keulen
- Computational Structural Biology Group, Bijvoet Center for Biomolecular Research, Science for Life, Faculty of Science – Chemistry Utrecht University Utrecht The Netherlands
| | - Juliette Martin
- CNRS, UMR 5086 Molecular Microbiology and Structural Biochemistry University of Lyon Lyon France
| | - Francesco Colizzi
- Molecular Ocean Laboratory, Department of Marine Biology and Oceanography Institute of Marine Sciences, ICM‐CSIC Barcelona Spain
| | - Elisa Frezza
- Université Paris Cité, CiTCoM, CNRS Paris France
| | - Daniel Trpevski
- Science for Life Laboratory, School of Electrical Engineering and Computer Science KTH Royal Institute of Technology Stockholm
| | - Nuria Cirauqui Diaz
- CNRS, UMR 5086 Molecular Microbiology and Structural Biochemistry University of Lyon Lyon France
| | - Pietro Vidossich
- Molecular Modeling and Drug Discovery Lab Istituto Italiano di Tecnologia Genoa Italy
| | - Ursula Rothlisberger
- Laboratory of Computational Chemistry and Biochemistry Ecole Polytechnique Fédérale de Lausanne (EPFL) Lausanne
| | - Jeanette Hellgren Kotaleski
- Science for Life Laboratory, School of Electrical Engineering and Computer Science KTH Royal Institute of Technology Stockholm
- Department of Neuroscience Karolinska Institute Stockholm
| | - Rebecca C. Wade
- Molecular and Cellular Modeling Group Heidelberg Institute for Theoretical Studies (HITS) Heidelberg Germany
- Center for Molecular Biology (ZMBH), DKFZ‐ZMBH Alliance, and Interdisciplinary Center for Scientific Computing (IWR) Heidelberg University Heidelberg Germany
| | - Paolo Carloni
- Institute for Neuroscience and Medicine (INM‐9) and Institute for Advanced Simulations (IAS‐5) “Computational biomedicine” Forschungszentrum Jülich Jülich Germany
- INM‐11 JARA‐Institute: Molecular Neuroscience and Neuroimaging Forschungszentrum Jülich Jülich Germany
| |
Collapse
|
4
|
Swanson JMJ. Multiscale kinetic analysis of proteins. Curr Opin Struct Biol 2022; 72:169-175. [PMID: 34923310 PMCID: PMC9288830 DOI: 10.1016/j.sbi.2021.11.005] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2021] [Revised: 10/30/2021] [Accepted: 11/01/2021] [Indexed: 02/03/2023]
Abstract
The stochasticity of molecular motion results in the existence of multiple kinetically relevant pathways in many biomolecular mechanisms. Because it is highly demanding to characterize them for complex systems, mechanisms are often described with a single-pathway perspective. However, kinetic network analysis and sub-ensemble experimental insight are increasingly demonstrating not only the existence of competing pathways but also the importance of kinetic selection in biology. This review focuses on advances in multiscale kinetic analysis of proteins, which connects molecular level information from simulations to macroscopic data to characterize mechanistic reaction networks and the reactive flux through them. We describe a range of methods used and highlight several examples where kinetic modeling has revealed functional importance of pathway heterogeneity.
Collapse
|
5
|
Bruce NJ, Narzi D, Trpevski D, van Keulen SC, Nair AG, Röthlisberger U, Wade RC, Carloni P, Hellgren Kotaleski J. Regulation of adenylyl cyclase 5 in striatal neurons confers the ability to detect coincident neuromodulatory signals. PLoS Comput Biol 2019; 15:e1007382. [PMID: 31665146 PMCID: PMC6821081 DOI: 10.1371/journal.pcbi.1007382] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2019] [Accepted: 09/05/2019] [Indexed: 02/04/2023] Open
Abstract
Long-term potentiation and depression of synaptic activity in response to stimuli is a key factor in reinforcement learning. Strengthening of the corticostriatal synapses depends on the second messenger cAMP, whose synthesis is catalysed by the enzyme adenylyl cyclase 5 (AC5), which is itself regulated by the stimulatory Gαolf and inhibitory Gαi proteins. AC isoforms have been suggested to act as coincidence detectors, promoting cellular responses only when convergent regulatory signals occur close in time. However, the mechanism for this is currently unclear, and seems to lie in their diverse regulation patterns. Despite attempts to isolate the ternary complex, it is not known if Gαolf and Gαi can bind to AC5 simultaneously, nor what activity the complex would have. Using protein structure-based molecular dynamics simulations, we show that this complex is stable and inactive. These simulations, along with Brownian dynamics simulations to estimate protein association rates constants, constrain a kinetic model that shows that the presence of this ternary inactive complex is crucial for AC5's ability to detect coincident signals, producing a synergistic increase in cAMP. These results reveal some of the prerequisites for corticostriatal synaptic plasticity, and explain recent experimental data on cAMP concentrations following receptor activation. Moreover, they provide insights into the regulatory mechanisms that control signal processing by different AC isoforms.
Collapse
Affiliation(s)
- Neil J. Bruce
- Molecular and Cellular Modeling Group, Heidelberg Institute for Theoretical Studies (HITS), Schloss-Heidelberg, Germany
| | - Daniele Narzi
- Institut des Sciences et Ingénierie Chimiques, École Polytechnique Fédérale de Lausanne (EPFL), CH-1015 Lausanne, Switzerland
| | - Daniel Trpevski
- Science for Life Laboratory, School of Electrical Engineering and Computer Science, KTH Royal Institute of Technology, Stockholm, Sweden
| | - Siri C. van Keulen
- Institut des Sciences et Ingénierie Chimiques, École Polytechnique Fédérale de Lausanne (EPFL), CH-1015 Lausanne, Switzerland
- Department of Computer Science, Stanford University, Stanford, California, United States of America
| | - Anu G. Nair
- Institute of Molecular Life Sciences, University of Zurich, Zurich, Switzerland
| | - Ursula Röthlisberger
- Institut des Sciences et Ingénierie Chimiques, École Polytechnique Fédérale de Lausanne (EPFL), CH-1015 Lausanne, Switzerland
| | - Rebecca C. Wade
- Molecular and Cellular Modeling Group, Heidelberg Institute for Theoretical Studies (HITS), Schloss-Heidelberg, Germany
- Center for Molecular Biology (ZMBH), DKFZ-ZMBH Alliance, Heidelberg University, Heidelberg, Germany
- Interdisciplinary Center for Scientific Computing (IWR), Heidelberg University, Heidelberg, Germany
| | - Paolo Carloni
- Department of Physics and Department of Neurobiology, RWTH Aachen University,Aachen, Germany
- Institute for Neuroscience and Medicine (INM)-11, Forschungszentrum Jülich, Jülich, Germany
- Institute of Neuroscience and Medicine (INM-9), Forschungszentrum Jülich, Jülich, Germany
- Institute for Advanced Simulation (IAS-5), Forschungszentrum Jülich, Jülich, Germany
| | - Jeanette Hellgren Kotaleski
- Science for Life Laboratory, School of Electrical Engineering and Computer Science, KTH Royal Institute of Technology, Stockholm, Sweden
- Department of Neuroscience, Karolinska Institutet, Solna, Sweden
| |
Collapse
|
6
|
Jordan EJ, Patil K, Suresh K, Park JH, Mosse YP, Lemmon MA, Radhakrishnan R. Computational algorithms for in silico profiling of activating mutations in cancer. Cell Mol Life Sci 2019; 76:2663-2679. [PMID: 30982079 PMCID: PMC6589134 DOI: 10.1007/s00018-019-03097-2] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2019] [Revised: 04/01/2019] [Accepted: 04/08/2019] [Indexed: 12/17/2022]
Abstract
Methods to catalog and computationally assess the mutational landscape of proteins in human cancers are desirable. One approach is to adapt evolutionary or data-driven methods developed for predicting whether a single-nucleotide polymorphism (SNP) is deleterious to protein structure and function. In cases where understanding the mechanism of protein activation and regulation is desired, an alternative approach is to employ structure-based computational approaches to predict the effects of point mutations. Through a case study of mutations in kinase domains of three proteins, namely, the anaplastic lymphoma kinase (ALK) in pediatric neuroblastoma patients, serine/threonine-protein kinase B-Raf (BRAF) in melanoma patients, and erythroblastic oncogene B 2 (ErbB2 or HER2) in breast cancer patients, we compare the two approaches above. We find that the structure-based method is most appropriate for developing a binary classification of several different mutations, especially infrequently occurring ones, concerning the activation status of the given target protein. This approach is especially useful if the effects of mutations on the interactions of inhibitors with the target proteins are being sought. However, many patients will present with mutations spread across different target proteins, making structure-based models computationally demanding to implement and execute. In this situation, data-driven methods-including those based on machine learning techniques and evolutionary methods-are most appropriate for recognizing and illuminate mutational patterns. We show, however, that, in the present status of the field, the two methods have very different accuracies and confidence values, and hence, the optimal choice of their deployment is context-dependent.
Collapse
Affiliation(s)
- E Joseph Jordan
- Graduate Group in Biochemistry and Molecular Biophysics, University of Pennsylvania, Philadelphia, PA, USA
| | - Keshav Patil
- Department of Chemical and Biomolecular Engineering, University of Pennsylvania, Philadelphia, PA, USA
| | - Krishna Suresh
- Department of Bioengineering, University of Pennsylvania, Philadelphia, PA, USA
| | - Jin H Park
- Department of Pharmacology, Yale University, New Haven, CT, USA
- Cancer Biology Institute, Yale University, West Haven, CT, USA
| | - Yael P Mosse
- Department of Pediatrics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
- Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Mark A Lemmon
- Department of Pharmacology, Yale University, New Haven, CT, USA
- Cancer Biology Institute, Yale University, West Haven, CT, USA
| | - Ravi Radhakrishnan
- Graduate Group in Biochemistry and Molecular Biophysics, University of Pennsylvania, Philadelphia, PA, USA.
- Department of Chemical and Biomolecular Engineering, University of Pennsylvania, Philadelphia, PA, USA.
- Department of Bioengineering, University of Pennsylvania, Philadelphia, PA, USA.
| |
Collapse
|
7
|
Wang B, Xie ZR, Chen J, Wu Y. Integrating Structural Information to Study the Dynamics of Protein-Protein Interactions in Cells. Structure 2018; 26:1414-1424.e3. [PMID: 30174150 DOI: 10.1016/j.str.2018.07.010] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2018] [Revised: 06/12/2018] [Accepted: 07/24/2018] [Indexed: 02/07/2023]
Abstract
The information of how two proteins interact is embedded in the atomic details of their binding interfaces. These interactions, spatial-temporally coordinating each other as a network in a variable cytoplasmic environment, dominate almost all biological functions. A feasible and reliable computational model is highly demanded to realistically simulate these cellular processes and unravel the complexities beneath them. We therefore present a multiscale framework that integrates simulations on two different scales. The higher-resolution model incorporates structural information of proteins and energetics of their binding, while the lower-resolution model uses a highly simplified representation of proteins to capture the long-time-scale dynamics of a system with multiple proteins. Through a systematic benchmark test and two practical applications of biomolecular systems with specific cellular functions, we demonstrated that this method could be a powerful approach to understand molecular mechanisms of dynamic interactions between biomolecules and their functional impacts with high computational efficiency.
Collapse
Affiliation(s)
- Bo Wang
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, 1300 Morris Park Avenue, Bronx, New York, NY 10461, USA
| | - Zhong-Ru Xie
- College of Engineering, University of Georgia, Athens, GA 30602, USA
| | - Jiawen Chen
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, 1300 Morris Park Avenue, Bronx, New York, NY 10461, USA
| | - Yinghao Wu
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, 1300 Morris Park Avenue, Bronx, New York, NY 10461, USA.
| |
Collapse
|
8
|
Eccleston RC, Wan S, Dalchau N, Coveney PV. The Role of Multiscale Protein Dynamics in Antigen Presentation and T Lymphocyte Recognition. Front Immunol 2017; 8:797. [PMID: 28740497 PMCID: PMC5502259 DOI: 10.3389/fimmu.2017.00797] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2016] [Accepted: 06/22/2017] [Indexed: 12/15/2022] Open
Abstract
T lymphocytes are stimulated when they recognize short peptides bound to class I proteins of the major histocompatibility complex (MHC) protein, as peptide-MHC complexes. Due to the diversity in T-cell receptor (TCR) molecules together with both the peptides and MHC proteins they bind to, it has been difficult to design vaccines and treatments based on these interactions. Machine learning has made some progress in trying to predict the immunogenicity of peptide sequences in the context of specific MHC class I alleles but, as such approaches cannot integrate temporal information and lack explanatory power, their scope will always be limited. Here, we advocate a mechanistic description of antigen presentation and TCR activation which is explanatory, predictive, and quantitative, drawing on modeling approaches that collectively span several length and time scales, being capable of furnishing reliable biological descriptions that are difficult for experimentalists to provide. It is a form of multiscale systems biology. We propose the use of chemical rate equations to describe the time evolution of the foreign and host proteins to explain how the original proteins end up being presented on the cell surface as peptide fragments, while we invoke molecular dynamics to describe the key binding processes on the molecular level, including those of peptide-MHC complexes with TCRs which lie at the heart of the immune response. On each level, complementary methods based on machine learning are available, and we discuss the relationship between these divergent approaches. The pursuit of predictive mechanistic modeling approaches requires experimentalists to adapt their work so as to acquire, store, and expose data that can be used to verify and validate such models.
Collapse
Affiliation(s)
- R Charlotte Eccleston
- Centre for Computational Science, Department of Chemistry, University College London, London, United Kingdom
| | - Shunzhou Wan
- Centre for Computational Science, Department of Chemistry, University College London, London, United Kingdom
| | | | - Peter V Coveney
- Centre for Computational Science, Department of Chemistry, University College London, London, United Kingdom
| |
Collapse
|
9
|
Davtyan A, Voth GA, Andersen HC. Dynamic force matching: Construction of dynamic coarse-grained models with realistic short time dynamics and accurate long time dynamics. J Chem Phys 2016; 145:224107. [DOI: 10.1063/1.4971430] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Affiliation(s)
- Aram Davtyan
- Department of Chemistry, The James Franck Institute and Institute for Biophysical Dynamics, The University of Chicago, Chicago, Illinois 60637, USA
| | - Gregory A. Voth
- Department of Chemistry, The James Franck Institute and Institute for Biophysical Dynamics, The University of Chicago, Chicago, Illinois 60637, USA
| | - Hans C. Andersen
- Department of Chemistry, Stanford University, Stanford, California 94305, USA
| |
Collapse
|
10
|
Davtyan A, Dama JF, Voth GA, Andersen HC. Dynamic force matching: A method for constructing dynamical coarse-grained models with realistic time dependence. J Chem Phys 2016; 142:154104. [PMID: 25903863 DOI: 10.1063/1.4917454] [Citation(s) in RCA: 73] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023] Open
Abstract
Coarse-grained (CG) models of molecular systems, with fewer mechanical degrees of freedom than an all-atom model, are used extensively in chemical physics. It is generally accepted that a coarse-grained model that accurately describes equilibrium structural properties (as a result of having a well constructed CG potential energy function) does not necessarily exhibit appropriate dynamical behavior when simulated using conservative Hamiltonian dynamics for the CG degrees of freedom on the CG potential energy surface. Attempts to develop accurate CG dynamic models usually focus on replacing Hamiltonian motion by stochastic but Markovian dynamics on that surface, such as Langevin or Brownian dynamics. However, depending on the nature of the system and the extent of the coarse-graining, a Markovian dynamics for the CG degrees of freedom may not be appropriate. In this paper, we consider the problem of constructing dynamic CG models within the context of the Multi-Scale Coarse-graining (MS-CG) method of Voth and coworkers. We propose a method of converting a MS-CG model into a dynamic CG model by adding degrees of freedom to it in the form of a small number of fictitious particles that interact with the CG degrees of freedom in simple ways and that are subject to Langevin forces. The dynamic models are members of a class of nonlinear systems interacting with special heat baths that were studied by Zwanzig [J. Stat. Phys. 9, 215 (1973)]. The properties of the fictitious particles can be inferred from analysis of the dynamics of all-atom simulations of the system of interest. This is analogous to the fact that the MS-CG method generates the CG potential from analysis of equilibrium structures observed in all-atom simulation data. The dynamic models generate a non-Markovian dynamics for the CG degrees of freedom, but they can be easily simulated using standard molecular dynamics programs. We present tests of this method on a series of simple examples that demonstrate that the method provides realistic dynamical CG models that have non-Markovian or close to Markovian behavior that is consistent with the actual dynamical behavior of the all-atom system used to construct the CG model. Both the construction and the simulation of such a dynamic CG model have computational requirements that are similar to those of the corresponding MS-CG model and are good candidates for CG modeling of very large systems.
Collapse
Affiliation(s)
- Aram Davtyan
- Department of Chemistry, The James Franck Institute, Institute for Biophysical Dynamics, and Computation Institute, The University of Chicago, Chicago, Illinois 60637, USA
| | - James F Dama
- Department of Chemistry, The James Franck Institute, Institute for Biophysical Dynamics, and Computation Institute, The University of Chicago, Chicago, Illinois 60637, USA
| | - Gregory A Voth
- Department of Chemistry, The James Franck Institute, Institute for Biophysical Dynamics, and Computation Institute, The University of Chicago, Chicago, Illinois 60637, USA
| | - Hans C Andersen
- Department of Chemistry, Stanford University, Stanford, California 94305, USA
| |
Collapse
|
11
|
Samish I, Bourne PE, Najmanovich RJ. Achievements and challenges in structural bioinformatics and computational biophysics. Bioinformatics 2014; 31:146-50. [PMID: 25488929 PMCID: PMC4271151 DOI: 10.1093/bioinformatics/btu769] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
Abstract
Motivation: The field of structural bioinformatics and computational biophysics has undergone a revolution in the last 10 years. Developments that are captured annually through the 3DSIG meeting, upon which this article reflects. Results: An increase in the accessible data, computational resources and methodology has resulted in an increase in the size and resolution of studied systems and the complexity of the questions amenable to research. Concomitantly, the parameterization and efficiency of the methods have markedly improved along with their cross-validation with other computational and experimental results. Conclusion: The field exhibits an ever-increasing integration with biochemistry, biophysics and other disciplines. In this article, we discuss recent achievements along with current challenges within the field. Contact:Rafael.Najmanovich@USherbrooke.ca
Collapse
Affiliation(s)
- Ilan Samish
- Department of Plant Sciences, Weizmann Institute of Science, Rehovot, 76100, Israel, Ort Braude College, Karmiel, 2161002, Israel, Office of the Director, National Institutes of Health, Bethesda, MD 20814, USA and Department of Biochemistry, University of Sherbrooke, Sherbrooke, J1H 5N4, Canada Department of Plant Sciences, Weizmann Institute of Science, Rehovot, 76100, Israel, Ort Braude College, Karmiel, 2161002, Israel, Office of the Director, National Institutes of Health, Bethesda, MD 20814, USA and Department of Biochemistry, University of Sherbrooke, Sherbrooke, J1H 5N4, Canada
| | - Philip E Bourne
- Department of Plant Sciences, Weizmann Institute of Science, Rehovot, 76100, Israel, Ort Braude College, Karmiel, 2161002, Israel, Office of the Director, National Institutes of Health, Bethesda, MD 20814, USA and Department of Biochemistry, University of Sherbrooke, Sherbrooke, J1H 5N4, Canada
| | - Rafael J Najmanovich
- Department of Plant Sciences, Weizmann Institute of Science, Rehovot, 76100, Israel, Ort Braude College, Karmiel, 2161002, Israel, Office of the Director, National Institutes of Health, Bethesda, MD 20814, USA and Department of Biochemistry, University of Sherbrooke, Sherbrooke, J1H 5N4, Canada
| |
Collapse
|
12
|
Davtyan A, Dama JF, Sinitskiy AV, Voth GA. The Theory of Ultra-Coarse-Graining. 2. Numerical Implementation. J Chem Theory Comput 2014; 10:5265-75. [PMID: 26583210 DOI: 10.1021/ct500834t] [Citation(s) in RCA: 55] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
The increasing interest in the modeling of complex macromolecular systems in recent years has spurred the development of numerous coarse-graining (CG) techniques. However, many of the CG models are constructed assuming that all details beneath the resolution of CG degrees of freedom are fast and average out, which sets limits on the resolution of feasible coarse-grainings and on the range of applications of the CG models. Ultra-coarse-graining (UCG) makes it possible to construct models at any desired resolution while accounting for discrete conformational or chemical changes within the CG sites that can modulate the interactions between them. Here, we discuss the UCG methodology and its numerical implementation. We pay particular attention to the numerical mechanism for including state transitions between different conformations within CG sites because this has not been discussed previously. Using a simple example of 1,2-dichloroethane, we demonstrate the ability of the UCG model to reproduce the multiconfigurational behavior of this molecular liquid, even when each molecule is modeled with only one CG site. The methodology can also be applied to other molecular liquids and macromolecular systems with time scale separation between conformational transitions and other intramolecular motions and rotations.
Collapse
Affiliation(s)
- Aram Davtyan
- Department of Chemistry, The James Franck Institute, Institute for Biophysical Dynamics, and Computation Institute, The University of Chicago , Chicago, Illinois 60637, United States
| | - James F Dama
- Department of Chemistry, The James Franck Institute, Institute for Biophysical Dynamics, and Computation Institute, The University of Chicago , Chicago, Illinois 60637, United States
| | - Anton V Sinitskiy
- Department of Chemistry, The James Franck Institute, Institute for Biophysical Dynamics, and Computation Institute, The University of Chicago , Chicago, Illinois 60637, United States
| | - Gregory A Voth
- Department of Chemistry, The James Franck Institute, Institute for Biophysical Dynamics, and Computation Institute, The University of Chicago , Chicago, Illinois 60637, United States
| |
Collapse
|
13
|
Xie L, Ge X, Tan H, Xie L, Zhang Y, Hart T, Yang X, Bourne PE. Towards structural systems pharmacology to study complex diseases and personalized medicine. PLoS Comput Biol 2014; 10:e1003554. [PMID: 24830652 PMCID: PMC4022462 DOI: 10.1371/journal.pcbi.1003554] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
Genome-Wide Association Studies (GWAS), whole genome sequencing, and high-throughput omics techniques have generated vast amounts of genotypic and molecular phenotypic data. However, these data have not yet been fully explored to improve the effectiveness and efficiency of drug discovery, which continues along a one-drug-one-target-one-disease paradigm. As a partial consequence, both the cost to launch a new drug and the attrition rate are increasing. Systems pharmacology and pharmacogenomics are emerging to exploit the available data and potentially reverse this trend, but, as we argue here, more is needed. To understand the impact of genetic, epigenetic, and environmental factors on drug action, we must study the structural energetics and dynamics of molecular interactions in the context of the whole human genome and interactome. Such an approach requires an integrative modeling framework for drug action that leverages advances in data-driven statistical modeling and mechanism-based multiscale modeling and transforms heterogeneous data from GWAS, high-throughput sequencing, structural genomics, functional genomics, and chemical genomics into unified knowledge. This is not a small task, but, as reviewed here, progress is being made towards the final goal of personalized medicines for the treatment of complex diseases.
Collapse
Affiliation(s)
- Lei Xie
- Department of Computer Science, Hunter College, The City University of New York, New York, New York, United States of America
- Ph.D. Program in Computer Science, Biology, and Biochemistry, The Graduate Center, The City University of New York, New York, New York, United States of America
- * E-mail:
| | - Xiaoxia Ge
- Department of Computer Science, Hunter College, The City University of New York, New York, New York, United States of America
| | - Hepan Tan
- Department of Computer Science, Hunter College, The City University of New York, New York, New York, United States of America
| | - Li Xie
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, California, United States of America
| | - Yinliang Zhang
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, California, United States of America
| | - Thomas Hart
- Department of Biological Sciences, Hunter College, The City University of New York, New York, New York, United States of America
| | - Xiaowei Yang
- School of Public Health, Hunter College, The City University of New York, New York, New York, United States of America
| | - Philip E. Bourne
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, California, United States of America
| |
Collapse
|
14
|
Moal IH, Torchala M, Bates PA, Fernández-Recio J. The scoring of poses in protein-protein docking: current capabilities and future directions. BMC Bioinformatics 2013; 14:286. [PMID: 24079540 PMCID: PMC3850738 DOI: 10.1186/1471-2105-14-286] [Citation(s) in RCA: 76] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2013] [Accepted: 09/25/2013] [Indexed: 12/16/2022] Open
Abstract
BACKGROUND Protein-protein docking, which aims to predict the structure of a protein-protein complex from its unbound components, remains an unresolved challenge in structural bioinformatics. An important step is the ranking of docked poses using a scoring function, for which many methods have been developed. There is a need to explore the differences and commonalities of these methods with each other, as well as with functions developed in the fields of molecular dynamics and homology modelling. RESULTS We present an evaluation of 115 scoring functions on an unbound docking decoy benchmark covering 118 complexes for which a near-native solution can be found, yielding top 10 success rates of up to 58%. Hierarchical clustering is performed, so as to group together functions which identify near-natives in similar subsets of complexes. Three set theoretic approaches are used to identify pairs of scoring functions capable of correctly scoring different complexes. This shows that functions in different clusters capture different aspects of binding and are likely to work together synergistically. CONCLUSIONS All functions designed specifically for docking perform well, indicating that functions are transferable between sampling methods. We also identify promising methods from the field of homology modelling. Further, differential success rates by docking difficulty and solution quality suggest a need for flexibility-dependent scoring. Investigating pairs of scoring functions, the set theoretic measures identify known scoring strategies as well as a number of novel approaches, indicating promising augmentations of traditional scoring methods. Such augmentation and parameter combination strategies are discussed in the context of the learning-to-rank paradigm.
Collapse
Affiliation(s)
- Iain H Moal
- Joint BSC-IRB Research Program in Computational Biology, Life Science Department, Barcelona Super computing Center, Barcelona 08034, Spain
| | - Mieczyslaw Torchala
- Biomolecular Modelling Laboratory, Cancer Research UK London Research Institute, London WC2A 3LY, UK
| | - Paul A Bates
- Biomolecular Modelling Laboratory, Cancer Research UK London Research Institute, London WC2A 3LY, UK
| | - Juan Fernández-Recio
- Joint BSC-IRB Research Program in Computational Biology, Life Science Department, Barcelona Super computing Center, Barcelona 08034, Spain
| |
Collapse
|
15
|
Agius R, Torchala M, Moal IH, Fernández-Recio J, Bates PA. Characterizing changes in the rate of protein-protein dissociation upon interface mutation using hotspot energy and organization. PLoS Comput Biol 2013; 9:e1003216. [PMID: 24039569 PMCID: PMC3764008 DOI: 10.1371/journal.pcbi.1003216] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2013] [Accepted: 07/25/2013] [Indexed: 12/21/2022] Open
Abstract
Predicting the effects of mutations on the kinetic rate constants of protein-protein interactions is central to both the modeling of complex diseases and the design of effective peptide drug inhibitors. However, while most studies have concentrated on the determination of association rate constants, dissociation rates have received less attention. In this work we take a novel approach by relating the changes in dissociation rates upon mutation to the energetics and architecture of hotspots and hotregions, by performing alanine scans pre- and post-mutation. From these scans, we design a set of descriptors that capture the change in hotspot energy and distribution. The method is benchmarked on 713 kinetically characterized mutations from the SKEMPI database. Our investigations show that, with the use of hotspot descriptors, energies from single-point alanine mutations may be used for the estimation of off-rate mutations to any residue type and also multi-point mutations. A number of machine learning models are built from a combination of molecular and hotspot descriptors, with the best models achieving a Pearson's Correlation Coefficient of 0.79 with experimental off-rates and a Matthew's Correlation Coefficient of 0.6 in the detection of rare stabilizing mutations. Using specialized feature selection models we identify descriptors that are highly specific and, conversely, broadly important to predicting the effects of different classes of mutations, interface regions and complexes. Our results also indicate that the distribution of the critical stability regions across protein-protein interfaces is a function of complex size more strongly than interface area. In addition, mutations at the rim are critical for the stability of small complexes, but consistently harder to characterize. The relationship between hotregion size and the dissociation rate is also investigated and, using hotspot descriptors which model cooperative effects within hotregions, we show how the contribution of hotregions of different sizes, changes under different cooperative effects.
Collapse
Affiliation(s)
- Rudi Agius
- Biomolecular Modelling Laboratory, Cancer Research UK London Research Institute, London, United Kingdom
| | - Mieczyslaw Torchala
- Biomolecular Modelling Laboratory, Cancer Research UK London Research Institute, London, United Kingdom
| | - Iain H. Moal
- Joint BSC-IRB Research Program in Computational Biology, Life Science Department, Barcelona Supercomputing Center, Barcelona, Spain
| | - Juan Fernández-Recio
- Joint BSC-IRB Research Program in Computational Biology, Life Science Department, Barcelona Supercomputing Center, Barcelona, Spain
| | - Paul A. Bates
- Biomolecular Modelling Laboratory, Cancer Research UK London Research Institute, London, United Kingdom
| |
Collapse
|
16
|
Feig M, Sugita Y. Reaching new levels of realism in modeling biological macromolecules in cellular environments. J Mol Graph Model 2013; 45:144-56. [PMID: 24036504 DOI: 10.1016/j.jmgm.2013.08.017] [Citation(s) in RCA: 53] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2013] [Revised: 08/14/2013] [Accepted: 08/19/2013] [Indexed: 12/21/2022]
Abstract
An increasing number of studies are aimed at modeling cellular environments in a comprehensive and realistic fashion. A major challenge in these efforts is how to bridge spatial and temporal scales over many orders of magnitude. Furthermore, there are additional challenges in integrating different aspects ranging from questions about biomolecular stability in crowded environments to the description of reactive processes on cellular scales. In this review, recent studies with models of biomolecules in cellular environments at different levels of detail are discussed in terms of their strengths and weaknesses. In particular, atomistic models, implicit representations of cellular environments, coarse-grained and spheroidal models of biomolecules, as well as the inclusion of reactive processes via reaction-diffusion models are described. Furthermore, strategies for integrating the different models into a comprehensive description of cellular environments are discussed.
Collapse
Affiliation(s)
- Michael Feig
- Department of Biochemistry & Molecular Biology and Department of Chemistry, Michigan State University, 603 Wilson Road, BCH 218, East Lansing, MI 48824, United States; RIKEN Quantitative Biology Center, International Medical Device Alliance (IMDA) 6F, 1-6-5 Minatojima-minamimachi, Chuo-ku, Kobe, Hyogo 650-0047, Japan.
| | | |
Collapse
|
17
|
Cheng TMK, Goehring L, Jeffery L, Lu YE, Hayles J, Novák B, Bates PA. A structural systems biology approach for quantifying the systemic consequences of missense mutations in proteins. PLoS Comput Biol 2012; 8:e1002738. [PMID: 23093928 PMCID: PMC3475653 DOI: 10.1371/journal.pcbi.1002738] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2012] [Accepted: 08/23/2012] [Indexed: 12/27/2022] Open
Abstract
Gauging the systemic effects of non-synonymous single nucleotide polymorphisms (nsSNPs) is an important topic in the pursuit of personalized medicine. However, it is a non-trivial task to understand how a change at the protein structure level eventually affects a cell's behavior. This is because complex information at both the protein and pathway level has to be integrated. Given that the idea of integrating both protein and pathway dynamics to estimate the systemic impact of missense mutations in proteins remains predominantly unexplored, we investigate the practicality of such an approach by formulating mathematical models and comparing them with experimental data to study missense mutations. We present two case studies: (1) interpreting systemic perturbation for mutations within the cell cycle control mechanisms (G2 to mitosis transition) for yeast; (2) phenotypic classification of neuron-related human diseases associated with mutations within the mitogen-activated protein kinase (MAPK) pathway. We show that the application of simplified mathematical models is feasible for understanding the effects of small sequence changes on cellular behavior. Furthermore, we show that the systemic impact of missense mutations can be effectively quantified as a combination of protein stability change and pathway perturbation.
Collapse
Affiliation(s)
- Tammy M. K. Cheng
- Biomolecular Modelling Laboratory, Cancer Research UK London Research Institute, London, United Kingdom
| | - Lucas Goehring
- Max Planck Institute for Dynamics and Self-Organization, Göttingen, Germany
| | - Linda Jeffery
- Cell Cycle Laboratory, Cancer Research UK London Research Institute, London, United Kingdom
| | - Yu-En Lu
- Computer Laboratory, University of Cambridge, Cambridge, United Kingdom
| | - Jacqueline Hayles
- Cell Cycle Laboratory, Cancer Research UK London Research Institute, London, United Kingdom
| | - Béla Novák
- Department of Biochemistry, University of Oxford, Oxford, United Kingdom
| | - Paul A. Bates
- Biomolecular Modelling Laboratory, Cancer Research UK London Research Institute, London, United Kingdom
| |
Collapse
|
18
|
Moal IH, Bates PA. Kinetic rate constant prediction supports the conformational selection mechanism of protein binding. PLoS Comput Biol 2012; 8:e1002351. [PMID: 22253587 PMCID: PMC3257286 DOI: 10.1371/journal.pcbi.1002351] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2011] [Accepted: 11/29/2011] [Indexed: 12/24/2022] Open
Abstract
The prediction of protein-protein kinetic rate constants provides a fundamental test of our understanding of molecular recognition, and will play an important role in the modeling of complex biological systems. In this paper, a feature selection and regression algorithm is applied to mine a large set of molecular descriptors and construct simple models for association and dissociation rate constants using empirical data. Using separate test data for validation, the predicted rate constants can be combined to calculate binding affinity with accuracy matching that of state of the art empirical free energy functions. The models show that the rate of association is linearly related to the proportion of unbound proteins in the bound conformational ensemble relative to the unbound conformational ensemble, indicating that the binding partners must adopt a geometry near to that of the bound prior to binding. Mirroring the conformational selection and population shift mechanism of protein binding, the models provide a strong separate line of evidence for the preponderance of this mechanism in protein-protein binding, complementing structural and theoretical studies.
Collapse
Affiliation(s)
- Iain H. Moal
- Protein Interactions and Docking Laboratory, Life Sciences Department, Barcelona Supercomputing Center, Barcelona, Spain
| | - Paul A. Bates
- Biomolecular Modelling Laboratory, Cancer Research UK London Research Institute, London, United Kingdom
| |
Collapse
|
19
|
Kamerlin SCL, Vicatos S, Dryga A, Warshel A. Coarse-grained (multiscale) simulations in studies of biophysical and chemical systems. Annu Rev Phys Chem 2011; 62:41-64. [PMID: 21034218 DOI: 10.1146/annurev-physchem-032210-103335] [Citation(s) in RCA: 151] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Recent years have witnessed an explosion in computational power, leading to attempts to model ever more complex systems. Nevertheless, there remain cases for which the use of brute-force computer simulations is clearly not the solution. In such cases, great benefit can be obtained from the use of physically sound simplifications. The introduction of such coarse graining can be traced back to the early usage of a simplified model in studies of proteins. Since then, the field has progressed tremendously. In this review, we cover both key developments in the field and potential future directions. Additionally, particular emphasis is given to two general approaches, namely the renormalization and reference potential approaches, which allow one to move back and forth between the coarse-grained (CG) and full models, as these approaches provide the foundation for CG modeling of complex systems.
Collapse
Affiliation(s)
- Shina C L Kamerlin
- Department of Chemistry, University of Southern California, Los Angeles, California 90089, USA
| | | | | | | |
Collapse
|
20
|
Qu Z, Garfinkel A, Weiss JN, Nivala M. Multi-scale modeling in biology: how to bridge the gaps between scales? PROGRESS IN BIOPHYSICS AND MOLECULAR BIOLOGY 2011; 107:21-31. [PMID: 21704063 DOI: 10.1016/j.pbiomolbio.2011.06.004] [Citation(s) in RCA: 95] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/10/2011] [Accepted: 06/11/2011] [Indexed: 11/25/2022]
Abstract
Human physiological functions are regulated across many orders of magnitude in space and time. Integrating the information and dynamics from one scale to another is critical for the understanding of human physiology and the treatment of diseases. Multi-scale modeling, as a computational approach, has been widely adopted by researchers in computational and systems biology. A key unsolved issue is how to represent appropriately the dynamical behaviors of a high-dimensional model of a lower scale by a low-dimensional model of a higher scale, so that it can be used to investigate complex dynamical behaviors at even higher scales of integration. In the article, we first review the widely-used different modeling methodologies and their applications at different scales. We then discuss the gaps between different modeling methodologies and between scales, and discuss potential methods for bridging the gaps between scales.
Collapse
Affiliation(s)
- Zhilin Qu
- Department of Medicine (Cardiology), David Geffen School of Medicine, University of California, Los Angeles, CA 90095, USA.
| | | | | | | |
Collapse
|
21
|
Bai H, Yang K, Yu D, Zhang C, Chen F, Lai L. Predicting kinetic constants of protein-protein interactions based on structural properties. Proteins 2010; 79:720-34. [PMID: 21287608 DOI: 10.1002/prot.22904] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2010] [Revised: 07/24/2010] [Accepted: 08/23/2010] [Indexed: 02/01/2023]
Abstract
Elucidating kinetic processes of protein-protein interactions (PPI) helps to understand how basic building blocks affect overall behavior of living systems. In this study, we used structure-based properties to build predictive models for kinetic constants of PPI. A highly diverse PPI dataset, protein-protein kinetic interaction data and structures (PPKIDS), was built. PPKIDS contains 62 PPI with complex structures and kinetic constants measured experimentally. The influence of structural properties on kinetics of PPI was studied using 35 structure-based features, describing different aspects of complex structures. Linear models for the prediction of kinetic constants were built by fitting with selected subsets of structure-based features. The models gave correlation coefficients of 0.801, 0.732, and 0.770 for k(off), k(on), and K(d), respectively, in leave-one-out cross validations. The predictive models reported here use only protein complex structures as input and can be generally applied in PPI studies as well as systems biology modeling. Our study confirmed that different properties play different roles in the kinetic process of PPI. For example, k(on) was affected by overall structural features of complexes, such as the composition of secondary structures, the change of translational and rotational entropy, and the electrostatic interaction; while k(off) was determined by interfacial properties, such as number of contacted atom pairs per 100 Ų. This information provides useful hints for PPI design.
Collapse
Affiliation(s)
- Hongjun Bai
- Beijing National Laboratory for Molecular Sciences, State Key Laboratory of Structural Chemistry for Stable and Unstable Species, College of Chemistry and Molecular Engineering, Peking University, Beijing 100871, China
| | | | | | | | | | | |
Collapse
|
22
|
Bridging the gap: linking molecular simulations and systemic descriptions of cellular compartments. PLoS One 2010; 5:e14070. [PMID: 21124924 PMCID: PMC2989909 DOI: 10.1371/journal.pone.0014070] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2010] [Accepted: 10/21/2010] [Indexed: 12/31/2022] Open
Abstract
Metabolic processes in biological cells are commonly either characterized at the level of individual enzymes and metabolites or at the network level. Often these two paradigms are considered as mutually exclusive because concepts from neither side are suited to describe the complete range of scales. Additionally, when modeling metabolic or regulatory cellular systems, often a large fraction of the required kinetic parameters are unknown. This even applies to such simple and extensively studied systems like the photosynthetic apparatus of purple bacteria. Using the chromatophore vesicles of Rhodobacter sphaeroides as a model system, we show that a consistent kinetic model emerges when fitting the dynamics of a molecular stochastic simulation to a set of time dependent experiments even though about two thirds of the kinetic parameters in this system are not known from experiment. Those kinetic parameters that were previously known all came out in the expected range. The simulation model was built from independent protein units composed of elementary reactions processing single metabolites. This pools-and-proteins approach naturally compiles the wealth of available molecular biological data into a systemic model and can easily be extended to describe other systems by adding new protein or nucleic acid types. The automated parameter optimization, performed with an evolutionary algorithm, reveals the sensitivity of the model to the value of each parameter and the relative importances of the experiments used. Such an analysis identifies the crucial system parameters and guides the setup of new experiments that would add most knowledge for a systemic understanding of cellular compartments. The successful combination of the molecular model and the systemic parametrization presented here on the example of the simple machinery for bacterial photosynthesis shows that it is actually possible to combine molecular and systemic modeling. This framework can now straightforwardly be applied to other currently less well characterized but biologically more relevant systems.
Collapse
|
23
|
Gurtovenko AA, Anwar J, Vattulainen I. Defect-Mediated Trafficking across Cell Membranes: Insights from in Silico Modeling. Chem Rev 2010; 110:6077-103. [DOI: 10.1021/cr1000783] [Citation(s) in RCA: 158] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Affiliation(s)
- Andrey A. Gurtovenko
- Institute of Macromolecular Compounds, Russian Academy of Sciences, Bolshoi Prospect 31, V.O., St. Petersburg, 199004 Russia, Computational Laboratory, Institute of Pharmaceutical Innovation, University of Bradford, Bradford, West Yorkshire BD7 1DP, U.K., Department of Physics, Tampere University of Technology, P.O. Box 692, FI-33101 Tampere, Finland, Aalto University, School of Science and Technology, Finland, and MEMPHYS—Center for Biomembrane Physics, University of Southern Denmark, Odense, Denmark
| | - Jamshed Anwar
- Institute of Macromolecular Compounds, Russian Academy of Sciences, Bolshoi Prospect 31, V.O., St. Petersburg, 199004 Russia, Computational Laboratory, Institute of Pharmaceutical Innovation, University of Bradford, Bradford, West Yorkshire BD7 1DP, U.K., Department of Physics, Tampere University of Technology, P.O. Box 692, FI-33101 Tampere, Finland, Aalto University, School of Science and Technology, Finland, and MEMPHYS—Center for Biomembrane Physics, University of Southern Denmark, Odense, Denmark
| | - Ilpo Vattulainen
- Institute of Macromolecular Compounds, Russian Academy of Sciences, Bolshoi Prospect 31, V.O., St. Petersburg, 199004 Russia, Computational Laboratory, Institute of Pharmaceutical Innovation, University of Bradford, Bradford, West Yorkshire BD7 1DP, U.K., Department of Physics, Tampere University of Technology, P.O. Box 692, FI-33101 Tampere, Finland, Aalto University, School of Science and Technology, Finland, and MEMPHYS—Center for Biomembrane Physics, University of Southern Denmark, Odense, Denmark
| |
Collapse
|
24
|
Pierri CL, Parisi G, Porcelli V. Computational approaches for protein function prediction: a combined strategy from multiple sequence alignment to molecular docking-based virtual screening. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS 2010; 1804:1695-712. [PMID: 20433957 DOI: 10.1016/j.bbapap.2010.04.008] [Citation(s) in RCA: 59] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/19/2010] [Revised: 03/04/2010] [Accepted: 04/14/2010] [Indexed: 12/12/2022]
Abstract
The functional characterization of proteins represents a daily challenge for biochemical, medical and computational sciences. Although finally proved on the bench, the function of a protein can be successfully predicted by computational approaches that drive the further experimental assays. Current methods for comparative modeling allow the construction of accurate 3D models for proteins of unknown structure, provided that a crystal structure of a homologous protein is available. Binding regions can be proposed by using binding site predictors, data inferred from homologous crystal structures, and data provided from a careful interpretation of the multiple sequence alignment of the investigated protein and its homologs. Once the location of a binding site has been proposed, chemical ligands that have a high likelihood of binding can be identified by using ligand docking and structure-based virtual screening of chemical libraries. Most docking algorithms allow building a list sorted by energy of the lowest energy docking configuration for each ligand of the library. In this review the state-of-the-art of computational approaches in 3D protein comparative modeling and in the study of protein-ligand interactions is provided. Furthermore a possible combined/concerted multistep strategy for protein function prediction, based on multiple sequence alignment, comparative modeling, binding region prediction, and structure-based virtual screening of chemical libraries, is described by using suitable examples. As practical examples, Abl-kinase molecular modeling studies, HPV-E6 protein multiple sequence alignment analysis, and some other model docking-based characterization reports are briefly described to highlight the importance of computational approaches in protein function prediction.
Collapse
Affiliation(s)
- Ciro Leonardo Pierri
- Department of Pharmaco-Biology, Laboratory of Biochemistry and Molecular Biology, University of Bari, Va E. Orabona, 4 - 70125 Bari, Italy.
| | | | | |
Collapse
|
25
|
Modelling the molecular mechanisms of synaptic plasticity using systems biology approaches. Nat Rev Neurosci 2010; 11:239-51. [PMID: 20300102 DOI: 10.1038/nrn2807] [Citation(s) in RCA: 145] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
Abstract
Synaptic plasticity is thought to underlie learning and memory, but the complexity of the interactions between the ion channels, enzymes and genes that are involved in synaptic plasticity impedes a deep understanding of this phenomenon. Computer modelling has been used to investigate the information processing that is performed by the signalling pathways involved in synaptic plasticity in principal neurons of the hippocampus, striatum and cerebellum. In the past few years, new software developments that combine computational neuroscience techniques with systems biology techniques have allowed large-scale, kinetic models of the molecular mechanisms underlying long-term potentiation and long-term depression. We highlight important advancements produced by these quantitative modelling efforts and introduce promising approaches that use advancements in live-cell imaging.
Collapse
|
26
|
Post-reductionist protein science, or putting Humpty Dumpty back together again. Nat Chem Biol 2010; 5:774-7. [PMID: 19841622 DOI: 10.1038/nchembio.241] [Citation(s) in RCA: 95] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
|
27
|
Abstract
Activities of many biological macromolecules involve large conformational transitions for which crystallography can specify atomic details of alternative end states, but the course of transitions is often beyond the reach of computations based on full-atomic potential functions. We have developed a coarse-grained force field for molecular mechanics calculations based on the virtual interactions of C alpha atoms in protein molecules. This force field is parameterized based on the statistical distribution of the energy terms extracted from crystallographic data, and it is formulated to capture features dependent on secondary structure and on residue-specific contact information. The resulting force field is applied to energy minimization and normal mode analysis of several proteins. We find robust convergence in minimizations to low energies and energy gradients with low degrees of structural distortion, and atomic fluctuations calculated from the normal mode analyses correlate well with the experimental B-factors obtained from high-resolution crystal structures. These findings suggest that the virtual atom force field is a suitable tool for various molecular mechanics applications on large macromolecular systems undergoing large conformational changes.
Collapse
|
28
|
Abstract
Proteins mediate transmission of signals along intercellular and intracellular pathways and between the exterior and the interior of a cell. The dynamic properties of signaling proteins are crucial to their functions. We discuss emerging paradigms for the role of protein dynamics in signaling. A central tenet is that proteins fluctuate among many states on evolutionarily selected energy landscapes. Upstream signals remodel this landscape, causing signaling proteins to transmit information to downstream partners. New methods provide insight into the dynamic properties of signaling proteins at the atomic scale. The next stages in the signaling hierarchy-how multiple signals are integrated and how cellular signaling pathways are organized in space and time-present exciting challenges for the future, requiring bold multidisciplinary approaches.
Collapse
Affiliation(s)
- Robert G. Smock
- Department of Biochemistry and Molecular Biology and Department of Chemistry, University of Massachusetts, Amherst, MA 01003, USA
| | - Lila M. Gierasch
- Department of Biochemistry and Molecular Biology and Department of Chemistry, University of Massachusetts, Amherst, MA 01003, USA
| |
Collapse
|
29
|
The role of predictive modelling in rationally re-engineering biological systems. Nat Rev Microbiol 2009; 7:297-305. [PMID: 19252506 DOI: 10.1038/nrmicro2107] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Technologies to synthesize and transplant a complete genome into a cell have opened limitless potential to redesign organisms for complex, specialized tasks. However, large-scale re-engineering of a biological circuit will require systems-level optimization that will come from a deep understanding of operational relationships among all the constituent parts of a cell. The integrated framework necessary for conducting such complex bioengineering requires the convergence of systems and synthetic biology. Here, we review the status of these rapidly developing interdisciplinary fields of biology and provide a perspective on plausible venues for their merger.
Collapse
|
30
|
Loewe L. A framework for evolutionary systems biology. BMC SYSTEMS BIOLOGY 2009; 3:27. [PMID: 19239699 PMCID: PMC2663779 DOI: 10.1186/1752-0509-3-27] [Citation(s) in RCA: 59] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/01/2008] [Accepted: 02/24/2009] [Indexed: 12/02/2022]
Abstract
BACKGROUND Many difficult problems in evolutionary genomics are related to mutations that have weak effects on fitness, as the consequences of mutations with large effects are often simple to predict. Current systems biology has accumulated much data on mutations with large effects and can predict the properties of knockout mutants in some systems. However experimental methods are too insensitive to observe small effects. RESULTS Here I propose a novel framework that brings together evolutionary theory and current systems biology approaches in order to quantify small effects of mutations and their epistatic interactions in silico. Central to this approach is the definition of fitness correlates that can be computed in some current systems biology models employing the rigorous algorithms that are at the core of much work in computational systems biology. The framework exploits synergies between the realism of such models and the need to understand real systems in evolutionary theory. This framework can address many longstanding topics in evolutionary biology by defining various 'levels' of the adaptive landscape. Addressed topics include the distribution of mutational effects on fitness, as well as the nature of advantageous mutations, epistasis and robustness. Combining corresponding parameter estimates with population genetics models raises the possibility of testing evolutionary hypotheses at a new level of realism. CONCLUSION EvoSysBio is expected to lead to a more detailed understanding of the fundamental principles of life by combining knowledge about well-known biological systems from several disciplines. This will benefit both evolutionary theory and current systems biology. Understanding robustness by analysing distributions of mutational effects and epistasis is pivotal for drug design, cancer research, responsible genetic engineering in synthetic biology and many other practical applications.
Collapse
Affiliation(s)
- Laurence Loewe
- Centre for Systems Biology at Edinburgh, The University of Edinburgh, Edinburgh, Scotland, UK.
| |
Collapse
|
31
|
Dell'Orco D. Fast predictions of thermodynamics and kinetics of protein-protein recognition from structures: from molecular design to systems biology. MOLECULAR BIOSYSTEMS 2009; 5:323-34. [PMID: 19396368 DOI: 10.1039/b821580d] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Abstract
The increasing call for an overall picture of the interactions between the components of a biological system that give rise to the observed function is often summarized by the expression systems biology. Both the interpretative and predictive capabilities of holistic models of biochemical systems, however, depend to a large extent on the level of physico-chemical knowledge of the individual molecular interactions making up the network. This review is focused on the structure-based quantitative characterization of protein-protein interactions, ubiquitous in any biochemical pathway. Recently developed, fast and effective computational methods are reviewed, which allow the assessment of kinetic and thermodynamic features of the association-dissociation processes of protein complexes, both in water soluble and membrane environments. The performance and the accuracy of fast and semi-empirical structure-based methods have reached comparable levels with respect to the classical and more elegant molecular simulations. Nevertheless, the broad accessibility and lower computational cost provide the former methods with the advantageous possibility to perform systems-level analyses including extensive in silico mutagenesis screenings and large-scale structural predictions of multiprotein complexes.
Collapse
Affiliation(s)
- Daniele Dell'Orco
- Department of Chemistry, University of Modena and Reggio Emilia, Via Campi 183, 41100, Modena, Italy.
| |
Collapse
|
32
|
Heath AP, Kavraki LE. Computational challenges in systems biology. COMPUTER SCIENCE REVIEW 2009; 3:1-17. [DOI: 10.1016/j.cosrev.2009.01.002] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/03/2025]
|
33
|
United we stand: combining structural methods. Curr Opin Struct Biol 2008; 18:617-22. [DOI: 10.1016/j.sbi.2008.07.004] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2008] [Accepted: 07/29/2008] [Indexed: 01/20/2023]
|
34
|
Abstract
Enzyme kinetic parameters can differ between different species and isoenzymes for the same catalysed reaction. Computational approaches to calculate enzymatic kinetic parameters from the three-dimensional structures of proteins will be reviewed briefly here. Enzyme kinetic parameters may be derived by modelling and simulating the rate-determining process. An alternative, approximate, but more computationally efficient approach is the comparison of molecular interaction fields for experimentally characterized enzymes and those for which parameters should be determined. A correlation between differences in interaction fields and experimentally determined kinetic parameters can be used to determine parameters for orthologous enzymes from other species. The estimation of enzymatic kinetic parameters is an important step in setting up mathematical models of biochemical pathways in systems biology.
Collapse
|
35
|
Dodson GG, Lane DP, Verma CS. Molecular simulations of protein dynamics: new windows on mechanisms in biology. EMBO Rep 2008; 9:144-50. [PMID: 18246106 PMCID: PMC2246404 DOI: 10.1038/sj.embor.7401160] [Citation(s) in RCA: 109] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2007] [Accepted: 12/10/2007] [Indexed: 11/08/2022] Open
Abstract
Recent advances in computer hardware and software have led to the development of increasingly successful molecular simulations of protein structural dynamics that are intrinsic to biological processes. These simulations have resulted in models that increasingly agree with experimental observations, suggest new experiments and provide insights into biological mechanisms. Used in combination with data obtained with sophisticated experimental techniques, simulations are helping us to understand biological complexity at the atomic and molecular levels and are giving promising insights into the genetic, thermodynamic and functional/mechanistic behaviour of biological processes. Here, we highlight some examples of such approaches that illustrate the current state and potential of the field of molecular simulation.
Collapse
Affiliation(s)
- Guy G Dodson
- York Structural Biology Laboratory, University of York, York YO10 5YW, UK
- National Institute for Medical Research, The
Ridgeway, Mill Hill, London NW7 1AA, UK
| | - David P Lane
- Institute of Molecular and Cell Biology, (A*STAR) Agency for Science, Technology and Research, 61 Biopolis Drive, Proteos, 138673 Singapore
| | - Chandra S Verma
- Bioinformatics Institute, (A*STAR) Agency for Science, Technology and Research, 30 Biopolis Way, #071-01 Matrix, 138671 Singapore
| |
Collapse
|