1
|
Pengmei Z, Lorpaiboon C, Guo SC, Weare J, Dinner AR. Using pretrained graph neural networks with token mixers as geometric featurizers for conformational dynamics. J Chem Phys 2025; 162:044107. [PMID: 39873278 PMCID: PMC11779506 DOI: 10.1063/5.0244453] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2024] [Accepted: 12/30/2024] [Indexed: 01/30/2025] Open
Abstract
Identifying informative low-dimensional features that characterize dynamics in molecular simulations remains a challenge, often requiring extensive manual tuning and system-specific knowledge. Here, we introduce geom2vec, in which pretrained graph neural networks (GNNs) are used as universal geometric featurizers. By pretraining equivariant GNNs on a large dataset of molecular conformations with a self-supervised denoising objective, we obtain transferable structural representations that are useful for learning conformational dynamics without further fine-tuning. We show how the learned GNN representations can capture interpretable relationships between structural units (tokens) by combining them with expressive token mixers. Importantly, decoupling training the GNNs from training for downstream tasks enables analysis of larger molecular graphs (that can represent small proteins at all-atom resolution) with limited computational resources. In these ways, geom2vec eliminates the need for manual feature selection and increases the robustness of simulation analyses.
Collapse
Affiliation(s)
- Zihan Pengmei
- Department of Chemistry and James Franck Institute, University of Chicago, Chicago, Illinois 60637, USA
| | - Chatipat Lorpaiboon
- Department of Chemistry and James Franck Institute, University of Chicago, Chicago, Illinois 60637, USA
| | - Spencer C. Guo
- Department of Chemistry and James Franck Institute, University of Chicago, Chicago, Illinois 60637, USA
| | - Jonathan Weare
- Courant Institute of Mathematical Sciences, New York University, New York, New York 10012, USA
| | - Aaron R. Dinner
- Department of Chemistry and James Franck Institute, University of Chicago, Chicago, Illinois 60637, USA
| |
Collapse
|
2
|
Pengmei Z, Lorpaiboon C, Guo SC, Weare J, Dinner AR. Using pretrained graph neural networks with token mixers as geometric featurizers for conformational dynamics. ARXIV 2024:arXiv:2409.19838v2. [PMID: 39801625 PMCID: PMC11722521] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/21/2025]
Abstract
Identifying informative low-dimensional features that characterize dynamics in molecular simulations remains a challenge, often requiring extensive manual tuning and system-specific knowledge. Here, we introduce geom2vec, in which pretrained graph neural networks (GNNs) are used as universal geometric featurizers. By pretraining equivariant GNNs on a large dataset of molecular conformations with a self-supervised denoising objective, we obtain transferable structural representations that are useful for learning conformational dynamics without further fine-tuning. We show how the learned GNN representations can capture interpretable relationships between structural units (tokens) by combining them with expressive token mixers. Importantly, decoupling training the GNNs from training for downstream tasks enables analysis of larger molecular graphs (such as small proteins at all-atom resolution) with limited computational resources. In these ways, geom2vec eliminates the need for manual feature selection and increases the robustness of simulation analyses.
Collapse
Affiliation(s)
- Zihan Pengmei
- Department of Chemistry and James Franck Institute, University of Chicago, Chicago, Illinois 60637, United States
| | - Chatipat Lorpaiboon
- Department of Chemistry and James Franck Institute, University of Chicago, Chicago, Illinois 60637, United States
| | - Spencer C. Guo
- Department of Chemistry and James Franck Institute, University of Chicago, Chicago, Illinois 60637, United States
| | - Jonathan Weare
- Courant Institute of Mathematical Sciences, New York University, New York, New York 10012, United States
| | - Aaron R. Dinner
- Department of Chemistry and James Franck Institute, University of Chicago, Chicago, Illinois 60637, United States
| |
Collapse
|
3
|
Jeong K, Guo SC, Allaw S, Dinner AR. Analysis of the Dynamics of a Complex, Multipathway Reaction: Insulin Dimer Dissociation. J Phys Chem B 2024; 128:12728-12740. [PMID: 39670451 DOI: 10.1021/acs.jpcb.4c06933] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2024]
Abstract
The protein hormone insulin forms a homodimer that must dissociate to bind to its receptor. Understanding the kinetics and mechanism of dissociation is essential for the rational design of therapeutic analogs. In addition to its physiological importance, this dissociation process serves as a paradigm for coupled (un)folding and (un)binding. Based on previous free energy simulations, insulin dissociation is thought to involve multiple pathways with comparable free energy barriers. Here, we analyze the mechanism of insulin dimer dissociation using a recently developed computational framework for estimating kinetic statistics from short-trajectory data. These statistics indicate that the likelihood of dissociation (the committor) closely tracks the decrease in the number of (native and nonnative) intermonomer contacts and the increase in the number of water contacts at the dimer interface; the transition state with equal likelihood of association and dissociation corresponds to an encounter complex with relatively few native contacts and many nonnative contacts. We identify four pathways out of the dimer state and quantify their contributions to the rate as well as their exchange by computing reactive fluxes. We show that both the pathways and their extents of exchange can be understood in terms of rotations around three axes of the dimer structure. Our results provide insights into the kinetics of insulin analogs and, more generally, how to characterize complex, multipathway processes.
Collapse
Affiliation(s)
- Kwanghoon Jeong
- Department of Chemistry, the University of Chicago, Chicago, Illinois 60637, United States
| | - Spencer C Guo
- Department of Chemistry, the University of Chicago, Chicago, Illinois 60637, United States
| | - Sammy Allaw
- Department of Chemistry, the University of Chicago, Chicago, Illinois 60637, United States
| | - Aaron R Dinner
- Department of Chemistry, the University of Chicago, Chicago, Illinois 60637, United States
- James Franck Institute, the University of Chicago, Chicago, Illinois 60637, United States
- Institute for Biophysical Dynamics, the University of Chicago, Chicago, Illinois 60637, United States
| |
Collapse
|
4
|
Zhang N, Sood D, Guo SC, Chen N, Antoszewski A, Marianchuk T, Dey S, Xiao Y, Hong L, Peng X, Baxa M, Partch C, Wang LP, Sosnick TR, Dinner AR, LiWang A. Temperature-dependent fold-switching mechanism of the circadian clock protein KaiB. Proc Natl Acad Sci U S A 2024; 121:e2412327121. [PMID: 39671178 DOI: 10.1073/pnas.2412327121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2024] [Accepted: 10/24/2024] [Indexed: 12/14/2024] Open
Abstract
The oscillator of the cyanobacterial circadian clock relies on the ability of the KaiB protein to switch reversibly between a stable ground-state fold (gsKaiB) and an unstable fold-switched fold (fsKaiB). Rare fold-switching events by KaiB provide a critical delay in the negative feedback loop of this posttranslational oscillator. In this study, we experimentally and computationally investigate the temperature dependence of fold switching and its mechanism. We demonstrate that the stability of gsKaiB increases with temperature compared to fsKaiB and that the Q10 value for the gsKaiB → fsKaiB transition is nearly three times smaller than that for the reverse transition in a construct optimized for NMR studies. Simulations and native-state hydrogen-deuterium exchange NMR experiments suggest that fold switching can involve both partially and completely unfolded intermediates. The simulations predict that the transition state for fold switching coincides with isomerization of conserved prolines in the most rapidly exchanging region, and we confirm experimentally that proline isomerization is a rate-limiting step for fold switching. We explore the implications of our results for temperature compensation, a hallmark of circadian clocks, through a kinetic model.
Collapse
Affiliation(s)
- Ning Zhang
- Department of Chemistry and Biochemistry, University of California, Merced, CA 95343
| | - Damini Sood
- Department of Chemistry and Biochemistry, University of California, Merced, CA 95343
| | - Spencer C Guo
- Department of Chemistry and James Franck Institute, University of Chicago, Chicago, IL 60637
| | - Nanhao Chen
- Department of Chemistry, University of California, Davis, CA 95616
| | - Adam Antoszewski
- Department of Chemistry and James Franck Institute, University of Chicago, Chicago, IL 60637
| | - Tegan Marianchuk
- Graduate Program in Biophysical Sciences, University of Chicago, Chicago, IL 60637
| | - Supratim Dey
- Department of Chemistry and Biochemistry, University of California, Merced, CA 95343
| | - Yunxian Xiao
- Department of Chemistry and Biochemistry, University of California, Merced, CA 95343
| | - Lu Hong
- Graduate Program in Biophysical Sciences, University of Chicago, Chicago, IL 60637
| | - Xiangda Peng
- Department of Biochemistry and Molecular Biology, University of Chicago, Chicago, IL 60637
| | - Michael Baxa
- Department of Biochemistry and Molecular Biology, University of Chicago, Chicago, IL 60637
| | - Carrie Partch
- Department of Chemistry and Biochemistry, University of California, Santa Cruz, CA 95064
| | - Lee-Ping Wang
- Department of Chemistry, University of California, Davis, CA 95616
| | - Tobin R Sosnick
- Department of Biochemistry and Molecular Biology, University of Chicago, Chicago, IL 60637
| | - Aaron R Dinner
- Department of Chemistry and James Franck Institute, University of Chicago, Chicago, IL 60637
| | - Andy LiWang
- Department of Chemistry and Biochemistry, University of California, Merced, CA 95343
- Center for Cellular and Biomolecular Machines, University of California, Merced, CA 95343
| |
Collapse
|
5
|
Mitchell AR, Rotskoff GM. Committor Guided Estimates of Molecular Transition Rates. J Chem Theory Comput 2024; 20:9378-9393. [PMID: 39420582 DOI: 10.1021/acs.jctc.4c00997] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2024]
Abstract
The probability that a configuration of a physical system reacts, or transitions from one metastable state to another, is quantified by the committor function. This function contains richly detailed mechanistic information about transition pathways, but a full parametrization of the committor requires the construction of a high-dimensional function, a generically challenging task. Recent efforts to leverage neural networks as a means to solve high-dimensional partial differential equations, often called "physics-informed" machine learning, have brought the committor into computational reach. Here, we build on the semigroup approach to learning the committor and assess its utility for predicting dynamical quantities such as transition rates. We show that a careful reframing of the objective function and improved adaptive sampling strategies provide highly accurate representations of the committor. Furthermore, by directly applying the Hill relation, we show that these committors provide accurate transition rates for molecular systems.
Collapse
Affiliation(s)
- Andrew R Mitchell
- Department of Chemistry, Stanford University, Stanford, California 94305, United States
| | - Grant M Rotskoff
- Department of Chemistry, Stanford University, Stanford, California 94305, United States
| |
Collapse
|
6
|
Jeong K, Guo SC, Allaw S, Dinner AR. Analysis of the dynamics of a complex, multipathway reaction: Insulin dimer dissociation. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.10.08.617297. [PMID: 39416150 PMCID: PMC11482781 DOI: 10.1101/2024.10.08.617297] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 10/19/2024]
Abstract
The protein hormone insulin forms a homodimer that must dissociate to bind to its receptor. Understanding the kinetics and mechanism of dissociation is essential for rational design of therapeutic analogs. In addition to its physiological importance, this dissociation process serves as a paradigm for coupled (un)folding and (un)binding. Based on previous free energy simulations, insulin dissociation is thought to involve multiple pathways with comparable free energy barriers. Here, we analyze the mechanism of insulin dimer dissociation using a recently developed computational framework for estimating kinetic statistics from short-trajectory data. These statistics indicate that the likelihood of dissociation (the committor) closely tracks the decrease in the number of (native and nonnative) intermonomer contacts and the increase in the number of water contacts at the dimer interface; the transition state with equal likelihood of association and dissociation corresponds to an encounter complex with relatively few native contacts and many nonnative contacts. We identify four pathways out of the dimer state and quantify their contributions to the rate, as well as their exchange, by computing reactive fluxes. We show that both the pathways and their extents of exchange can be understood in terms of rotations around three axes of the dimer structure. Our results provide insights into the kinetics of insulin analogues and, more generally, how to characterize complex, multipathway processes.
Collapse
|
7
|
Strahan J, Lorpaiboon C, Weare J, Dinner AR. BAD-NEUS: Rapidly converging trajectory stratification. J Chem Phys 2024; 161:084109. [PMID: 39185846 PMCID: PMC11349377 DOI: 10.1063/5.0215975] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2024] [Accepted: 07/25/2024] [Indexed: 08/27/2024] Open
Abstract
An issue for molecular dynamics simulations is that events of interest often involve timescales that are much longer than the simulation time step, which is set by the fastest timescales of the model. Because of this timescale separation, direct simulation of many events is prohibitively computationally costly. This issue can be overcome by aggregating information from many relatively short simulations that sample segments of trajectories involving events of interest. This is the strategy of Markov state models (MSMs) and related approaches, but such methods suffer from approximation error because the variables defining the states generally do not capture the dynamics fully. By contrast, once converged, the weighted ensemble (WE) method aggregates information from trajectory segments so as to yield unbiased estimates of both thermodynamic and kinetic statistics. Unfortunately, errors decay no faster than unbiased simulation in WE as originally formulated and commonly deployed. Here, we introduce a theoretical framework for describing WE that shows that the introduction of an approximate stationary distribution on top of the stratification, as in nonequilibrium umbrella sampling (NEUS), accelerates convergence. Building on ideas from MSMs and related methods, we generalize the NEUS approach in such a way that the approximation error can be reduced systematically. We show that the improved algorithm can decrease the simulation time required to achieve the desired precision by orders of magnitude.
Collapse
Affiliation(s)
- John Strahan
- Department of Chemistry and James Franck Institute, University of Chicago, Chicago, Illinois 60637, USA
| | - Chatipat Lorpaiboon
- Department of Chemistry and James Franck Institute, University of Chicago, Chicago, Illinois 60637, USA
| | - Jonathan Weare
- Courant Institute of Mathematical Sciences, New York University, New York, New York 10012, USA
| | - Aaron R. Dinner
- Department of Chemistry and James Franck Institute, University of Chicago, Chicago, Illinois 60637, USA
| |
Collapse
|
8
|
Zhang N, Sood D, Guo SC, Chen N, Antoszewski A, Marianchuk T, Chavan A, Dey S, Xiao Y, Hong L, Peng X, Baxa M, Partch C, Wang LP, Sosnick TR, Dinner AR, LiWang A. Temperature-Dependent Fold-Switching Mechanism of the Circadian Clock Protein KaiB. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.05.21.594594. [PMID: 38826295 PMCID: PMC11142059 DOI: 10.1101/2024.05.21.594594] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/04/2024]
Abstract
The oscillator of the cyanobacterial circadian clock relies on the ability of the KaiB protein to switch reversibly between a stable ground-state fold (gsKaiB) and an unstable fold-switched fold (fsKaiB). Rare fold-switching events by KaiB provide a critical delay in the negative feedback loop of this post-translational oscillator. In this study, we experimentally and computationally investigate the temperature dependence of fold switching and its mechanism. We demonstrate that the stability of gsKaiB increases with temperature compared to fsKaiB and that the Q10 value for the gsKaiB → fsKaiB transition is nearly three times smaller than that for the reverse transition. Simulations and native-state hydrogen-deuterium exchange NMR experiments suggest that fold switching can involve both subglobally and near-globally unfolded intermediates. The simulations predict that the transition state for fold switching coincides with isomerization of conserved prolines in the most rapidly exchanging region, and we confirm experimentally that proline isomerization is a rate-limiting step for fold switching. We explore the implications of our results for temperature compensation, a hallmark of circadian clocks, through a kinetic model.
Collapse
|
9
|
Lorpaiboon C, Guo SC, Strahan J, Weare J, Dinner AR. Accurate estimates of dynamical statistics using memory. J Chem Phys 2024; 160:084108. [PMID: 38391020 PMCID: PMC10898919 DOI: 10.1063/5.0187145] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2023] [Accepted: 01/29/2024] [Indexed: 02/24/2024] Open
Abstract
Many chemical reactions and molecular processes occur on time scales that are significantly longer than those accessible by direct simulations. One successful approach to estimating dynamical statistics for such processes is to use many short time series of observations of the system to construct a Markov state model, which approximates the dynamics of the system as memoryless transitions between a set of discrete states. The dynamical Galerkin approximation (DGA) is a closely related framework for estimating dynamical statistics, such as committors and mean first passage times, by approximating solutions to their equations with a projection onto a basis. Because the projected dynamics are generally not memoryless, the Markov approximation can result in significant systematic errors. Inspired by quasi-Markov state models, which employ the generalized master equation to encode memory resulting from the projection, we reformulate DGA to account for memory and analyze its performance on two systems: a two-dimensional triple well and the AIB9 peptide. We demonstrate that our method is robust to the choice of basis and can decrease the time series length required to obtain accurate kinetics by an order of magnitude.
Collapse
Affiliation(s)
- Chatipat Lorpaiboon
- Department of Chemistry and James Franck Institute, University of Chicago, Chicago, Illinois 60637, USA
| | - Spencer C. Guo
- Department of Chemistry and James Franck Institute, University of Chicago, Chicago, Illinois 60637, USA
| | - John Strahan
- Department of Chemistry and James Franck Institute, University of Chicago, Chicago, Illinois 60637, USA
| | - Jonathan Weare
- Courant Institute of Mathematical Sciences, New York University, New York, New York 10012, USA
| | - Aaron R. Dinner
- Department of Chemistry and James Franck Institute, University of Chicago, Chicago, Illinois 60637, USA
| |
Collapse
|
10
|
Guo SC, Shen R, Roux B, Dinner AR. Dynamics of activation in the voltage-sensing domain of Ciona intestinalis phosphatase Ci-VSP. Nat Commun 2024; 15:1408. [PMID: 38360718 PMCID: PMC10869754 DOI: 10.1038/s41467-024-45514-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2022] [Accepted: 01/25/2024] [Indexed: 02/17/2024] Open
Abstract
The Ciona intestinalis voltage-sensing phosphatase (Ci-VSP) is a membrane protein containing a voltage-sensing domain (VSD) that is homologous to VSDs from voltage-gated ion channels responsible for cellular excitability. Previously published crystal structures of Ci-VSD in putative resting and active conformations suggested a helical-screw voltage sensing mechanism in which the S4 helix translocates and rotates to enable exchange of salt-bridge partners, but the microscopic details of the transition between the resting and active conformations remained unknown. Here, by combining extensive molecular dynamics simulations with a recently developed computational framework based on dynamical operators, we elucidate the microscopic mechanism of the resting-active transition at physiological membrane potential. Sparse regression reveals a small set of coordinates that distinguish intermediates that are hidden from electrophysiological measurements. The intermediates arise from a noncanonical helical-screw mechanism in which translocation, rotation, and side-chain movement of the S4 helix are only loosely coupled. These results provide insights into existing experimental and computational findings on voltage sensing and suggest ways of further probing its mechanism.
Collapse
Affiliation(s)
- Spencer C Guo
- Department of Chemistry, The University of Chicago, Chicago, IL, 60637, USA
- James Franck Institute, The University of Chicago, Chicago, IL, 60637, USA
| | - Rong Shen
- Department of Biochemistry and Molecular Biology, The University of Chicago, Chicago, IL, 60637, USA
| | - Benoît Roux
- Department of Chemistry, The University of Chicago, Chicago, IL, 60637, USA.
- Department of Biochemistry and Molecular Biology, The University of Chicago, Chicago, IL, 60637, USA.
- Institute for Biophysical Dynamics, The University of Chicago, Chicago, IL, 60637, USA.
| | - Aaron R Dinner
- Department of Chemistry, The University of Chicago, Chicago, IL, 60637, USA.
- James Franck Institute, The University of Chicago, Chicago, IL, 60637, USA.
- Institute for Biophysical Dynamics, The University of Chicago, Chicago, IL, 60637, USA.
| |
Collapse
|
11
|
Maruyama Y, Mitsutake A. Effect of Main and Side Chains on the Folding Mechanism of the Trp-Cage Miniprotein. ACS OMEGA 2023; 8:43827-43835. [PMID: 38027385 PMCID: PMC10666239 DOI: 10.1021/acsomega.3c05809] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/07/2023] [Revised: 09/19/2023] [Accepted: 10/27/2023] [Indexed: 12/01/2023]
Abstract
Proteins that do not fold into their functional native state have been linked to diseases. In this study, the influence of the main and side chains of individual amino acids on the folding of the tryptophan cage (Trp-cage), a designed 20-residue miniprotein, was analyzed. For this purpose, we calculated the solvation free energy (SFE) contributions of individual atoms by using the 3D-reference interaction site model with the atomic decomposition method. The mechanism by which the Trp-cage is stabilized during the folding process was examined by calculating the total energy, which is the sum of the conformational energy and SFE. The folding process of the Trp-cage resulted in a stable native state, with a total energy that was 62.4 kcal/mol lower than that of the unfolded state. The solvation entropy, which is considered to be responsible for the hydrophobic effect, contributed 31.3 kcal/mol to structural stabilization. In other words, the contribution of the solvation entropy accounted for approximately half of the total contribution to Trp-cage folding. The hydrophobic core centered on Trp6 contributed 15.6 kcal/mol to the total energy, whereas the solvation entropy contribution was 6.3 kcal/mol. The salt bridge formed by the hydrophilic side chains of Asp9 and Arg16 contributed 10.9 and 5.0 kcal/mol, respectively. This indicates that not only the hydrophobic core but also the salt bridge of the hydrophilic side chains gain solvation entropy and contribute to stabilizing the native structure of the Trp-cage.
Collapse
Affiliation(s)
- Yutaka Maruyama
- Data
Science Center for Creative Design and Manufacturing, The Institute of Statistical Mathematics, 10-3 Midori-cho, Tachikawa, Tokyo 190-8562, Japan
- Department
of Physics, School of Science and Technology, Meiji University, 1-1-1
Higashi-Mita, Tama-ku, Kawasaki-shi, Kanagawa 214-8571, Japan
| | - Ayori Mitsutake
- Department
of Physics, School of Science and Technology, Meiji University, 1-1-1
Higashi-Mita, Tama-ku, Kawasaki-shi, Kanagawa 214-8571, Japan
| |
Collapse
|
12
|
Bajpai S, Petkov BK, Tong M, Abreu CRA, Nair NN, Tuckerman ME. An interoperable implementation of collective-variable based enhanced sampling methods in extended phase space within the OpenMM package. J Comput Chem 2023; 44:2166-2183. [PMID: 37464902 DOI: 10.1002/jcc.27182] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2023] [Revised: 05/30/2023] [Accepted: 06/06/2023] [Indexed: 07/20/2023]
Abstract
Collective variable (CV)-based enhanced sampling techniques are widely used today for accelerating barrier-crossing events in molecular simulations. A class of these methods, which includes temperature accelerated molecular dynamics (TAMD)/driven-adiabatic free energy dynamics (d-AFED), unified free energy dynamics (UFED), and temperature accelerated sliced sampling (TASS), uses an extended variable formalism to achieve quick exploration of conformational space. These techniques are powerful, as they enhance the sampling of a large number of CVs simultaneously compared to other techniques. Extended variables are kept at a much higher temperature than the physical temperature by ensuring adiabatic separation between the extended and physical subsystems and employing rigorous thermostatting. In this work, we present a computational platform to perform extended phase space enhanced sampling simulations using the open-source molecular dynamics engine OpenMM. The implementation allows users to have interoperability of sampling techniques, as well as employ state-of-the-art thermostats and multiple time-stepping. This work also presents protocols for determining the critical parameters and procedures for reconstructing high-dimensional free energy surfaces. As a demonstration, we present simulation results on the high dimensional conformational landscapes of the alanine tripeptide in vacuo, tetra-N-methylglycine (tetra-sarcosine) peptoid in implicit solvent, and the Trp-cage mini protein in explicit water.
Collapse
Affiliation(s)
- Shitanshu Bajpai
- Department of Chemistry, Indian Institute of Technology Kanpur (IITK), Kanpur, India
| | - Brian K Petkov
- Department of Chemistry, New York University (NYU), New York, New York, USA
| | - Muchen Tong
- Department of Chemistry, New York University (NYU), New York, New York, USA
| | - Charlles R A Abreu
- Chemical Engineering Department, Escola de Química, Universidade Federal do Rio de Janeiro, Rio de Janeiro, Brazil
| | - Nisanth N Nair
- Department of Chemistry, Indian Institute of Technology Kanpur (IITK), Kanpur, India
| | - Mark E Tuckerman
- Department of Chemistry, New York University (NYU), New York, New York, USA
- Courant Institute of Mathematical Sciences, New York University (NYU), New York, New York, USA
- NYU-ECNU Center for Computational Chemistry at NYU Shanghai, Shanghai, China
- Simons Center for Computational Physical Chemistry, New York University, New York, New York, USA
| |
Collapse
|
13
|
Strahan J, Finkel J, Dinner AR, Weare J. Predicting rare events using neural networks and short-trajectory data. JOURNAL OF COMPUTATIONAL PHYSICS 2023; 488:112152. [PMID: 37332834 PMCID: PMC10270692 DOI: 10.1016/j.jcp.2023.112152] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/20/2023]
Abstract
Estimating the likelihood, timing, and nature of events is a major goal of modeling stochastic dynamical systems. When the event is rare in comparison with the timescales of simulation and/or measurement needed to resolve the elemental dynamics, accurate prediction from direct observations becomes challenging. In such cases a more effective approach is to cast statistics of interest as solutions to Feynman-Kac equations (partial differential equations). Here, we develop an approach to solve Feynman-Kac equations by training neural networks on short-trajectory data. Our approach is based on a Markov approximation but otherwise avoids assumptions about the underlying model and dynamics. This makes it applicable to treating complex computational models and observational data. We illustrate the advantages of our method using a low-dimensional model that facilitates visualization, and this analysis motivates an adaptive sampling strategy that allows on-the-fly identification of and addition of data to regions important for predicting the statistics of interest. Finally, we demonstrate that we can compute accurate statistics for a 75-dimensional model of sudden stratospheric warming. This system provides a stringent test bed for our method.
Collapse
Affiliation(s)
- John Strahan
- Department of Chemistry and James Franck Institute, the University of Chicago, Chicago, IL 60637
| | - Justin Finkel
- Department of Earth, Atmospheric, and Planetary Sciences, Massachusetts Institute of Technology, Cambridge, MA 02139
| | - Aaron R. Dinner
- Department of Chemistry and James Franck Institute, the University of Chicago, Chicago, IL 60637
- Committee on Computational and Applied Mathematics, the University of Chicago, Chicago, IL 60637
| | - Jonathan Weare
- Courant Institute of Mathematical Sciences, New York University, New York, New York 10012
| |
Collapse
|
14
|
Strahan J, Guo SC, Lorpaiboon C, Dinner AR, Weare J. Inexact iterative numerical linear algebra for neural network-based spectral estimation and rare-event prediction. J Chem Phys 2023; 159:014110. [PMID: 37409704 PMCID: PMC10328561 DOI: 10.1063/5.0151309] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2023] [Accepted: 06/02/2023] [Indexed: 07/07/2023] Open
Abstract
Understanding dynamics in complex systems is challenging because there are many degrees of freedom, and those that are most important for describing events of interest are often not obvious. The leading eigenfunctions of the transition operator are useful for visualization, and they can provide an efficient basis for computing statistics, such as the likelihood and average time of events (predictions). Here, we develop inexact iterative linear algebra methods for computing these eigenfunctions (spectral estimation) and making predictions from a dataset of short trajectories sampled at finite intervals. We demonstrate the methods on a low-dimensional model that facilitates visualization and a high-dimensional model of a biomolecular system. Implications for the prediction problem in reinforcement learning are discussed.
Collapse
Affiliation(s)
- John Strahan
- Department of Chemistry and James Franck Institute, University of Chicago, Chicago, Illinois 60637, USA
| | - Spencer C. Guo
- Department of Chemistry and James Franck Institute, University of Chicago, Chicago, Illinois 60637, USA
| | - Chatipat Lorpaiboon
- Department of Chemistry and James Franck Institute, University of Chicago, Chicago, Illinois 60637, USA
| | - Aaron R. Dinner
- Department of Chemistry and James Franck Institute, University of Chicago, Chicago, Illinois 60637, USA
| | - Jonathan Weare
- Courant Institute of Mathematical Sciences, New York University, New York, New York 10012, USA
| |
Collapse
|
15
|
Abstract
The treatment of slow and rare transitions in the simulation of complex systems poses a great computational challenge. A powerful approach to tackle this challenge is the string method, which represents the transition path as a one-dimensional curve in a multidimensional space of collective variables. Commonly used strategies for pathway optimization include aligning the tangent of the string to the local mean force or to the mean drift determined from swarms of short trajectories. Here, a novel strategy is proposed, allowing the string to be optimized based on a variational principle involving the unidirectional reactive flux expressed in terms of the time-correlation function of the committor. The method is illustrated with model systems and then probed with the alanine dipeptide and a coarse-grained model of the barstar-barnase protein complex. Successive iterations variationally refine the string toward an optimal transition pathway following the gradient of the committor between two metastable states.
Collapse
Affiliation(s)
- Ziwei He
- Department of Chemistry, The University of Chicago, 5735 S. Ellis Avenue, Chicago60637, Illinois, United States
| | - Christophe Chipot
- Laboratoire International Associé Centre National de la Recherche Scientifique et University of Illinois at Urbana-Champaign, Unité Mixte de Recherche No. 7019, Université de Lorraine, B.P. 70239, Vandœuvre-lès-Nancy cedex54506, France
| | - Benoît Roux
- Department of Chemistry, The University of Chicago, 5735 S. Ellis Avenue, Chicago60637, Illinois, United States
- Department of Biochemistry and Molecular Biology, The University of Chicago, Chicago60637, IllinoisUnited States
| |
Collapse
|
16
|
Lorpaiboon C, Weare J, Dinner AR. Augmented transition path theory for sequences of events. J Chem Phys 2022; 157:094115. [PMID: 36075728 PMCID: PMC9458294 DOI: 10.1063/5.0098587] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2022] [Accepted: 07/29/2022] [Indexed: 11/14/2022] Open
Abstract
Transition path theory provides a statistical description of the dynamics of a reaction in terms of local spatial quantities. In its original formulation, it is limited to reactions that consist of trajectories flowing from a reactant set A to a product set B. We extend the basic concepts and principles of transition path theory to reactions in which trajectories exhibit a specified sequence of events and illustrate the utility of this generalization on examples.
Collapse
Affiliation(s)
- Chatipat Lorpaiboon
- Department of Chemistry and James Franck Institute, University of Chicago, Chicago, Illinois 60637, USA
| | - Jonathan Weare
- Courant Institute of Mathematical Sciences, New York University, New York, New York 10012, USA
| | - Aaron R. Dinner
- Department of Chemistry and James Franck Institute, University of Chicago, Chicago, Illinois 60637, USA
| |
Collapse
|
17
|
Vani BP, Weare J, Dinner AR. Computing transition path theory quantities with trajectory stratification. J Chem Phys 2022; 157:034106. [PMID: 35868925 PMCID: PMC9296190 DOI: 10.1063/5.0087058] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2022] [Accepted: 06/23/2022] [Indexed: 11/15/2022] Open
Abstract
Transition path theory computes statistics from ensembles of reactive trajectories. A common strategy for sampling reactive trajectories is to control the branching and pruning of trajectories so as to enhance the sampling of low probability segments. However, it can be challenging to apply transition path theory to data from such methods because determining whether configurations and trajectory segments are part of reactive trajectories requires looking backward and forward in time. Here, we show how this issue can be overcome efficiently by introducing simple data structures. We illustrate the approach in the context of nonequilibrium umbrella sampling, but the strategy is general and can be used to obtain transition path theory statistics from other methods that sample segments of unbiased trajectories.
Collapse
Affiliation(s)
- Bodhi P. Vani
- Department of Chemistry and James Franck Institute, University of Chicago, Chicago, Illinois 60637, USA
| | - Jonathan Weare
- Courant Institute of Mathematical Sciences, New York University, New York, New York 10012, USA
| | - Aaron R. Dinner
- Department of Chemistry and James Franck Institute, University of Chicago, Chicago, Illinois 60637, USA
| |
Collapse
|
18
|
Kapakayala AB, Nair NN. Boosting the conformational sampling by combining replica exchange with solute tempering and well-sliced metadynamics. J Comput Chem 2021; 42:2233-2240. [PMID: 34585768 DOI: 10.1002/jcc.26752] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2021] [Revised: 08/30/2021] [Accepted: 09/12/2021] [Indexed: 01/22/2023]
Abstract
Methods that combine collective variable (CV) based enhanced sampling and global tempering approaches are used in speeding-up the conformational sampling and free energy calculation of large and soft systems with a plethora of energy minima. In this paper, a new method of this kind is proposed in which the well-sliced metadynamics approach (WSMTD) is united with replica exchange with solute tempering (REST2) method. WSMTD employs a divide-and-conquer strategy wherein high-dimensional slices of a free energy surface are independently sampled and combined. The method enables one to accomplish a controlled exploration of the CV-space with a restraining bias as in umbrella sampling, and enhance-sampling of one or more orthogonal CVs using a metadynamics like bias. The new hybrid method proposed here enables boosting the sampling of more slow degrees of freedom in WSMTD simulations, without the need to specify associated CVs, through a replica exchange scheme within the framework of REST2. The high-dimensional slices of the probability distributions of CVs computed from the united WSMTD and REST2 simulations are subsequently combined using the weighted histogram analysis method to obtain the free energy surface. We show that the new method proposed here is accurate, improves the conformational sampling, and achieves quick convergence in free energy estimates. We demonstrate this by computing the conformational free energy landscapes of solvated alanine tripeptide and Trp-cage mini protein in explicit water.
Collapse
Affiliation(s)
- Anji Babu Kapakayala
- Department of Chemistry, Indian Institute of Technology Kanpur, Kanpur, India.,School of Pharmacy and Biomedical Sciences, Curtin University, Perth, Australia
| | - Nisanth N Nair
- Department of Chemistry, Indian Institute of Technology Kanpur, Kanpur, India
| |
Collapse
|
19
|
Antoszewski A, Lorpaiboon C, Strahan J, Dinner AR. Kinetics of Phenol Escape from the Insulin R 6 Hexamer. J Phys Chem B 2021; 125:11637-11649. [PMID: 34648712 DOI: 10.1021/acs.jpcb.1c06544] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Therapeutic preparations of insulin often contain phenolic molecules, which can impact both pharmacokinetics and shelf life. Thus, understanding the interactions of insulin and phenolic molecules can aid in designing improved therapeutics. In this study, we use molecular dynamics to investigate phenol release from the insulin hexamer. Leveraging recent advances in methods for analyzing molecular dynamics data, we expand on existing simulation studies to identify and quantitatively characterize six phenol binding/unbinding pathways for wild-type and A10 Ile → Val and B13 Glu → Gln mutant insulins. A number of these pathways involve large-scale opening of the primary escape channel, suggesting that the hexamer is much more dynamic than previously appreciated. We show that phenol unbinding is a multipathway process, with no single pathway representing more than 50% of the reactive current and all pathways representing at least 10%. We use the mutant simulations to show how the contributions of specific pathways can be rationally manipulated. Predicting the net effects of mutations is more challenging because the kinetics depend on all of the pathways, demanding quantitatively accurate simulations and experiments.
Collapse
Affiliation(s)
- Adam Antoszewski
- Department of Chemistry, The University of Chicago, Chicago, Illinois 60637, United States
| | - Chatipat Lorpaiboon
- Department of Chemistry, The University of Chicago, Chicago, Illinois 60637, United States
| | - John Strahan
- Department of Chemistry, The University of Chicago, Chicago, Illinois 60637, United States
| | - Aaron R Dinner
- Department of Chemistry, The University of Chicago, Chicago, Illinois 60637, United States.,James Franck Institute, The University of Chicago, Chicago, Illinois 60637, United States.,Institute for Biophysical Dynamics, The University of Chicago, Chicago, Illinois 60637, United States
| |
Collapse
|