1
|
Duraisamy B, Pramanik D. Influence of DNA Sequences on the Thermodynamic and Structural Stability of the ZTA Transcription Factor─DNA Complex: An All-Atom Molecular Dynamics Study. J Phys Chem B 2025; 129:4282-4297. [PMID: 40266646 DOI: 10.1021/acs.jpcb.4c07713] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/24/2025]
Abstract
The Epstein-Barr virus (EBV) is one of the cancer-causing gamma-type viruses. Although more than 90% of people are infected by this virus at some point, it remains in the body in a latent state, typically causing only minor symptoms. Our current understanding is that a known transcription factor (TF), the ZTA protein, binds with dsDNA (double-stranded DNA) and plays a crucial role in mediating the viral latent-to-lytic cycle through binding of specific ZTA-responsive elements (ZREs). However, there is no clear understanding of the effect of DNA sequences on the structural stability and quantitative estimation of the binding affinity between ZTA TF and DNA, along with their mechanistic details. In this study, we employed classical all-atom molecular dynamics and enhanced sampling simulations to study the ZTA-dsDNA structural properties, thermodynamics, and mechanistic details for the ZTA protein and for two different dsDNA systems: the core motif and the core motif with flanking end sequences. We conducted residue-level and nucleic acid-level analyses to assess the important protein residues and DNA bases forming interactions between the ZTA and dsDNA systems. We also explored the effect of adding flanking end sequences to the core motif on DNA groove lengths and interstrand hydrogen bonds. Our results indicate that the flanking sequences surrounding the core motif significantly influence the structural stability and binding affinity of the ZTA-dsDNA complex. Among ZRE 1, ZRE 2, and ZRE 3, particularly when paired with their naturally occurring flanking ends, ZRE 3 exhibits higher stability and binding affinity. These findings provide insights into the molecular mechanisms underlying EBV pathogenesis and may indicate potential targets for therapeutic intervention. A detailed explanation of the binding mechanisms will allow for the design of better-targeted therapies against EBV-associated cancers. This study will serve as a holistic benchmark for future studies of these viral protein interactions.
Collapse
Affiliation(s)
- Boobalan Duraisamy
- Department of Physics, SRM University AP, Amaravati 522 240, Andhra Pradesh, India
| | - Debabrata Pramanik
- Department of Physics, SRM University AP, Amaravati 522 240, Andhra Pradesh, India
- Centre for Computational and Integrative Sciences, SRM University AP, Amaravati 522 240, Andhra Pradesh, India
| |
Collapse
|
2
|
Mondal K, Klauda JB. Physically interpretable performance metrics for clustering. J Chem Phys 2024; 161:244106. [PMID: 39723706 DOI: 10.1063/5.0241122] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2024] [Accepted: 11/21/2024] [Indexed: 12/28/2024] Open
Abstract
Clustering is a type of machine learning technique, which is used to group huge amounts of data based on their similarity into separate groups or clusters. Clustering is a very important task that is nowadays used to analyze the huge and diverse amount of data coming out of molecular dynamics (MD) simulations. Typically, the data from the MD simulations in terms of their various frames in the trajectory are clustered into different groups and a representative element from each group is studied separately. Now, a very important question coming in this process is: what is the quality of the clusters that are obtained? There are several performance metrics that are available in the literature such as the silhouette index and the Davies-Bouldin Index that are often used to analyze the quality of clustering. However, most of these metrics focus on the overlap or the similarity of the clusters in the reduced dimension that is used for clustering and do not focus on the physically important properties or the parameters of the system. To address this issue, we have developed two physically interpretable scoring metrics that focus on the physical parameters of the system that we are analyzing. We have used and tested our algorithm on three different systems: (1) Ising model, (2) peptide folding and unfolding of WT HP35, (3) a protein-ligand trajectory of an enzyme and substrate, and (4) a protein-ligand dissociated trajectory. We show that the scoring metrics provide us clusters that match with our physical intuition about the systems.
Collapse
Affiliation(s)
- Kinjal Mondal
- Institute for Physical Science and Technology, Biophysics Program, University of Maryland, College Park, Maryland 20742, USA
| | - Jeffery B Klauda
- Institute for Physical Science and Technology, Biophysics Program, University of Maryland, College Park, Maryland 20742, USA
- Department of Chemical and Biomolecular Engineering, University of Maryland, College Park, Maryland 20742, USA
| |
Collapse
|
3
|
Kumar A, Kukal S, Marepalli A, Kumar S, Govindarajan S, Pramanik D. Probing the Molecular Interactions of A22 with Prokaryotic Actin MreB and Eukaryotic Actin: A Computational and Experimental Study. J Phys Chem B 2024; 128:10553-10564. [PMID: 39413431 DOI: 10.1021/acs.jpcb.4c02963] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2024]
Abstract
Actin is a major cytoskeletal system that mediates the intricate organization of macromolecules within cells. The bacterial cytoskeletal protein MreB is a prokaryotic actin-like protein governing the cell shape and intracellular organization in many rod-shaped bacteria, including pathogens. MreB stands as a target for antibiotic development, and compounds like A22 and its analogue, MP265, are identified as potent inhibitors of MreB. The bacterial actin MreB shares structural homology with eukaryotic actin despite lacking sequence similarity. It is currently not clear whether small molecules that inhibit MreB can act on eukaryotic actin due to their structural similarity. In this study, we investigate the molecular interactions between A22 and its analogue MP265 with MreB and eukaryotic actin through a molecular dynamics approach. Employing MD simulations and free energy calculations with an all-atom model, we unveil the robust interaction of A22 and MP265 with MreB, and substantial binding affinity is observed for A22 and MP265 with eukaryotic actin. Experimental assays reveal A22's toxicity to eukaryotic cells, including yeast and human glioblastoma cells. Microscopy analysis demonstrates the profound effects of A22 on actin organization in human glioblastoma cells. This integrative computational and experimental study provides new insights into A22's mode of action, highlighting its potential as a versatile tool for probing the dynamics of both prokaryotic and eukaryotic actins.
Collapse
Affiliation(s)
- Anuj Kumar
- Department of Physics, SRM University - AP, Amaravati, Andhra Pradesh 522 240, India
| | - Samiksha Kukal
- Kusuma School of Biological Sciences, Indian Institute of Technology Delhi, New Delhi, Hauz Khas 110016, India
| | - Anusha Marepalli
- Department of Biological Sciences, SRM University - AP, Amaravati, Andhra Pradesh 522 240, India
| | - Saran Kumar
- Kusuma School of Biological Sciences, Indian Institute of Technology Delhi, New Delhi, Hauz Khas 110016, India
| | - Sutharsan Govindarajan
- Department of Biological Sciences, SRM University - AP, Amaravati, Andhra Pradesh 522 240, India
| | - Debabrata Pramanik
- Department of Physics, SRM University - AP, Amaravati, Andhra Pradesh 522 240, India
- Centre for Computational and Integrative Sciences, SRM University - AP, Amaravati, Andhra Pradesh 522 240, India
| |
Collapse
|
4
|
Ray D, Parrinello M. Data-driven classification of ligand unbinding pathways. Proc Natl Acad Sci U S A 2024; 121:e2313542121. [PMID: 38412121 PMCID: PMC10927508 DOI: 10.1073/pnas.2313542121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2023] [Accepted: 01/26/2024] [Indexed: 02/29/2024] Open
Abstract
Studying the pathways of ligand-receptor binding is essential to understand the mechanism of target recognition by small molecules. The binding free energy and kinetics of protein-ligand complexes can be computed using molecular dynamics (MD) simulations, often in quantitative agreement with experiments. However, only a qualitative picture of the ligand binding/unbinding paths can be obtained through a conventional analysis of the MD trajectories. Besides, the higher degree of manual effort involved in analyzing pathways limits its applicability in large-scale drug discovery. Here, we address this limitation by introducing an automated approach for analyzing molecular transition paths with a particular focus on protein-ligand dissociation. Our method is based on the dynamic time-warping algorithm, originally designed for speech recognition. We accurately classified molecular trajectories using a very generic descriptor set of contacts or distances. Our approach outperforms manual classification by distinguishing between parallel dissociation channels, within the pathways identified by visual inspection. Most notably, we could compute exit-path-specific ligand-dissociation kinetics. The unbinding timescale along the fastest path agrees with the experimental residence time, providing a physical interpretation to our entirely data-driven protocol. In combination with appropriate enhanced sampling algorithms, this technique can be used for the initial exploration of ligand-dissociation pathways as well as for calculating path-specific thermodynamic and kinetic properties.
Collapse
Affiliation(s)
- Dhiman Ray
- Simulations Research Line, Italian Institute of Technology, Via Enrico Melen 83, GenovaGE16152, Italy
| | - Michele Parrinello
- Simulations Research Line, Italian Institute of Technology, Via Enrico Melen 83, GenovaGE16152, Italy
| |
Collapse
|
5
|
Fu H, Liu H, Xing J, Zhao T, Shao X, Cai W. Deep-Learning-Assisted Enhanced Sampling for Exploring Molecular Conformational Changes. J Phys Chem B 2023; 127:9926-9935. [PMID: 37947397 DOI: 10.1021/acs.jpcb.3c05284] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2023]
Abstract
We present a novel strategy to explore conformational changes and identify stable states of molecular objects, eliminating the need for a priori knowledge. The approach applies a deep learning method to extract information about the movement modes of the molecular object from a short, high-dimensional, and parameter-free preliminary enhanced-sampling simulation. The gathered information is described by a small set of deep-learning-based collective variables (dCVs), which steer the production-enhanced-sampling simulation. Considering the challenge of adequately exploring the configurational space using the low-dimensional, suboptimal dCVs, we incorporate a method designed for ergodic sampling, namely, Gaussian-accelerated molecular dynamics (MD), into the framework of CV-based enhanced sampling. MD simulations on both toy models and nontrivial examples demonstrate the remarkable computational efficiency of the strategy in capturing the conformational changes of molecular objects without a priori knowledge. Specifically, we achieved the blind folding of two fast folders, chignolin and villin, within a time scale of hundreds of nanoseconds and successfully reconstructed the free-energy landscapes that characterize their reversible folding. All in all, the presented strategy holds significant promise for investigating conformational changes in macromolecules, and it is anticipated to find extensive applications in the fields of chemistry and biology.
Collapse
Affiliation(s)
- Haohao Fu
- Research Center for Analytical Sciences, Tianjin Key Laboratory of Biosensing and Molecular Recognition, State Key Laboratory of Medicinal Chemical Biology, College of Chemistry, Nankai University, Tianjin 300071, China
- Haihe Laboratory of Sustainable Chemical Transformations, Tianjin 300192, China
| | - Han Liu
- Research Center for Analytical Sciences, Tianjin Key Laboratory of Biosensing and Molecular Recognition, State Key Laboratory of Medicinal Chemical Biology, College of Chemistry, Nankai University, Tianjin 300071, China
- Haihe Laboratory of Sustainable Chemical Transformations, Tianjin 300192, China
| | - Jingya Xing
- Research Center for Analytical Sciences, Tianjin Key Laboratory of Biosensing and Molecular Recognition, State Key Laboratory of Medicinal Chemical Biology, College of Chemistry, Nankai University, Tianjin 300071, China
- Haihe Laboratory of Sustainable Chemical Transformations, Tianjin 300192, China
| | - Tong Zhao
- Research Center for Analytical Sciences, Tianjin Key Laboratory of Biosensing and Molecular Recognition, State Key Laboratory of Medicinal Chemical Biology, College of Chemistry, Nankai University, Tianjin 300071, China
| | - Xueguang Shao
- Research Center for Analytical Sciences, Tianjin Key Laboratory of Biosensing and Molecular Recognition, State Key Laboratory of Medicinal Chemical Biology, College of Chemistry, Nankai University, Tianjin 300071, China
- Haihe Laboratory of Sustainable Chemical Transformations, Tianjin 300192, China
| | - Wensheng Cai
- Research Center for Analytical Sciences, Tianjin Key Laboratory of Biosensing and Molecular Recognition, State Key Laboratory of Medicinal Chemical Biology, College of Chemistry, Nankai University, Tianjin 300071, China
- Haihe Laboratory of Sustainable Chemical Transformations, Tianjin 300192, China
| |
Collapse
|
6
|
Ray D, Parrinello M. Kinetics from Metadynamics: Principles, Applications, and Outlook. J Chem Theory Comput 2023; 19:5649-5670. [PMID: 37585703 DOI: 10.1021/acs.jctc.3c00660] [Citation(s) in RCA: 43] [Impact Index Per Article: 21.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/18/2023]
Abstract
Metadynamics is a popular enhanced sampling algorithm for computing the free energy landscape of rare events by using molecular dynamics simulation. Ten years ago, Tiwary and Parrinello introduced the infrequent metadynamics approach for calculating the kinetics of transitions across free energy barriers. Since then, metadynamics-based methods for obtaining rate constants have attracted significant attention in computational molecular science. Such methods have been applied to study a wide range of problems, including protein-ligand binding, protein folding, conformational transitions, chemical reactions, catalysis, and nucleation. Here, we review the principles of elucidating kinetics from metadynamics-like approaches, subsequent methodological developments in this area, and successful applications on chemical, biological, and material systems. We also highlight the challenges of reconstructing accurate kinetics from enhanced sampling simulations and the scope of future developments.
Collapse
Affiliation(s)
- Dhiman Ray
- Atomistic Simulations, Italian Institute of Technology, Via Enrico Melen 83, 16152 Genova, Italy
| | - Michele Parrinello
- Atomistic Simulations, Italian Institute of Technology, Via Enrico Melen 83, 16152 Genova, Italy
| |
Collapse
|
7
|
Wong CF. 15 Years of molecular simulation of drug-binding kinetics. Expert Opin Drug Discov 2023; 18:1333-1348. [PMID: 37789731 PMCID: PMC10926948 DOI: 10.1080/17460441.2023.2264770] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2023] [Accepted: 09/26/2023] [Indexed: 10/05/2023]
Abstract
INTRODUCTION Drug-binding kinetics has been increasingly recognized as an important factor to be considered in drug discovery. Long residence time could prolong the action of some drugs while produce toxicity on others. Early evaluation of the binding kinetics of drug candidates could reduce attrition rate late in the drug discovery process. Computational prediction of drug-binding kinetics is useful as compounds can be evaluated even before they are made. However, simulation of drug-binding kinetics is a challenging problem because of the long-time scale involved. Nevertheless, significant progress has been made. AREAS COVERED This review illustrates the rapid evolution of qualitative to quantitative molecular dynamics-based methods that have been developed over the last 15 years. EXPERT OPINION The development of new methods based on molecular dynamics simulations now enables computation of absolute association/dissociation rate constants. Cheaper methods capable of identifying candidates with fast or slow binding kinetics, or rank-ordering rate constants are also available. Together, these methods have generated useful insights into the molecular mechanisms of drug-binding kinetics, and the design of drug candidates with therapeutically favorable kinetics. Although predicting absolute rate constants is still expensive and challenging, rapid improvement is expected in the coming years with the continuing refinement of current technologies, development of new methodologies, and the utilization of machine learning.
Collapse
Affiliation(s)
- Chung F Wong
- Department of Chemistry and Biochemistry, University of Missouri-St. Louis, St. Louis, MO, USA
| |
Collapse
|
8
|
Chennakesavalu S, Toomer DJ, Rotskoff GM. Ensuring thermodynamic consistency with invertible coarse-graining. J Chem Phys 2023; 158:124126. [PMID: 37003724 DOI: 10.1063/5.0141888] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/03/2023] Open
Abstract
Coarse-grained models are a core computational tool in theoretical chemistry and biophysics. A judicious choice of a coarse-grained model can yield physical insights by isolating the essential degrees of freedom that dictate the thermodynamic properties of a complex, condensed-phase system. The reduced complexity of the model typically leads to lower computational costs and more efficient sampling compared with atomistic models. Designing "good" coarse-grained models is an art. Generally, the mapping from fine-grained configurations to coarse-grained configurations itself is not optimized in any way; instead, the energy function associated with the mapped configurations is. In this work, we explore the consequences of optimizing the coarse-grained representation alongside its potential energy function. We use a graph machine learning framework to embed atomic configurations into a low-dimensional space to produce efficient representations of the original molecular system. Because the representation we obtain is no longer directly interpretable as a real-space representation of the atomic coordinates, we also introduce an inversion process and an associated thermodynamic consistency relation that allows us to rigorously sample fine-grained configurations conditioned on the coarse-grained sampling. We show that this technique is robust, recovering the first two moments of the distribution of several observables in proteins such as chignolin and alanine dipeptide.
Collapse
Affiliation(s)
| | - David J Toomer
- Department of Chemistry, Stanford University, Stanford, California 94305, USA
| | - Grant M Rotskoff
- Department of Chemistry, Stanford University, Stanford, California 94305, USA
| |
Collapse
|
9
|
Predicting efficacy of drug-carrier nanoparticle designs for cancer treatment: a machine learning-based solution. Sci Rep 2023; 13:547. [PMID: 36631637 PMCID: PMC9834306 DOI: 10.1038/s41598-023-27729-7] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2022] [Accepted: 01/06/2023] [Indexed: 01/13/2023] Open
Abstract
Molecular Dynamic (MD) simulations are very effective in the discovery of nanomedicines for treating cancer, but these are computationally expensive and time-consuming. Existing studies integrating machine learning (ML) into MD simulation to enhance the process and enable efficient analysis cannot provide direct insights without the complete simulation. In this study, we present an ML-based approach for predicting the solvent accessible surface area (SASA) of a nanoparticle (NP), denoting its efficacy, from a fraction of the MD simulations data. The proposed framework uses a time series model for simulating the MD, resulting in an intermediate state, and a second model to calculate the SASA in that state. Empirically, the solution can predict the SASA value 260 timesteps ahead 7.5 times faster with a very low average error of 1956.93. We also introduce the use of an explainability technique to validate the predictions. This work can reduce the computational expense of both processing and data size greatly while providing reliable solutions for the nanomedicine design process.
Collapse
|
10
|
Abstract
The treatment of slow and rare transitions in the simulation of complex systems poses a great computational challenge. A powerful approach to tackle this challenge is the string method, which represents the transition path as a one-dimensional curve in a multidimensional space of collective variables. Commonly used strategies for pathway optimization include aligning the tangent of the string to the local mean force or to the mean drift determined from swarms of short trajectories. Here, a novel strategy is proposed, allowing the string to be optimized based on a variational principle involving the unidirectional reactive flux expressed in terms of the time-correlation function of the committor. The method is illustrated with model systems and then probed with the alanine dipeptide and a coarse-grained model of the barstar-barnase protein complex. Successive iterations variationally refine the string toward an optimal transition pathway following the gradient of the committor between two metastable states.
Collapse
Affiliation(s)
- Ziwei He
- Department of Chemistry, The University of Chicago, 5735 S. Ellis Avenue, Chicago60637, Illinois, United States
| | - Christophe Chipot
- Laboratoire International Associé Centre National de la Recherche Scientifique et University of Illinois at Urbana-Champaign, Unité Mixte de Recherche No. 7019, Université de Lorraine, B.P. 70239, Vandœuvre-lès-Nancy cedex54506, France
| | - Benoît Roux
- Department of Chemistry, The University of Chicago, 5735 S. Ellis Avenue, Chicago60637, Illinois, United States
- Department of Biochemistry and Molecular Biology, The University of Chicago, Chicago60637, IllinoisUnited States
| |
Collapse
|
11
|
Pramanik D, Pawar AB, Roy S, Singh JK. Mechanistic insights of key host proteins and potential repurposed inhibitors regulating SARS-CoV-2 pathway. J Comput Chem 2022; 43:1237-1250. [PMID: 35535951 PMCID: PMC9348233 DOI: 10.1002/jcc.26888] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2022] [Revised: 04/03/2022] [Accepted: 04/22/2022] [Indexed: 12/16/2022]
Abstract
The emergence of pandemic situations originated from severe acute respiratory syndrome (SARS)‐CoV‐2 and its new variants created worldwide medical emergencies. Due to the non‐availability of efficient drugs and vaccines at these emergency hours, repurposing existing drugs can effectively treat patients critically infected by SARS‐CoV‐2. Finding a suitable repurposing drug with inhibitory efficacy to a host‐protein is challenging. A detailed mechanistic understanding of the kinetics, (dis)association pathways, key protein residues facilitating the entry–exit of the drugs with targets are fundamental in selecting these repurposed drugs. Keeping this target as the goal of the paper, the potential repurposing drugs, Nafamostat, Camostat, Silmitasertib, Valproic acid, and Zotatifin with host‐proteins HDAC2, CSK22, eIF4E2 are studied to elucidate energetics, kinetics, and dissociation pathways. From an ensemble of independent simulations, we observed the presence of single or multiple dissociation pathways with varying host‐proteins‐drug systems and quantitatively estimated the probability of unbinding through these specific pathways. We also explored the crucial gateway residues facilitating these dissociation mechanisms. Interestingly, the residues we obtained for HDAC2 and CSK22 are also involved in the catalytic activity. Our results demonstrate how these potential drugs interact with the host machinery and the specific target residues, showing involvement in the mechanism. Most of these drugs are in the preclinical phase, and some are already being used to treat severe COVID‐19 patients. Hence, the mechanistic insight presented in this study is envisaged to support further findings of clinical studies and eventually develop efficient inhibitors to treat SARS‐CoV‐2.
Collapse
Affiliation(s)
- Debabrata Pramanik
- Department of Chemical Engineering, Indian Institute of Technology Kanpur, Kanpur, India
| | | | - Sudip Roy
- Prescience Insilico Private Limited, Bangalore, India
| | - Jayant Kumar Singh
- Department of Chemical Engineering, Indian Institute of Technology Kanpur, Kanpur, India.,Prescience Insilico Private Limited, Bangalore, India
| |
Collapse
|
12
|
Abstract
The kinetics of a dynamical system dominated by two metastable states is examined from the perspective of the activated-dynamics reactive flux formalism, Markov state eigenvalue spectral decomposition, and committor-based transition path theory. Analysis shows that the different theoretical formulations are consistent, clarifying the significance of the inherent microscopic lag-times that are implicated, and that the most meaningful one-dimensional reaction coordinate in the region of the transition state is along the gradient of the committor in the multidimensional subspace of collective variables. It is shown that the familiar reactive flux activated dynamics formalism provides an effective route to calculate the transition rate in the case of a narrow sharp barrier but much less so in the case of a broad flat barrier. In this case, the standard reactive flux correlation function decays very slowly to the plateau value that corresponds to the transmission coefficient. Treating the committor function as a reaction coordinate does not alleviate all issues caused by the slow relaxation of the reactive flux correlation function. A more efficient activated dynamics simulation algorithm may be achieved from a modified reactive flux weighted by the committor. Simulation results on simple systems are used to illustrate the various conceptual points.
Collapse
Affiliation(s)
- Benoît Roux
- Department of Biochemistry and Molecular Biology, Department of Chemistry, The University of Chicago, 5735 S Ellis Ave., Chicago, Illinois 60637, USA
| |
Collapse
|
13
|
Chen H, Ogden D, Pant S, Cai W, Tajkhorshid E, Moradi M, Roux B, Chipot C. A Companion Guide to the String Method with Swarms of Trajectories: Characterization, Performance, and Pitfalls. J Chem Theory Comput 2022; 18:1406-1422. [PMID: 35138832 PMCID: PMC8904302 DOI: 10.1021/acs.jctc.1c01049] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
The string method with swarms of trajectories (SMwST) is an algorithm that identifies a physically meaningful transition pathway─a one-dimensional curve, embedded within a high-dimensional space of selected collective variables. The SMwST algorithm leans on a series of short, unbiased molecular dynamics simulations spawned at different locations of the discretized path, from whence an average dynamic drift is determined to evolve the string toward an optimal pathway. However conceptually simple in both its theoretical formulation and practical implementation, the SMwST algorithm is computationally intensive and requires a careful choice of parameters for optimal cost-effectiveness in applications to challenging problems in chemistry and biology. In this contribution, the SMwST algorithm is presented in a self-contained manner, discussing with a critical eye its theoretical underpinnings, applicability, inherent limitations, and use in the context of path-following free-energy calculations and their possible extension to kinetics modeling. Through multiple simulations of a prototypical polypeptide, combining the search of the transition pathway and the computation of the potential of mean force along it, several practical aspects of the methodology are examined with the objective of optimizing the computational effort, yet without sacrificing accuracy. In light of the results reported here, we propose some general guidelines aimed at improving the efficiency and reliability of the computed pathways and free-energy profiles underlying the conformational transitions at hand.
Collapse
Affiliation(s)
- Haochuan Chen
- Research Center for Analytical Sciences, College of Chemistry, Tianjin Key Laboratory of Biosensing and Molecular Recognition, Nankai University, Tianjin 300071, China
- Theoretical and Computational Biophysics Group, NIH Center for Macromolecular Modeling and Bioinformatics, Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
- Laboratoire International Associé Centre National de la Recherche Scientifique et University of Illinois at Urbana-Champaign, Unité Mixte de Recherche no 7019, Université de Lorraine, B.P. 70239, 54506 Vandœuvre-lès-Nancy Cedex, France
| | - Dylan Ogden
- Department of Chemistry and Biochemistry, University of Arkansas, Fayetteville, Arkansas 72701, United States
| | - Shashank Pant
- Theoretical and Computational Biophysics Group, NIH Center for Macromolecular Modeling and Bioinformatics, Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
| | - Wensheng Cai
- Research Center for Analytical Sciences, College of Chemistry, Tianjin Key Laboratory of Biosensing and Molecular Recognition, Nankai University, Tianjin 300071, China
| | - Emad Tajkhorshid
- Theoretical and Computational Biophysics Group, NIH Center for Macromolecular Modeling and Bioinformatics, Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
- Department of Biochemistry and Center for Biophysics and Quantitative Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
| | - Mahmoud Moradi
- Department of Chemistry and Biochemistry, University of Arkansas, Fayetteville, Arkansas 72701, United States
| | - Benoît Roux
- Department of Biochemistry and Molecular Biology, University of Chicago, Chicago, Illinois 60637, United States
| | - Christophe Chipot
- Theoretical and Computational Biophysics Group, NIH Center for Macromolecular Modeling and Bioinformatics, Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
- Laboratoire International Associé Centre National de la Recherche Scientifique et University of Illinois at Urbana-Champaign, Unité Mixte de Recherche no 7019, Université de Lorraine, B.P. 70239, 54506 Vandœuvre-lès-Nancy Cedex, France
- Department of Physics, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
| |
Collapse
|
14
|
Tsai ST, Smith Z, Tiwary P. SGOOP-d: Estimating Kinetic Distances and Reaction Coordinate Dimensionality for Rare Event Systems from Biased/Unbiased Simulations. J Chem Theory Comput 2021; 17:6757-6765. [PMID: 34662516 DOI: 10.1021/acs.jctc.1c00431] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
Understanding kinetics including reaction pathways and associated transition rates is an important yet difficult problem in numerous chemical and biological systems, especially in situations with multiple competing pathways. When these high-dimensional systems are projected on low-dimensional coordinates, which are often needed for enhanced sampling or for interpretation of simulations and experiments, one can end up losing the kinetic connectivity of the underlying high-dimensional landscape. Thus, in the low-dimensional projection, metastable states might appear closer or further than they actually are. To deal with this issue, in this work, we develop a formalism that learns a multidimensional yet minimally complex reaction coordinate (RC) for generic high-dimensional systems. When projected along this RC, all possible kinetically relevant pathways can be demarcated and the true high-dimensional connectivity is maintained. One of the defining attributes of our method lies in that it can work on long unbiased simulations as well as biased simulations often needed for rare event systems. We demonstrate the utility of the method by studying a range of model systems including conformational transitions in a small peptide Ace-Ala3-Nme, where we show how two-dimensional and three-dimensional RCs found by our previously published spectral gap optimization method "SGOOP" [Tiwary, P. and Berne, B. J. Proc. Natl. Acad. Sci. 2016, 113, 2839] can capture the kinetics for 23 and all 28 out of the 28 dominant state-to-state transitions, respectively.
Collapse
Affiliation(s)
- Sun-Ting Tsai
- Department of Physics and Institute for Physical Science and Technology, University of Maryland, College Park 20742, Maryland, United States
| | - Zachary Smith
- Biophysics Program and Institute for Physical Science and Technology, University of Maryland, College Park 20742, Maryland, United States
| | - Pratyush Tiwary
- Department of Chemistry and Biochemistry and Institute for Physical Science and Technology, University of Maryland, College Park 20742, Maryland, United States
| |
Collapse
|
15
|
Smith Z, Tiwary P. Making High-Dimensional Molecular Distribution Functions Tractable through Belief Propagation on Factor Graphs. J Phys Chem B 2021; 125:11150-11158. [PMID: 34586819 DOI: 10.1021/acs.jpcb.1c05717] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Molecular dynamics (MD) simulations provide a wealth of high-dimensional data at all-atom and femtosecond resolution but deciphering mechanistic information from this data is an ongoing challenge in physical chemistry and biophysics. Theoretically speaking, joint probabilities of the equilibrium distribution contain all thermodynamic information, but they prove increasingly difficult to compute and interpret as the dimensionality increases. Here, inspired by tools in probabilistic graphical modeling, we develop a factor graph trained through belief propagation that helps factorize the joint probability into an approximate tractable form that can be easily visualized and used. We validate the study through the analysis of the conformational dynamics of two small peptides with five and nine residues. Our validations include testing the conditional dependency predictions through an intervention scheme inspired by Judea Pearl. Second, we directly use the belief propagation-based approximate probability distribution as a high-dimensional static bias for enhanced sampling, where we achieve spontaneous back-and-forth motion between metastable states that is up to 350 times faster than unbiased MD. We believe this work opens up useful ways to thinking about and dealing with high-dimensional molecular simulations.
Collapse
Affiliation(s)
- Zachary Smith
- Biophysics Program and Institute for Physical Science and Technology, University of Maryland, College Park, Maryland 20742, United States
| | - Pratyush Tiwary
- Department of Chemistry and Biochemistry and Institute for Physical Science and Technology, University of Maryland, College Park, Maryland 20742, United States
| |
Collapse
|
16
|
Roux B. String Method with Swarms-of-Trajectories, Mean Drifts, Lag Time, and Committor. J Phys Chem A 2021; 125:7558-7571. [PMID: 34406010 PMCID: PMC8419867 DOI: 10.1021/acs.jpca.1c04110] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2021] [Revised: 07/26/2021] [Indexed: 11/29/2022]
Abstract
The kinetics of a dynamical system comprising two metastable states is formulated in terms of a finite-time propagator in phase space (position and velocity) adapted to the underdamped Langevin equation. Dimensionality reduction to a subspace of collective variables yields familiar expressions for the propagator, committor, and steady-state flux. A quadratic expression for the steady-state flux between the two metastable states can serve as a robust variational principle to determine an optimal approximate committor expressed in terms of a set of collective variables. The theoretical formulation is exploited to clarify the foundation of the string method with swarms-of-trajectories, which relies on the mean drift of short trajectories to determine the optimal transition pathway. It is argued that the conditions for Markovity within a subspace of collective variables may not be satisfied with an arbitrary short time-step and that proper kinetic behaviors appear only when considering the effective propagator for longer lag times. The effective propagator with finite lag time is amenable to an eigenvalue-eigenvector spectral analysis, as elaborated previously in the context of position-based Markov models. The time-correlation functions calculated by swarms-of-trajectories along the string pathway constitutes a natural extension of these developments. The present formulation provides a powerful theoretical framework to characterize the optimal pathway between two metastable states of a system.
Collapse
Affiliation(s)
- Benoît Roux
- Department
of Biochemistry and Molecular Biology, The
University of Chicago, Chicago, Illinois 60637, United States
- Department
of Chemistry, The University of Chicago, 5735 S. Ellis Avenue, Chicago, Illinois 60637, United States
| |
Collapse
|
17
|
Blow KE, Quigley D, Sosso GC. The seven deadly sins: When computing crystal nucleation rates, the devil is in the details. J Chem Phys 2021; 155:040901. [PMID: 34340373 DOI: 10.1063/5.0055248] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
The formation of crystals has proven to be one of the most challenging phase transformations to quantitatively model-let alone to actually understand-be it by means of the latest experimental technique or the full arsenal of enhanced sampling approaches at our disposal. One of the most crucial quantities involved with the crystallization process is the nucleation rate, a single elusive number that is supposed to quantify the average probability for a nucleus of critical size to occur within a certain volume and time span. A substantial amount of effort has been devoted to attempt a connection between the crystal nucleation rates computed by means of atomistic simulations and their experimentally measured counterparts. Sadly, this endeavor almost invariably fails to some extent, with the venerable classical nucleation theory typically blamed as the main culprit. Here, we review some of the recent advances in the field, focusing on a number of perhaps more subtle details that are sometimes overlooked when computing nucleation rates. We believe it is important for the community to be aware of the full impact of aspects, such as finite size effects and slow dynamics, that often introduce inconspicuous and yet non-negligible sources of uncertainty into our simulations. In fact, it is key to obtain robust and reproducible trends to be leveraged so as to shed new light on the kinetics of a process, that of crystal nucleation, which is involved into countless practical applications, from the formulation of pharmaceutical drugs to the manufacturing of nano-electronic devices.
Collapse
Affiliation(s)
- Katarina E Blow
- Department of Physics, University of Warwick, Coventry CV4 7AL, United Kingdom
| | - David Quigley
- Department of Physics, University of Warwick, Coventry CV4 7AL, United Kingdom
| | - Gabriele C Sosso
- Department of Chemistry, University of Warwick, Coventry CV4 7AL, United Kingdom
| |
Collapse
|
18
|
Giberti F, Tribello GA, Ceriotti M. Global Free-Energy Landscapes as a Smoothly Joined Collection of Local Maps. J Chem Theory Comput 2021; 17:3292-3308. [PMID: 34003008 DOI: 10.1021/acs.jctc.0c01177] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Enhanced sampling techniques have become an essential tool in computational chemistry and physics, where they are applied to sample activated processes that occur on a time scale that is inaccessible to conventional simulations. Despite their popularity, it is well known that they have constraints that hinder their application to complex problems. The core issue lies in the need to describe the system using a small number of collective variables (CVs). Any slow degree of freedom that is not properly described by the chosen CVs will hinder sampling efficiency. However, the exploration of configuration space is also hampered by including variables that are not relevant for the activated process under study. This paper presents the Adaptive Topography of Landscape for Accelerated Sampling (ATLAS), a new biasing method capable of working with many CVs. The root idea of ATLAS is to apply a divide-and-conquer strategy, where the high-dimensional CVs space is divided into basins, each of which is described by an automatically determined, low-dimensional set of variables. A well-tempered metadynamics-like bias is constructed as a function of these local variables. Indicator functions associated with the basins switch on and off the local biases so that the sampling is performed on a collection of low-dimensional CV spaces that are smoothly combined to generate an effectively high-dimensional bias. The unbiased Boltzmann distribution is recovered through reweighing, making the evaluation of conformational and thermodynamic properties straightforward. The decomposition of the free-energy landscape in local basins can be updated iteratively as the simulation discovers new (meta)stable states.
Collapse
Affiliation(s)
- F Giberti
- Laboratory of Computational Science and Modeling, Institute of Materials, École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
| | - G A Tribello
- Atomistic Simulation Centre, School of Mathematics and Physics, Queen's University Belfast, Belfast BT14 7EN, United Kingdom
| | - M Ceriotti
- Laboratory of Computational Science and Modeling, Institute of Materials, École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
| |
Collapse
|
19
|
Pant S, Smith Z, Wang Y, Tajkhorshid E, Tiwary P. Confronting pitfalls of AI-augmented molecular dynamics using statistical physics. J Chem Phys 2020; 153:234118. [PMID: 33353347 PMCID: PMC7863682 DOI: 10.1063/5.0030931] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2020] [Accepted: 11/29/2020] [Indexed: 12/31/2022] Open
Abstract
Artificial intelligence (AI)-based approaches have had indubitable impact across the sciences through the ability to extract relevant information from raw data. Recently, AI has also found use in enhancing the efficiency of molecular simulations, wherein AI derived slow modes are used to accelerate the simulation in targeted ways. However, while typical fields where AI is used are characterized by a plethora of data, molecular simulations, per construction, suffer from limited sampling and thus limited data. As such, the use of AI in molecular simulations can suffer from a dangerous situation where the AI-optimization could get stuck in spurious regimes, leading to incorrect characterization of the reaction coordinate (RC) for the problem at hand. When such an incorrect RC is then used to perform additional simulations, one could start to deviate progressively from the ground truth. To deal with this problem of spurious AI-solutions, here, we report a novel and automated algorithm using ideas from statistical mechanics. It is based on the notion that a more reliable AI-solution will be one that maximizes the timescale separation between slow and fast processes. To learn this timescale separation even from limited data, we use a maximum caliber-based framework. We show the applicability of this automatic protocol for three classic benchmark problems, namely, the conformational dynamics of a model peptide, ligand-unbinding from a protein, and folding/unfolding energy landscape of the C-terminal domain of protein G. We believe that our work will lead to increased and robust use of trustworthy AI in molecular simulations of complex systems.
Collapse
Affiliation(s)
- Shashank Pant
- NIH Center for Macromolecular Modeling and Bioinformatics, Beckman Institute for Advanced Science and Technology, Department of Biochemistry, Center for Biophysics and Quantitative Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA
| | | | | | - Emad Tajkhorshid
- NIH Center for Macromolecular Modeling and Bioinformatics, Beckman Institute for Advanced Science and Technology, Department of Biochemistry, Center for Biophysics and Quantitative Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA
| | | |
Collapse
|
20
|
Ray D, Gokey T, Mobley DL, Andricioaei I. Kinetics and free energy of ligand dissociation using weighted ensemble milestoning. J Chem Phys 2020; 153:154117. [PMID: 33092382 DOI: 10.1063/5.0021953] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
We consider the recently developed weighted ensemble milestoning (WEM) scheme [D. Ray and I. Andricioaei, J. Chem. Phys. 152, 234114 (2020)] and test its capability of simulating ligand-receptor dissociation dynamics. We performed WEM simulations on the following host-guest systems: Na+/Cl- ion pair and 4-hydroxy-2-butanone ligand with FK506 binding protein. As a proof of principle, we show that the WEM formalism reproduces the Na+/Cl- ion pair dissociation timescale and the free energy profile obtained from long conventional MD simulation. To increase the accuracy of WEM calculations applied to kinetics and thermodynamics in protein-ligand binding, we introduced a modified WEM scheme called weighted ensemble milestoning with restraint release (WEM-RR), which can increase the number of starting points per milestone without adding additional computational cost. WEM-RR calculations obtained a ligand residence time and binding free energy in agreement with experimental and previous computational results. Moreover, using the milestoning framework, the binding time and rate constants, dissociation constants, and committor probabilities could also be calculated at a low computational cost. We also present an analytical approach for estimating the association rate constant (kon) when binding is primarily diffusion driven. We show that the WEM method can efficiently calculate multiple experimental observables describing ligand-receptor binding/unbinding and is a promising candidate for computer-aided inhibitor design.
Collapse
Affiliation(s)
- Dhiman Ray
- Department of Chemistry, University of California Irvine, Irvine, California 92697, USA
| | - Trevor Gokey
- Department of Chemistry, University of California Irvine, Irvine, California 92697, USA
| | - David L Mobley
- Department of Chemistry, University of California Irvine, Irvine, California 92697, USA
| | - Ioan Andricioaei
- Department of Chemistry, University of California Irvine, Irvine, California 92697, USA
| |
Collapse
|
21
|
Tsai ST, Kuo EJ, Tiwary P. Learning molecular dynamics with simple language model built upon long short-term memory neural network. Nat Commun 2020; 11:5115. [PMID: 33037228 PMCID: PMC7547727 DOI: 10.1038/s41467-020-18959-8] [Citation(s) in RCA: 32] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2020] [Accepted: 09/23/2020] [Indexed: 12/04/2022] Open
Abstract
Recurrent neural networks have led to breakthroughs in natural language processing and speech recognition. Here we show that recurrent networks, specifically long short-term memory networks can also capture the temporal evolution of chemical/biophysical trajectories. Our character-level language model learns a probabilistic model of 1-dimensional stochastic trajectories generated from higher-dimensional dynamics. The model captures Boltzmann statistics and also reproduces kinetics across a spectrum of timescales. We demonstrate how training the long short-term memory network is equivalent to learning a path entropy, and that its embedding layer, instead of representing contextual meaning of characters, here exhibits a nontrivial connectivity between different metastable states in the underlying physical system. We demonstrate our model's reliability through different benchmark systems and a force spectroscopy trajectory for multi-state riboswitch. We anticipate that our work represents a stepping stone in the understanding and use of recurrent neural networks for understanding the dynamics of complex stochastic molecular systems.
Collapse
Affiliation(s)
- Sun-Ting Tsai
- Department of Physics and Institute for Physical Science and Technology, University of Maryland, College Park, MD, 20742, USA
| | - En-Jui Kuo
- Department of Physics and Joint Quantum Institute, University of Maryland, College Park, MD, 20742, USA
| | - Pratyush Tiwary
- Department of Chemistry and Biochemistry and Institute for Physical Science and Technology, University of Maryland, College Park, MD, 20742, USA.
| |
Collapse
|
22
|
Smith Z, Ravindra P, Wang Y, Cooley R, Tiwary P. Discovering Protein Conformational Flexibility through Artificial-Intelligence-Aided Molecular Dynamics. J Phys Chem B 2020; 124:8221-8229. [DOI: 10.1021/acs.jpcb.0c03985] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
|
23
|
Abstract
Ever since Clausius in 1865 and Boltzmann in 1877, the concepts of entropy and of its maximization have been the foundations for predicting how material equilibria derive from microscopic properties. But, despite much work, there has been no equally satisfactory general variational principle for nonequilibrium situations. However, in 1980, a new avenue was opened by E.T. Jaynes and by Shore and Johnson. We review here maximum caliber, which is a maximum-entropy-like principle that can infer distributions of flows over pathways, given dynamical constraints. This approach is providing new insights, particularly into few-particle complex systems, such as gene circuits, protein conformational reaction coordinates, network traffic, bird flocking, cell motility, and neuronal firing.
Collapse
Affiliation(s)
- Kingshuk Ghosh
- Department of Physics and Astronomy, University of Denver, Denver, Colorado 80209, USA
| | - Purushottam D. Dixit
- Department of Systems Biology, Columbia University, New York, NY 10032, USA,Department of Physics, University of Florida, Gainesville, Florida 32611, USA
| | - Luca Agozzino
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, New York 11794, USA
| | - Ken A. Dill
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, New York 11794, USA
| |
Collapse
|
24
|
Armacost KA, Riniker S, Cournia Z. Novel Directions in Free Energy Methods and Applications. J Chem Inf Model 2020; 60:1-5. [DOI: 10.1021/acs.jcim.9b01174] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Affiliation(s)
- Kira A. Armacost
- Computational and Structural Chemistry, MRL, Merck & Co., Inc. West Point, Pennsylvania 19486, United States
| | - Sereina Riniker
- Laboratory of Physical Chemistry, ETH Zürich, Vladimir-Prelog-Weg 2, 8093 Zürich, Switzerland
| | - Zoe Cournia
- Biomedical Research Foundation Academy of Athens, Soranou Ephessiou 4, 11527 Athens, Greece
| |
Collapse
|
25
|
Wang Y, Lamim Ribeiro JM, Tiwary P. Machine learning approaches for analyzing and enhancing molecular dynamics simulations. Curr Opin Struct Biol 2020; 61:139-145. [PMID: 31972477 DOI: 10.1016/j.sbi.2019.12.016] [Citation(s) in RCA: 141] [Impact Index Per Article: 28.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2019] [Revised: 12/16/2019] [Accepted: 12/26/2019] [Indexed: 10/25/2022]
Abstract
Molecular dynamics (MD) has become a powerful tool for studying biophysical systems, due to increasing computational power and availability of software. Although MD has made many contributions to better understanding these complex biophysical systems, there remain methodological difficulties to be surmounted. First, how to make the deluge of data generated in running even a microsecond long MD simulation human comprehensible. Second, how to efficiently sample the underlying free energy surface and kinetics. In this short perspective, we summarize machine learning based ideas that are solving both of these limitations, with a focus on their key theoretical underpinnings and remaining challenges.
Collapse
Affiliation(s)
- Yihang Wang
- Biophysics Program and Institute for Physical Science and Technology, University of Maryland, College Park, MD 20742, USA
| | - João Marcelo Lamim Ribeiro
- Department of Pharmacological Sciences, Icahn School of Medicine at Mount Sinai, One Gustave L. Levy Place, Box 1677, New York, NY 10029, USA
| | - Pratyush Tiwary
- Department of Chemistry and Biochemistry and Institute for Physical Science and Technology, University of Maryland, College Park, MD 20742, USA.
| |
Collapse
|
26
|
Affiliation(s)
- Frank Noé
- Department of Mathematics and Computer Science, Freie Universität Berlin, Berlin, Germany
- Department of Physics, Freie Universität Berlin, Berlin, Germany
| | - Edina Rosta
- Department of Chemistry, Kings College London, London, England
| |
Collapse
|
27
|
Tsai ST, Smith Z, Tiwary P. Reaction coordinates and rate constants for liquid droplet nucleation: Quantifying the interplay between driving force and memory. J Chem Phys 2019; 151:154106. [PMID: 31640371 DOI: 10.1063/1.5124385] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
In this work, we revisit the classic problem of homogeneous nucleation of a liquid droplet in a supersaturated vapor phase. We consider this at different extents of the driving force, or equivalently the supersaturation, and calculate a reaction coordinate (RC) for nucleation as the driving force is varied. The RC is constructed as a linear combination of three order parameters, where one accounts for the number of liquidlike atoms and the other two for local density fluctuations. The RC is calculated from biased and unbiased molecular dynamics (MD) simulations using the spectral gap optimization approach "SGOOP" [P. Tiwary and B. J. Berne, Proc. Natl. Acad. Sci. U. S. A. 113, 2839 (2016)]. Our key finding is that as the supersaturation decreases, the RC ceases to simply be the number of liquidlike atoms, and instead, it becomes important to explicitly consider local density fluctuations that correlate with shape and density variations in the nucleus. All three order parameters are found to have similar barriers in their respective potentials of mean force; however, as the supersaturation decreases, the density fluctuations decorrelate slower and thus carry longer memory. Thus, at lower supersaturations, density fluctuations are non-Markovian and cannot be simply ignored from the RC by virtue of being noise. Finally, we use this optimized RC to calculate nucleation rates in the infrequent metadynamics framework and show that it leads to a more accurate estimate of the nucleation rate with four orders of magnitude acceleration relative to unbiased MD.
Collapse
Affiliation(s)
- Sun-Ting Tsai
- Department of Physics and Institute for Physical Science and Technology, University of Maryland, College Park, Maryland 20742, USA
| | - Zachary Smith
- Biophysics Program and Institute for Physical Science and Technology, University of Maryland, College Park, Maryland 20742, USA
| | - Pratyush Tiwary
- Department of Chemistry and Biochemistry and Institute for Physical Science and Technology, University of Maryland, College Park, Maryland 20742, USA
| |
Collapse
|
28
|
Past-future information bottleneck for sampling molecular reaction coordinate simultaneously with thermodynamics and kinetics. Nat Commun 2019; 10:3573. [PMID: 31395868 PMCID: PMC6687748 DOI: 10.1038/s41467-019-11405-4] [Citation(s) in RCA: 99] [Impact Index Per Article: 16.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2019] [Accepted: 07/10/2019] [Indexed: 02/06/2023] Open
Abstract
The ability to rapidly learn from high-dimensional data to make reliable bets about the future is crucial in many contexts. This could be a fly avoiding predators, or the retina processing gigabytes of data to guide human actions. In this work we draw parallels between these and the efficient sampling of biomolecules with hundreds of thousands of atoms. For this we use the Predictive Information Bottleneck framework used for the first two problems, and re-formulate it for the sampling of biomolecules, especially when plagued with rare events. Our method uses a deep neural network to learn the minimally complex yet most predictive aspects of a given biomolecular trajectory. This information is used to perform iteratively biased simulations that enhance the sampling and directly obtain associated thermodynamic and kinetic information. We demonstrate the method on two test-pieces, studying processes slower than milliseconds, calculating free energies, kinetics and critical mutations. Efficient sampling of rare events in all-atom molecular dynamics simulations remains a challenge. Here, the authors adapt the Predictive Information Bottleneck framework to sample biomolecular structure and dynamics through iterative rounds of biased simulations and deep learning.
Collapse
|
29
|
Pramanik D, Smith Z, Kells A, Tiwary P. Can One Trust Kinetic and Thermodynamic Observables from Biased Metadynamics Simulations?: Detailed Quantitative Benchmarks on Millimolar Drug Fragment Dissociation. J Phys Chem B 2019; 123:3672-3678. [DOI: 10.1021/acs.jpcb.9b01813] [Citation(s) in RCA: 25] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Affiliation(s)
- Debabrata Pramanik
- Department of Chemistry and Biochemistry and Institute for Physical Science and Technology, University of Maryland, College Park, Maryland 20742, United States
| | - Zachary Smith
- Biophysics Program and Institute for Physical Science and Technology, University of Maryland, College Park, Maryland 20742, United States
| | - Adam Kells
- Department of Chemistry, King’s College London, SE1 1DB, London, U.K
| | - Pratyush Tiwary
- Department of Chemistry and Biochemistry and Institute for Physical Science and Technology, University of Maryland, College Park, Maryland 20742, United States
| |
Collapse
|
30
|
Dixit PD, Dill KA. Building Markov state models using optimal transport theory. J Chem Phys 2019; 150:054105. [DOI: 10.1063/1.5086681] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Affiliation(s)
- Purushottam D. Dixit
- Department of Systems Biology, Columbia University, New York, New York 10032, USA
| | - Ken A. Dill
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, New York 11794, USA
- Department of Chemistry, Stony Brook University, Stony Brook, New York 11794, USA
- Department of Physics and Astronomy, Stony Brook University, Stony Brook, New York 11794, USA
| |
Collapse
|
31
|
Lamim Ribeiro JM, Tiwary P. Toward Achieving Efficient and Accurate Ligand-Protein Unbinding with Deep Learning and Molecular Dynamics through RAVE. J Chem Theory Comput 2018; 15:708-719. [PMID: 30525598 DOI: 10.1021/acs.jctc.8b00869] [Citation(s) in RCA: 42] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
In this work, we demonstrate how to leverage our recent iterative deep learning-all atom molecular dynamics (MD) technique "Reweighted autoencoded variational Bayes for enhanced sampling (RAVE)" (Ribeiro, Bravo, Wang, Tiwary, J. Chem. Phys. 2018, 149, 072301) for investigating ligand-protein unbinding mechanisms and calculating absolute binding free energies, Δ Gb, when plagued with difficult to sample rare events. In order to do so, we introduce a simple but powerful extension to RAVE that allows learning a reaction coordinate expressed as a piecewise function that is linear over all intervals. Such an approach allows us to retain the physical interpretation of a RAVE-derived reaction coordinate while making the method more applicable to a wider range of complex biophysical problems. As we will demonstrate, using as our test-case the slow dissociation of benzene from the L99A variant of lysozyme, the RAVE extension led to observing an unbinding event in 100% of the independent all-atom MD simulations, all within 3-50 ns for a process that takes on an average close to few hundred milliseconds, which reflects a 7 orders of magnitude acceleration relative to straightforward MD. Furthermore, we will show that without the use of time-dependent biasing, clear back-and-forth movement between metastable intermediates was achieved during the various simulations, demonstrating the caliber of the RAVE-derived piecewise reaction coordinate and bias potential, which together drive efficient and accurate sampling of the ligand-protein dissociation event. Last, we report the results for Δ Gb, which via very short MD simulations, can form a strict lower-bound that is ∼2-3 kcal/mol off from experiments. We believe that RAVE, together with its multidimensional extension that we introduce here, will be a useful tool for simulating the slow unbinding process of practical ligand-protein complexes in an automated manner with minimal use of human intuition.
Collapse
Affiliation(s)
- João Marcelo Lamim Ribeiro
- Department of Chemistry and Biochemistry and Institute for Physical Science and Technology , University of Maryland , Maryland , College Park 20742 , United States
| | - Pratyush Tiwary
- Department of Chemistry and Biochemistry and Institute for Physical Science and Technology , University of Maryland , Maryland , College Park 20742 , United States
| |
Collapse
|