1
|
Bose S, Kilinc C, Dickson A. Markov State Models with Weighted Ensemble Simulation: How to Eliminate the Trajectory Merging Bias. J Chem Theory Comput 2025; 21:1805-1816. [PMID: 39933004 PMCID: PMC11866749 DOI: 10.1021/acs.jctc.4c01141] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2024] [Revised: 01/20/2025] [Accepted: 01/30/2025] [Indexed: 02/13/2025]
Abstract
The weighted ensemble (WE) algorithm is gaining popularity as a rare event method for studying long timescale processes with molecular dynamics. WE is particularly useful for determining kinetic properties, such as rates of protein (un)folding and ligand (un)binding, where transition rates can be calculated from the flux of trajectories into a target basin of interest. However, this flux depends exponentially on the number of splitting events that a given trajectory experiences before reaching the target state and can vary by orders of magnitude between WE replicates. Markov state models (MSMs) are helpful tools to aggregate information across multiple WE simulations and have previously been shown to provide more accurate transition rates than WE alone. Discrete-time MSMs are models that coarsely describe the evolution of the system from one discrete state to the next using a discrete lag time, τ. When an MSM is built using conventional MD data, longer values of τ typically provide more accurate results. Combining WE simulations with Markov state modeling presents some additional challenges, especially when using a value of τ that exceeds the lag time between resampling steps in the WE algorithm, τWE. Here, we identify a source of bias that occurs when τ > τWE, which we refer to as "merging bias". We also propose an algorithm to eliminate the merging bias, which results in merging bias-corrected MSMs, or "MBC-MSMs". Using a simple model system, as well as a complex biomolecular example, we show that MBC-MSMs significantly outperform both τ = τWE MSMs and uncorrected MSMs at longer lag times.
Collapse
Affiliation(s)
- Samik Bose
- Department
of Biochemistry and Molecular Biology, Michigan
State University, East Lansing, Michigan 48824, United States
- Department
of Computational Mathematics, Science and Engineering, Michigan State University, East Lansing, Michigan 48824, United States
| | - Ceren Kilinc
- Department
of Biochemistry and Molecular Biology, Michigan
State University, East Lansing, Michigan 48824, United States
| | - Alex Dickson
- Department
of Biochemistry and Molecular Biology, Michigan
State University, East Lansing, Michigan 48824, United States
- Department
of Computational Mathematics, Science and Engineering, Michigan State University, East Lansing, Michigan 48824, United States
| |
Collapse
|
2
|
Mitra S, Biswas R, Chakrabarty S. WeTICA: A directed search weighted ensemble based enhanced sampling method to estimate rare event kinetics in a reduced dimensional space. J Chem Phys 2025; 162:034106. [PMID: 39812249 DOI: 10.1063/5.0239713] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2024] [Accepted: 12/30/2024] [Indexed: 01/16/2025] Open
Abstract
Estimating rare event kinetics from molecular dynamics simulations is a non-trivial task despite the great advances in enhanced sampling methods. Weighted Ensemble (WE) simulation, a special class of enhanced sampling techniques, offers a way to directly calculate kinetic rate constants from biased trajectories without the need to modify the underlying energy landscape using bias potentials. Conventional WE algorithms use different binning schemes to partition the collective variable (CV) space separating the two metastable states of interest. In this work, we have developed a new "binless" WE simulation algorithm to bypass the hurdles of optimizing binning procedures. Our proposed protocol (WeTICA) uses a low-dimensional CV space to drive the WE simulation toward the specified target state. We have applied this new algorithm to recover the unfolding kinetics of three proteins: (A) TC5b Trp-cage mutant, (B) TC10b Trp-cage mutant, and (C) Protein G, with unfolding times spanning the range between 3 and 40 μs using projections along predefined fixed Time-lagged Independent Component Analysis (TICA) eigenvectors as CVs. Calculated unfolding times converge to the reported values with good accuracy with more than one order of magnitude less cumulative WE simulation time than the unfolding time scales with or without a priori knowledge of the CVs that can capture unfolding. Our algorithm can be used with other linear CVs, not limited to TICA. Moreover, the new walker selection criteria for resampling employed in this algorithm can be used on more sophisticated nonlinear CV space for further improvements of binless WE methods.
Collapse
Affiliation(s)
- Sudipta Mitra
- Department of Chemical and Biological Sciences, S. N. Bose National Centre for Basic Sciences, Block-JD, Sector-III, Salt Lake, Kolkata 700106, India
| | - Ranjit Biswas
- Department of Chemical and Biological Sciences, S. N. Bose National Centre for Basic Sciences, Block-JD, Sector-III, Salt Lake, Kolkata 700106, India
| | - Suman Chakrabarty
- Department of Chemical and Biological Sciences, S. N. Bose National Centre for Basic Sciences, Block-JD, Sector-III, Salt Lake, Kolkata 700106, India
| |
Collapse
|
3
|
Wang D, Tiwary P. Augmenting Human Expertise in Weighted Ensemble Simulations through Deep Learning-Based Information Bottleneck. J Chem Theory Comput 2024; 20:10371-10383. [PMID: 39589127 DOI: 10.1021/acs.jctc.4c00919] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2024]
Abstract
The weighted ensemble (WE) method stands out as a widely used segment-based sampling technique renowned for its rigorous treatment of kinetics. The WE framework typically involves initially mapping the configuration space onto a low-dimensional collective variable (CV) space and then partitioning it into bins. The efficacy of WE simulations heavily depends on the selection of CVs and binning schemes. The recently proposed state predictive information bottleneck (SPIB) method has emerged as a promising tool for automatically constructing CVs from data and guiding enhanced sampling through an iterative manner. In this work, we advance this data-driven pipeline by incorporating prior expert knowledge. Our hybrid approach combines SPIB-learned CVs to enhance sampling in explored regions with expert-based CVs to guide exploration in regions of interest, synergizing the strengths of both methods. Through benchmarking on alanine dipeptide and chignolin systems, we demonstrate that our hybrid approach effectively guides WE simulations to sample states of interest and reduces run-to-run variances. Moreover, our integration of the SPIB model also enhances the analysis and interpretation of WE simulation data by effectively identifying metastable states and pathways and offering direct visualization of dynamics.
Collapse
Affiliation(s)
- Dedi Wang
- Biophysics Program and Institute for Physical Science and Technology, University of Maryland, College Park, Maryland 20742, United States
| | - Pratyush Tiwary
- Department of Chemistry and Biochemistry and Institute for Physical Science and Technology, University of Maryland, College Park, Maryland 20742, United States
- University of Maryland Institute for Health Computing, Bethesda, Maryland 20852, United States
| |
Collapse
|
4
|
Cummins D, Longstreth C, McCarty J. Analysis of transition rates from variational flooding using analytical theory. J Chem Phys 2024; 161:194104. [PMID: 39555758 DOI: 10.1063/5.0238289] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2024] [Accepted: 10/31/2024] [Indexed: 11/19/2024] Open
Abstract
Variational flooding is an enhanced sampling method for obtaining kinetic rates from molecular dynamics simulations. This method is inspired by the idea of conformational flooding that employs a boost potential acting along a chosen reaction coordinate to accelerate rare events. In this work, we show how the empirical distribution of crossing times from variational flooding simulations can be modeled with analytical Kramers' time-dependent rate (KTR) theory. An optimized bias potential that fills metastable free energy basins is constructed from the variationally enhanced sampling (VES) method. This VES-derived flooding potential is then augmented by a switching function that determines the fill level of the boost. Having a prescribed time-dependent fill rate of the flooding potential gives an analytical expression for the distribution of crossing times from KTR theory that is used to extract unbiased rates. In the case of a static boost potential, the distribution of barrier crossing times follows an expected exponential distribution, and unbiased rates are extracted from a series of boosted simulations at discrete fill levels. Introducing a time-dependent boost that increases the fill level gradually over the simulation time leads to a simplified procedure for fitting the biased distribution of crossing times to analytical theory. We demonstrate the approach for the paradigmatic cases of alanine dipeptide in vacuum, the asymmetric SN2 reaction, and the folding of chignolin in explicit solvent.
Collapse
Affiliation(s)
- David Cummins
- Department of Chemistry, Western Washington University, Bellingham, Washington 98225, USA
| | - Carter Longstreth
- Department of Chemistry, Western Washington University, Bellingham, Washington 98225, USA
| | - James McCarty
- Department of Chemistry, Western Washington University, Bellingham, Washington 98225, USA
| |
Collapse
|
5
|
Wang D, Tiwary P. Augmenting Human Expertise in Weighted Ensemble Simulations through Deep Learning based Information Bottleneck. ARXIV 2024:arXiv:2406.14839v2. [PMID: 38947925 PMCID: PMC11213147] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Subscribe] [Scholar Register] [Indexed: 07/02/2024]
Abstract
The weighted ensemble (WE) method stands out as a widely used segment-based sampling technique renowned for its rigorous treatment of kinetics. The WE framework typically involves initially mapping the configuration space onto a low-dimensional collective variable (CV) space and then partitioning it into bins. The efficacy of WE simulations heavily depends on the selection of CVs and binning schemes. The recently proposed State Predictive Information Bottleneck (SPIB) method has emerged as a promising tool for automatically constructing CVs from data and guiding enhanced sampling through an iterative manner. In this work, we advance this data-driven pipeline by incorporating prior expert knowledge. Our hybrid approach combines SPIB-learned CVs to enhance sampling in explored regions with expert-based CVs to guide exploration in regions of interest, synergizing the strengths of both methods. Through benchmarking on alanine dipeptide and chignoin systems, we demonstrate that our hybrid approach effectively guides WE simulations to sample states of interest, and reduces run-to-run variances. Moreover, our integration of the SPIB model also enhances the analysis and interpretation of WE simulation data by effectively identifying metastable states and pathways, and offering direct visualization of dynamics.
Collapse
Affiliation(s)
- Dedi Wang
- Biophysics Program and Institute for Physical Science and Technology, University of Maryland, College Park 20742, USA
| | - Pratyush Tiwary
- Department of Chemistry and Biochemistry and Institute for Physical Science and Technology, University of Maryland, College Park 20742, USA
- University of Maryland Institute for Health Computing, Bethesda 20852, USA
| |
Collapse
|
6
|
Strahan J, Lorpaiboon C, Weare J, Dinner AR. BAD-NEUS: Rapidly converging trajectory stratification. J Chem Phys 2024; 161:084109. [PMID: 39185846 PMCID: PMC11349377 DOI: 10.1063/5.0215975] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2024] [Accepted: 07/25/2024] [Indexed: 08/27/2024] Open
Abstract
An issue for molecular dynamics simulations is that events of interest often involve timescales that are much longer than the simulation time step, which is set by the fastest timescales of the model. Because of this timescale separation, direct simulation of many events is prohibitively computationally costly. This issue can be overcome by aggregating information from many relatively short simulations that sample segments of trajectories involving events of interest. This is the strategy of Markov state models (MSMs) and related approaches, but such methods suffer from approximation error because the variables defining the states generally do not capture the dynamics fully. By contrast, once converged, the weighted ensemble (WE) method aggregates information from trajectory segments so as to yield unbiased estimates of both thermodynamic and kinetic statistics. Unfortunately, errors decay no faster than unbiased simulation in WE as originally formulated and commonly deployed. Here, we introduce a theoretical framework for describing WE that shows that the introduction of an approximate stationary distribution on top of the stratification, as in nonequilibrium umbrella sampling (NEUS), accelerates convergence. Building on ideas from MSMs and related methods, we generalize the NEUS approach in such a way that the approximation error can be reduced systematically. We show that the improved algorithm can decrease the simulation time required to achieve the desired precision by orders of magnitude.
Collapse
Affiliation(s)
- John Strahan
- Department of Chemistry and James Franck Institute, University of Chicago, Chicago, Illinois 60637, USA
| | - Chatipat Lorpaiboon
- Department of Chemistry and James Franck Institute, University of Chicago, Chicago, Illinois 60637, USA
| | - Jonathan Weare
- Courant Institute of Mathematical Sciences, New York University, New York, New York 10012, USA
| | - Aaron R. Dinner
- Department of Chemistry and James Franck Institute, University of Chicago, Chicago, Illinois 60637, USA
| |
Collapse
|
7
|
Yang D, Chong LT. WEDAP: A Python Package for Streamlined Plotting of Molecular Simulation Data. J Chem Inf Model 2024; 64:5749-5755. [PMID: 39013164 PMCID: PMC11323263 DOI: 10.1021/acs.jcim.4c00867] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2024] [Revised: 07/01/2024] [Accepted: 07/06/2024] [Indexed: 07/18/2024]
Abstract
Given the growing interest in path sampling methods for extending the time scales of molecular dynamics (MD) simulations, there has been great interest in software tools that streamline the generation of plots for monitoring the progress of large-scale simulations. Here, we present the WEDAP Python package for simplifying the analysis of data generated from either conventional MD simulations or the weighted ensemble (WE) path sampling method, as implemented in the widely used WESTPA software package. WEDAP facilitates (i) the parsing of WE simulation data stored in highly compressed, hierarchical HDF5 files and (ii) incorporates trajectory weights from WE simulations into all generated plots. Our Python package consists of multiple user-friendly interfaces: a command-line interface, a graphical user interface, and a Python application programming interface. We demonstrate the plotting features of WEDAP through a series of examples using data from WE and conventional MD simulations that focus on the HIV-1 capsid protein's C-terminal domain dimer as a showcase system. The source code for WEDAP is freely available on GitHub at https://github.com/chonglab-pitt/wedap.
Collapse
Affiliation(s)
- Darian
T. Yang
- Molecular
Biophysics and Structural Biology Graduate Program, University of Pittsburgh and Carnegie Mellon University, Pittsburgh, Pennsylvania 15260, United States
- Department
of Structural Biology, University of Pittsburgh
School of Medicine, Pittsburgh, Pennsylvania 15260, United States
- Department
of Chemistry, University of Pittsburgh, Pittsburgh, Pennsylvania 15260, United States
| | - Lillian T. Chong
- Department
of Chemistry, University of Pittsburgh, Pittsburgh, Pennsylvania 15260, United States
| |
Collapse
|
8
|
Banerjee S, Chowdhury D, Chakraborty S, Haldar S. Force-regulated chaperone activity of BiP/ERdj3 is opposite to their homologs DnaK/DnaJ. Protein Sci 2024; 33:e5068. [PMID: 38864739 PMCID: PMC11168073 DOI: 10.1002/pro.5068] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2024] [Revised: 04/17/2024] [Accepted: 05/20/2024] [Indexed: 06/13/2024]
Abstract
Polypeptide chains experience mechanical tension while translocating through cellular tunnels, which are subsequently folded by molecular chaperones. However, interactions between tunnel-associated chaperones and these emerging polypeptides under force is not completely understood. Our investigation focused on mechanical chaperone activity of two tunnel-associated chaperones, BiP and ERdj3 both with and without mechanical constraints and comparing them with their cytoplasmic homologs: DnaK and DnaJ. While BiP/ERdj3 have been observed to exhibit robust foldase activity under force, DnaK/DnaJ showed holdase function. Importantly, the tunnel-associated chaperones (BiP/ERdj3) transitioned to a holdase state in the absence of force, indicating a force-dependent chaperone behavior. This chaperone-driven folding event in the tunnel generated an additional mechanical energy of up to 54 zJ, potentially aiding protein translocation. Our findings align with strain theory, where chaperones with higher intrinsic deformability act as mechanical foldases (BiP, ERdj3), while those with lower deformability serve as holdases (DnaK and DnaJ). This study thus elucidates the differential mechanically regulated chaperoning activity and introduces a novel perspective on co-translocational protein folding.
Collapse
Affiliation(s)
- Souradeep Banerjee
- Department of BiologyTrivedi School of Biosciences, Ashoka UniversitySonepatHaryanaIndia
| | - Debojyoti Chowdhury
- Department of Chemical and Biological SciencesS.N. Bose National Center for Basic SciencesKolkataWest BengalIndia
| | - Soham Chakraborty
- Department of BiologyTrivedi School of Biosciences, Ashoka UniversitySonepatHaryanaIndia
| | - Shubhasis Haldar
- Department of BiologyTrivedi School of Biosciences, Ashoka UniversitySonepatHaryanaIndia
- Department of Chemical and Biological SciencesS.N. Bose National Center for Basic SciencesKolkataWest BengalIndia
- Technical Research Centre, S.N. Bose National Centre for Basic SciencesKolkataWest BengalIndia
| |
Collapse
|
9
|
Bose S, Lotz SD, Deb I, Shuck M, Lee KSS, Dickson A. How Robust Is the Ligand Binding Transition State? J Am Chem Soc 2023; 145:25318-25331. [PMID: 37943667 PMCID: PMC11059145 DOI: 10.1021/jacs.3c08940] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2023]
Abstract
For many drug targets, it has been shown that the kinetics of drug binding (e.g., on rate and off rate) is more predictive of drug efficacy than thermodynamic quantities alone. This motivates the development of predictive computational models that can be used to optimize compounds on the basis of their kinetics. The structural details underpinning these computational models are found not only in the bound state but also in the short-lived ligand binding transition states. Although transition states cannot be directly observed experimentally due to their extremely short lifetimes, recent successes have demonstrated that modeling the ligand binding transition state is possible with the help of enhanced sampling molecular dynamics methods. Previously, we generated unbinding paths for an inhibitor of soluble epoxide hydrolase (sEH) with a residence time of 11 min. Here, we computationally modeled unbinding events with the weighted ensemble method REVO (resampling of ensembles by variation optimization) for five additional inhibitors of sEH with residence times ranging from 14.25 to 31.75 min, with average prediction accuracy within an order of magnitude. The unbinding ensembles are analyzed in detail, focusing on features of the ligand binding transition state ensembles (TSEs). We find that ligands with similar bound poses can show significant differences in their ligand binding TSEs, in terms of their spatial distribution and protein-ligand interactions. However, we also find similarities across the TSEs when examining more general features such as ligand degrees of freedom. Together these findings show significant challenges for rational, kinetics-based drug design.
Collapse
Affiliation(s)
- Samik Bose
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, Michigan 48824, United States
| | - Samuel D Lotz
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, Michigan 48824, United States
| | - Indrajit Deb
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, Michigan 48824, United States
| | - Megan Shuck
- Department of Pharmacology and Toxicology, Michigan State University, East Lansing, Michigan 48824, United States
| | - Kin Sing Stephen Lee
- Department of Pharmacology and Toxicology, Michigan State University, East Lansing, Michigan 48824, United States
- Department of Chemistry, Michigan State University, East Lansing, Michigan 48824, United States
- Institute of Integrative Toxicology, Michigan State University, East Lansing, Michigan 48824, United States
| | - Alex Dickson
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, Michigan 48824, United States
- Department of Computational Mathematics, Science and Engineering, Michigan State University, East Lansing, Michigan 48824, United States
| |
Collapse
|
10
|
Ray D, Parrinello M. Kinetics from Metadynamics: Principles, Applications, and Outlook. J Chem Theory Comput 2023; 19:5649-5670. [PMID: 37585703 DOI: 10.1021/acs.jctc.3c00660] [Citation(s) in RCA: 43] [Impact Index Per Article: 21.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/18/2023]
Abstract
Metadynamics is a popular enhanced sampling algorithm for computing the free energy landscape of rare events by using molecular dynamics simulation. Ten years ago, Tiwary and Parrinello introduced the infrequent metadynamics approach for calculating the kinetics of transitions across free energy barriers. Since then, metadynamics-based methods for obtaining rate constants have attracted significant attention in computational molecular science. Such methods have been applied to study a wide range of problems, including protein-ligand binding, protein folding, conformational transitions, chemical reactions, catalysis, and nucleation. Here, we review the principles of elucidating kinetics from metadynamics-like approaches, subsequent methodological developments in this area, and successful applications on chemical, biological, and material systems. We also highlight the challenges of reconstructing accurate kinetics from enhanced sampling simulations and the scope of future developments.
Collapse
Affiliation(s)
- Dhiman Ray
- Atomistic Simulations, Italian Institute of Technology, Via Enrico Melen 83, 16152 Genova, Italy
| | - Michele Parrinello
- Atomistic Simulations, Italian Institute of Technology, Via Enrico Melen 83, 16152 Genova, Italy
| |
Collapse
|
11
|
Mostofian B, Martin HJ, Razavi A, Patel S, Allen B, Sherman W, Izaguirre JA. Targeted Protein Degradation: Advances, Challenges, and Prospects for Computational Methods. J Chem Inf Model 2023; 63:5408-5432. [PMID: 37602861 PMCID: PMC10498452 DOI: 10.1021/acs.jcim.3c00603] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2023] [Indexed: 08/22/2023]
Abstract
The therapeutic approach of targeted protein degradation (TPD) is gaining momentum due to its potentially superior effects compared with protein inhibition. Recent advancements in the biotech and pharmaceutical sectors have led to the development of compounds that are currently in human trials, with some showing promising clinical results. However, the use of computational tools in TPD is still limited, as it has distinct characteristics compared with traditional computational drug design methods. TPD involves creating a ternary structure (protein-degrader-ligase) responsible for the biological function, such as ubiquitination and subsequent proteasomal degradation, which depends on the spatial orientation of the protein of interest (POI) relative to E2-loaded ubiquitin. Modeling this structure necessitates a unique blend of tools initially developed for small molecules (e.g., docking) and biologics (e.g., protein-protein interaction modeling). Additionally, degrader molecules, particularly heterobifunctional degraders, are generally larger than conventional small molecule drugs, leading to challenges in determining drug-like properties like solubility and permeability. Furthermore, the catalytic nature of TPD makes occupancy-based modeling insufficient. TPD consists of multiple interconnected yet distinct steps, such as POI binding, E3 ligase binding, ternary structure interactions, ubiquitination, and degradation, along with traditional small molecule properties. A comprehensive set of tools is needed to address the dynamic nature of the induced proximity ternary complex and its implications for ubiquitination. In this Perspective, we discuss the current state of computational tools for TPD. We start by describing the series of steps involved in the degradation process and the experimental methods used to characterize them. Then, we delve into a detailed analysis of the computational tools employed in TPD. We also present an integrative approach that has proven successful for degrader design and its impact on project decisions. Finally, we examine the future prospects of computational methods in TPD and the areas with the greatest potential for impact.
Collapse
Affiliation(s)
- Barmak Mostofian
- OpenEye, Cadence Molecular Sciences, Boston, Massachusetts 02114 United States
| | - Holli-Joi Martin
- Laboratory
for Molecular Modeling, Division of Chemical Biology and Medicinal
Chemistry, Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, North Carolina 27599 United States
| | - Asghar Razavi
- ENKO
Chem, Inc, Mystic, Connecticut 06355 United States
| | - Shivam Patel
- Psivant
Therapeutics, Boston, Massachusetts 02210 United States
| | - Bryce Allen
- Differentiated
Therapeutics, San Diego, California 92056 United States
| | - Woody Sherman
- Psivant
Therapeutics, Boston, Massachusetts 02210 United States
| | - Jesus A Izaguirre
- Differentiated
Therapeutics, San Diego, California 92056 United States
- Atommap
Corporation, New York, New York 10013 United States
| |
Collapse
|
12
|
Mondal A, Lenz S, MacCallum JL, Perez A. Hybrid computational methods combining experimental information with molecular dynamics. Curr Opin Struct Biol 2023; 81:102609. [PMID: 37224642 DOI: 10.1016/j.sbi.2023.102609] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2023] [Revised: 04/12/2023] [Accepted: 04/23/2023] [Indexed: 05/26/2023]
Abstract
A goal of structural biology is to understand how macromolecules carry out their biological roles by identifying their metastable states, mechanisms of action, pathways leading to conformational changes, and the thermodynamic and kinetic relationships between those states. Integrative modeling brings structural insights into systems where traditional structure determination approaches cannot help. We focus on the synergies and challenges of integrative modeling combining experimental data with molecular dynamics simulations.
Collapse
Affiliation(s)
- Arup Mondal
- Quantum Theory Project, Department of Chemistry, University of Florida, Leigh, UK. https://twitter.com/@amondal_chem
| | - Stefan Lenz
- Department of Chemistry, University of Calgary, 2500 University Drive, Canada
| | - Justin L MacCallum
- Department of Chemistry, University of Calgary, 2500 University Drive, Canada. https://twitter.com/@jlmaccal
| | - Alberto Perez
- Quantum Theory Project, Department of Chemistry, University of Florida, Leigh, UK.
| |
Collapse
|
13
|
Dominic AJ, Cao S, Montoya-Castillo A, Huang X. Memory Unlocks the Future of Biomolecular Dynamics: Transformative Tools to Uncover Physical Insights Accurately and Efficiently. J Am Chem Soc 2023; 145:9916-9927. [PMID: 37104720 DOI: 10.1021/jacs.3c01095] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/29/2023]
Abstract
Conformational changes underpin function and encode complex biomolecular mechanisms. Gaining atomic-level detail of how such changes occur has the potential to reveal these mechanisms and is of critical importance in identifying drug targets, facilitating rational drug design, and enabling bioengineering applications. While the past two decades have brought Markov state model techniques to the point where practitioners can regularly use them to glimpse the long-time dynamics of slow conformations in complex systems, many systems are still beyond their reach. In this Perspective, we discuss how including memory (i.e., non-Markovian effects) can reduce the computational cost to predict the long-time dynamics in these complex systems by orders of magnitude and with greater accuracy and resolution than state-of-the-art Markov state models. We illustrate how memory lies at the heart of successful and promising techniques, ranging from the Fokker-Planck and generalized Langevin equations to deep-learning recurrent neural networks and generalized master equations. We delineate how these techniques work, identify insights that they can offer in biomolecular systems, and discuss their advantages and disadvantages in practical settings. We show how generalized master equations can enable the investigation of, for example, the gate-opening process in RNA polymerase II and demonstrate how our recent advances tame the deleterious influence of statistical underconvergence of the molecular dynamics simulations used to parameterize these techniques. This represents a significant leap forward that will enable our memory-based techniques to interrogate systems that are currently beyond the reach of even the best Markov state models. We conclude by discussing some current challenges and future prospects for how exploiting memory will open the door to many exciting opportunities.
Collapse
Affiliation(s)
- Anthony J Dominic
- Department of Chemistry, University of Colorado Boulder, Boulder, Colorado 80309, USA
| | - Siqin Cao
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | | | - Xuhui Huang
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| |
Collapse
|
14
|
Ojha AA, Thakur S, Ahn SH, Amaro RE. DeepWEST: Deep Learning of Kinetic Models with the Weighted Ensemble Simulation Toolkit for Enhanced Sampling. J Chem Theory Comput 2023; 19:1342-1359. [PMID: 36719802 DOI: 10.1021/acs.jctc.2c00282] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
Recent advances in computational power and algorithms have enabled molecular dynamics (MD) simulations to reach greater time scales. However, for observing conformational transitions associated with biomolecular processes, MD simulations still have limitations. Several enhanced sampling techniques seek to address this challenge, including the weighted ensemble (WE) method, which samples transitions between metastable states using many weighted trajectories to estimate kinetic rate constants. However, initial sampling of the potential energy surface has a significant impact on the performance of WE, i.e., convergence and efficiency. We therefore introduce deep-learned kinetic modeling approaches that extract statistically relevant information from short MD trajectories to provide a well-sampled initial state distribution for WE simulations. This hybrid approach overcomes any statistical bias to the system, as it runs short unbiased MD trajectories and identifies meaningful metastable states of the system. It is shown to provide a more refined free energy landscape closer to the steady state that could efficiently sample kinetic properties such as rate constants.
Collapse
Affiliation(s)
- Anupam Anand Ojha
- Department of Chemistry, University of California San Diego, La Jolla, California92093, United States
| | - Saumya Thakur
- Department of Chemistry, Indian Institute of Technology Bombay, Mumbai, Maharashtra400076, India
| | - Surl-Hee Ahn
- Department of Chemical Engineering, University of California Davis, Davis, California95616, United States
| | - Rommie E Amaro
- Department of Chemistry, University of California San Diego, La Jolla, California92093, United States
| |
Collapse
|
15
|
Aristoff D, Copperman J, Simpson G, Webber RJ, Zuckerman DM. Weighted ensemble: Recent mathematical developments. J Chem Phys 2023; 158:014108. [PMID: 36610976 PMCID: PMC9822651 DOI: 10.1063/5.0110873] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2022] [Accepted: 12/04/2022] [Indexed: 12/12/2022] Open
Abstract
Weighted ensemble (WE) is an enhanced sampling method based on periodically replicating and pruning trajectories generated in parallel. WE has grown increasingly popular for computational biochemistry problems due, in part, to improved hardware and accessible software implementations. Algorithmic and analytical improvements have played an important role, and progress has accelerated in recent years. Here, we discuss and elaborate on the WE method from a mathematical perspective, highlighting recent results that enhance the computational efficiency. The mathematical theory reveals a new strategy for optimizing trajectory management that approaches the best possible variance while generalizing to systems of arbitrary dimension.
Collapse
Affiliation(s)
- D. Aristoff
- Mathematics, Colorado State University, Fort Collins, CO 80521 USA
| | - J. Copperman
- Biomedical Engineering, Oregon Health and Science University, Portland, OR 97239 USA
| | - G. Simpson
- Mathematics, Drexel University, Philadelphia, Pennsylvania 19104 USA
| | - R. J. Webber
- Computing and Mathematical Sciences, California Institute of Technology, Pasadena, California 91125 USA
| | - D. M. Zuckerman
- Biomedical Engineering, Oregon Health and Science University, Portland, OR 97239 USA
| |
Collapse
|
16
|
Bogetti AT, Leung JMG, Russo JD, Zhang S, Thompson JP, Saglam AS, Ray D, Mostofian B, Pratt AJ, Abraham RC, Harrison PO, Dudek M, Torrillo PA, DeGrave AJ, Adhikari U, Faeder JR, Andricioaei I, Adelman JL, Zwier MC, LeBard DN, Zuckerman DM, Chong LT. A Suite of Tutorials for the WESTPA 2.0 Rare-Events Sampling Software [Article v2.0]. LIVING JOURNAL OF COMPUTATIONAL MOLECULAR SCIENCE 2023; 5:1655. [PMID: 37200895 PMCID: PMC10191340 DOI: 10.33011/livecoms.5.1.1655] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/20/2023]
Abstract
The weighted ensemble (WE) strategy has been demonstrated to be highly efficient in generating pathways and rate constants for rare events such as protein folding and protein binding using atomistic molecular dynamics simulations. Here we present two sets of tutorials instructing users in the best practices for preparing, carrying out, and analyzing WE simulations for various applications using the WESTPA software. The first set of more basic tutorials describes a range of simulation types, from a molecular association process in explicit solvent to more complex processes such as host-guest association, peptide conformational sampling, and protein folding. The second set ecompasses six advanced tutorials instructing users in the best practices of using key new features and plugins/extensions of the WESTPA 2.0 software package, which consists of major upgrades for larger systems and/or slower processes. The advanced tutorials demonstrate the use of the following key features: (i) a generalized resampler module for the creation of "binless" schemes, (ii) a minimal adaptive binning scheme for more efficient surmounting of free energy barriers, (iii) streamlined handling of large simulation datasets using an HDF5 framework, (iv) two different schemes for more efficient rate-constant estimation, (v) a Python API for simplified analysis of WE simulations, and (vi) plugins/extensions for Markovian Weighted Ensemble Milestoning and WE rule-based modeling for systems biology models. Applications of the advanced tutorials include atomistic and non-spatial models, and consist of complex processes such as protein folding and the membrane permeability of a drug-like molecule. Users are expected to already have significant experience with running conventional molecular dynamics or systems biology simulations.
Collapse
Affiliation(s)
| | | | - John D. Russo
- Department of Biomedical Engineering, Oregon Health and Science University, Portland, OR
| | | | | | - Ali S. Saglam
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, PA
| | - Dhiman Ray
- Department of Chemistry, University of California Irvine, Irvine, CA
| | - Barmak Mostofian
- Department of Biomedical Engineering, Oregon Health and Science University, Portland, OR
| | - AJ Pratt
- Department of Chemistry, University of Pittsburgh, Pittsburgh, PA
| | - Rhea C. Abraham
- Department of Chemistry, University of Pittsburgh, Pittsburgh, PA
| | - Page O. Harrison
- Department of Chemistry, University of Pittsburgh, Pittsburgh, PA
| | - Max Dudek
- Department of Chemistry, University of Pittsburgh, Pittsburgh, PA
| | - Paul A. Torrillo
- Department of Chemistry, University of Pittsburgh, Pittsburgh, PA
| | - Alex J. DeGrave
- Department of Chemistry, University of Pittsburgh, Pittsburgh, PA
| | - Upendra Adhikari
- Department of Biomedical Engineering, Oregon Health and Science University, Portland, OR
| | - James R. Faeder
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, PA
| | - Ioan Andricioaei
- Department of Chemistry, University of California Irvine, Irvine, CA
| | - Joshua L. Adelman
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, PA
| | | | | | - Daniel M. Zuckerman
- Department of Biomedical Engineering, Oregon Health and Science University, Portland, OR
| | - Lillian T. Chong
- Department of Chemistry, University of Pittsburgh, Pittsburgh, PA
| |
Collapse
|
17
|
Dommer A, Casalino L, Kearns F, Rosenfeld M, Wauer N, Ahn SH, Russo J, Oliveira S, Morris C, Bogetti A, Trifan A, Brace A, Sztain T, Clyde A, Ma H, Chennubhotla C, Lee H, Turilli M, Khalid S, Tamayo-Mendoza T, Welborn M, Christensen A, Smith DG, Qiao Z, Sirumalla SK, O'Connor M, Manby F, Anandkumar A, Hardy D, Phillips J, Stern A, Romero J, Clark D, Dorrell M, Maiden T, Huang L, McCalpin J, Woods C, Gray A, Williams M, Barker B, Rajapaksha H, Pitts R, Gibbs T, Stone J, Zuckerman DM, Mulholland AJ, Miller T, Jha S, Ramanathan A, Chong L, Amaro RE. #COVIDisAirborne: AI-enabled multiscale computational microscopy of delta SARS-CoV-2 in a respiratory aerosol. THE INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS 2023; 37:28-44. [PMID: 36647365 PMCID: PMC9527558 DOI: 10.1177/10943420221128233] [Citation(s) in RCA: 35] [Impact Index Per Article: 17.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 05/29/2023]
Abstract
We seek to completely revise current models of airborne transmission of respiratory viruses by providing never-before-seen atomic-level views of the SARS-CoV-2 virus within a respiratory aerosol. Our work dramatically extends the capabilities of multiscale computational microscopy to address the significant gaps that exist in current experimental methods, which are limited in their ability to interrogate aerosols at the atomic/molecular level and thus obscure our understanding of airborne transmission. We demonstrate how our integrated data-driven platform provides a new way of exploring the composition, structure, and dynamics of aerosols and aerosolized viruses, while driving simulation method development along several important axes. We present a series of initial scientific discoveries for the SARS-CoV-2 Delta variant, noting that the full scientific impact of this work has yet to be realized.
Collapse
Affiliation(s)
| | | | | | | | | | | | - John Russo
- Oregon Health & Science University, Portland, OR, USA
| | | | | | | | - Anda Trifan
- Argonne National Laboratory, Lemont, IL, USA
- University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Alexander Brace
- Argonne National Laboratory, Lemont, IL, USA
- University of Chicago, Chicago, IL, USA
| | - Terra Sztain
- UC San Diego, La Jolla, CA, USA
- Freie Universitat Berlin
| | - Austin Clyde
- Argonne National Laboratory, Lemont, IL, USA
- University of Chicago, Chicago, IL, USA
| | - Heng Ma
- Argonne National Laboratory, Lemont, IL, USA
| | | | - Hyungro Lee
- Brookhaven National Lab and Rutgers University
| | | | | | | | | | | | | | - Zhuoran Qiao
- California Institute of Technology, Pasadena, CA, USA
| | | | | | | | - Anima Anandkumar
- California Institute of Technology, Pasadena, CA, USA
- NVIDIA Corp, Santa Clara, CA, USA
| | - David Hardy
- University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - James Phillips
- University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | | | | | | | | | - Tom Maiden
- Pittsburgh Supercomputing Center, Pittsburgh, PA, USA
| | - Lei Huang
- Texas Advanced Computing Center, Austin, TX, USA
| | | | | | | | | | | | | | | | | | - John Stone
- University of Illinois at Urbana-Champaign, Urbana, IL, USA
- NVIDIA Corp, Santa Clara, CA, USA
| | | | | | - Thomas Miller
- Entos, Inc., San Diego, CA, USA
- California Institute of Technology, Pasadena, CA, USA
| | | | | | | | | |
Collapse
|
18
|
Chakraborty S, Chaudhuri D, Chaudhuri D, Singh V, Banerjee S, Chowdhury D, Haldar S. Connecting conformational stiffness of the protein with energy landscape by a single experiment. NANOSCALE 2022; 14:7659-7673. [PMID: 35546109 DOI: 10.1039/d1nr07582a] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
The structure-function dynamics of a protein as a flexible polymer is essential to describe its biological functions. Here, using single-molecule magnetic tweezers, we have studied the effect of ionic strength on the folding mechanics of protein L, and probed its folding-associated physical properties by re-measuring the same protein in a range of ammonium sulfate concentrations from 150 mM to 650 mM. We observed an electrolyte-dependent conformational dynamics and folding landscape of the protein in a single experiment. Salt increases the refolding kinetics, while decreasing the unfolding kinetics under force, which in turn modifies the barrier heights towards the folded state. Additionally, salt enhances the molecular compaction by decreasing the Kuhn length of the protein polymer from 1.18 nm to 0.58 nm, which we have explained by modifying the freely jointed chain model. Finally, we correlated polymer chain physics to the folding dynamics, and thus provided an analytical framework for understanding compaction-induced folding mechanics across a range of ionic strengths from a single experiment.
Collapse
Affiliation(s)
- Soham Chakraborty
- Department of Biological Sciences, Ashoka University, Sonepat, Haryana 131029, India.
| | - Deep Chaudhuri
- Department of Biological Sciences, Ashoka University, Sonepat, Haryana 131029, India.
| | - Dyuti Chaudhuri
- Department of Biological Sciences, Ashoka University, Sonepat, Haryana 131029, India.
| | - Vihan Singh
- Department of Biological Sciences, Ashoka University, Sonepat, Haryana 131029, India.
| | - Souradeep Banerjee
- Department of Biological Sciences, Ashoka University, Sonepat, Haryana 131029, India.
| | - Debojyoti Chowdhury
- Department of Biological Sciences, Ashoka University, Sonepat, Haryana 131029, India.
| | - Shubhasis Haldar
- Department of Biological Sciences, Ashoka University, Sonepat, Haryana 131029, India.
| |
Collapse
|
19
|
Russo JD, Zhang S, Leung JMG, Bogetti AT, Thompson JP, DeGrave AJ, Torrillo PA, Pratt AJ, Wong KF, Xia J, Copperman J, Adelman JL, Zwier MC, LeBard DN, Zuckerman DM, Chong LT. WESTPA 2.0: High-Performance Upgrades for Weighted Ensemble Simulations and Analysis of Longer-Timescale Applications. J Chem Theory Comput 2022; 18:638-649. [PMID: 35043623 PMCID: PMC8825686 DOI: 10.1021/acs.jctc.1c01154] [Citation(s) in RCA: 42] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
Abstract
The weighted ensemble (WE) family of methods is one of several statistical mechanics-based path sampling strategies that can provide estimates of key observables (rate constants and pathways) using a fraction of the time required by direct simulation methods such as molecular dynamics or discrete-state stochastic algorithms. WE methods oversee numerous parallel trajectories using intermittent overhead operations at fixed time intervals, enabling facile interoperability with any dynamics engine. Here, we report on the major upgrades to the WESTPA software package, an open-source, high-performance framework that implements both basic and recently developed WE methods. These upgrades offer substantial improvements over traditional WE methods. The key features of the new WESTPA 2.0 software enhance the efficiency and ease of use: an adaptive binning scheme for more efficient surmounting of large free energy barriers, streamlined handling of large simulation data sets, exponentially improved analysis of kinetics, and developer-friendly tools for creating new WE methods, including a Python API and resampler module for implementing both binned and "binless" WE strategies.
Collapse
Affiliation(s)
- John D Russo
- Department of Biomedical Engineering, Oregon Health and Science University, Portland, Oregon 97239-3098, United States
| | - She Zhang
- OpenEye Scientific, Santa Fe, New Mexico 87508, United States
| | - Jeremy M G Leung
- Department of Chemistry, University of Pittsburgh, Pittsburgh, Pennsylvania 15260, United States
| | - Anthony T Bogetti
- Department of Chemistry, University of Pittsburgh, Pittsburgh, Pennsylvania 15260, United States
| | - Jeff P Thompson
- OpenEye Scientific, Santa Fe, New Mexico 87508, United States
| | - Alex J DeGrave
- Department of Chemistry, University of Pittsburgh, Pittsburgh, Pennsylvania 15260, United States
| | - Paul A Torrillo
- Department of Chemistry, University of Pittsburgh, Pittsburgh, Pennsylvania 15260, United States
| | - A J Pratt
- Department of Chemistry, University of Pittsburgh, Pittsburgh, Pennsylvania 15260, United States
| | - Kim F Wong
- Department of Chemistry, University of Pittsburgh, Pittsburgh, Pennsylvania 15260, United States
| | - Junchao Xia
- OpenEye Scientific, Santa Fe, New Mexico 87508, United States
| | - Jeremy Copperman
- Department of Biomedical Engineering, Oregon Health and Science University, Portland, Oregon 97239-3098, United States
| | - Joshua L Adelman
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, Pennsylvania 15260, United States
| | - Matthew C Zwier
- Department of Chemistry, Drake University, Des Moines, Iowa 50311-4505, United States
| | - David N LeBard
- OpenEye Scientific, Santa Fe, New Mexico 87508, United States
| | - Daniel M Zuckerman
- Department of Biomedical Engineering, Oregon Health and Science University, Portland, Oregon 97239-3098, United States
| | - Lillian T Chong
- Department of Chemistry, University of Pittsburgh, Pittsburgh, Pennsylvania 15260, United States
| |
Collapse
|
20
|
Abstract
Abstract
We study weighted ensemble, an interacting particle method for sampling distributions of Markov chains that has been used in computational chemistry since the 1990s. Many important applications of weighted ensemble require the computation of long time averages. We establish the consistency of weighted ensemble in this setting by proving an ergodic theorem for time averages. As part of the proof, we derive explicit variance formulas that could be useful for optimizing the method.
Collapse
|
21
|
Santhouse JR, Leung JMG, Chong LT, Horne WS. Implications of the unfolded state in the folding energetics of heterogeneous-backbone protein mimetics. Chem Sci 2022; 13:11798-11806. [PMID: 36320921 PMCID: PMC9580521 DOI: 10.1039/d2sc04427g] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2022] [Accepted: 09/19/2022] [Indexed: 12/28/2022] Open
Abstract
Sequence-encoded folding is the foundation of protein structure and is also possible in synthetic chains of artificial chemical composition. In natural proteins, the characteristics of the unfolded state are as important as those of the folded state in determining folding energetics. While much is known about folded structures adopted by artificial protein-like chains, corresponding information about the unfolded states of these molecules is lacking. Here, we report the consequences of altered backbone composition on the structure, stability, and dynamics of the folded and unfolded states of a compact helix-rich protein. Characterization through a combination of biophysical experiments and atomistic simulation reveals effects of backbone modification that depend on both the type of artificial monomers employed and where they are applied in sequence. In general, introducing artificial connectivity in a way that reinforces characteristics of the unfolded state ensemble of the prototype natural protein minimizes the impact of chemical changes on folded stability. These findings have implications in the design of protein mimetics and provide an atomically detailed picture of the unfolded state of a natural protein and artificial analogues under non-denaturing conditions. Biophysical experiments and atomistic simulation reveal impacts of protein backbone alteration on the ensemble that defines the unfolded state. These effects have implications on folded stability of protein mimetics.![]()
Collapse
Affiliation(s)
| | - Jeremy M. G. Leung
- Department of Chemistry, University of Pittsburgh, Pittsburgh, PA, 15211, USA
| | - Lillian T. Chong
- Department of Chemistry, University of Pittsburgh, Pittsburgh, PA, 15211, USA
| | - W. Seth Horne
- Department of Chemistry, University of Pittsburgh, Pittsburgh, PA, 15211, USA
| |
Collapse
|
22
|
Ray D, Stone SE, Andricioaei I. Markovian Weighted Ensemble Milestoning (M-WEM): Long-Time Kinetics from Short Trajectories. J Chem Theory Comput 2021; 18:79-95. [PMID: 34910499 DOI: 10.1021/acs.jctc.1c00803] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
We introduce a rare-event sampling scheme, named Markovian Weighted Ensemble Milestoning (M-WEM), which inlays a weighted ensemble framework within a Markovian milestoning theory to efficiently calculate thermodynamic and kinetic properties of long-time-scale biomolecular processes from short atomistic molecular dynamics simulations. M-WEM is tested on the Müller-Brown potential model, the conformational switching in alanine dipeptide, and the millisecond time-scale protein-ligand unbinding in a trypsin-benzamidine complex. Not only can M-WEM predict the kinetics of these processes with quantitative accuracy but it also allows for a scheme to reconstruct a multidimensional free-energy landscape along additional degrees of freedom, which are not part of the milestoning progress coordinate. For the ligand-receptor system, the experimental residence time, association and dissociation kinetics, and binding free energy could be reproduced using M-WEM within a simulation time of a few hundreds of nanoseconds, which is a fraction of the computational cost of other currently available methods, and close to 4 orders of magnitude less than the experimental residence time. Due to the high accuracy and low computational cost, the M-WEM approach can find potential applications in kinetics and free-energy-based computational drug design.
Collapse
Affiliation(s)
- Dhiman Ray
- Department of Chemistry, University of California Irvine, Irvine, California 92697, United States
| | - Sharon Emily Stone
- Department of Chemistry, University of California Irvine, Irvine, California 92697, United States
| | - Ioan Andricioaei
- Department of Chemistry, University of California Irvine, Irvine, California 92697, United States.,Department of Physics and Astronomy, University of California Irvine, Irvine, California 92697, United States
| |
Collapse
|
23
|
Ahn SH, Ojha AA, Amaro RE, McCammon JA. Gaussian-Accelerated Molecular Dynamics with the Weighted Ensemble Method: A Hybrid Method Improves Thermodynamic and Kinetic Sampling. J Chem Theory Comput 2021; 17:7938-7951. [PMID: 34844409 DOI: 10.1021/acs.jctc.1c00770] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Gaussian-accelerated molecular dynamics (GaMD) is a well-established enhanced sampling method for molecular dynamics simulations that effectively samples the potential energy landscape of the system by adding a boost potential, which smoothens the surface and lowers the energy barriers between states. GaMD is unable to give time-dependent properties such as kinetics directly. On the other hand, the weighted ensemble (WE) method can efficiently sample transitions between states with its many weighted trajectories, which directly yield rates and pathways. However, convergence to equilibrium conditions remains a challenge for the WE method. Hence, we have developed a hybrid method that combines the two methods, wherein GaMD is first used to sample the potential energy landscape of the system and WE is subsequently used to further sample the potential energy landscape and kinetic properties of interest. We show that the hybrid method can sample both thermodynamic and kinetic properties more accurately and quickly compared to using either method alone.
Collapse
Affiliation(s)
- Surl-Hee Ahn
- Department of Chemistry, University of California San Diego, La Jolla 92093, California, United States
| | - Anupam A Ojha
- Department of Chemistry, University of California San Diego, La Jolla 92093, California, United States
| | - Rommie E Amaro
- Department of Chemistry, University of California San Diego, La Jolla 92093, California, United States
| | - J Andrew McCammon
- Department of Chemistry, University of California San Diego, La Jolla 92093, California, United States.,Department of Pharmacology, University of California San Diego, La Jolla 92093, California, United States
| |
Collapse
|
24
|
Dommer A, Casalino L, Kearns F, Rosenfeld M, Wauer N, Ahn SH, Russo J, Oliveira S, Morris C, Bogetti A, Trifan A, Brace A, Sztain T, Clyde A, Ma H, Chennubhotla C, Lee H, Turilli M, Khalid S, Tamayo-Mendoza T, Welborn M, Christensen A, Smith DGA, Qiao Z, Sirumalla SK, O'Connor M, Manby F, Anandkumar A, Hardy D, Phillips J, Stern A, Romero J, Clark D, Dorrell M, Maiden T, Huang L, McCalpin J, Woods C, Gray A, Williams M, Barker B, Rajapaksha H, Pitts R, Gibbs T, Stone J, Zuckerman D, Mulholland A, Miller T, Jha S, Ramanathan A, Chong L, Amaro R. #COVIDisAirborne: AI-Enabled Multiscale Computational Microscopy of Delta SARS-CoV-2 in a Respiratory Aerosol. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2021:2021.11.12.468428. [PMID: 34816263 PMCID: PMC8609898 DOI: 10.1101/2021.11.12.468428] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/12/2023]
Abstract
We seek to completely revise current models of airborne transmission of respiratory viruses by providing never-before-seen atomic-level views of the SARS-CoV-2 virus within a respiratory aerosol. Our work dramatically extends the capabilities of multiscale computational microscopy to address the significant gaps that exist in current experimental methods, which are limited in their ability to interrogate aerosols at the atomic/molecular level and thus ob-scure our understanding of airborne transmission. We demonstrate how our integrated data-driven platform provides a new way of exploring the composition, structure, and dynamics of aerosols and aerosolized viruses, while driving simulation method development along several important axes. We present a series of initial scientific discoveries for the SARS-CoV-2 Delta variant, noting that the full scientific impact of this work has yet to be realized. ACM REFERENCE FORMAT Abigail Dommer 1† , Lorenzo Casalino 1† , Fiona Kearns 1† , Mia Rosenfeld 1 , Nicholas Wauer 1 , Surl-Hee Ahn 1 , John Russo, 2 Sofia Oliveira 3 , Clare Morris 1 , AnthonyBogetti 4 , AndaTrifan 5,6 , Alexander Brace 5,7 , TerraSztain 1,8 , Austin Clyde 5,7 , Heng Ma 5 , Chakra Chennubhotla 4 , Hyungro Lee 9 , Matteo Turilli 9 , Syma Khalid 10 , Teresa Tamayo-Mendoza 11 , Matthew Welborn 11 , Anders Christensen 11 , Daniel G. A. Smith 11 , Zhuoran Qiao 12 , Sai Krishna Sirumalla 11 , Michael O'Connor 11 , Frederick Manby 11 , Anima Anandkumar 12,13 , David Hardy 6 , James Phillips 6 , Abraham Stern 13 , Josh Romero 13 , David Clark 13 , Mitchell Dorrell 14 , Tom Maiden 14 , Lei Huang 15 , John McCalpin 15 , Christo- pherWoods 3 , Alan Gray 13 , MattWilliams 3 , Bryan Barker 16 , HarindaRajapaksha 16 , Richard Pitts 16 , Tom Gibbs 13 , John Stone 6 , Daniel Zuckerman 2 *, Adrian Mulholland 3 *, Thomas MillerIII 11,12 *, ShantenuJha 9 *, Arvind Ramanathan 5 *, Lillian Chong 4 *, Rommie Amaro 1 *. 2021. #COVIDisAirborne: AI-Enabled Multiscale Computational Microscopy ofDeltaSARS-CoV-2 in a Respiratory Aerosol. In Supercomputing '21: International Conference for High Perfor-mance Computing, Networking, Storage, and Analysis . ACM, New York, NY, USA, 14 pages. https://doi.org/finalDOI.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | | | - Anda Trifan
- Argonne National Laboratory
- University of Illinois at Urbana-Champaign
| | | | | | - Austin Clyde
- Argonne National Laboratory
- University of Chicago
| | | | | | - Hyungro Lee
- Brookhaven National Lab & Rutgers University
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | - John Stone
- University of Illinois at Urbana-Champaign
| | | | | | | | | | | | | | | |
Collapse
|
25
|
Zuckerman DM, Russo JD. A gentle introduction to the non-equilibrium physics of trajectories: Theory, algorithms, and biomolecular applications. AMERICAN JOURNAL OF PHYSICS 2021; 89:1048-1061. [PMID: 35530173 PMCID: PMC9075726 DOI: 10.1119/10.0005603] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/20/2021] [Accepted: 06/25/2021] [Indexed: 06/14/2023]
Abstract
Despite the importance of non-equilibrium statistical mechanics in modern physics and related fields, the topic is often omitted from undergraduate and core-graduate curricula. Key aspects of non-equilibrium physics, however, can be understood with a minimum of formalism based on a rigorous trajectory picture. The fundamental object is the ensemble of trajectories, a set of independent time-evolving systems, which easily can be visualized or simulated (e.g., for protein folding) and which can be analyzed rigorously in analogy to an ensemble of static system configurations. The trajectory picture provides a straightforward basis for understanding first-passage times, "mechanisms" in complex systems, and fundamental constraints on the apparent reversibility of complex processes. Trajectories make concrete the physics underlying the diffusion and Fokker-Planck partial differential equations. Last but not least, trajectory ensembles underpin some of the most important algorithms that have provided significant advances in biomolecular studies of protein conformational and binding processes.
Collapse
|
26
|
Bogetti AT, Presti MF, Loh SN, Chong LT. The Next Frontier for Designing Switchable Proteins: Rational Enhancement of Kinetics. J Phys Chem B 2021; 125:9069-9077. [PMID: 34324338 PMCID: PMC8826494 DOI: 10.1021/acs.jpcb.1c04082] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Designing proteins that can switch between active (ON) and inactive (OFF) conformations in response to signals such as ligand binding and incident light has been a tantalizing endeavor in protein engineering for over a decade. While such designs have yielded novel biosensors, therapeutic agents, and smart biomaterials, the response times (times for switching ON and OFF) of many switches have been too slow to be of practical use. Among the defining properties of such switches, the kinetics of switching has been the most challenging to optimize. This is largely due to the difficulty of characterizing the structures of transient states, which are required for manipulating the height of the effective free energy barrier between the ON and OFF states. We share our perspective of the most promising new experimental and computational strategies over the past several years for tackling this next frontier for designing switchable proteins.
Collapse
Affiliation(s)
- Anthony T Bogetti
- Department of Chemistry, University of Pittsburgh, Pittsburgh, Pennsylvania 15260, United States
| | - Maria F Presti
- Department of Biochemistry and Molecular Biology, State University of New York Upstate Medical University, Syracuse, New York 13210, United States
| | - Stewart N Loh
- Department of Biochemistry and Molecular Biology, State University of New York Upstate Medical University, Syracuse, New York 13210, United States
| | - Lillian T Chong
- Department of Chemistry, University of Pittsburgh, Pittsburgh, Pennsylvania 15260, United States
| |
Collapse
|
27
|
DeGrave AJ, Bogetti AT, Chong LT. The RED scheme: Rate-constant estimation from pre-steady state weighted ensemble simulations. J Chem Phys 2021; 154:114111. [PMID: 33752378 PMCID: PMC7972523 DOI: 10.1063/5.0041278] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2020] [Accepted: 02/11/2021] [Indexed: 12/13/2022] Open
Abstract
We present the Rate from Event Durations (RED) scheme, a new scheme that more efficiently calculates rate constants using the weighted ensemble path sampling strategy. This scheme enables rate-constant estimation from shorter trajectories by incorporating the probability distribution of event durations, or barrier-crossing times, from a simulation. We have applied the RED scheme to weighted ensemble simulations of a variety of rare-event processes that range in complexity: residue-level simulations of protein conformational switching, atomistic simulations of Na+/Cl- association in explicit solvent, and atomistic simulations of protein-protein association in explicit solvent. Rate constants were estimated with up to 50% greater efficiency than the original weighted ensemble scheme. Importantly, our scheme accounts for the systematic error that results from statistical bias toward the observation of events with short durations and reweights the event duration distribution accordingly. The RED scheme is relevant to any simulation strategy that involves unbiased trajectories of similar length to the most probable event duration, including weighted ensemble, milestoning, and standard simulations as well as the construction of Markov state models.
Collapse
Affiliation(s)
| | - Anthony T. Bogetti
- Department of Chemistry, University of Pittsburgh, Pittsburgh, Pennsylvania 15260, USA
| | - Lillian T. Chong
- Department of Chemistry, University of Pittsburgh, Pittsburgh, Pennsylvania 15260, USA
| |
Collapse
|