1
|
Wilke K, Tao S, Calero S, Lervik A, van Erp TS. NaCl Dissociation Explored Through Predictive Power Path Sampling Analysis. J Chem Theory Comput 2025; 21:4604-4614. [PMID: 40310238 PMCID: PMC12079800 DOI: 10.1021/acs.jctc.5c00054] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2025] [Revised: 04/17/2025] [Accepted: 04/21/2025] [Indexed: 05/02/2025]
Abstract
Utilizing the replica exchange transition interface sampling (RETIS) technique, we simulated the dynamics of sodium chloride dissociation in water. Subsequently, the resulting trajectories were analyzed using predictive power analysis (PPA), enabling the identification and quantification of collective variables (CVs) capable of forecasting the reaction occurrence. We improved the robustness of the PPA method by incorporating the Savitzky-Golay (SG) filter on integrated histograms, effectively avoiding the limitations associated with binning. Applying this adapted PPA method, the previously designed solvent parameters and distances from the index invariant distance matrix were assessed. This revealed that the sixth closest oxygen to sodium serves as an equally effective predictor as the best complex solvent parameter. The latter, however, required more knowledge and human intuition as an input for its design, while the former provided such intuition purely as an output. Through a comparable analysis, the chloride solvation shell appears to contain less predictive information. Employing a linear combination of several CVs can further enhance predictability, albeit at the expense of a reduced human interpretability.
Collapse
Affiliation(s)
- Konrad Wilke
- Materials
Simulation & Modelling, Department of Applied Physics and Science
Education, Eindhoven University of Technology, 5600 MB, Eindhoven, The Netherlands
| | - Shuxia Tao
- Materials
Simulation & Modelling, Department of Applied Physics and Science
Education, Eindhoven University of Technology, 5600 MB, Eindhoven, The Netherlands
| | - Sofia Calero
- Materials
Simulation & Modelling, Department of Applied Physics and Science
Education, Eindhoven University of Technology, 5600 MB, Eindhoven, The Netherlands
| | - Anders Lervik
- Department
of Chemistry, Faculty of Natural Sciences and Technology, NTNU, Norwegian University of Science and Technology, 7941 Trondheim, Norway
| | - Titus S. van Erp
- Department
of Chemistry, Faculty of Natural Sciences and Technology, NTNU, Norwegian University of Science and Technology, 7941 Trondheim, Norway
| |
Collapse
|
2
|
Ianeselli A, Howard J, Gerstein MB. A discard-and-restart MD algorithm for the sampling of protein intermediate states. Biophys J 2025:S0006-3495(25)00198-5. [PMID: 40156184 DOI: 10.1016/j.bpj.2025.03.024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2024] [Revised: 01/20/2025] [Accepted: 03/25/2025] [Indexed: 04/01/2025] Open
Abstract
We introduce a discard-and-restart molecular dynamics (MD) algorithm tailored for the sampling of realistic protein intermediate states. It aids computational structure-based drug discovery by reducing the simulation times to compute a "quick sketch" of folding pathways by up to 2000×. The algorithm iteratively performs short MD simulations and measures their proximity to a target state via a collective variable loss, which can be defined in a flexible fashion, locally or globally. Using the loss, if the trajectory proceeds toward the target, the MD simulation continues. Otherwise, it is discarded, and a new MD simulation is restarted, with new initial velocities randomly drawn from a Maxwell-Boltzmann distribution. The discard-and-restart algorithm demonstrates efficacy and atomistic accuracy in capturing the folding pathways in several contexts: 1) fast-folding small protein domains, 2) the folding intermediate of the prion protein PrP, and 3) the spontaneous partial unfolding of α-tubulin, a crucial event for microtubule severing. During each iteration of the algorithm, we can perform AI-based analysis of the transitory conformations to find potential binding pockets, which could represent druggable sites. Overall, our algorithm enables systematic and computationally efficient exploration of conformational landscapes, enhancing the design of ligands targeting dynamic protein states.
Collapse
Affiliation(s)
- Alan Ianeselli
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut
| | - Jonathon Howard
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut
| | - Mark B Gerstein
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut; Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut; Department of Computer Science, Yale University, New Haven, Connecticut; Department of Statistics and Data Science, Yale University, New Haven, Connecticut; Department of Biomedical Informatics & Data Science, Yale University, New Haven, Connecticut.
| |
Collapse
|
3
|
Zhang BW, Fajer M, Chen W, Moraca F, Wang L. Leveraging the Thermodynamics of Protein Conformations in Drug Discovery. J Chem Inf Model 2025; 65:252-264. [PMID: 39681511 DOI: 10.1021/acs.jcim.4c01612] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2024]
Abstract
As the name implies, structure-based drug design requires confidence in the holo complex structure. The ability to clarify which protein conformation to use when ambiguity arises would be incredibly useful. We present a large scale validation of the computational method Protein Reorganization Free Energy Perturbation (PReorg-FEP) and demonstrate its quantitative accuracy in selecting the correct protein conformation among candidate models in apo or ligand induced states for 14 different systems. These candidate conformations are pulled from various drug discovery related campaigns: cryptic conformations induced by novel hits in lead identification, binding site rearrangement during lead optimization, and conflicting structural biology models. We also show an example of a pH-dependent conformational change, relevant to protein design.
Collapse
Affiliation(s)
- Bin W Zhang
- Schrödinger Inc., 1540 Broadway, 24th Floor, New York, New York 10036-4041, United States
| | - Mikolai Fajer
- Schrödinger Inc., 1540 Broadway, 24th Floor, New York, New York 10036-4041, United States
| | - Wei Chen
- Schrödinger Inc., 1540 Broadway, 24th Floor, New York, New York 10036-4041, United States
| | - Francesca Moraca
- Schrödinger Inc., 1540 Broadway, 24th Floor, New York, New York 10036-4041, United States
| | - Lingle Wang
- Schrödinger Inc., 1540 Broadway, 24th Floor, New York, New York 10036-4041, United States
| |
Collapse
|
4
|
Ikizawa S, Hori T, Wijaya TN, Kono H, Bai Z, Kimizono T, Lu W, Tran DP, Kitao A. PaCS-Toolkit: Optimized Software Utilities for Parallel Cascade Selection Molecular Dynamics (PaCS-MD) Simulations and Subsequent Analyses. J Phys Chem B 2024; 128:3631-3642. [PMID: 38578072 PMCID: PMC11033871 DOI: 10.1021/acs.jpcb.4c01271] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2024] [Revised: 03/26/2024] [Accepted: 03/26/2024] [Indexed: 04/06/2024]
Abstract
Parallel cascade selection molecular dynamics (PaCS-MD) is an enhanced conformational sampling method conducted as a "repetition of time leaps in parallel worlds", comprising cycles of multiple molecular dynamics (MD) simulations performed in parallel and selection of the initial structures of MDs for the next cycle. We developed PaCS-Toolkit, an optimized software utility enabling the use of different MD software and trajectory analysis tools to facilitate the execution of the PaCS-MD simulation and analyze the obtained trajectories, including the preparation for the subsequent construction of the Markov state model. PaCS-Toolkit is coded with Python, is compatible with various computing environments, and allows for easy customization by editing the configuration file and specifying the MD software and analysis tools to be used. We present the software design of PaCS-Toolkit and demonstrate applications of PaCS-MD variations: original targeted PaCS-MD to peptide folding; rmsdPaCS-MD to protein domain motion; and dissociation PaCS-MD to ligand dissociation from adenosine A2A receptor.
Collapse
Affiliation(s)
- Shinji Ikizawa
- School
of Life Science and Technology, Tokyo Institute
of Technology, 2-12-2 Ookayama, Meguro, Tokyo 152-8550, Japan
| | - Tatsuki Hori
- School
of Life Science and Technology, Tokyo Institute
of Technology, 2-12-2 Ookayama, Meguro, Tokyo 152-8550, Japan
| | - Tegar Nurwahyu Wijaya
- School
of Life Science and Technology, Tokyo Institute
of Technology, 2-12-2 Ookayama, Meguro, Tokyo 152-8550, Japan
- Department
of Chemistry, Universitas Pertamina, Jl. Teuku Nyak Arief, Simprug, Jakarta 12220, Indonesia
| | - Hiroshi Kono
- School
of Life Science and Technology, Tokyo Institute
of Technology, 2-12-2 Ookayama, Meguro, Tokyo 152-8550, Japan
| | - Zhen Bai
- School
of Life Science and Technology, Tokyo Institute
of Technology, 2-12-2 Ookayama, Meguro, Tokyo 152-8550, Japan
| | - Tatsuhiro Kimizono
- School
of Life Science and Technology, Tokyo Institute
of Technology, 2-12-2 Ookayama, Meguro, Tokyo 152-8550, Japan
| | - Wenbo Lu
- School
of Life Science and Technology, Tokyo Institute
of Technology, 2-12-2 Ookayama, Meguro, Tokyo 152-8550, Japan
| | - Duy Phuoc Tran
- School
of Life Science and Technology, Tokyo Institute
of Technology, 2-12-2 Ookayama, Meguro, Tokyo 152-8550, Japan
| | - Akio Kitao
- School
of Life Science and Technology, Tokyo Institute
of Technology, 2-12-2 Ookayama, Meguro, Tokyo 152-8550, Japan
| |
Collapse
|
5
|
Halder R, Warshel A. Energetic and structural insights behind calcium induced conformational transition in calmodulin. Proteins 2024; 92:384-394. [PMID: 37915244 PMCID: PMC10872638 DOI: 10.1002/prot.26620] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2023] [Revised: 10/01/2023] [Accepted: 10/10/2023] [Indexed: 11/03/2023]
Abstract
Calmodulin (CaM) is a key signaling protein that triggers several cellular and physiological processes inside the cell. Upon binding with calcium ion, CaM undergoes large scale conformational transition from a closed state to an open state that facilitates its interaction with various target protein and regulates their activity. This work explores the origin of the energetic and structural variation of the wild type and mutated CaM and explores the molecular origin for the structural differences between them. We first calculated the sequential calcium binding energy to CaM using the PDLD/S-LRA/β approach. This study shows a very good correlation with experimental calcium binding energies. Next we calculated the calcium binding energies to the wild type CaM and several mutated CaM systems which were reported experimentally. On the structural aspect, it has been reported experimentally that certain mutation (Q41L-K75I) in calcium bound CaM leads to complete conformational transition from an open to a closed state. By using equilibrium molecular dynamics simulation, free energy calculation and contact frequency map analysis, we have shown that the formation of a cluster of long-range hydrophobic contacts, initiated by the Q41L-K75I CaM variant is the driving force behind its closing motion. This study unravels the energetics and structural aspects behind calcium ion induced conformational changes in wild type CaM and its variant.
Collapse
Affiliation(s)
- Ritaban Halder
- Department of Chemistry, University of Southern California, Los Angeles, California, USA
| | - Arieh Warshel
- Department of Chemistry, University of Southern California, Los Angeles, California, USA
| |
Collapse
|
6
|
Hellemann E, Durrant JD. Worth the Weight: Sub-Pocket EXplorer (SubPEx), a Weighted Ensemble Method to Enhance Binding-Pocket Conformational Sampling. J Chem Theory Comput 2023; 19:5677-5689. [PMID: 37585617 PMCID: PMC10500992 DOI: 10.1021/acs.jctc.3c00478] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2023] [Indexed: 08/18/2023]
Abstract
Structure-based virtual screening (VS) is an effective method for identifying potential small-molecule ligands, but traditional VS approaches consider only a single binding-pocket conformation. Consequently, they struggle to identify ligands that bind to alternate conformations. Ensemble docking helps address this issue by incorporating multiple conformations into the docking process, but it depends on methods that can thoroughly explore pocket flexibility. We here introduce Sub-Pocket EXplorer (SubPEx), an approach that uses weighted ensemble (WE) path sampling to accelerate binding-pocket sampling. As proof of principle, we apply SubPEx to three proteins relevant to drug discovery: heat shock protein 90, influenza neuraminidase, and yeast hexokinase 2. SubPEx is available free of charge without registration under the terms of the open-source MIT license: http://durrantlab.com/subpex/.
Collapse
Affiliation(s)
- Erich Hellemann
- Department of Biological
Sciences, University of Pittsburgh, Pittsburgh, Pennsylvania 15260, United States
| | - Jacob D. Durrant
- Department of Biological
Sciences, University of Pittsburgh, Pittsburgh, Pennsylvania 15260, United States
| |
Collapse
|
7
|
Hellemann E, Durrant JD. Worth the weight: Sub-Pocket EXplorer (SubPEx), a weighted-ensemble method to enhance binding-pocket conformational sampling. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.03.539330. [PMID: 37251500 PMCID: PMC10214482 DOI: 10.1101/2023.05.03.539330] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
Structure-based virtual screening (VS) is an effective method for identifying potential small-molecule ligands, but traditional VS approaches consider only a single binding-pocket conformation. Consequently, they struggle to identify ligands that bind to alternate conformations. Ensemble docking helps address this issue by incorporating multiple conformations into the docking process, but it depends on methods that can thoroughly explore pocket flexibility. We here introduce Sub-Pocket EXplorer (SubPEx), an approach that uses weighted ensemble (WE) path sampling to accelerate binding-pocket sampling. As proof of principle, we apply SubPEx to three proteins relevant to drug discovery: heat shock protein 90, influenza neuraminidase, and yeast hexokinase 2. SubPEx is available free of charge without registration under the terms of the open-source MIT license: http://durrantlab.com/subpex/.
Collapse
Affiliation(s)
- Erich Hellemann
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, Pennsylvania, 15260, United States
| | - Jacob D. Durrant
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, Pennsylvania, 15260, United States
| |
Collapse
|
8
|
Aristoff D, Copperman J, Simpson G, Webber RJ, Zuckerman DM. Weighted ensemble: Recent mathematical developments. J Chem Phys 2023; 158:014108. [PMID: 36610976 PMCID: PMC9822651 DOI: 10.1063/5.0110873] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2022] [Accepted: 12/04/2022] [Indexed: 12/12/2022] Open
Abstract
Weighted ensemble (WE) is an enhanced sampling method based on periodically replicating and pruning trajectories generated in parallel. WE has grown increasingly popular for computational biochemistry problems due, in part, to improved hardware and accessible software implementations. Algorithmic and analytical improvements have played an important role, and progress has accelerated in recent years. Here, we discuss and elaborate on the WE method from a mathematical perspective, highlighting recent results that enhance the computational efficiency. The mathematical theory reveals a new strategy for optimizing trajectory management that approaches the best possible variance while generalizing to systems of arbitrary dimension.
Collapse
Affiliation(s)
- D. Aristoff
- Mathematics, Colorado State University, Fort Collins, CO 80521 USA
| | - J. Copperman
- Biomedical Engineering, Oregon Health and Science University, Portland, OR 97239 USA
| | - G. Simpson
- Mathematics, Drexel University, Philadelphia, Pennsylvania 19104 USA
| | - R. J. Webber
- Computing and Mathematical Sciences, California Institute of Technology, Pasadena, California 91125 USA
| | - D. M. Zuckerman
- Biomedical Engineering, Oregon Health and Science University, Portland, OR 97239 USA
| |
Collapse
|
9
|
Bogetti AT, Leung JMG, Russo JD, Zhang S, Thompson JP, Saglam AS, Ray D, Mostofian B, Pratt AJ, Abraham RC, Harrison PO, Dudek M, Torrillo PA, DeGrave AJ, Adhikari U, Faeder JR, Andricioaei I, Adelman JL, Zwier MC, LeBard DN, Zuckerman DM, Chong LT. A Suite of Tutorials for the WESTPA 2.0 Rare-Events Sampling Software [Article v2.0]. LIVING JOURNAL OF COMPUTATIONAL MOLECULAR SCIENCE 2023; 5:1655. [PMID: 37200895 PMCID: PMC10191340 DOI: 10.33011/livecoms.5.1.1655] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/20/2023]
Abstract
The weighted ensemble (WE) strategy has been demonstrated to be highly efficient in generating pathways and rate constants for rare events such as protein folding and protein binding using atomistic molecular dynamics simulations. Here we present two sets of tutorials instructing users in the best practices for preparing, carrying out, and analyzing WE simulations for various applications using the WESTPA software. The first set of more basic tutorials describes a range of simulation types, from a molecular association process in explicit solvent to more complex processes such as host-guest association, peptide conformational sampling, and protein folding. The second set ecompasses six advanced tutorials instructing users in the best practices of using key new features and plugins/extensions of the WESTPA 2.0 software package, which consists of major upgrades for larger systems and/or slower processes. The advanced tutorials demonstrate the use of the following key features: (i) a generalized resampler module for the creation of "binless" schemes, (ii) a minimal adaptive binning scheme for more efficient surmounting of free energy barriers, (iii) streamlined handling of large simulation datasets using an HDF5 framework, (iv) two different schemes for more efficient rate-constant estimation, (v) a Python API for simplified analysis of WE simulations, and (vi) plugins/extensions for Markovian Weighted Ensemble Milestoning and WE rule-based modeling for systems biology models. Applications of the advanced tutorials include atomistic and non-spatial models, and consist of complex processes such as protein folding and the membrane permeability of a drug-like molecule. Users are expected to already have significant experience with running conventional molecular dynamics or systems biology simulations.
Collapse
Affiliation(s)
| | | | - John D. Russo
- Department of Biomedical Engineering, Oregon Health and Science University, Portland, OR
| | | | | | - Ali S. Saglam
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, PA
| | - Dhiman Ray
- Department of Chemistry, University of California Irvine, Irvine, CA
| | - Barmak Mostofian
- Department of Biomedical Engineering, Oregon Health and Science University, Portland, OR
| | - AJ Pratt
- Department of Chemistry, University of Pittsburgh, Pittsburgh, PA
| | - Rhea C. Abraham
- Department of Chemistry, University of Pittsburgh, Pittsburgh, PA
| | - Page O. Harrison
- Department of Chemistry, University of Pittsburgh, Pittsburgh, PA
| | - Max Dudek
- Department of Chemistry, University of Pittsburgh, Pittsburgh, PA
| | - Paul A. Torrillo
- Department of Chemistry, University of Pittsburgh, Pittsburgh, PA
| | - Alex J. DeGrave
- Department of Chemistry, University of Pittsburgh, Pittsburgh, PA
| | - Upendra Adhikari
- Department of Biomedical Engineering, Oregon Health and Science University, Portland, OR
| | - James R. Faeder
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, PA
| | - Ioan Andricioaei
- Department of Chemistry, University of California Irvine, Irvine, CA
| | - Joshua L. Adelman
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, PA
| | | | | | - Daniel M. Zuckerman
- Department of Biomedical Engineering, Oregon Health and Science University, Portland, OR
| | - Lillian T. Chong
- Department of Chemistry, University of Pittsburgh, Pittsburgh, PA
| |
Collapse
|
10
|
Vani BP, Weare J, Dinner AR. Computing transition path theory quantities with trajectory stratification. J Chem Phys 2022; 157:034106. [PMID: 35868925 PMCID: PMC9296190 DOI: 10.1063/5.0087058] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2022] [Accepted: 06/23/2022] [Indexed: 11/15/2022] Open
Abstract
Transition path theory computes statistics from ensembles of reactive trajectories. A common strategy for sampling reactive trajectories is to control the branching and pruning of trajectories so as to enhance the sampling of low probability segments. However, it can be challenging to apply transition path theory to data from such methods because determining whether configurations and trajectory segments are part of reactive trajectories requires looking backward and forward in time. Here, we show how this issue can be overcome efficiently by introducing simple data structures. We illustrate the approach in the context of nonequilibrium umbrella sampling, but the strategy is general and can be used to obtain transition path theory statistics from other methods that sample segments of unbiased trajectories.
Collapse
Affiliation(s)
- Bodhi P. Vani
- Department of Chemistry and James Franck Institute, University of Chicago, Chicago, Illinois 60637, USA
| | - Jonathan Weare
- Courant Institute of Mathematical Sciences, New York University, New York, New York 10012, USA
| | - Aaron R. Dinner
- Department of Chemistry and James Franck Institute, University of Chicago, Chicago, Illinois 60637, USA
| |
Collapse
|
11
|
Guan X, Tan C, Li W, Wang W, Thirumalai D. Role of water-bridged interactions in metal ion coupled protein allostery. PLoS Comput Biol 2022; 18:e1010195. [PMID: 35653400 PMCID: PMC9197054 DOI: 10.1371/journal.pcbi.1010195] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2022] [Revised: 06/14/2022] [Accepted: 05/10/2022] [Indexed: 11/18/2022] Open
Abstract
Allosteric communication between distant parts of proteins controls many cellular functions, in which metal ions are widely utilized as effectors to trigger the allosteric cascade. Due to the involvement of strong coordination interactions, the energy landscape dictating the metal ion binding is intrinsically rugged. How metal ions achieve fast binding by overcoming the landscape ruggedness and thereby efficiently mediate protein allostery is elusive. By performing molecular dynamics simulations for the Ca2+ binding mediated allostery of the calmodulin (CaM) domains, each containing two Ca2+ binding helix-loop-helix motifs (EF-hands), we revealed the key role of water-bridged interactions in Ca2+ binding and protein allostery. The bridging water molecules between Ca2+ and binding residue reduces the ruggedness of ligand exchange landscape by acting as a lubricant, facilitating the Ca2+ coupled protein allostery. Calcium-induced rotation of the helices in the EF-hands, with the hydrophobic core serving as the pivot, leads to exposure of hydrophobic sites for target binding. Intriguingly, despite being structurally similar, the response of the two symmetrically arranged EF-hands upon Ca2+ binding is asymmetric. Breakage of symmetry is needed for efficient allosteric communication between the EF-hands. The key roles that water molecules play in driving allosteric transitions are likely to be general in other metal ion mediated protein allostery. Natural proteins often utilize allostery in executing a variety of functions. Metal ions are typical cofactors to trigger the allosteric cascade. In this work, using the Ca2+ sensor protein calmodulin as the model system, we revealed crucial roles of water-bridged interactions in the metal ion coupled protein allostery. The coordination of the Ca2+ to the binding site involves an intermediate in which the water molecule bridges the Ca2+ and the liganding residue. The bridging water reduces the free energy barrier height of ligand exchange, therefore facilitating the ligand exchange and allosteric coupling by acting as a lubricant. We also showed that the response of the two symmetrically arranged EF-hand motifs of CaM domains upon Ca2+ binding is asymmetric, which is directly attributed to the differing dehydration process of the Ca2+ ions and is needed for efficient allosteric communication.
Collapse
Affiliation(s)
- Xingyue Guan
- Department of Physics, National Laboratory of Solid State Microstructure, Nanjing University, Nanjing, China
- Wenzhou Institute, University of Chinese Academy of Sciences, Wenzhou, Zhejiang, China
| | - Cheng Tan
- Department of Physics, National Laboratory of Solid State Microstructure, Nanjing University, Nanjing, China
| | - Wenfei Li
- Department of Physics, National Laboratory of Solid State Microstructure, Nanjing University, Nanjing, China
- Wenzhou Institute, University of Chinese Academy of Sciences, Wenzhou, Zhejiang, China
- * E-mail: (WL); (WW); (DT)
| | - Wei Wang
- Department of Physics, National Laboratory of Solid State Microstructure, Nanjing University, Nanjing, China
- * E-mail: (WL); (WW); (DT)
| | - D. Thirumalai
- Department of Chemistry, University of Texas, Texas, United States of America
- * E-mail: (WL); (WW); (DT)
| |
Collapse
|
12
|
Abstract
Abstract
We study weighted ensemble, an interacting particle method for sampling distributions of Markov chains that has been used in computational chemistry since the 1990s. Many important applications of weighted ensemble require the computation of long time averages. We establish the consistency of weighted ensemble in this setting by proving an ergodic theorem for time averages. As part of the proof, we derive explicit variance formulas that could be useful for optimizing the method.
Collapse
|
13
|
Kania S, Oztekin A, Cheng X, Zhang XF, Webb E. Flow-regulated nucleation protrusion theory for collapsed polymers. Phys Rev E 2021; 104:054504. [PMID: 34942837 DOI: 10.1103/physreve.104.054504] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2021] [Accepted: 10/28/2021] [Indexed: 11/07/2022]
Abstract
The globular-stretch transition of a collapsed polymer in low strain rate elongational flow is studied using polymeric protrusion kinetics scaling laws and numerical simulation. Results demonstrate the influence of fluid flow on the occurrence probability of long-length thermally nucleated polymeric protrusions, which regulate collapsed polymer unfolding in low strain rate flows. Further, we estimate that the globular-stretch transition rate (k_{s}) in low strain rate (∈[over ̇]) elongational flows varies as k_{s}∼e^{-α∈[over ̇]^{-1}}. Results here reveal that the existing approach of neglecting the effects of fluid flow on thermally nucleated protrusions distribution is not valid for analyzing polymer unfolding behavior in low strain rate flows. Neglecting such an effect overestimates the constant α in the scaling law of transition rate (k_{s}∼e^{-α∈[over ̇]^{-1}}) by a factor of 2.
Collapse
Affiliation(s)
- Sagar Kania
- Department of Mechanical Engineering and Mechanics, Lehigh University, Bethlehem, Pennsylvania 18015, USA
| | - Alparslan Oztekin
- Department of Mechanical Engineering and Mechanics, Lehigh University, Bethlehem, Pennsylvania 18015, USA
| | - Xuanhong Cheng
- Department of Material Science and Engineering, Lehigh University, Bethlehem, Pennsylvania 18015, USA.,Department of Bioengineering, Lehigh University, Bethlehem, Pennsylvania 18015, USA
| | - X Frank Zhang
- Department of Bioengineering, Lehigh University, Bethlehem, Pennsylvania 18015, USA
| | - Edmund Webb
- Department of Mechanical Engineering and Mechanics, Lehigh University, Bethlehem, Pennsylvania 18015, USA
| |
Collapse
|
14
|
Clerc I, Sagar A, Barducci A, Sibille N, Bernadó P, Cortés J. The diversity of molecular interactions involving intrinsically disordered proteins: A molecular modeling perspective. Comput Struct Biotechnol J 2021; 19:3817-3828. [PMID: 34285781 PMCID: PMC8273358 DOI: 10.1016/j.csbj.2021.06.031] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2021] [Revised: 06/17/2021] [Accepted: 06/21/2021] [Indexed: 01/15/2023] Open
Abstract
Intrinsically Disordered Proteins and Regions (IDPs/IDRs) are key components of a multitude of biological processes. Conformational malleability enables IDPs/IDRs to perform very specialized functions that cannot be accomplished by globular proteins. The functional role for most of these proteins is related to the recognition of other biomolecules to regulate biological processes or as a part of signaling pathways. Depending on the extent of disorder, the number of interacting sites and the type of partner, very different architectures for the resulting assemblies are possible. More recently, molecular condensates with liquid-like properties composed of multiple copies of IDPs and nucleic acids have been proven to regulate key processes in eukaryotic cells. The structural and kinetic details of disordered biomolecular complexes are difficult to unveil experimentally due to their inherent conformational heterogeneity. Computational approaches, alone or in combination with experimental data, have emerged as unavoidable tools to understand the functional mechanisms of this elusive type of assemblies. The level of description used, all-atom or coarse-grained, strongly depends on the size of the molecular systems and on the timescale of the investigated mechanism. In this mini-review, we describe the most relevant architectures found for molecular interactions involving IDPs/IDRs and the computational strategies applied for their investigation.
Collapse
Affiliation(s)
- Ilinka Clerc
- LAAS-CNRS, Université de Toulouse, CNRS, Toulouse, France
| | - Amin Sagar
- Centre de Biochimie Structurale, INSERM, CNRS, Université de Montpellier, France
| | - Alessandro Barducci
- Centre de Biochimie Structurale, INSERM, CNRS, Université de Montpellier, France
| | - Nathalie Sibille
- Centre de Biochimie Structurale, INSERM, CNRS, Université de Montpellier, France
| | - Pau Bernadó
- Centre de Biochimie Structurale, INSERM, CNRS, Université de Montpellier, France
| | - Juan Cortés
- LAAS-CNRS, Université de Toulouse, CNRS, Toulouse, France
| |
Collapse
|
15
|
Arasteh S, Zhang BW, Levy RM. Protein Loop Conformational Free Energy Changes via an Alchemical Path without Reaction Coordinates. J Phys Chem Lett 2021; 12:4368-4377. [PMID: 33938761 PMCID: PMC8170697 DOI: 10.1021/acs.jpclett.1c00778] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
We introduce a method called restrain-free energy perturbation-release 2.0 (R-FEP-R 2.0) to estimate conformational free energy changes of protein loops via an alchemical path. R-FEP-R 2.0 is a generalization of the method called restrain-free energy perturbation-release (R-FEP-R) that can only estimate conformational free energy changes of protein side chains but not loops. The reorganization of protein loops is a central feature of many biological processes. Unlike other advanced sampling algorithms such as umbrella sampling and metadynamics, R-FEP-R and R-FEP-R 2.0 do not require predetermined collective coordinates and transition pathways that connect the two endpoint conformational states. The R-FEP-R 2.0 method was applied to estimate the conformational free energy change of a β-turn flip in the protein ubiquitin. The result obtained by R-FEP-R 2.0 agrees with the benchmarks very well. We also comment on problems commonly encountered when applying umbrella sampling to calculate protein conformational free energy changes.
Collapse
Affiliation(s)
- Shima Arasteh
- Center for Biophysics and Computational Biology and Department of Chemistry, Temple University, Philadelphia, Pennsylvania 19122, United States
| | - Bin W Zhang
- Center for Biophysics and Computational Biology and Department of Chemistry, Temple University, Philadelphia, Pennsylvania 19122, United States
| | - Ronald M Levy
- Center for Biophysics and Computational Biology and Department of Chemistry, Temple University, Philadelphia, Pennsylvania 19122, United States
| |
Collapse
|
16
|
Kania S, Oztekin A, Cheng X, Zhang XF, Webb E. Predicting pathological von Willebrand factor unraveling in elongational flow. Biophys J 2021; 120:1903-1915. [PMID: 33737157 DOI: 10.1016/j.bpj.2021.03.008] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2020] [Revised: 03/05/2021] [Accepted: 03/10/2021] [Indexed: 10/21/2022] Open
Abstract
The globular-to-unraveled conformation transition of von Willebrand factor (vWF), a large polymeric glycoprotein in human blood plasma, is a crucial step in the process of clotting at sites of vascular injury. However, unraveling of vWF multimers in uninjured vasculature can lead to pathology (i.e., thrombus formation or degradation of vWF proteins by enzyme ADAMTS13, making them nonfunctional). To identify blood flow conditions that might induce pathological unraveling of vWF multimers, here we have computed the globular-to-unraveled transition rate of vWF multimers subjected to varying strain rate elongational flow by employing an enhanced sampling technique, the weighted ensemble method. Weighted ensemble sampling was employed instead of standard brute-force simulations because pathological blood flow conditions can induce undesired vWF unraveling on timescales potentially inaccessible to standard simulation methods. Results here indicate that brief but periodic exposure of vWF to the elongational flow of strain rate greater than or equal to 2500 s-1 represents a source of possible pathology caused by the undesired unraveling of vWF multimers.
Collapse
Affiliation(s)
- Sagar Kania
- Department of Mechanical Engineering and Mechanics, Lehigh University, Bethlehem, Pennsylvania
| | - Alparslan Oztekin
- Department of Mechanical Engineering and Mechanics, Lehigh University, Bethlehem, Pennsylvania
| | - Xuanhong Cheng
- Department of Material Science and Engineering, Lehigh University, Bethlehem, Pennsylvania; Department of Bioengineering, Lehigh University, Bethlehem, Pennsylvania
| | - X Frank Zhang
- Department of Mechanical Engineering and Mechanics, Lehigh University, Bethlehem, Pennsylvania; Department of Bioengineering, Lehigh University, Bethlehem, Pennsylvania
| | - Edmund Webb
- Department of Mechanical Engineering and Mechanics, Lehigh University, Bethlehem, Pennsylvania.
| |
Collapse
|
17
|
Lotz S, Dickson A. Wepy: A Flexible Software Framework for Simulating Rare Events with Weighted Ensemble Resampling. ACS OMEGA 2020; 5:31608-31623. [PMID: 33344813 PMCID: PMC7745226 DOI: 10.1021/acsomega.0c03892] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/12/2020] [Accepted: 10/15/2020] [Indexed: 05/03/2023]
Abstract
Here, we introduce the open-source software framework wepy (https://github.com/ADicksonLab/wepy) which is a toolkit for running and analyzing weighted ensemble (WE) simulations. The wepy toolkit is in pure Python and as such is highly portable and extensible, making it an excellent platform to develop and use new WE resampling algorithms such as WExplore, REVO, and others while leveraging the entire Python ecosystem. In addition, wepy simplifies WE-specific analyses by defining out-of-core tree-like data structures using the cross-platform HDF5 file format. In this paper, we discuss the motivations and challenges for simulating rare events in biomolecular systems. As has previously been shown, high-dimensional WE resampling algorithms such as WExplore and REVO have been successful at these tasks, especially for rare events that are difficult to describe by one or two collective variables. We explain in detail how wepy facilitates implementation of these algorithms, as well as aids in analyzing the unique structure of WE simulation results. To explain how wepy and WE work in general, we describe the mathematical formalism of WE, an overview of the architecture of wepy, and provide code examples of how to construct, run, and analyze simulation results for a protein-ligand system (T4 Lysozyme in an implicit solvent). This paper is written with a variety of readers in mind, including (1) those curious about how to leverage WE rare-event simulations for their domain, (2) current WE users who want to begin using new high-dimensional resamplers such as WExplore and REVO, and (3) expert users who would like to prototype or implement their own algorithms that can be easily adopted by others.
Collapse
Affiliation(s)
- Samuel
D. Lotz
- Department
of Biochemistry & Molecular Biology, Michigan State University, East Lansing 48824, Michigan, United States
| | - Alex Dickson
- Department
of Biochemistry & Molecular Biology, Michigan State University, East Lansing 48824, Michigan, United States
- Department
of Computational Mathematics, Science and Engineering, Michigan State University, East Lansing 48824, Michigan, United States
| |
Collapse
|
18
|
Roussey NM, Dickson A. Enhanced Jarzynski free energy calculations using weighted ensemble. J Chem Phys 2020; 153:134116. [PMID: 33032408 PMCID: PMC7544513 DOI: 10.1063/5.0020600] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2020] [Accepted: 09/16/2020] [Indexed: 02/07/2023] Open
Abstract
The free energy of transitions between stable states is the key thermodynamic quantity that governs the relative probabilities of the forward and reverse reactions and the ratio of state probabilities at equilibrium. The binding free energy of a drug and its receptor is of particular interest, as it serves as an optimization function for drug design. Over the years, many computational methods have been developed to calculate binding free energies, and while many of these methods have a long history, issues such as convergence of free energy estimates and the projection of a binding process onto order parameters remain. Over 20 years ago, the Jarzynski equality was derived with the promise to calculate equilibrium free energies by measuring the work applied to short nonequilibrium trajectories. However, these calculations were found to be dominated by trajectories with low applied work that occur with extremely low probability. Here, we examine the combination of weighted ensemble algorithms with the Jarzynski equality. In this combined method, an ensemble of nonequilibrium trajectories are run in parallel, and cloning and merging operations are used to preferentially sample low-work trajectories that dominate the free energy calculations. Two additional methods are also examined: (i) a novel weighted ensemble resampler that samples trajectories directly according to their importance to the work of work and (ii) the diffusion Monte Carlo method using the applied work as the selection potential. We thoroughly examine both the accuracy and efficiency of unbinding free energy calculations for a series of model Lennard-Jones atom pairs with interaction strengths ranging from 2 kcal/mol to 20 kcal/mol. We find that weighted ensemble calculations can more efficiently determine accurate binding free energies, especially for deeper Lennard-Jones well depths.
Collapse
Affiliation(s)
- Nicole M. Roussey
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, Michigan 48823, USA
| | - Alex Dickson
- Author to whom correspondence should be addressed:
| |
Collapse
|
19
|
Sharpe DJ, Wales DJ. Efficient and exact sampling of transition path ensembles on Markovian networks. J Chem Phys 2020; 153:024121. [DOI: 10.1063/5.0012128] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Affiliation(s)
- Daniel J. Sharpe
- Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| | - David J. Wales
- Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| |
Collapse
|
20
|
Aristoff D, Zuckerman DM. OPTIMIZING WEIGHTED ENSEMBLE SAMPLING OF STEADY STATES. MULTISCALE MODELING & SIMULATION : A SIAM INTERDISCIPLINARY JOURNAL 2020; 18:646-673. [PMID: 34421402 PMCID: PMC8378190 DOI: 10.1137/18m1212100] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/08/2023]
Abstract
We propose parameter optimization techniques for weighted ensemble sampling of Markov chains in the steady-state regime. Weighted ensemble consists of replicas of a Markov chain, each carrying a weight, that are periodically resampled according to their weights inside of each of a number of bins that partition state space. We derive, from first principles, strategies for optimizing the choices of weighted ensemble parameters, in particular the choice of bins and the number of replicas to maintain in each bin. In a simple numerical example, we compare our new strategies with more traditional ones and with direct Monte Carlo.
Collapse
|
21
|
Bogetti AT, Mostofian B, Dickson A, Pratt AJ, Saglam AS, Harrison PO, Adelman JL, Dudek M, Torrillo PA, DeGrave AJ, Adhikari U, Zwier MC, Zuckerman DM, Chong LT. A Suite of Tutorials for the WESTPA Rare-Events Sampling Software [Article v1.0]. ACTA ACUST UNITED AC 2019; 1. [PMID: 32395705 DOI: 10.33011/livecoms.1.2.10607] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
The weighted ensemble (WE) strategy has been demonstrated to be highly efficient in generating pathways and rate constants for rare events such as protein folding and protein binding using atomistic molecular dynamics simulations. Here we present five tutorials instructing users in the best practices for preparing, carrying out, and analyzing WE simulations for various applications using the WESTPA software. Users are expected to already have significant experience with running standard molecular dynamics simulations using the underlying dynamics engine of interest (e.g. Amber, Gromacs, OpenMM). The tutorials range from a molecular association process in explicit solvent to more complex processes such as host-guest association, peptide conformational sampling, and protein folding.
Collapse
Affiliation(s)
| | - Barmak Mostofian
- Department of Biomedical Engineering, Oregon Health and Science University, Portland, OR
| | - Alex Dickson
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI
| | - A J Pratt
- Department of Chemistry, University of Pittsburgh, Pittsburgh, PA
| | - Ali S Saglam
- Department of Chemistry, University of Pittsburgh, Pittsburgh, PA
| | - Page O Harrison
- Department of Chemistry, University of Pittsburgh, Pittsburgh, PA
| | - Joshua L Adelman
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, PA; currently unaffiliated
| | - Max Dudek
- Department of Chemistry, University of Pittsburgh, Pittsburgh, PA
| | - Paul A Torrillo
- Department of Chemistry, University of Pittsburgh, Pittsburgh, PA
| | - Alex J DeGrave
- Department of Chemistry, University of Pittsburgh, Pittsburgh, PA.,Current address: Paul G. Allen School of Computer Science and Engineering, University of Washington, Seattle, WA
| | - Upendra Adhikari
- Department of Biomedical Engineering, Oregon Health and Science University, Portland, OR.,Current address: Department of Chemistry, Missouri Valley College, Marshall, MO
| | | | - Daniel M Zuckerman
- Department of Biomedical Engineering, Oregon Health and Science University, Portland, OR
| | - Lillian T Chong
- Department of Chemistry, University of Pittsburgh, Pittsburgh, PA
| |
Collapse
|
22
|
Srivastava A. Conformational transitions of bio-molecular systems studied using adaptive bond bending elastic network model. J Chem Phys 2019. [DOI: 10.1063/1.5102135] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Affiliation(s)
- Amit Srivastava
- School of Computational Sciences, Korea Institute for Advanced Study, Seoul 02455, South Korea
| |
Collapse
|
23
|
Vergadou N, Theodorou DN. Molecular Modeling Investigations of Sorption and Diffusion of Small Molecules in Glassy Polymers. MEMBRANES 2019; 9:E98. [PMID: 31398889 PMCID: PMC6723301 DOI: 10.3390/membranes9080098] [Citation(s) in RCA: 45] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/12/2019] [Revised: 07/22/2019] [Accepted: 07/23/2019] [Indexed: 11/16/2022]
Abstract
With a wide range of applications, from energy and environmental engineering, such as in gas separations and water purification, to biomedical engineering and packaging, glassy polymeric materials remain in the core of novel membrane and state-of the art barrier technologies. This review focuses on molecular simulation methodologies implemented for the study of sorption and diffusion of small molecules in dense glassy polymeric systems. Basic concepts are introduced and systematic methods for the generation of realistic polymer configurations are briefly presented. Challenges related to the long length and time scale phenomena that govern the permeation process in the glassy polymer matrix are described and molecular simulation approaches developed to address the multiscale problem at hand are discussed.
Collapse
Affiliation(s)
- Niki Vergadou
- Molecular Thermodynamics and Modelling of Materials Laboratory, Institute of Nanoscience and Nanotechnology, National Center for Scientific Research Demokritos, Aghia Paraskevi Attikis, GR-15310 Athens, Greece.
| | - Doros N Theodorou
- School of Chemical Engineering, National Technical University of Athens, GR 15780 Athens, Greece
| |
Collapse
|
24
|
Spiriti J, Subramanian SR, Palli R, Wu M, Zuckerman DM. Middle-way flexible docking: Pose prediction using mixed-resolution Monte Carlo in estrogen receptor α. PLoS One 2019; 14:e0215694. [PMID: 31013302 PMCID: PMC6478315 DOI: 10.1371/journal.pone.0215694] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2019] [Accepted: 04/06/2019] [Indexed: 12/17/2022] Open
Abstract
There is a vast gulf between the two primary strategies for simulating protein-ligand interactions. Docking methods significantly limit or eliminate protein flexibility to gain great speed at the price of uncontrolled inaccuracy, whereas fully flexible atomistic molecular dynamics simulations are expensive and often suffer from limited sampling. We have developed a flexible docking approach geared especially for highly flexible or poorly resolved targets based on mixed-resolution Monte Carlo (MRMC), which is intended to offer a balance among speed, protein flexibility, and sampling power. The binding region of the protein is treated with a standard atomistic force field, while the remainder of the protein is modeled at the residue level with a Gō model that permits protein flexibility while saving computational cost. Implicit solvation is used. Here we assess three facets of the MRMC approach with implications for other docking studies: (i) the role of receptor flexibility in cross-docking pose prediction; (ii) the use of non-equilibrium candidate Monte Carlo (NCMC) and (iii) the use of pose-clustering in scoring. We examine 61 co-crystallized ligands of estrogen receptor α, an important cancer target known for its flexibility. We also compare the performance of the MRMC approach with Autodock smina. Adding protein flexibility, not surprisingly, leads to significantly lower total energies and stronger interactions between protein and ligand, but notably we document the important role of backbone flexibility in the improvement. The improved backbone flexibility also leads to improved performance relative to smina. Somewhat unexpectedly, our implementation of NCMC leads to only modestly improved sampling of ligand poses. Overall, the addition of protein flexibility improves the performance of docking, as measured by energy-ranked poses, but we do not find significant improvements based on cluster information or the use of NCMC. We discuss possible improvements for the model including alternative coarse-grained force fields, improvements to the treatment of solvation, and adding additional types of NCMC moves.
Collapse
Affiliation(s)
- Justin Spiriti
- Department of Biomedical Engineering, Oregon Health and Science University, Portland, OR 97239, United States of America
| | - Sundar Raman Subramanian
- Department of Computational and Systems Biology, University of Pittsburgh, Pittsburgh, PA 15260, United States of America
| | - Rohith Palli
- Department of Computational and Systems Biology, University of Pittsburgh, Pittsburgh, PA 15260, United States of America
| | - Maria Wu
- Department of Computational and Systems Biology, University of Pittsburgh, Pittsburgh, PA 15260, United States of America
| | - Daniel M. Zuckerman
- Department of Biomedical Engineering, Oregon Health and Science University, Portland, OR 97239, United States of America
- * E-mail:
| |
Collapse
|
25
|
Recent Advances in Computational Protocols Addressing Intrinsically Disordered Proteins. Biomolecules 2019; 9:biom9040146. [PMID: 30979035 PMCID: PMC6523529 DOI: 10.3390/biom9040146] [Citation(s) in RCA: 42] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2019] [Revised: 04/09/2019] [Accepted: 04/10/2019] [Indexed: 01/09/2023] Open
Abstract
Intrinsically disordered proteins (IDP) are abundant in the human genome and have recently emerged as major therapeutic targets for various diseases. Unlike traditional proteins that adopt a definitive structure, IDPs in free solution are disordered and exist as an ensemble of conformations. This enables the IDPs to signal through multiple signaling pathways and serve as scaffolds for multi-protein complexes. The challenge in studying IDPs experimentally stems from their disordered nature. Nuclear magnetic resonance (NMR), circular dichroism, small angle X-ray scattering, and single molecule Förster resonance energy transfer (FRET) can give the local structural information and overall dimension of IDPs, but seldom provide a unified picture of the whole protein. To understand the conformational dynamics of IDPs and how their structural ensembles recognize multiple binding partners and small molecule inhibitors, knowledge-based and physics-based sampling techniques are utilized in-silico, guided by experimental structural data. However, efficient sampling of the IDP conformational ensemble requires traversing the numerous degrees of freedom in the IDP energy landscape, as well as force-fields that accurately model the protein and solvent interactions. In this review, we have provided an overview of the current state of computational methods for studying IDP structure and dynamics and discussed the major challenges faced in this field.
Collapse
|
26
|
Nunes-Alves A, Zuckerman DM, Arantes GM. Escape of a Small Molecule from Inside T4 Lysozyme by Multiple Pathways. Biophys J 2019. [PMID: 29539393 DOI: 10.1016/j.bpj.2018.01.014] [Citation(s) in RCA: 48] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022] Open
Abstract
The T4 lysozyme L99A mutant is often used as a model system to study small-molecule binding to proteins, but pathways for ligand entry and exit from the buried binding site and the associated protein conformational changes have not been fully resolved. Here, molecular dynamics simulations were employed to model benzene exit from its binding cavity using the weighted ensemble (WE) approach to enhance sampling of low-probability unbinding trajectories. Independent WE simulations revealed four pathways for benzene exit, which correspond to transient tunnels spontaneously formed in previous simulations of apo T4 lysozyme. Thus, benzene unbinding occurs through multiple pathways partially created by intrinsic protein structural fluctuations. Motions of several α-helices and side chains were involved in ligand escape from metastable microstates. WE simulations also provided preliminary estimates of rate constants for each exit pathway. These results complement previous works and provide a semiquantitative characterization of pathway heterogeneity for binding of small molecules to proteins.
Collapse
Affiliation(s)
- Ariane Nunes-Alves
- Department of Biochemistry, Instituto de Química, Universidade de São Paulo, São Paulo, Brazil
| | - Daniel M Zuckerman
- Department of Biomedical Engineering, School of Medicine, Oregon Health & Science University, Portland, Oregon.
| | | |
Collapse
|
27
|
Zhu Q, Yuan Y, Ma J, Dong H. A Data‐Driven Accelerated Sampling Method for Searching Functional States of Proteins. ADVANCED THEORY AND SIMULATIONS 2019. [DOI: 10.1002/adts.201800171] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Affiliation(s)
- Qiang Zhu
- Key Laboratory of Mesoscopic Chemistry of Ministry of EducationInstitute of Theoretical and Computational Chemistry School of Chemistry and Chemical EngineeringNanjing University Nanjing 210023 P. R. China
- Kuang Yaming Honors SchoolNanjing University Nanjing 210023 P. R. China
| | - Yigao Yuan
- Kuang Yaming Honors SchoolNanjing University Nanjing 210023 P. R. China
| | - Jing Ma
- Key Laboratory of Mesoscopic Chemistry of Ministry of EducationInstitute of Theoretical and Computational Chemistry School of Chemistry and Chemical EngineeringNanjing University Nanjing 210023 P. R. China
| | - Hao Dong
- Kuang Yaming Honors SchoolNanjing University Nanjing 210023 P. R. China
| |
Collapse
|
28
|
Zhao L, Lai L, Zhang Z. How calcium ion binding induces the conformational transition of the calmodulin N-terminal domain—an atomic level characterization. Phys Chem Chem Phys 2019; 21:19795-19804. [DOI: 10.1039/c9cp03917a] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
The Ca2+binding and triggering conformation transition of nCaM were detected in unbiased molecular dynamics simulations.
Collapse
Affiliation(s)
- Likun Zhao
- College of Life Science
- University of Chinese Academy of Sciences
- Beijing
- China
| | - Luhua Lai
- BNLMS, and Peking-Tsinghua Center for Life Sciences at the College of Chemistry and Molecular Engineering
- Peking University
- Beijing
- China
- Center for Quantitative Biology
| | - Zhuqing Zhang
- College of Life Science
- University of Chinese Academy of Sciences
- Beijing
- China
| |
Collapse
|
29
|
Residual Structure Accelerates Binding of Intrinsically Disordered ACTR by Promoting Efficient Folding upon Encounter. J Mol Biol 2018; 431:422-432. [PMID: 30528464 DOI: 10.1016/j.jmb.2018.12.001] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2018] [Revised: 11/10/2018] [Accepted: 12/03/2018] [Indexed: 11/21/2022]
Abstract
Intrinsically disordered proteins (IDPs) often fold into stable structures upon specific binding. The roles of residual structure of unbound IDPs in coupling binding and folding have been under much debate. While many studies emphasize the importance of conformational flexibility for IDP recognition, it was recently demonstrated that stabilization the N-terminal helix of intrinsically disordered ACTR accelerated its binding to another IDP, NCBD of the CREB-binding protein. To understand how enhancing ACTR helicity accelerates binding, we derived a series of topology-based coarse-grained models that mimicked various ACTR mutants with increasing helical contents and reproduced their NCBD binding affinities. Molecular dynamics simulations were then performed to sample hundreds of reversible coupled binding and folding transitions. The results show that increasing ACTR helicity does not alter the baseline mechanism of synergistic folding, which continues to follow "extended conformational selection" with multiple stages of selection and induced folding. Importantly, these coarse-grained models, while only calibrated based on binding thermodynamics, recapitulate the observed kinetic acceleration with increasing ACTR helicity. However, the residual helices do not enhance the association kinetics via more efficient seeding of productive collisions. Instead, they allow the nonspecific collision complexes to evolve more efficiently into the final bound and folded state, which is the primary source of accelerated association kinetics. Meanwhile, reduced dissociation kinetics with increasing ACTR helicity can be directly attributed to smaller entropic cost of forming the bound state. Altogether, this study provides important mechanistic insights into how residual structure may modulate thermodynamics and kinetics of IDP interactions.
Collapse
|
30
|
Chu WT, Wang J. Quantifying the Intrinsic Conformation Energy Landscape Topography of Proteins with Large-Scale Open-Closed Transition. ACS CENTRAL SCIENCE 2018; 4:1015-1022. [PMID: 30159398 PMCID: PMC6107866 DOI: 10.1021/acscentsci.8b00274] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/03/2018] [Indexed: 06/08/2023]
Abstract
Large-scale conformational changes of proteins, including the open-closed transitions, are crucial for a variety of protein functions. These open-closed transitions are often associated with ligand binding. However, the understandings of the underlying mechanisms of the conformational changes within proteins during the open-closed transitions are still challenging at present. In this study, we quantified the intrinsic underlying conformational energy landscapes of five different proteins with large-scale open-closed transitions. This is realized by exploring the underlying density of states and the intrinsic conformational energy landscape topography measure Λ. Λ is a dimensionless ratio of conformational energy gap δE versus conformational energy roughness δE and configurational entropy S or size of the intrinsic conformational energy landscape. By quantifying the Λ of intrinsic open-closed conformational (Λoc) and intrinsic global folding (Λglobal) energy landscapes, we show that both intrinsic open-closed conformation energy and entropy landscapes are funneled toward the closed state. Furthermore, our results indicate the strong correlations between Λ and thermodynamics (conformational state transition temperature against trapping temperature) as well as between Λ and kinetics (open-closed kinetic time) of these proteins. This shows that the intrinsic conformational landscape topography determines both the conformational thermodynamic stability and kinetic speed of the conformational dynamics. Our investigations provide important insights for understanding the fundamental mechanisms of the protein conformational dynamics in a physical and global way.
Collapse
Affiliation(s)
- Wen-Ting Chu
- State
Key Laboratory of Electroanalytical Chemistry, Changchun Institute of Applied Chemistry, Chinese Academy of Sciences, Changchun, Jilin 130022, China
| | - Jin Wang
- State
Key Laboratory of Electroanalytical Chemistry, Changchun Institute of Applied Chemistry, Chinese Academy of Sciences, Changchun, Jilin 130022, China
- Department
of Chemistry & Physics, State University
of New York at Stony Brook, Stony
Brook, New York 11794, United States
| |
Collapse
|
31
|
Ahn SH, Grate JW, Darve EF. Investigating the role of non-covalent interactions in conformation and assembly of triazine-based sequence-defined polymers. J Chem Phys 2018; 149:072330. [DOI: 10.1063/1.5024552] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023] Open
Affiliation(s)
- Surl-Hee Ahn
- Chemistry Department, Stanford University, Stanford, California 94305, USA
| | - Jay W. Grate
- Pacific Northwest National Laboratory, Richland, Washington 99352, USA
| | - Eric F. Darve
- Mechanical Engineering Department, Stanford University, Stanford, California 94305, USA
| |
Collapse
|
32
|
He P, Zhang BW, Arasteh S, Wang L, Abel R, Levy RM. Conformational Free Energy Changes via an Alchemical Path without Reaction Coordinates. J Phys Chem Lett 2018; 9:4428-4435. [PMID: 30024165 PMCID: PMC6092130 DOI: 10.1021/acs.jpclett.8b01851] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
We introduce a novel method called restrain-free energy perturbation-release (R-FEP-R) to estimate conformational free energy changes via an alchemical path, which for some conformational landscapes like those associated with cellular signaling proteins in the kinase family is more direct and readily converged than the corresponding free energy changes along the physical path. The R-FEP-R method was developed from the dual topology free energy perturbation method that is widely applied to estimate the binding free energy difference between two ligands. In R-FEP-R, the free energy change between two conformational basins is calculated by free energy perturbations that remove those atoms involved in the conformational change from their initial conformational basin while simultaneously growing them back according to the final conformational basin. Both the initial and final dual topology states are unphysical, but they are designed in a way such that the unphysical contributions to the initial and final partition functions cancel. Compared with other advanced sampling algorithms such as umbrella sampling and metadynamics, the R-FEP-R method does not require predetermined transition pathways or reaction coordinates that connect the two conformational states. As a first illustration, the R-FEP-R method was applied to calculate the free energy change between conformational basins for alanine dipeptide in solution and for a side chain in the binding pocket of T4 lysozyme. The results obtained by R-FEP-R agree with the benchmarks very well.
Collapse
Affiliation(s)
- Peng He
- Center for Biophysics and Computational Biology, Department of Chemistry and Institute for Computational Molecular Science, Temple University, Philadelphia, Pennsylvania 19122, United States
| | - Bin W. Zhang
- Center for Biophysics and Computational Biology, Department of Chemistry and Institute for Computational Molecular Science, Temple University, Philadelphia, Pennsylvania 19122, United States
| | - Shima Arasteh
- Center for Biophysics and Computational Biology, Department of Chemistry and Institute for Computational Molecular Science, Temple University, Philadelphia, Pennsylvania 19122, United States
| | - Lingle Wang
- Schrödinger, Inc., 120 West 45th Street, New York, New York 10036, United States
| | - Robert Abel
- Schrödinger, Inc., 120 West 45th Street, New York, New York 10036, United States
| | - Ronald M. Levy
- Center for Biophysics and Computational Biology, Department of Chemistry and Institute for Computational Molecular Science, Temple University, Philadelphia, Pennsylvania 19122, United States
| |
Collapse
|
33
|
Ahn SH, Grate JW, Darve EF. Efficiently sampling conformations and pathways using the concurrent adaptive sampling (CAS) algorithm. J Chem Phys 2018; 147:074115. [PMID: 28830168 DOI: 10.1063/1.4999097] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023] Open
Abstract
Molecular dynamics simulations are useful in obtaining thermodynamic and kinetic properties of bio-molecules, but they are limited by the time scale barrier. That is, we may not obtain properties' efficiently because we need to run microseconds or longer simulations using femtosecond time steps. To overcome this time scale barrier, we can use the weighted ensemble (WE) method, a powerful enhanced sampling method that efficiently samples thermodynamic and kinetic properties. However, the WE method requires an appropriate partitioning of phase space into discrete macrostates, which can be problematic when we have a high-dimensional collective space or when little is known a priori about the molecular system. Hence, we developed a new WE-based method, called the "Concurrent Adaptive Sampling (CAS) algorithm," to tackle these issues. The CAS algorithm is not constrained to use only one or two collective variables, unlike most reaction coordinate-dependent methods. Instead, it can use a large number of collective variables and adaptive macrostates to enhance the sampling in the high-dimensional space. This is especially useful for systems in which we do not know what the right reaction coordinates are, in which case we can use many collective variables to sample conformations and pathways. In addition, a clustering technique based on the committor function is used to accelerate sampling the slowest process in the molecular system. In this paper, we introduce the new method and show results from two-dimensional models and bio-molecules, specifically penta-alanine and a triazine trimer.
Collapse
Affiliation(s)
- Surl-Hee Ahn
- Chemistry Department, Stanford University, Stanford, California 94305, USA
| | - Jay W Grate
- Pacific Northwest National Laboratory, Richland, Washington 99352, USA
| | - Eric F Darve
- Mechanical Engineering Department, Stanford University, Stanford, California 94305, USA
| |
Collapse
|
34
|
Bacci M, Langini C, Vymětal J, Caflisch A, Vitalis A. Focused conformational sampling in proteins. J Chem Phys 2018; 147:195102. [PMID: 29166086 DOI: 10.1063/1.4996879] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023] Open
Abstract
A detailed understanding of the conformational dynamics of biological molecules is difficult to obtain by experimental techniques due to resolution limitations in both time and space. Computer simulations avoid these in theory but are often too short to sample rare events reliably. Here we show that the progress index-guided sampling (PIGS) protocol can be used to enhance the sampling of rare events in selected parts of biomolecules without perturbing the remainder of the system. The method is very easy to use as it only requires as essential input a set of several features representing the parts of interest sufficiently. In this feature space, new states are discovered by spontaneous fluctuations alone and in unsupervised fashion. Because there are no energetic biases acting on phase space variables or projections thereof, the trajectories PIGS generates can be analyzed directly in the framework of transition networks. We demonstrate the possibility and usefulness of such focused explorations of biomolecules with two loops that are part of the binding sites of bromodomains, a family of epigenetic "reader" modules. This real-life application uncovers states that are structurally and kinetically far away from the initial crystallographic structures and are also metastable. Representative conformations are intended to be used in future high-throughput virtual screening campaigns.
Collapse
Affiliation(s)
- Marco Bacci
- Department of Biochemistry, University of Zurich, Winterthurerstrasse 190, CH-8057 Zurich, Switzerland
| | - Cassiano Langini
- Department of Biochemistry, University of Zurich, Winterthurerstrasse 190, CH-8057 Zurich, Switzerland
| | - Jiří Vymětal
- Department of Biochemistry, University of Zurich, Winterthurerstrasse 190, CH-8057 Zurich, Switzerland
| | - Amedeo Caflisch
- Department of Biochemistry, University of Zurich, Winterthurerstrasse 190, CH-8057 Zurich, Switzerland
| | - Andreas Vitalis
- Department of Biochemistry, University of Zurich, Winterthurerstrasse 190, CH-8057 Zurich, Switzerland
| |
Collapse
|
35
|
Sapin E, De Jong KA, Shehu A. From Optimization to Mapping: An Evolutionary Algorithm for Protein Energy Landscapes. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2018; 15:719-731. [PMID: 28113951 DOI: 10.1109/tcbb.2016.2628745] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
Stochastic search is often the only viable option to address complex optimization problems. Recently, evolutionary algorithms have been shown to handle challenging continuous optimization problems related to protein structure modeling. Building on recent work in our laboratories, we propose an evolutionary algorithm for efficiently mapping the multi-basin energy landscapes of dynamic proteins that switch between thermodynamically stable or semi-stable structural states to regulate their biological activity in the cell. The proposed algorithm balances computational resources between exploration and exploitation of the nonlinear, multimodal landscapes that characterize multi-state proteins via a novel combination of global and local search to generate a dynamically-updated, information-rich map of a protein's energy landscape. This new mapping-oriented EA is applied to several dynamic proteins and their disease-implicated variants to illustrate its ability to map complex energy landscapes in a computationally feasible manner. We further show that, given the availability of such maps, comparison between the maps of wildtype and variants of a protein allows for the formulation of a structural and thermodynamic basis for the impact of sequence mutations on dysfunction that may prove useful in guiding further wet-laboratory investigations of dysfunction and molecular interventions.
Collapse
|
36
|
Abstract
The design of protein conformational switches—or proteins that change conformations in response to a signal such as ligand binding—has great potential for developing novel biosensors, diagnostic tools, and therapeutic agents. Among the defining properties of such switches, the response time has been the most challenging to optimize. Here we apply a computational design strategy in synergistic combination with biophysical experiments to rationally improve the response time of an engineered protein-based Ca2+-sensor in which the switching process occurs via mutually exclusive folding of two alternate frames. Notably, our strategy identifies mutations that increase switching rates by as much as 32-fold, achieving response times on the order of fast physiological Ca2+ fluctuations. Our computational design strategy is general and may aid in optimizing the kinetics of other protein conformational switches. The rational optimization of response times of protein conformational switches is a major challenge for biomolecular switch design. Here the authors present a generally applicable computational design strategy that in combination with biophysical experiments can improve response times using a Ca2+-sensor as an example.
Collapse
|
37
|
Li W. Equipartition terms in transition path ensemble: Insights from molecular dynamics simulations of alanine dipeptide. J Chem Phys 2018; 148:084105. [PMID: 29495774 DOI: 10.1063/1.5010408] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Transition path ensemble consists of reactive trajectories and possesses all the information necessary for the understanding of the mechanism and dynamics of important condensed phase processes. However, quantitative description of the properties of the transition path ensemble is far from being established. Here, with numerical calculations on a model system, the equipartition terms defined in thermal equilibrium were for the first time estimated in the transition path ensemble. It was not surprising to observe that the energy was not equally distributed among all the coordinates. However, the energies distributed on a pair of conjugated coordinates remained equal. Higher energies were observed to be distributed on several coordinates, which are highly coupled to the reaction coordinate, while the rest were almost equally distributed. In addition, the ensemble-averaged energy on each coordinate as a function of time was also quantified. These quantitative analyses on energy distributions provided new insights into the transition path ensemble.
Collapse
Affiliation(s)
- Wenjin Li
- Institute for Advanced Study, Shenzhen University, Shenzhen, China
| |
Collapse
|
38
|
Zhang BW, Deng N, Tan Z, Levy RM. Stratified UWHAM and Its Stochastic Approximation for Multicanonical Simulations Which Are Far from Equilibrium. J Chem Theory Comput 2017; 13:4660-4674. [PMID: 28902500 PMCID: PMC5897113 DOI: 10.1021/acs.jctc.7b00651] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
We describe a new analysis tool called Stratified unbinned Weighted Histogram Analysis Method (Stratified-UWHAM), which can be used to compute free energies and expectations from a multicanonical ensemble when a subset of the parallel simulations is far from being equilibrated because of barriers between free energy basins which are only rarely (or never) crossed at some states. The Stratified-UWHAM equations can be obtained in the form of UWHAM equations but with an expanded set of states. We also provide a stochastic solver, Stratified RE-SWHAM, for Stratified-UWHAM to remove its computational bottleneck. Stratified-UWHAM and Stratified RE-SWHAM are applied to study three test topics: the free energy landscape of alanine dipeptide, the binding affinity of a host-guest binding complex, and path sampling for a two-dimensional double well potential. The examples show that when some of the parallel simulations are only locally equilibrated, the estimates of free energies and equilibrium distributions provided by the conventional UWHAM (or MBAR) solutions exhibit considerable biases, but the estimates provided by Stratified-UWHAM and Stratified RE-SWHAM agree with the benchmark very well. Lastly, we discuss features of the Stratified-UWHAM approach which is based on coarse-graining in relation to two other maximum likelihood-based methods which were proposed recently, that also coarse-grain the multicanonical data.
Collapse
|
39
|
Zhu J, Likhtman AE, Wang Z. Arm retraction dynamics of entangled star polymers: A forward flux sampling method study. J Chem Phys 2017; 147:044907. [DOI: 10.1063/1.4995422] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023] Open
Affiliation(s)
- Jian Zhu
- Department of Mathematics and Statistics, University of Reading, Reading RG6 6AX, United Kingdom
| | - Alexei E. Likhtman
- Department of Mathematics and Statistics, University of Reading, Reading RG6 6AX, United Kingdom
| | - Zuowei Wang
- Department of Mathematics and Statistics, University of Reading, Reading RG6 6AX, United Kingdom
| |
Collapse
|
40
|
Abstract
The weighted ensemble (WE) methodology orchestrates quasi-independent parallel simulations run with intermittent communication that can enhance sampling of rare events such as protein conformational changes, folding, and binding. The WE strategy can achieve superlinear scaling-the unbiased estimation of key observables such as rate constants and equilibrium state populations to greater precision than would be possible with ordinary parallel simulation. WE software can be used to control any dynamics engine, such as standard molecular dynamics and cell-modeling packages. This article reviews the theoretical basis of WE and goes on to describe successful applications to a number of complex biological processes-protein conformational transitions, (un)binding, and assembly processes, as well as cell-scale processes in systems biology. We furthermore discuss the challenges that need to be overcome in the next phase of WE methodological development. Overall, the combined advances in WE methodology and software have enabled the simulation of long-timescale processes that would otherwise not be practical on typical computing resources using standard simulation.
Collapse
Affiliation(s)
- Daniel M Zuckerman
- Department of Biomedical Engineering, Oregon Health and Science University, Portland, Oregon 97239;
| | - Lillian T Chong
- Department of Chemistry, University of Pittsburgh, Pittsburgh, Pennsylvania 15260;
| |
Collapse
|
41
|
Tan Z, Xia J, Zhang BW, Levy RM. Locally weighted histogram analysis and stochastic solution for large-scale multi-state free energy estimation. J Chem Phys 2016; 144:034107. [PMID: 26801020 DOI: 10.1063/1.4939768] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
The weighted histogram analysis method (WHAM) including its binless extension has been developed independently in several different contexts, and widely used in chemistry, physics, and statistics, for computing free energies and expectations from multiple ensembles. However, this method, while statistically efficient, is computationally costly or even infeasible when a large number, hundreds or more, of distributions are studied. We develop a locally WHAM (local WHAM) from the perspective of simulations of simulations (SOS), using generalized serial tempering (GST) to resample simulated data from multiple ensembles. The local WHAM equations based on one jump attempt per GST cycle can be solved by optimization algorithms orders of magnitude faster than standard implementations of global WHAM, but yield similarly accurate estimates of free energies to global WHAM estimates. Moreover, we propose an adaptive SOS procedure for solving local WHAM equations stochastically when multiple jump attempts are performed per GST cycle. Such a stochastic procedure can lead to more accurate estimates of equilibrium distributions than local WHAM with one jump attempt per cycle. The proposed methods are broadly applicable when the original data to be "WHAMMED" are obtained properly by any sampling algorithm including serial tempering and parallel tempering (replica exchange). To illustrate the methods, we estimated absolute binding free energies and binding energy distributions using the binding energy distribution analysis method from one and two dimensional replica exchange molecular dynamics simulations for the beta-cyclodextrin-heptanoate host-guest system. In addition to the computational advantage of handling large datasets, our two dimensional WHAM analysis also demonstrates that accurate results similar to those from well-converged data can be obtained from simulations for which sampling is limited and not fully equilibrated.
Collapse
Affiliation(s)
- Zhiqiang Tan
- Department of Statistics, Rutgers University, Piscataway, New Jersey 08854, USA
| | - Junchao Xia
- Center for Biophysics and Computational Biology, Department of Chemistry and Institute for Computational Molecular Science, Temple University, Philadelphia, Pennsylvania 19122, USA
| | - Bin W Zhang
- Center for Biophysics and Computational Biology, Department of Chemistry and Institute for Computational Molecular Science, Temple University, Philadelphia, Pennsylvania 19122, USA
| | - Ronald M Levy
- Center for Biophysics and Computational Biology, Department of Chemistry and Institute for Computational Molecular Science, Temple University, Philadelphia, Pennsylvania 19122, USA
| |
Collapse
|
42
|
Redner GS, Wagner CG, Baskaran A, Hagan MF. Classical Nucleation Theory Description of Active Colloid Assembly. PHYSICAL REVIEW LETTERS 2016; 117:148002. [PMID: 27740811 DOI: 10.1103/physrevlett.117.148002] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/03/2016] [Indexed: 06/06/2023]
Abstract
Nonaligning self-propelled particles with purely repulsive excluded volume interactions undergo athermal motility-induced phase separation into a dilute gas and a dense cluster phase. Here, we use enhanced sampling computational methods and analytic theory to examine the kinetics of formation of the dense phase. Despite the intrinsically nonequilibrium nature of the phase transition, we show that the kinetics can be described using an approach analogous to equilibrium classical nucleation theory, governed by an effective free energy of cluster formation with identifiable bulk and surface terms. The theory captures the location of the binodal, nucleation rates as a function of supersaturation, and the cluster size distributions below the binodal, while discrepancies in the metastable region reveal additional physics about the early stages of active crystal formation. The success of the theory shows that a framework similar to equilibrium thermodynamics can be obtained directly from the microdynamics of an active system, and can be used to describe the kinetics of evolution toward nonequilibrium steady states.
Collapse
Affiliation(s)
- Gabriel S Redner
- Martin Fisher School of Physics, Brandeis University, Waltham, Massachusetts 02453, USA
| | - Caleb G Wagner
- Martin Fisher School of Physics, Brandeis University, Waltham, Massachusetts 02453, USA
| | - Aparna Baskaran
- Martin Fisher School of Physics, Brandeis University, Waltham, Massachusetts 02453, USA
| | - Michael F Hagan
- Martin Fisher School of Physics, Brandeis University, Waltham, Massachusetts 02453, USA
| |
Collapse
|
43
|
Zwier MC, Pratt AJ, Adelman JL, Kaus JW, Zuckerman DM, Chong LT. Efficient Atomistic Simulation of Pathways and Calculation of Rate Constants for a Protein-Peptide Binding Process: Application to the MDM2 Protein and an Intrinsically Disordered p53 Peptide. J Phys Chem Lett 2016; 7:3440-5. [PMID: 27532687 PMCID: PMC5008990 DOI: 10.1021/acs.jpclett.6b01502] [Citation(s) in RCA: 76] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/19/2023]
Abstract
The characterization of protein binding processes - with all of the key conformational changes - has been a grand challenge in the field of biophysics. Here, we have used the weighted ensemble path sampling strategy to orchestrate molecular dynamics simulations, yielding atomistic views of protein-peptide binding pathways involving the MDM2 oncoprotein and an intrinsically disordered p53 peptide. A total of 182 independent, continuous binding pathways were generated, yielding a kon that is in good agreement with experiment. These pathways were generated in 15 days using 3500 cores of a supercomputer, substantially faster than would be possible with "brute force" simulations. Many of these pathways involve the anchoring of p53 residue F19 into the MDM2 binding cleft when forming the metastable encounter complex, indicating that F19 may be a kinetically important residue. Our study demonstrates that it is now practical to generate pathways and calculate rate constants for protein binding processes using atomistic simulation on typical computing resources.
Collapse
Affiliation(s)
- Matthew C. Zwier
- Department of Chemistry, Drake University, Des Moines, Iowa 50311, United States
| | - Adam J. Pratt
- Department of Chemistry, University of Pittsburgh, Pittsburgh, Pennsylvania 15260, United States
| | - Joshua L. Adelman
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, Pennsylvania 15260, United States
| | - Joseph W. Kaus
- Department of Chemistry, University of Pittsburgh, Pittsburgh, Pennsylvania 15260, United States
| | - Daniel M. Zuckerman
- Department of Computational and Systems Biology, University of Pittsburgh, Pittsburgh, Pennsylvania 15260, United States
| | - Lillian T. Chong
- Department of Chemistry, University of Pittsburgh, Pittsburgh, Pennsylvania 15260, United States
- Institute of Biochemistry and Biotechnology, Martin-Luther Universität Halle-Wittenberg, Halle 06120, Germany
- Corresponding Author:
| |
Collapse
|
44
|
Sapin E, Carr DB, De Jong KA, Shehu A. Computing energy landscape maps and structural excursions of proteins. BMC Genomics 2016; 17 Suppl 4:546. [PMID: 27535545 PMCID: PMC5001232 DOI: 10.1186/s12864-016-2798-8] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
BACKGROUND Structural excursions of a protein at equilibrium are key to biomolecular recognition and function modulation. Protein modeling research is driven by the need to aid wet laboratories in characterizing equilibrium protein dynamics. In principle, structural excursions of a protein can be directly observed via simulation of its dynamics, but the disparate temporal scales involved in such excursions make this approach computationally impractical. On the other hand, an informative representation of the structure space available to a protein at equilibrium can be obtained efficiently via stochastic optimization, but this approach does not directly yield information on equilibrium dynamics. METHODS We present here a novel methodology that first builds a multi-dimensional map of the energy landscape that underlies the structure space of a given protein and then queries the computed map for energetically-feasible excursions between structures of interest. An evolutionary algorithm builds such maps with a practical computational budget. Graphical techniques analyze a computed multi-dimensional map and expose interesting features of an energy landscape, such as basins and barriers. A path searching algorithm then queries a nearest-neighbor graph representation of a computed map for energetically-feasible basin-to-basin excursions. RESULTS Evaluation is conducted on intrinsically-dynamic proteins of importance in human biology and disease. Visual statistical analysis of the maps of energy landscapes computed by the proposed methodology reveals features already captured in the wet laboratory, as well as new features indicative of interesting, unknown thermodynamically-stable and semi-stable regions of the equilibrium structure space. Comparison of maps and structural excursions computed by the proposed methodology on sequence variants of a protein sheds light on the role of equilibrium structure and dynamics in the sequence-function relationship. CONCLUSIONS Applications show that the proposed methodology is effective at locating basins in complex energy landscapes and computing basin-basin excursions of a protein with a practical computational budget. While the actual temporal scales spanned by a structural excursion cannot be directly obtained due to the foregoing of simulation of dynamics, hypotheses can be formulated regarding the impact of sequence mutations on protein function. These hypotheses are valuable in instigating further research in wet laboratories.
Collapse
Affiliation(s)
- Emmanuel Sapin
- Department of Computer Science, George Mason University, 4400 University Drive, Fairfax, 22030, VA, USA
| | - Daniel B Carr
- Department of Statistics, George Mason University, 4400 University Drive, Fairfax, 22030, VA, USA
| | - Kenneth A De Jong
- Department of Computer Science, George Mason University, 4400 University Drive, Fairfax, 22030, VA, USA.,Krasnow Institute for Advanced Study, George Mason University, 4400 University Drive, Fairfax, 22030, VA, USA
| | - Amarda Shehu
- Department of Computer Science, George Mason University, 4400 University Drive, Fairfax, 22030, VA, USA. amarda.@gmu.edu.,Department of Bioengineering, George Mason University, 4400 University Drive, Fairfax, 22030, VA, USA. amarda.@gmu.edu.,School of Systems Biology, George Mason University, 10900 University Boulevard, Manassas, 20110, VA, USA. amarda.@gmu.edu
| |
Collapse
|
45
|
Jungblut S, Dellago C. Pathways to self-organization: Crystallization via nucleation and growth. THE EUROPEAN PHYSICAL JOURNAL. E, SOFT MATTER 2016; 39:77. [PMID: 27498980 DOI: 10.1140/epje/i2016-16077-6] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/13/2016] [Accepted: 07/13/2016] [Indexed: 06/06/2023]
Abstract
Crystallization, a prototypical self-organization process during which a disordered state spontaneously transforms into a crystal characterized by a regular arrangement of its building blocks, usually proceeds by nucleation and growth. In the initial stages of the transformation, a localized nucleus of the new phase forms in the old one due to a random fluctuation. Most of these nuclei disappear after a short time, but rarely a crystalline embryo may reach a critical size after which further growth becomes thermodynamically favorable and the entire system is converted into the new phase. In this article, we will discuss several theoretical concepts and computational methods to study crystallization. More specifically, we will address the rare event problem arising in the simulation of nucleation processes and explain how to calculate nucleation rates accurately. Particular attention is directed towards discussing statistical tools to analyze crystallization trajectories and identify the transition mechanism.
Collapse
Affiliation(s)
- S Jungblut
- Faculty of Physics, University of Vienna, Boltzmanngasse 5, 1090, Wien, Austria
| | - C Dellago
- Faculty of Physics, University of Vienna, Boltzmanngasse 5, 1090, Wien, Austria.
| |
Collapse
|
46
|
Perkett MR, Mirijanian DT, Hagan MF. The allosteric switching mechanism in bacteriophage MS2. J Chem Phys 2016; 145:035101. [PMID: 27448905 PMCID: PMC4947040 DOI: 10.1063/1.4955187] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2016] [Accepted: 06/07/2016] [Indexed: 01/16/2023] Open
Abstract
We use all-atom simulations to elucidate the mechanisms underlying conformational switching and allostery within the coat protein of the bacteriophage MS2. Assembly of most icosahedral virus capsids requires that the capsid protein adopts different conformations at precise locations within the capsid. It has been shown that a 19 nucleotide stem loop (TR) from the MS2 genome acts as an allosteric effector, guiding conformational switching of the coat protein during capsid assembly. Since the principal conformational changes occur far from the TR binding site, it is important to understand the molecular mechanism underlying this allosteric communication. To this end, we use all-atom simulations with explicit water combined with a path sampling technique to sample the MS2 coat protein conformational transition, in the presence and absence of TR-binding. The calculations find that TR binding strongly alters the transition free energy profile, leading to a switch in the favored conformation. We discuss changes in molecular interactions responsible for this shift. We then identify networks of amino acids with correlated motions to reveal the mechanism by which effects of TR binding span the protein. We find that TR binding strongly affects residues located at the 5-fold and quasi-sixfold interfaces in the assembled capsid, suggesting a mechanism by which the TR binding could direct formation of the native capsid geometry. The analysis predicts amino acids whose substitution by mutagenesis could alter populations of the conformational substates or their transition rates.
Collapse
Affiliation(s)
- Matthew R Perkett
- Martin Fisher School of Physics, Brandeis University, Waltham, Massachusetts 02474, USA
| | - Dina T Mirijanian
- Martin Fisher School of Physics, Brandeis University, Waltham, Massachusetts 02474, USA
| | - Michael F Hagan
- Martin Fisher School of Physics, Brandeis University, Waltham, Massachusetts 02474, USA
| |
Collapse
|
47
|
Suárez E, Adelman JL, Zuckerman DM. Accurate Estimation of Protein Folding and Unfolding Times: Beyond Markov State Models. J Chem Theory Comput 2016; 12:3473-81. [PMID: 27340835 PMCID: PMC5022777 DOI: 10.1021/acs.jctc.6b00339] [Citation(s) in RCA: 53] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Because standard molecular dynamics (MD) simulations are unable to access time scales of interest in complex biomolecular systems, it is common to "stitch together" information from multiple shorter trajectories using approximate Markov state model (MSM) analysis. However, MSMs may require significant tuning and can yield biased results. Here, by analyzing some of the longest protein MD data sets available (>100 μs per protein), we show that estimators constructed based on exact non-Markovian (NM) principles can yield significantly improved mean first-passage times (MFPTs) for protein folding and unfolding. In some cases, MSM bias of more than an order of magnitude can be corrected when identical trajectory data are reanalyzed by non-Markovian approaches. The NM analysis includes "history" information, higher order time correlations compared to MSMs, that is available in every MD trajectory. The NM strategy is insensitive to fine details of the states used and works well when a fine time-discretization (i.e., small "lag time") is used.
Collapse
Affiliation(s)
- Ernesto Suárez
- Department of Computational and Systems Biology, University of Pittsburgh , Pittsburgh, Pennsylvania 15260, United States
| | - Joshua L Adelman
- Department of Biological Sciences, University of Pittsburgh , Pittsburgh, Pennsylvania 15260, United States
| | - Daniel M Zuckerman
- Department of Computational and Systems Biology, University of Pittsburgh , Pittsburgh, Pennsylvania 15260, United States
| |
Collapse
|
48
|
Dickson A, Lotz SD. Ligand Release Pathways Obtained with WExplore: Residence Times and Mechanisms. J Phys Chem B 2016; 120:5377-85. [DOI: 10.1021/acs.jpcb.6b04012] [Citation(s) in RCA: 42] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Alex Dickson
- Department of Biochemistry & Molecular Biology and ‡Department of Computational Mathematics, Science and Engineering, Michigan State University, East Lansing, Michigan 48824, United States
| | - Samuel D. Lotz
- Department of Biochemistry & Molecular Biology and ‡Department of Computational Mathematics, Science and Engineering, Michigan State University, East Lansing, Michigan 48824, United States
| |
Collapse
|
49
|
Zhang BW, Dai W, Gallicchio E, He P, Xia J, Tan Z, Levy RM. Simulating Replica Exchange: Markov State Models, Proposal Schemes, and the Infinite Swapping Limit. J Phys Chem B 2016; 120:8289-301. [PMID: 27079355 DOI: 10.1021/acs.jpcb.6b02015] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
Abstract
Replica exchange molecular dynamics is a multicanonical simulation technique commonly used to enhance the sampling of solvated biomolecules on rugged free energy landscapes. While replica exchange is relatively easy to implement, there are many unanswered questions about how to use this technique most efficiently, especially because it is frequently the case in practice that replica exchange simulations are not fully converged. A replica exchange cycle consists of a series of molecular dynamics steps of a set of replicas moving under different Hamiltonians or at different thermodynamic states followed by one or more replica exchange attempts to swap replicas among the different states. How the replica exchange cycle is constructed affects how rapidly the system equilibrates. We have constructed a Markov state model of replica exchange (MSMRE) using long molecular dynamics simulations of a host-guest binding system as an example, in order to study how different implementations of the replica exchange cycle can affect the sampling efficiency. We analyze how the number of replica exchange attempts per cycle, the number of MD steps per cycle, and the interaction between the two parameters affects the largest implied time scale of the MSMRE simulation. The infinite swapping limit is an important concept in replica exchange. We show how to estimate the infinite swapping limit from the diagonal elements of the exchange transition matrix constructed from MSMRE "simulations of simulations" as well as from relatively short runs of the actual replica exchange simulations.
Collapse
Affiliation(s)
- Bin W Zhang
- Center for Biophysics and Computational Biology, Department of Chemistry and Institute for Computational Molecular Science, Temple University , Philadelphia, Pennsylvania 19122, United States
| | - Wei Dai
- Department of Physics and Astronomy, Rutgers, the State University of New Jersey , Piscataway, New Jersey 08854, United States
| | - Emilio Gallicchio
- Department of Chemistry, Brooklyn College of the City University of New York , Brooklyn, New York 11210, United States
| | - Peng He
- Center for Biophysics and Computational Biology, Department of Chemistry and Institute for Computational Molecular Science, Temple University , Philadelphia, Pennsylvania 19122, United States
| | - Junchao Xia
- Center for Biophysics and Computational Biology, Department of Chemistry and Institute for Computational Molecular Science, Temple University , Philadelphia, Pennsylvania 19122, United States
| | - Zhiqiang Tan
- Department of Statistics, Rutgers, the State University of New Jersey , Piscataway, New Jersey 08854, United States
| | - Ronald M Levy
- Center for Biophysics and Computational Biology, Department of Chemistry and Institute for Computational Molecular Science, Temple University , Philadelphia, Pennsylvania 19122, United States
| |
Collapse
|
50
|
Maximova T, Moffatt R, Ma B, Nussinov R, Shehu A. Principles and Overview of Sampling Methods for Modeling Macromolecular Structure and Dynamics. PLoS Comput Biol 2016; 12:e1004619. [PMID: 27124275 PMCID: PMC4849799 DOI: 10.1371/journal.pcbi.1004619] [Citation(s) in RCA: 144] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
Abstract
Investigation of macromolecular structure and dynamics is fundamental to understanding how macromolecules carry out their functions in the cell. Significant advances have been made toward this end in silico, with a growing number of computational methods proposed yearly to study and simulate various aspects of macromolecular structure and dynamics. This review aims to provide an overview of recent advances, focusing primarily on methods proposed for exploring the structure space of macromolecules in isolation and in assemblies for the purpose of characterizing equilibrium structure and dynamics. In addition to surveying recent applications that showcase current capabilities of computational methods, this review highlights state-of-the-art algorithmic techniques proposed to overcome challenges posed in silico by the disparate spatial and time scales accessed by dynamic macromolecules. This review is not meant to be exhaustive, as such an endeavor is impossible, but rather aims to balance breadth and depth of strategies for modeling macromolecular structure and dynamics for a broad audience of novices and experts.
Collapse
Affiliation(s)
- Tatiana Maximova
- Department of Computer Science, George Mason University, Fairfax, Virginia, United States of America
| | - Ryan Moffatt
- Department of Computer Science, George Mason University, Fairfax, Virginia, United States of America
| | - Buyong Ma
- Basic Science Program, Leidos Biomedical Research, Inc. Cancer and Inflammation Program, National Cancer Institute, Frederick, Maryland, United States of America
| | - Ruth Nussinov
- Basic Science Program, Leidos Biomedical Research, Inc. Cancer and Inflammation Program, National Cancer Institute, Frederick, Maryland, United States of America
- Sackler Institute of Molecular Medicine, Department of Human Genetics and Molecular Medicine, Sackler School of Medicine, Tel Aviv University, Tel Aviv, Israel
| | - Amarda Shehu
- Department of Computer Science, George Mason University, Fairfax, Virginia, United States of America
- Department of Biongineering, George Mason University, Fairfax, Virginia, United States of America
- School of Systems Biology, George Mason University, Manassas, Virginia, United States of America
| |
Collapse
|