1
|
Zhou M, Shao X, Cai W, Chipot C, Fu H. Zooming across the Alchemical Space. J Phys Chem Lett 2025:4419-4427. [PMID: 40273944 DOI: 10.1021/acs.jpclett.5c00525] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/26/2025]
Abstract
Alchemical transformations, whereby chemical species are modified seamlessly, represent a powerful tool in molecular simulations and free-energy calculations, with a broad range of applications. A general-extent, or alchemical parameter, λ ∈ [0,1], describes the gradual transition between the initial and final states of the transformation, and its discretization critically affects the reliability and efficiency of the free-energy calculations. For transformations involving large moieties, free-energy perturbation (FEP) and thermodynamic integration (TI) require numerous intermediates, or λ-states, to ensure appropriate overlap of the configurational ensembles and suitable convergence of the simulation, each state demanding extensive sampling, which burdens computational feasibility. To address this limitation, we combine λ-dynamics─treating λ as a dynamic variable─with the enhanced-sampling approach well-tempered metadynamics-extended adaptive biasing force (WTM-eABF), forming the basis of WTM-λABF. By handling λ as a continuously varying collective variable (CV) and applying a bin-discretized bias, WTM-λABF efficiently explores the λ-space, even when the latter is stratified in numerous intermediates. Calculations of free-energies of hydration, of protein-ligand binding, and of amino-acid mutations in proteins reveal that WTM-λABF consistently converges faster than standard FEP or λ-ABF, with its advantages becoming more pronounced as the number of intermediates rises. We find that WTM-λABF can handle alchemical transformations efficiently with as many as 1,000 intermediates, allowing transformations involving large moieties, or significant potential-energy changes, to be tackled with utmost accuracy. Additionally, its rapid exploration of the continuous λ-space accelerates sampling in the orthogonal space. We are confident that WTM-λABF has the potential to serve as a foundational method for routine applications relevant to chemistry and biophysics, ranging from drug discovery to protein engineering and design.
Collapse
Affiliation(s)
- Mengchen Zhou
- Research Center for Analytical Sciences, Tianjin Key Laboratory of Biosensing and Molecular Recognition, State Key Laboratory of Medicinal Chemical Biology, College of Chemistry, Nankai University, Tianjin 300071, China
- Haihe Laboratory of Sustainable Chemical Transformations, Tianjin 300192, China
| | - Xueguang Shao
- Research Center for Analytical Sciences, Tianjin Key Laboratory of Biosensing and Molecular Recognition, State Key Laboratory of Medicinal Chemical Biology, College of Chemistry, Nankai University, Tianjin 300071, China
- Haihe Laboratory of Sustainable Chemical Transformations, Tianjin 300192, China
- School of Materials Science and Engineering, Smart Sensing Interdisciplinary Science Center, Nankai University, Tianjin 300350, China
| | - Wensheng Cai
- Research Center for Analytical Sciences, Tianjin Key Laboratory of Biosensing and Molecular Recognition, State Key Laboratory of Medicinal Chemical Biology, College of Chemistry, Nankai University, Tianjin 300071, China
- Haihe Laboratory of Sustainable Chemical Transformations, Tianjin 300192, China
- School of Materials Science and Engineering, Smart Sensing Interdisciplinary Science Center, Nankai University, Tianjin 300350, China
| | - Christophe Chipot
- Laboratoire International Associé CNRS and University of Illinois at Urbana-Champaign, UMR no. 7019, Université de Lorraine, BP 70239, F-54506 Vandœuvre-lès-Nancy, France
- Department of Physics, University of Illinois at Urbana-Champaign, 1110 West Green Street, Urbana, Illinois 61801, United States
- Department of Biochemistry and Molecular Biology, The University of Chicago, Chicago, Illinois 60637, United States
| | - Haohao Fu
- Research Center for Analytical Sciences, Tianjin Key Laboratory of Biosensing and Molecular Recognition, State Key Laboratory of Medicinal Chemical Biology, College of Chemistry, Nankai University, Tianjin 300071, China
- Haihe Laboratory of Sustainable Chemical Transformations, Tianjin 300192, China
- School of Materials Science and Engineering, Smart Sensing Interdisciplinary Science Center, Nankai University, Tianjin 300350, China
| |
Collapse
|
2
|
Schoenmaker L, Jiskoot DA, Scheen J, Cheng E, Gapsys V, Hahn DF, Ries B, van Westen GJP, Mobley DL, Jespers W. IMERGE-FEP: Improving Relative Free Energy Calculation Convergence with Chemical Intermediates. J Phys Chem B 2025; 129:2370-2379. [PMID: 39976528 DOI: 10.1021/acs.jpcb.4c07156] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/12/2025]
Abstract
Alchemical free energy calculations are becoming an increasingly prevalent tool in drug discovery efforts. Over the past decade, significant progress has been made in automating various aspects of this technique. However, one aspect hampering wider application is the construction of perturbation networks to connect ligands of interest. More specifically, ligand pairs with large dissimilarities should be avoided since they can lower convergence and decrease accuracy. Here, we propose a technique for automatic generation of intermediate molecules to break up problematic edges─calculations connecting two different ligands or molecules─into smaller perturbations. To this end, a modular tool was developed that generates intermediates for a molecule pair by enumerating R-group combinations called IMERGE-FEP (Intermediate MolEculaR GEnerator for Free Energy Perturbation). Intermediate enumeration of multiple, representative congeneric series showed that intermediates increase similarity regarding shared substructures, geometry, and LOMAP scores. Taken together, this tool eases integration of intermediate steps into free energy calculation protocols.
Collapse
Affiliation(s)
- Linde Schoenmaker
- Leiden Academic Centre for Drug Research, Leiden University, Einsteinweg 55, 2333 CC Leiden, The Netherlands
- Department of Pharmaceutical Sciences, University of California, Irvine, California 92697, United States
| | - Daan A Jiskoot
- Leiden Academic Centre for Drug Research, Leiden University, Einsteinweg 55, 2333 CC Leiden, The Netherlands
- Department of Pharmaceutical Sciences, University of California, Irvine, California 92697, United States
| | - Jenke Scheen
- Open Molecular Software Foundation, Davis, California 95618, United States
| | - Evien Cheng
- Department of Pharmaceutical Sciences, University of California, Irvine, California 92697, United States
| | - Vytautas Gapsys
- Computational Chemistry, Janssen Research and Development, Janssen Pharmaceutica N. V., Turnhoutseweg 30, 2340 Beerse, Belgium
| | - David F Hahn
- Computational Chemistry, Janssen Research and Development, Janssen Pharmaceutica N. V., Turnhoutseweg 30, 2340 Beerse, Belgium
| | - Benjamin Ries
- Medicinal Chemistry, Boehringer Ingelheim Pharma GmbH & Co KG, Birkendorfer Str 65, 88397 Biberachan der Riss, Germany
- Open Free Energy, Open Molecular Software Foundation, Davis, California 95616, United States
| | - Gerard J P van Westen
- Leiden Academic Centre for Drug Research, Leiden University, Einsteinweg 55, 2333 CC Leiden, The Netherlands
| | - David L Mobley
- Department of Pharmaceutical Sciences, University of California, Irvine, California 92697, United States
| | - Willem Jespers
- Leiden Academic Centre for Drug Research, Leiden University, Einsteinweg 55, 2333 CC Leiden, The Netherlands
| |
Collapse
|
3
|
Cui Q. Machine learning in molecular biophysics: Protein allostery, multi-level free energy simulations, and lipid phase transitions. BIOPHYSICS REVIEWS 2025; 6:011305. [PMID: 39957913 PMCID: PMC11825181 DOI: 10.1063/5.0248589] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/12/2024] [Accepted: 01/14/2025] [Indexed: 02/18/2025]
Abstract
Machine learning (ML) techniques have been making major impacts on all areas of science and engineering, including biophysics. In this review, we discuss several applications of ML to biophysical problems based on our recent research. The topics include the use of ML techniques to identify hotspot residues in allosteric proteins using deep mutational scanning data and to analyze how mutations of these hotspots perturb co-operativity in the framework of a statistical thermodynamic model, to improve the accuracy of free energy simulations by integrating data from different levels of potential energy functions, and to determine the phase transition temperature of lipid membranes. Through these examples, we illustrate the unique value of ML in extracting patterns or parameters from complex data sets, as well as the remaining limitations. By implementing the ML approaches in the context of physically motivated models or computational frameworks, we are able to gain a deeper mechanistic understanding or better convergence in numerical simulations. We conclude by briefly discussing how the introduced models can be further expanded to tackle more complex problems.
Collapse
Affiliation(s)
- Qiang Cui
- Author to whom correspondence should be addressed:
| |
Collapse
|
4
|
Wang M, Jiang H, Ryde U. Impact of Varying Velocities and Solvation Boxes on Alchemical Free-Energy Simulations. J Chem Inf Model 2025; 65:2107-2115. [PMID: 39887323 PMCID: PMC11863368 DOI: 10.1021/acs.jcim.4c02236] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2024] [Revised: 01/22/2025] [Accepted: 01/23/2025] [Indexed: 02/01/2025]
Abstract
Alchemical free-energy perturbation (FEP) is an accurate and thermodynamically stringent way to estimate relative energies for the binding of small ligands to biological macromolecules. It has repeatedly been pointed out that a single simulation normally stays near the starting point in phase space and therefore underestimates the uncertainty of the results. Therefore, it is better to run an ensemble of independent simulations. Traditionally, such an ensemble has been generated by using different starting velocities. We argue that it is better to use also other random choices made during the setup of the simulations, in particular the solvation of the solute. We show here that such solvent-induced independent simulations (SIS) sometimes give a larger standard deviation and slightly different results for the binding of 42 ligands to five different proteins, viz. human N-terminal bromodomain 4, the Leu99Ala mutant of T4 lysozyme, dihydrofolate reductase, blood-clotting factor Xa, and ferritin. SIS does not involve any increase in the time consumption. Therefore, we strongly recommend the use of SIS (in addition to different velocities) to start independent simulations. Other random or uncertain choices in the setup of the simulated systems, e.g., the selection of residues with alternative conformations or positions of added protons, may also be used to enhance the variation in independent simulations.
Collapse
Affiliation(s)
- Meiting Wang
- School
of Medical Engineering & Xinxiang Key Laboratory of Biomedical
Information Research & Henan International Joint Laboratory of
Neural Information Analysis and Drug Intelligent Design & Xinxiang
Key Laboratory of Biomedical Information Research, Xinxiang Medical University, Xinxiang 453003, China
- Department
of Computational Chemistry, Lund University, Chemical Centre, P.O. Box 124, Lund SE-221 00, Sweden
| | - Hao Jiang
- Department
of Computational Chemistry, Lund University, Chemical Centre, P.O. Box 124, Lund SE-221 00, Sweden
| | - Ulf Ryde
- Department
of Computational Chemistry, Lund University, Chemical Centre, P.O. Box 124, Lund SE-221 00, Sweden
| |
Collapse
|
5
|
Rufa D, Fass J, Chodera JD. Fine-tuning molecular mechanics force fields to experimental free energy measurements. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.01.06.631610. [PMID: 39829785 PMCID: PMC11741335 DOI: 10.1101/2025.01.06.631610] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/22/2025]
Abstract
Alchemical free energy methods using molecular mechanics (MM) force fields are essential tools for predicting thermodynamic properties of small molecules, especially via free energy calculations that can estimate quantities relevant for drug discovery such as affinities, selectivities, the impact of target mutations, and ADMET properties. While traditional MM forcefields rely on hand-crafted, discrete atom types and parameters, modern approaches based on graph neural networks (GNNs) learn continuous embedding vectors that represent chemical environments from which MM parameters can be generated. Excitingly, GNN parameterization approaches provide a fully end-to-end differentiable model that offers the possibility of systematically improving these models using experimental data. In this study, we treat a pretrained GNN force field-here, espaloma-0.3.2-as a foundation simulation model and fine-tune its charge model using limited quantities of experimental hydration free energy data, with the goal of assessing the degree to which this can systematically improve the prediction of other related free energies. We demonstrate that a highly efficient "one-shot fine-tuning" method using an exponential (Zwanzig) reweighting free energy estimator can improve prediction accuracy without the need to resimulate molecular configurations. To achieve this "one-shot" improvement, we demonstrate the importance of using effective sample size (ESS) regularization strategies to retain good overlap between initial and fine-tuned force fields. Moreover, we show that leveraging low-rank projections of embedding vectors can achieve comparable accuracy improvements as higher-dimensional approaches in a variety of data-size regimes. Our results demonstrate that linearly-perturbative fine-tuning of foundation model electrostatic parameters to limited experimental data offers a cost-effective strategy that achieves state-of-the-art performance in predicting hydration free energies on the FreeSolv dataset.
Collapse
Affiliation(s)
- Dominic Rufa
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, NY 10065
- Tri-Institutional PhD Program in Chemical Biology, Weill Cornell Graduate School of Medical Sciences, New York, NY 10065, USA
| | - Joshua Fass
- Computation, Relay Therapeutics, Cambridge, Massachusetts 02139, United States
| | - John D. Chodera
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, NY 10065
| |
Collapse
|
6
|
Wang M, Mei Y, Ryde U. Convergence criteria for single-step free-energy calculations: the relation between the Π bias measure and the sample variance. Chem Sci 2024; 15:8786-8799. [PMID: 38873060 PMCID: PMC11168088 DOI: 10.1039/d4sc00140k] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2024] [Accepted: 05/08/2024] [Indexed: 06/15/2024] Open
Abstract
Free energy calculations play a crucial role in simulating chemical processes, enzymatic reactions, and drug design. However, assessing the reliability and convergence of these calculations remains a challenge. This study focuses on single-step free-energy calculations using thermodynamic perturbation. It explores how the sample distributions influence the estimated results and evaluates the reliability of various convergence criteria, including Kofke's bias measure Π and the standard deviation of the energy difference ΔU, σ ΔU . The findings reveal that for Gaussian distributions, there is a straightforward relationship between Π and σ ΔU , free energies can be accurately approximated using a second-order cumulant expansion, and reliable results are attainable for σ ΔU up to 25 kcal mol-1. However, interpreting non-Gaussian distributions is more complex. If the distribution is skewed towards more positive values than a Gaussian, converging the free energy becomes easier, rendering standard convergence criteria overly stringent. Conversely, distributions that are skewed towards more negative values than a Gaussian present greater challenges in achieving convergence, making standard criteria unreliable. We propose a practical approach to assess the convergence of estimated free energies.
Collapse
Affiliation(s)
- Meiting Wang
- School of Medical Engineering & Henan International Joint Laboratory of Neural Information Analysis and Drug Intelligent Design, Xinxiang Medical University Xinxiang 453003 China
- Department of Computational Chemistry, Lund University, Chemical Centre P.O. Box 124 SE-221 00 Lund Sweden
| | - Ye Mei
- State Key Laboratory of Precision Spectroscopy, School of Physics and Electronic Science, East China Normal University Shanghai 200241 China
- NYU-ECNU Center for Computational Chemistry at NYU Shanghai Shanghai 200062 China
- Collaborative Innovation Center of Extreme Optics, Shanxi University Taiyuan Shanxi 030006 China
| | - Ulf Ryde
- Department of Computational Chemistry, Lund University, Chemical Centre P.O. Box 124 SE-221 00 Lund Sweden
| |
Collapse
|
7
|
Zeng J, Qian Y. Adaptive lambda schemes for efficient relative binding free energy calculation. J Comput Chem 2024; 45:855-862. [PMID: 38153254 DOI: 10.1002/jcc.27287] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2023] [Revised: 11/13/2023] [Accepted: 12/03/2023] [Indexed: 12/29/2023]
Abstract
The relative free energy perturbation (RFEP) calculation is one of the most theoretically sound computational chemistry approaches for the binding affinity prediction. However, its application is often hindered by the complexity of the calculation choices and the requirement of expertise in the field. Improper lambda scheme of RFEP may result in deviations from an accurate description of the perturbation process and is prone to erroneous affinity predictions. To address such challenges, an automated adaptive lambda method is proposed where the adaptive lambda schemes are obtained through a split-and-merge algorithm based on the pilot runs. The newly established workflow along with a series of improvements to the perturbation settings increases the consistency of the RFEP calculation results. Comparing the pilot and adaptive lambda schemes, the latter demonstrated improvements in convergence and reproducibility and lowered the mean unsigned error and the root-mean-square error. Overall, the adaptive lambda method is a reliable and robust choice to predict small molecule relative binding free energy and can be capitalized to benefit routine RFEP calculations for drug discovery projects.
Collapse
Affiliation(s)
- Jin Zeng
- AIxplorerBio Biotech Co., Ltd., Jiaxing, Zhejiang Province, China
| | - Yue Qian
- Viva Biotech (Shanghai) Ltd., Shanghai, China
| |
Collapse
|
8
|
Verteramo ML, Ignjatović MM, Kumar R, Wernersson S, Ekberg V, Wallerstein J, Carlström G, Chadimová V, Leffler H, Zetterberg F, Logan DT, Ryde U, Akke M, Nilsson UJ. Interplay of halogen bonding and solvation in protein-ligand binding. iScience 2024; 27:109636. [PMID: 38633000 PMCID: PMC11021960 DOI: 10.1016/j.isci.2024.109636] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2023] [Revised: 02/13/2024] [Accepted: 03/26/2024] [Indexed: 04/19/2024] Open
Abstract
Halogen bonding is increasingly utilized in efforts to achieve high affinity and selectivity of molecules designed to bind proteins, making it paramount to understand the relationship between structure, dynamics, and thermodynamic driving forces. We present a detailed analysis addressing this problem using a series of protein-ligand complexes involving single halogen substitutions - F, Cl, Br, and I - and nearly identical structures. Isothermal titration calorimetry reveals an increasingly favorable binding enthalpy from F to I that correlates with the halogen size and σ-hole electropositive character, but is partially counteracted by unfavorable entropy, which is constant from F to Cl and Br, but worse for I. Consequently, the binding free energy is roughly equal for Cl, Br, and I. QM and solvation-free-energy calculations reflect an intricate balance between halogen bonding, hydrogen bonds, and solvation. These advances have the potential to aid future drug design initiatives involving halogenated compounds.
Collapse
Affiliation(s)
| | | | - Rohit Kumar
- Department of Chemistry, Lund University, Lund, Sweden
| | | | | | | | | | | | - Hakon Leffler
- Microbiology, Immunology, and Glycobiology, Department of Experimental Medicine, Lund University, Lund, Sweden
| | | | | | - Ulf Ryde
- Department of Chemistry, Lund University, Lund, Sweden
| | - Mikael Akke
- Department of Chemistry, Lund University, Lund, Sweden
| | | |
Collapse
|
9
|
York DM. Modern Alchemical Free Energy Methods for Drug Discovery Explained. ACS PHYSICAL CHEMISTRY AU 2023; 3:478-491. [PMID: 38034038 PMCID: PMC10683484 DOI: 10.1021/acsphyschemau.3c00033] [Citation(s) in RCA: 26] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/17/2023] [Revised: 09/12/2023] [Accepted: 09/13/2023] [Indexed: 12/02/2023]
Abstract
This Perspective provides a contextual explanation of the current state-of-the-art alchemical free energy methods and their role in drug discovery as well as highlights select emerging technologies. The narrative attempts to answer basic questions about what goes on "under the hood" in free energy simulations and provide general guidelines for how to run simulations and analyze the results. It is the hope that this work will provide a valuable introduction to students and scientists in the field.
Collapse
Affiliation(s)
- Darrin M. York
- Laboratory for Biomolecular
Simulation Research, Institute for Quantitative Biomedicine, and Department
of Chemistry and Chemical Biology, Rutgers
University, Piscataway, New Jersey 08854, United States
| |
Collapse
|
10
|
Nadkarni I, Wu H, Aluru NR. Data-Driven Approach to Coarse-Graining Simple Liquids in Confinement. J Chem Theory Comput 2023; 19:7358-7370. [PMID: 37791529 DOI: 10.1021/acs.jctc.3c00633] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/05/2023]
Abstract
We propose a data-driven framework for identifying coarse-grained (CG) Lennard-Jones (LJ) potential parameters in confined systems for simple liquids. Our approach involves the use of a Deep Neural Network (DNN) that is trained to approximate the solution of the Inverse Liquid State (ILST) problem for confined systems. The DNN model inherently incorporates essential physical characteristics specific to confined fluids, enabling an accurate prediction of inhomogeneity effects. By utilizing transfer learning, we predict single-site LJ potentials of simple multiatomic liquids confined in a slit-like channel, which effectively replicate both the fluid structure and molecular force of the target All-Atom (AA) system when the electrostatic interactions are not dominant. In addition, we showcase the synergy between the data-driven approach and the well-known Bottom-Up coarse-graining method utilizing Relative-Entropy (RE) Minimization. Through the sequential utilization of these two methods, the robustness of the iterative RE method is significantly augmented, leading to a remarkable enhancement in convergence.
Collapse
Affiliation(s)
- Ishan Nadkarni
- Walker Department of Mechanical Engineering, The University of Texas at Austin, Austin, Texas 78712, United States
| | - Haiyi Wu
- Oden Institute for Computational Engineering and Sciences, The University of Texas at Austin, Austin, Texas 78712, United States
| | - Narayana R Aluru
- Walker Department of Mechanical Engineering, The University of Texas at Austin, Austin, Texas 78712, United States
- Oden Institute for Computational Engineering and Sciences, The University of Texas at Austin, Austin, Texas 78712, United States
| |
Collapse
|
11
|
Willow SY, Kang L, Minh DDL. Learned mappings for targeted free energy perturbation between peptide conformations. J Chem Phys 2023; 159:124104. [PMID: 38127367 PMCID: PMC10517865 DOI: 10.1063/5.0164662] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2023] [Accepted: 09/04/2023] [Indexed: 12/23/2023] Open
Abstract
Targeted free energy perturbation uses an invertible mapping to promote configuration space overlap and the convergence of free energy estimates. However, developing suitable mappings can be challenging. Wirnsberger et al. [J. Chem. Phys. 153, 144112 (2020)] demonstrated the use of machine learning to train deep neural networks that map between Boltzmann distributions for different thermodynamic states. Here, we adapt their approach to the free energy differences of a flexible bonded molecule, deca-alanine, with harmonic biases and different spring centers. When the neural network is trained until "early stopping"-when the loss value of the test set increases-we calculate accurate free energy differences between thermodynamic states with spring centers separated by 1 Å and sometimes 2 Å. For more distant thermodynamic states, the mapping does not produce structures representative of the target state, and the method does not reproduce reference calculations.
Collapse
Affiliation(s)
- Soohaeng Yoo Willow
- Department of Chemistry, Illinois Institute of Technology, Chicago, Illinois 60616, USA
| | - Lulu Kang
- Department of Applied Mathematics, Illinois Institute of Technology, Chicago, Illinois 60616, USA
| | - David D. L. Minh
- Department of Chemistry, Department of Biology, and Center for Interdisciplinary Scientific Computation, Illinois Institute of Technology, Chicago, Illinois 60616, USA
| |
Collapse
|
12
|
Yuan Y, Cui Q. Accurate and Efficient Multilevel Free Energy Simulations with Neural Network-Assisted Enhanced Sampling. J Chem Theory Comput 2023; 19:5394-5406. [PMID: 37527495 PMCID: PMC10810721 DOI: 10.1021/acs.jctc.3c00591] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/03/2023]
Abstract
Free energy differences (ΔF) are essential to quantitative characterization and understanding of chemical and biological processes. Their direct estimation with an accurate quantum mechanical potential is of great interest and yet impractical due to high computational cost and incompatibility with typical alchemical free energy protocols. One promising solution is the multilevel free energy simulation in which the estimate of ΔF at an inexpensive low level of theory is combined with the correction toward a higher level of theory. The poor configurational overlap generally expected between the two levels of theory, however, presents a major challenge. We overcome this challenge by using a deep neural network model and enhanced sampling simulations. An adversarial autoencoder is used to identify a low-dimensional (latent) space that compactly represents the degrees of freedom that encode the distinct distributions at the two levels of theory. Enhanced sampling in this latent space is then used to drive the sampling of configurations that predominantly contribute to the free energy correction. Results for both gas phase and condensed phase systems demonstrate that this data-driven approach offers high accuracy and efficiency with great potential for scalability to complex systems.
Collapse
Affiliation(s)
- Yuchen Yuan
- Department of Chemistry, Boston University, 590 Commonwealth Avenue, Boston, Massachusetts 02215, United States
| | - Qiang Cui
- Department of Chemistry, Boston University, 590 Commonwealth Avenue, Boston, Massachusetts 02215, United States
- Department of Physics, Boston University, 590 Commonwealth Avenue, Boston, Massachusetts 02215, United States
- Department of Biomedical Engineering, Boston University, 44 Cummington Mall, Boston, Massachusetts 02215, United States
| |
Collapse
|
13
|
Snyder R, Kim B, Pan X, Shao Y, Pu J. Bridging semiempirical and ab initio QM/MM potentials by Gaussian process regression and its sparse variants for free energy simulation. J Chem Phys 2023; 159:054107. [PMID: 37530109 PMCID: PMC10400118 DOI: 10.1063/5.0156327] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2023] [Accepted: 07/10/2023] [Indexed: 08/03/2023] Open
Abstract
Free energy simulations that employ combined quantum mechanical and molecular mechanical (QM/MM) potentials at ab initio QM (AI) levels are computationally highly demanding. Here, we present a machine-learning-facilitated approach for obtaining AI/MM-quality free energy profiles at the cost of efficient semiempirical QM/MM (SE/MM) methods. Specifically, we use Gaussian process regression (GPR) to learn the potential energy corrections needed for an SE/MM level to match an AI/MM target along the minimum free energy path (MFEP). Force modification using gradients of the GPR potential allows us to improve configurational sampling and update the MFEP. To adaptively train our model, we further employ the sparse variational GP (SVGP) and streaming sparse GPR (SSGPR) methods, which efficiently incorporate previous sample information without significantly increasing the training data size. We applied the QM-(SS)GPR/MM method to the solution-phase SN2 Menshutkin reaction, NH3+CH3Cl→CH3NH3++Cl-, using AM1/MM and B3LYP/6-31+G(d,p)/MM as the base and target levels, respectively. For 4000 configurations sampled along the MFEP, the iteratively optimized AM1-SSGPR-4/MM model reduces the energy error in AM1/MM from 18.2 to 4.4 kcal/mol. Although not explicitly fitting forces, our method also reduces the key internal force errors from 25.5 to 11.1 kcal/mol/Å and from 30.2 to 10.3 kcal/mol/Å for the N-C and C-Cl bonds, respectively. Compared to the uncorrected simulations, the AM1-SSGPR-4/MM method lowers the predicted free energy barrier from 28.7 to 11.7 kcal/mol and decreases the reaction free energy from -12.4 to -41.9 kcal/mol, bringing these results into closer agreement with their AI/MM and experimental benchmarks.
Collapse
Affiliation(s)
- Ryan Snyder
- Department of Chemistry and Chemical Biology, Indiana University-Purdue University Indianapolis, 402 N Blackford St., Indianapolis, Indiana 46202, USA
| | - Bryant Kim
- Department of Chemistry and Chemical Biology, Indiana University-Purdue University Indianapolis, 402 N Blackford St., Indianapolis, Indiana 46202, USA
| | - Xiaoliang Pan
- Department of Chemistry and Biochemistry, University of Oklahoma, 101 Stephenson Pkwy, Norman, Oklahoma 73019, USA
| | - Yihan Shao
- Department of Chemistry and Biochemistry, University of Oklahoma, 101 Stephenson Pkwy, Norman, Oklahoma 73019, USA
| | - Jingzhi Pu
- Department of Chemistry and Chemical Biology, Indiana University-Purdue University Indianapolis, 402 N Blackford St., Indianapolis, Indiana 46202, USA
| |
Collapse
|
14
|
Wang JN, Xue Y, Li P, Pan X, Wang M, Shao Y, Mo Y, Mei Y. Perspective: Reference-Potential Methods for the Study of Thermodynamic Properties in Chemical Processes: Theory, Applications, and Pitfalls. J Phys Chem Lett 2023; 14:4866-4875. [PMID: 37196031 PMCID: PMC10840091 DOI: 10.1021/acs.jpclett.3c00671] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/19/2023]
Abstract
In silico investigations of enzymatic reactions and chemical reactions in condensed phases often suffer from formidable computational costs due to a large number of degrees of freedom and enormous important volume in phase space. Usually, accuracy must be compromised to trade for efficiency by lowering the reliability of the Hamiltonians employed or reducing the sampling time. Reference-potential methods (RPMs) offer an alternative approach to reaching high accuracy of simulation without much loss of efficiency. In this Perspective, we summarize the idea of RPMs and showcase some recent applications. Most importantly, the pitfalls of these methods are also discussed, and remedies to these pitfalls are presented.
Collapse
Affiliation(s)
- Jia-Ning Wang
- State Key Laboratory of Precision Spectroscopy, School of Physics and Electronic Science, East China Normal University, Shanghai 200241, China
| | - Yuanfei Xue
- State Key Laboratory of Precision Spectroscopy, School of Physics and Electronic Science, East China Normal University, Shanghai 200241, China
| | - Pengfei Li
- Single Particle, LLC, San Diego 92127, California, United States
| | - Xiaoliang Pan
- Department of Chemistry and Biochemistry, University of Oklahoma, Norman 73019, Oklahoma, United States
| | - Meiting Wang
- School of Medical Engineering, Xinxiang Medical University, Xinxiang 453003, Henan, China
| | - Yihan Shao
- Department of Chemistry and Biochemistry, University of Oklahoma, Norman 73019, Oklahoma, United States
| | - Yan Mo
- State Key Laboratory of Precision Spectroscopy, School of Physics and Electronic Science, East China Normal University, Shanghai 200241, China
- NYU-ECNU Center for Computational Chemistry at NYU Shanghai, Shanghai 200062, China
- Collaborative Innovation Center of Extreme Optics, Shanxi University, Taiyuan 030006, Shanxi, China
| | - Ye Mei
- State Key Laboratory of Precision Spectroscopy, School of Physics and Electronic Science, East China Normal University, Shanghai 200241, China
- NYU-ECNU Center for Computational Chemistry at NYU Shanghai, Shanghai 200062, China
- Collaborative Innovation Center of Extreme Optics, Shanxi University, Taiyuan 030006, Shanxi, China
| |
Collapse
|
15
|
Plé T, Mauger N, Adjoua O, Inizan TJ, Lagardère L, Huppert S, Piquemal JP. Routine Molecular Dynamics Simulations Including Nuclear Quantum Effects: From Force Fields to Machine Learning Potentials. J Chem Theory Comput 2023; 19:1432-1445. [PMID: 36856658 DOI: 10.1021/acs.jctc.2c01233] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/02/2023]
Abstract
We report the implementation of a multi-CPU and multi-GPU massively parallel platform dedicated to the explicit inclusion of nuclear quantum effects (NQEs) in the Tinker-HP molecular dynamics (MD) package. The platform, denoted Quantum-HP, exploits two simulation strategies: the Ring-Polymer Molecular Dynamics (RPMD) that provides exact structural properties at the cost of a MD simulation in an extended space of multiple replicas and the adaptive Quantum Thermal Bath (adQTB) that imposes the quantum distribution of energy on a classical system via a generalized Langevin thermostat and provides computationally affordable and accurate (though approximate) NQEs. We discuss some implementation details, efficient numerical schemes, and parallelization strategies and quickly review the GPU acceleration of our code. Our implementation allows an efficient inclusion of NQEs in MD simulations for very large systems, as demonstrated by scaling tests on water boxes with more than 200,000 atoms (simulated using the AMOEBA polarizable force field). We test the compatibility of the approach with Tinker-HP's recently introduced Deep-HP machine learning potentials module by computing water properties using the DeePMD potential with adQTB thermostatting. Finally, we show that the platform is also compatible with the alchemical free energy estimation capabilities of Tinker-HP and fast enough to perform simulations. Therefore, we study how NQEs affect the hydration free energy of small molecules solvated with the recently developed Q-AMOEBA water force field. Overall, the Quantum-HP platform allows users to perform routine quantum MD simulations of large condensed-phase systems and will help to shed new light on the quantum nature of important interactions in biological matter.
Collapse
Affiliation(s)
- Thomas Plé
- Sorbonne Université, LCT, UMR 7616 CNRS, F-75005 Paris, France
| | - Nastasia Mauger
- Sorbonne Université, LCT, UMR 7616 CNRS, F-75005 Paris, France
| | - Olivier Adjoua
- Sorbonne Université, LCT, UMR 7616 CNRS, F-75005 Paris, France
| | | | - Louis Lagardère
- Sorbonne Université, LCT, UMR 7616 CNRS, F-75005 Paris, France
| | - Simon Huppert
- Institut des Nanosciences de Paris (INSP), CNRS UMR 7588, and Sorbonne Université, F-75005 Paris, France
| | - Jean-Philip Piquemal
- Sorbonne Université, LCT, UMR 7616 CNRS, F-75005 Paris, France.,Institut Universitaire de France, 75005 Paris, France.,Department of Biomedical Engineering, The University of Texas at Austin, Austin, Texas 78712, United States
| |
Collapse
|
16
|
Lee TS, Tsai HC, Ganguly A, York DM. ACES: Optimized Alchemically Enhanced Sampling. J Chem Theory Comput 2023; 19:10.1021/acs.jctc.2c00697. [PMID: 36630672 PMCID: PMC10333454 DOI: 10.1021/acs.jctc.2c00697] [Citation(s) in RCA: 30] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
Abstract
We present an alchemical enhanced sampling (ACES) method implemented in the GPU-accelerated AMBER free energy MD engine. The methods hinges on the creation of an "enhanced sampling state" by reducing or eliminating selected potential energy terms and interactions that lead to kinetic traps and conformational barriers while maintaining those terms that curtail the need to otherwise sample large volumes of phase space. For example, the enhanced sampling state might involve transforming regions of a ligand and/or protein side chain into a noninteracting "dummy state" with internal electrostatics and torsion angle terms turned off. The enhanced sampling state is connected to a real-state end point through a Hamiltonian replica exchange (HREMD) framework that is facilitated by newly developed alchemical transformation pathways and smoothstep softcore potentials. This creates a counterdiffusion of real and enhanced-sampling states along the HREMD network. The effect of a differential response of the environment to the real and enhanced-sampling states is minimized by leveraging the dual topology framework in AMBER to construct a counterbalancing HREMD network in the opposite alchemical direction with the same (or similar) real and enhanced sampling states at inverted end points. The method has been demonstrated in a series of test cases of increasing complexity where traditional MD, and in several cases alternative REST2-like enhanced sampling methods, are shown to fail. The hydration free energy for acetic acid was shown to be independent of the starting conformation, and the values for four additional edge case molecules from the FreeSolv database were shown to have a significantly closer agreement with experiment using ACES. The method was further able to handle different rotamer states in a Cdk2 ligand identified as fractionally occupied in crystal structures. Finally, ACES was applied to T4-lysozyme and demonstrated that the side chain distribution of V111χ1 could be reliably reproduced for the apo state, bound to p-xylene, and in p-xylene→ benzene transformations. In these cases, the ACES method is shown to be highly robust and superior to a REST2-like enhanced sampling implementation alone.
Collapse
Affiliation(s)
- Tai-Sung Lee
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, NJ 08854, USA
| | - Hsu-Chun Tsai
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, NJ 08854, USA
| | - Abir Ganguly
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, NJ 08854, USA
| | - Darrin M. York
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, NJ 08854, USA
| |
Collapse
|
17
|
Tsai HC, Lee TS, Ganguly A, Giese TJ, Ebert MCCJC, Labute P, Merz KM, York DM. AMBER Free Energy Tools: A New Framework for the Design of Optimized Alchemical Transformation Pathways. J Chem Theory Comput 2023; 19:10.1021/acs.jctc.2c00725. [PMID: 36622640 PMCID: PMC10329732 DOI: 10.1021/acs.jctc.2c00725] [Citation(s) in RCA: 21] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]
Abstract
We develop a framework for the design of optimized alchemical transformation pathways in free energy simulations using nonlinear mixing and a new functional form for so-called "softcore" potentials. We describe the implementation and testing of this framework in the GPU-accelerated AMBER software suite. The new optimized alchemical transformation pathways integrate a number of important features, including (1) the use of smoothstep functions to stabilize behavior near the transformation end points, (2) consistent power scaling of Coulomb and Lennard-Jones (LJ) interactions with unitless control parameters to maintain balance of electrostatic attractions and exchange repulsions, (3) pairwise form based on the LJ contact radius for the effective interaction distance with separation-shifted scaling, and (4) rigorous smoothing of the potential at the nonbonded cutoff boundary. The new softcore potential form is combined with smoothly transforming nonlinear λ weights for mixing specific potential energy terms, along with flexible λ-scheduling features, to enable robust and stable alchemical transformation pathways. The resulting pathways are demonstrated and tested, and shown to be superior to the traditional methods in terms of numerical stability and minimal variance of the free energy estimates for all cases considered. The framework presented here can be used to design new alchemical enhanced sampling methods, and leveraged in robust free energy workflows for large ligand data sets.
Collapse
Affiliation(s)
- Hsu-Chun Tsai
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, NJ 08854, USA
| | - Tai-Sung Lee
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, NJ 08854, USA
| | - Abir Ganguly
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, NJ 08854, USA
| | - Timothy J. Giese
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, NJ 08854, USA
| | - Maximilian CCJC Ebert
- Congruence Therapeutics, 7171 Rue Frederick Banting #117, Saint-Laurent, Quebec, Canada H4S 1Z9
| | - Paul Labute
- Chemical Computing Group ULC, 910-1010 Sherbrooke West, Montreal, Quebec, Canada H3A 2R7
| | - Kenneth M. Merz
- Department of Chemistry and Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI 48824, USA
| | - Darrin M. York
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, NJ 08854, USA
| |
Collapse
|
18
|
Ranaudo A, Greco C, Moro G, Zucchi A, Mattiello S, Beverina L, Cosentino U. Partition of the Reactive Species of the Suzuki-Miyaura Reaction between Aqueous and Micellar Environments. J Phys Chem B 2022; 126:9408-9416. [PMID: 36330777 PMCID: PMC9677424 DOI: 10.1021/acs.jpcb.2c04591] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
The Suzuki-Miyaura reaction between the aryl halide (1) and the phenyl boronic acid (2), in the presence of the palladium(0) complex (3) as catalyst, gives the cross-coupling product (4) in quantitative yield when performed in basic aqueous solution of the nonionic surfactant Kolliphor-EL (K-EL). The partition between the aqueous and micellar environments of the species of this reaction has been investigated by means of Molecular Dynamics (MD) simulations. Starting from the K-EL molecules dispersed in water, a micelle model has been generated by MD simulations, adopting the 2016H66 force field. Reagent and product species have been described with the same force field, once the reliability of this force field has been tested comparing the n-octanol/water partition free energies calculated from the MD and Free Energy Perturbation (FEP) method with those obtained from the quantum-mechanical SMD method. The potential of mean force for the transfer process between water and the micellar phase of the different species has been calculated by the MD simulations and the Umbrella Sampling (US) method. The overall picture that emerges from these results confirms that the molecular species involved in this reaction prefers the micellar environment and concentrates in different but close zones of the micelle. This supports the experimental evidence that the use of suitable surfactant agents promotes reactivity, allowing micelles to behave as nanoreactors in which reactive species are solubilized and enhance their local concentration.
Collapse
Affiliation(s)
- Anna Ranaudo
- Department
of Earth and Environmental Sciences, University
of Milano-Bicocca, Piazza della Scienza 1, Milan 20126, Italy,
| | - Claudio Greco
- Department
of Earth and Environmental Sciences, University
of Milano-Bicocca, Piazza della Scienza 1, Milan 20126, Italy
| | - Giorgio Moro
- Department
of Biotechnology and Biosciences, University
of Milano-Bicocca, Piazza della Scienza 2, Milan 20126, Italy
| | - Anita Zucchi
- Department
of Materials Science, University of Milano-Bicocca, Via Roberto Cozzi 55, Milan 20125, Italy
| | - Sara Mattiello
- Department
of Materials Science, University of Milano-Bicocca, Via Roberto Cozzi 55, Milan 20125, Italy
| | - Luca Beverina
- Department
of Materials Science, University of Milano-Bicocca, Via Roberto Cozzi 55, Milan 20125, Italy
| | - Ugo Cosentino
- Department
of Earth and Environmental Sciences, University
of Milano-Bicocca, Piazza della Scienza 1, Milan 20126, Italy
| |
Collapse
|
19
|
Hasyim MR, Batton CH, Mandadapu KK. Supervised learning and the finite-temperature string method for computing committor functions and reaction rates. J Chem Phys 2022; 157:184111. [DOI: 10.1063/5.0102423] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022] Open
Abstract
A central object in the computational studies of rare events is the committor function. Though costly to compute, the committor function encodes complete mechanistic information of the processes involving rare events, including reaction rates and transition-state ensembles. Under the framework of transition path theory, Rotskoff et al. [ Proceedings of the 2nd Mathematical and Scientific Machine Learning Conference, Proceedings of Machine Learning Research (PLMR, 2022), Vol. 145, pp. 757–780] proposes an algorithm where a feedback loop couples a neural network that models the committor function with importance sampling, mainly umbrella sampling, which collects data needed for adaptive training. In this work, we show additional modifications are needed to improve the accuracy of the algorithm. The first modification adds elements of supervised learning, which allows the neural network to improve its prediction by fitting to sample-mean estimates of committor values obtained from short molecular dynamics trajectories. The second modification replaces the committor-based umbrella sampling with the finite-temperature string (FTS) method, which enables homogeneous sampling in regions where transition pathways are located. We test our modifications on low-dimensional systems with non-convex potential energy where reference solutions can be found via analytical or finite element methods, and show how combining supervised learning and the FTS method yields accurate computation of committor functions and reaction rates. We also provide an error analysis for algorithms that use the FTS method, using which reaction rates can be accurately estimated during training with a small number of samples. The methods are then applied to a molecular system in which no reference solution is known, where accurate computations of committor functions and reaction rates can still be obtained.
Collapse
Affiliation(s)
- Muhammad R. Hasyim
- Department of Chemical and Biomolecular Engineering, University of California, Berkeley, Berkeley, California 94720, USA
| | - Clay H. Batton
- Department of Chemical and Biomolecular Engineering, University of California, Berkeley, Berkeley, California 94720, USA
| | - Kranthi K. Mandadapu
- Department of Chemical and Biomolecular Engineering, University of California, Berkeley, Berkeley, California 94720, USA
- Chemical Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| |
Collapse
|
20
|
Correa GB, Maciel JCSL, Tavares FW, Abreu CRA. A New Formulation for the Concerted Alchemical Calculation of van der Waals and Coulomb Components of Solvation Free Energies. J Chem Theory Comput 2022; 18:5876-5889. [PMID: 36189930 DOI: 10.1021/acs.jctc.2c00563] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Alchemical free energy calculations via molecular dynamics have been widely used to obtain thermodynamic properties related to protein-ligand binding and solute-solvent interactions. Although soft-core modeling is the most common approach, the linear basis function (LBF) methodology [Naden, L. N.; et al. J. Chem. Theory Comput.2014, 10 (3), 1128; 2015, 11 (6), 2536] has emerged as a suitable alternative. It overcomes the end-point singularity of the scaling method while maintaining essential advantages such as ease of implementation and high flexibility for postprocessing analysis. In the present work, we propose a simple LBF variant and formulate an efficient protocol for evaluating van der Waals and Coulomb components of an alchemical transformation in tandem, in contrast to the prevalent sequential evaluation mode. To validate our proposal, which results from a careful optimization study, we performed solvation free energy calculations and obtained octanol-water partition coefficients of small organic molecules. Comparisons with results obtained via the sequential mode using either another LBF approach or the soft-core model attest to the effectiveness and correctness of our method. In addition, we show that a reaction field model with an infinite dielectric constant can provide very accurate hydration free energies when used instead of a lattice-sum method to model solute-solvent electrostatics.
Collapse
Affiliation(s)
- Gabriela B Correa
- Chemical Engineering Program, Instituto Alberto Luiz Coimbra de Pós-Graduação e Pesquisa em Engenharia, Universidade Federal do Rio de Janeiro, 21941-909Rio de Janeiro, RJ, Brazil
| | - Jéssica C S L Maciel
- Chemical Engineering Department, Escola de Química, Universidade Federal do Rio de Janeiro, 21941-909Rio de Janeiro, RJ, Brazil
| | - Frederico W Tavares
- Chemical Engineering Program, Instituto Alberto Luiz Coimbra de Pós-Graduação e Pesquisa em Engenharia, Universidade Federal do Rio de Janeiro, 21941-909Rio de Janeiro, RJ, Brazil.,Chemical Engineering Department, Escola de Química, Universidade Federal do Rio de Janeiro, 21941-909Rio de Janeiro, RJ, Brazil
| | - Charlles R A Abreu
- Chemical Engineering Department, Escola de Química, Universidade Federal do Rio de Janeiro, 21941-909Rio de Janeiro, RJ, Brazil
| |
Collapse
|
21
|
Hudson PS, Aviat F, Meana-Pañeda R, Warrensford L, Pollard BC, Prasad S, Jones MR, Woodcock HL, Brooks BR. Obtaining QM/MM binding free energies in the SAMPL8 drugs of abuse challenge: indirect approaches. J Comput Aided Mol Des 2022; 36:263-277. [PMID: 35597880 PMCID: PMC9148874 DOI: 10.1007/s10822-022-00443-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2021] [Accepted: 02/17/2022] [Indexed: 11/28/2022]
Abstract
Accurately predicting free energy differences is essential in realizing the full potential of rational drug design. Unfortunately, high levels of accuracy often require computationally expensive QM/MM Hamiltonians. Fortuitously, the cost of employing QM/MM approaches in rigorous free energy simulation can be reduced through the use of the so-called “indirect” approach to QM/MM free energies, in which the need for QM/MM simulations is avoided via a QM/MM “correction” at the classical endpoints of interest. Herein, we focus on the computation of QM/MM binding free energies in the context of the SAMPL8 Drugs of Abuse host–guest challenge. Of the 5 QM/MM correction coupled with force-matching submissions, PM6-D3H4/MM ranked submission proved the best overall QM/MM entry, with an RMSE from experimental results of 2.43 kcal/mol (best in ranked submissions), a Pearson’s correlation of 0.78 (second-best in ranked submissions), and a Kendall \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\tau$$\end{document}τ correlation of 0.52 (best in ranked submissions).
Collapse
Affiliation(s)
- Phillip S Hudson
- Laboratory of Computational Biology, National Heart, Lung and Blood Institute, National Institutes of Health, Bethesda, MD, 20852, USA.
| | - Félix Aviat
- Laboratory of Computational Biology, National Heart, Lung and Blood Institute, National Institutes of Health, Bethesda, MD, 20852, USA
| | - Rubén Meana-Pañeda
- Laboratory of Computational Biology, National Heart, Lung and Blood Institute, National Institutes of Health, Bethesda, MD, 20852, USA
| | - Luke Warrensford
- Department of Chemistry, University of South Florida, Tampa, FL, 33620, USA
| | - Benjamin C Pollard
- Department of Chemistry, University of South Florida, Tampa, FL, 33620, USA
| | - Samarjeet Prasad
- Laboratory of Computational Biology, National Heart, Lung and Blood Institute, National Institutes of Health, Bethesda, MD, 20852, USA
| | - Michael R Jones
- Laboratory of Computational Biology, National Heart, Lung and Blood Institute, National Institutes of Health, Bethesda, MD, 20852, USA
| | - H Lee Woodcock
- Department of Chemistry, University of South Florida, Tampa, FL, 33620, USA
| | - Bernard R Brooks
- Laboratory of Computational Biology, National Heart, Lung and Blood Institute, National Institutes of Health, Bethesda, MD, 20852, USA
| |
Collapse
|
22
|
Reinhardt M, Grubmüller H. Small-sample limit of the Bennett acceptance ratio method and the variationally derived intermediates. Phys Rev E 2021; 104:054133. [PMID: 34942806 DOI: 10.1103/physreve.104.054133] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2021] [Accepted: 10/28/2021] [Indexed: 11/07/2022]
Abstract
Free energy calculations based on atomistic Hamiltonians provide microscopic insight into the thermodynamic driving forces of biophysical or condensed matter systems. Many approaches use intermediate Hamiltonians interpolating between the two states for which the free energy difference is calculated. The Bennett acceptance ratio (BAR) and variationally derived intermediates (VI) methods are optimal estimator and intermediate states in that the mean-squared error of free energy calculations based on independent sampling is minimized. However, BAR and VI have been derived based on several approximations that do not hold for very few sample points. Analyzing one-dimensional test systems, we show that in such cases BAR and VI are suboptimal and that established uncertainty estimates are inaccurate. Whereas for VI to become optimal, less than seven samples per state suffice in all cases; for BAR the required number increases unboundedly with decreasing configuration space densities overlap of the end states. We show that for BAR, the required number of samples is related to the overlap through an inverse power law. Because this relation seems to hold universally and almost independent of other system properties, these findings can guide the proper choice of estimators for free energy calculations.
Collapse
Affiliation(s)
- Martin Reinhardt
- Max Planck Institute for Biophysical Chemistry, Am Fassberg 11, 37077 Göttingen, Germany
| | - Helmut Grubmüller
- Max Planck Institute for Biophysical Chemistry, Am Fassberg 11, 37077 Göttingen, Germany
| |
Collapse
|
23
|
Morado J, Mortenson PN, Nissink JWM, Verdonk ML, Ward RA, Essex JW, Skylaris CK. Generation of Quantum Configurational Ensembles Using Approximate Potentials. J Chem Theory Comput 2021; 17:7021-7042. [PMID: 34644088 DOI: 10.1021/acs.jctc.1c00532] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Conformational analysis is of paramount importance in drug design: it is crucial to determine pharmacological properties, understand molecular recognition processes, and characterize the conformations of ligands when unbound. Molecular Mechanics (MM) simulation methods, such as Monte Carlo (MC) and molecular dynamics (MD), are usually employed to generate ensembles of structures due to their ability to extensively sample the conformational space of molecules. The accuracy of these MM-based schemes strongly depends on the functional form of the force field (FF) and its parametrization, components that often hinder their performance. High-level methods, such as ab initio MD, provide reliable structural information but are still too computationally expensive to allow for extensive sampling. Therefore, to overcome these limitations, we present a multilevel MC method that is capable of generating quantum configurational ensembles while keeping the computational cost at a minimum. We show that FF reparametrization is an efficient route to generate FFs that reproduce QM results more closely, which, in turn, can be used as low-cost models to achieve the gold standard QM accuracy. We demonstrate that the MC acceptance rate is strongly correlated with various phase space overlap measurements and that it constitutes a robust metric to evaluate the similarity between the MM and QM levels of theory. As a more advanced application, we present a self-parametrizing version of the algorithm, which combines sampling and FF parametrization in one scheme, and apply the methodology to generate the QM/MM distribution of a ligand in aqueous solution.
Collapse
Affiliation(s)
- João Morado
- School of Chemistry, University of Southampton, Highfield, Southampton SO17 1BJ, United Kingdom
| | - Paul N Mortenson
- Astex Pharmaceuticals, 436 Cambridge Science Park, Milton Road, Cambridge CB4 0QA, United Kingdom
| | - J Willem M Nissink
- Medicinal Chemistry, Oncology R&D, AstraZeneca, Cambridge CB4 0WG, United Kingdom
| | - Marcel L Verdonk
- Astex Pharmaceuticals, 436 Cambridge Science Park, Milton Road, Cambridge CB4 0QA, United Kingdom
| | - Richard A Ward
- Medicinal Chemistry, Oncology R&D, AstraZeneca, Cambridge CB4 0WG, United Kingdom
| | - Jonathan W Essex
- School of Chemistry, University of Southampton, Highfield, Southampton SO17 1BJ, United Kingdom
| | - Chris-Kriton Skylaris
- School of Chemistry, University of Southampton, Highfield, Southampton SO17 1BJ, United Kingdom
| |
Collapse
|
24
|
Escobedo FA. On the calculation of free energies over Hamiltonian and order parameters via perturbation and thermodynamic integration. J Chem Phys 2021; 155:114112. [PMID: 34551542 DOI: 10.1063/5.0061541] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
In this work, complementary formulas are presented to compute free-energy differences via perturbation (FEP) methods and thermodynamic integration (TI). These formulas are derived by selecting only the most statistically significant data from the information extractable from the simulated points involved. On the one hand, commonly used FEP techniques based on overlap sampling leverage the full information contained in the overlapping macrostate probability distributions. On the other hand, conventional TI methods only use information on the first moments of those distributions, as embodied by the first derivatives of the free energy. Since the accuracy of simulation data degrades considerably for high-order moments (for FEP) or free-energy derivatives (for TI), it is proposed to consider, consistently for both methods, data up to second-order moments/derivatives. This provides a compromise between the limiting strategies embodied by common FEP and TI and leads to simple, optimized expressions to evaluate free-energy differences. The proposed formulas are validated with an analytically solvable harmonic Hamiltonian (for assessing systematic errors), an atomistic system (for computing the potential of mean force with coordinate-dependent order parameters), and a binary-component coarse-grained model (for tracing a solid-liquid phase diagram in an ensemble sampled through alchemical transformations). It is shown that the proposed FEP and TI formulas are straightforward to implement, perform similarly well, and allow robust estimation of free-energy differences even when the spacing of successive points does not guarantee them to have proper overlapping in phase space.
Collapse
Affiliation(s)
- Fernando A Escobedo
- Robert Frederick Smith School of Chemistry and Biomolecular Engineering, Cornell University, Ithaca, New York 14853, USA
| |
Collapse
|
25
|
Jorge M, Milne AW, Barrera MC, Gomes JR. New Force-Field for Organosilicon Molecules in the Liquid Phase. ACS PHYSICAL CHEMISTRY AU 2021; 1:54-69. [PMID: 34939073 PMCID: PMC8679648 DOI: 10.1021/acsphyschemau.1c00014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/24/2021] [Indexed: 11/29/2022]
Abstract
In this paper, we present a new molecular model that can accurately predict thermodynamic liquid state and phase-change properties for organosilicon molecules including several functional groups (alkylsilane, alkoxysilane, siloxane, and silanol). These molecules are of great importance in geological processes, biological systems, and material science, yet no force field currently exists that is widely applicable to organosilicates. The model is parametrized according to the recent Polarization-Consistent Approach (PolCA), which allows for polarization effects to be incorporated into a nonpolarizable model through post facto correction terms and is therefore consistent with previous parametrizations of the PolCA force field. Alkyl groups are described by the United-Atom approach, bond and angle parameters were taken from previous literature studies, dihedral parameters were fitted to new quantum chemical energy profiles, point charges were calculated from quantum chemical optimizations in a continuum solvent, and Lennard-Jones dispersion/repulsion parameters were fitted to match the density and enthalpy of vaporization of a small number of selected compounds. Extensive validation efforts were carried out, after careful collection and curation of experimental data for organosilicates. Overall, the model performed quite well for the density, enthalpy of vaporization, dielectric constant, and self-diffusion coefficient, but it slightly overestimated the magnitude of self-solvation free energies. The modular and transferable nature of the PolCA force field allows for further extensions to other types of silicon-containing compounds.
Collapse
Affiliation(s)
- Miguel Jorge
- Department
of Chemical and Process Engineering, University
of Strathclyde, 75 Montrose Street, Glasgow G1 1XJ, United Kingdom,
| | - Andrew W. Milne
- Department
of Chemical and Process Engineering, University
of Strathclyde, 75 Montrose Street, Glasgow G1 1XJ, United Kingdom
| | - Maria Cecilia Barrera
- Department
of Chemical and Process Engineering, University
of Strathclyde, 75 Montrose Street, Glasgow G1 1XJ, United Kingdom
| | - José R.
B. Gomes
- CICECO
− Aveiro Institute of Materials, Department of Chemistry, University of Aveiro, Campus Universitário de Santiago, 3810-193 Aveiro, Portugal
| |
Collapse
|
26
|
Ekberg V, Ryde U. On the Use of Interaction Entropy and Related Methods to Estimate Binding Entropies. J Chem Theory Comput 2021; 17:5379-5391. [PMID: 34254810 PMCID: PMC8389774 DOI: 10.1021/acs.jctc.1c00374] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2021] [Indexed: 11/29/2022]
Abstract
Molecular mechanics combined with Poisson-Boltzmann or generalized Born and solvent-accessible area solvation energies (MM/PBSA and MM/GBSA) are popular methods to estimate the free energy for the binding of small molecules to biomacromolecules. However, the estimation of the entropy has been problematic and time-consuming. Traditionally, normal-mode analysis has been used to estimate the entropy, but more recently, alternative approaches have been suggested. In particular, it has been suggested that exponential averaging of the electrostatic and Lennard-Jones interaction energies may provide much faster and more accurate entropies, the interaction entropy (IE) approach. In this study, we show that this exponential averaging is extremely poorly conditioned. Using stochastic simulations, assuming that the interaction energies follow a Gaussian distribution, we show that if the standard deviation of the interaction energies (σIE) is larger than 15 kJ/mol, it becomes practically impossible to converge the interaction entropies (more than 10 million energies are needed, and the number increases exponentially). A cumulant approximation to the second order of the exponential average shows a better convergence, but for σIE > 25 kJ/mol, it gives entropies that are unrealistically large. Moreover, in practical applications, both methods show a steady increase in the entropy with the number of energies considered.
Collapse
Affiliation(s)
- Vilhelm Ekberg
- Department of Theoretical Chemistry,
Chemical Centre, Lund University, P.O. Box 124, SE-221 00 Lund, Sweden
| | - Ulf Ryde
- Department of Theoretical Chemistry,
Chemical Centre, Lund University, P.O. Box 124, SE-221 00 Lund, Sweden
| |
Collapse
|
27
|
Gotzias A, Tocci E, Sapalidis A. On the Consistency of the Exfoliation Free Energy of Graphenes by Molecular Simulations. Int J Mol Sci 2021; 22:ijms22158291. [PMID: 34361058 PMCID: PMC8347420 DOI: 10.3390/ijms22158291] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2021] [Revised: 07/27/2021] [Accepted: 07/28/2021] [Indexed: 01/04/2023] Open
Abstract
Monolayer graphene is now produced at significant yields, by liquid phase exfoliation of graphites in solvents. This has increased the interest in molecular simulation studies to give new insights in the field. We use decoupling simulations to compute the exfoliation free energy of graphenes in a liquid environment. Starting from a bilayer graphene configuration, we decouple the Van der Waals interactions of a graphene monolayer in the presence of saline water. Then, we introduce the monolayer back into water by coupling its interactions with water molecules and ions. A different approach to compute the graphene exfoliation free energy is to use umbrella sampling. We apply umbrella sampling after pulling the graphene monolayer on the shear direction up to a distance from a bilayer. We show that the decoupling and umbrella methods give highly consistent free energy results for three bilayer graphene samples with different size. This strongly suggests that the systems in both methods remain closely in equilibrium as we move between the states before and after the exfoliation. Therefore, the amount of nonequilibrium work needed to peel the two layers apart is minimized efficiently.
Collapse
Affiliation(s)
- Anastasios Gotzias
- National Centre for Scientific Research “Demokritos”, Institute of Nanoscience and Nanotechnology INN, 15310 Athens, Greece;
- Correspondence: ; Tel.: +30-210-6503408
| | - Elena Tocci
- Institute on Membrane Technology ITM–CNR, National Research Council, 87036 Rende, Italy;
| | - Andreas Sapalidis
- National Centre for Scientific Research “Demokritos”, Institute of Nanoscience and Nanotechnology INN, 15310 Athens, Greece;
| |
Collapse
|
28
|
Baumann HM, Gapsys V, de Groot BL, Mobley DL. Challenges Encountered Applying Equilibrium and Nonequilibrium Binding Free Energy Calculations. J Phys Chem B 2021; 125:4241-4261. [PMID: 33905257 PMCID: PMC8240641 DOI: 10.1021/acs.jpcb.0c10263] [Citation(s) in RCA: 36] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
Binding free energy calculations have become increasingly valuable to drive decision making in drug discovery projects. However, among other issues, inadequate sampling can reduce accuracy, limiting the value of the technique. In this paper, we apply absolute binding free energy calculations to ligands binding to T4 lysozyme L99A and HSP90 using equilibrium and nonequilibrium approaches. We highlight sampling problems encountered in these systems, such as slow side chain rearrangements and slow changes of water placement upon ligand binding. These same types of challenges are also likely to show up in other protein-ligand systems, and we propose some strategies to diagnose and test for such problems in alchemical free energy calculations. We also explore similarities and differences in how the equilibrium and the nonequilibrium approaches handle these problems. Our results show the large amount of work still to be done to make free energy calculations robust and reliable and provide insight for future research in this area.
Collapse
Affiliation(s)
- Hannah M Baumann
- Department of Pharmaceutical Sciences, University of California, Irvine, California 92617, United States
| | - Vytautas Gapsys
- Computational Biomolecular Dynamics Group, Department of Theoretical and Computational Biophysics, Max Planck Institute for Biophysical Chemistry, D-37077 Göttingen, Germany
| | - Bert L de Groot
- Computational Biomolecular Dynamics Group, Department of Theoretical and Computational Biophysics, Max Planck Institute for Biophysical Chemistry, D-37077 Göttingen, Germany
| | - David L Mobley
- Department of Pharmaceutical Sciences, University of California, Irvine, California 92617, United States
- Department of Chemistry, University of California, Irvine, California 92617, United States
| |
Collapse
|
29
|
Schultz AJ, Kofke DA. Identifying and estimating bias in overlap-sampling free-energy calculations. MOLECULAR SIMULATION 2021. [DOI: 10.1080/08927022.2020.1758695] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
Affiliation(s)
- Andrew J. Schultz
- Department of Chemical and Biological Engineering, University at Buffalo, The State University of New York, Buffalo, NY, USA
| | - David A. Kofke
- Department of Chemical and Biological Engineering, University at Buffalo, The State University of New York, Buffalo, NY, USA
| |
Collapse
|
30
|
Jin J, Han Y, Pak AJ, Voth GA. A new one-site coarse-grained model for water: Bottom-up many-body projected water (BUMPer). I. General theory and model. J Chem Phys 2021; 154:044104. [PMID: 33514116 PMCID: PMC7826168 DOI: 10.1063/5.0026651] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2020] [Accepted: 12/14/2020] [Indexed: 12/26/2022] Open
Abstract
Water is undoubtedly one of the most important molecules for a variety of chemical and physical systems, and constructing precise yet effective coarse-grained (CG) water models has been a high priority for computer simulations. To recapitulate important local correlations in the CG water model, explicit higher-order interactions are often included. However, the advantages of coarse-graining may then be offset by the larger computational cost in the model parameterization and simulation execution. To leverage both the computational efficiency of the CG simulation and the inclusion of higher-order interactions, we propose a new statistical mechanical theory that effectively projects many-body interactions onto pairwise basis sets. The many-body projection theory presented in this work shares similar physics from liquid state theory, providing an efficient approach to account for higher-order interactions within the reduced model. We apply this theory to project the widely used Stillinger-Weber three-body interaction onto a pairwise (two-body) interaction for water. Based on the projected interaction with the correct long-range behavior, we denote the new CG water model as the Bottom-Up Many-Body Projected Water (BUMPer) model, where the resultant CG interaction corresponds to a prior model, the iteratively force-matched model. Unlike other pairwise CG models, BUMPer provides high-fidelity recapitulation of pair correlation functions and three-body distributions, as well as N-body correlation functions. BUMPer extensively improves upon the existing bottom-up CG water models by extending the accuracy and applicability of such models while maintaining a reduced computational cost.
Collapse
Affiliation(s)
- Jaehyeok Jin
- Department of Chemistry, Chicago Center for Theoretical Chemistry, Institute for Biophysical Dynamics, and James Franck Institute, The University of Chicago, Chicago, Illinois 60637, USA
| | - Yining Han
- Department of Chemistry, Chicago Center for Theoretical Chemistry, Institute for Biophysical Dynamics, and James Franck Institute, The University of Chicago, Chicago, Illinois 60637, USA
| | - Alexander J. Pak
- Department of Chemistry, Chicago Center for Theoretical Chemistry, Institute for Biophysical Dynamics, and James Franck Institute, The University of Chicago, Chicago, Illinois 60637, USA
| | - Gregory A. Voth
- Department of Chemistry, Chicago Center for Theoretical Chemistry, Institute for Biophysical Dynamics, and James Franck Institute, The University of Chicago, Chicago, Illinois 60637, USA
| |
Collapse
|
31
|
Lee TS, Allen BK, Giese TJ, Guo Z, Li P, Lin C, McGee TD, Pearlman DA, Radak BK, Tao Y, Tsai HC, Xu H, Sherman W, York DM. Alchemical Binding Free Energy Calculations in AMBER20: Advances and Best Practices for Drug Discovery. J Chem Inf Model 2020; 60:5595-5623. [PMID: 32936637 PMCID: PMC7686026 DOI: 10.1021/acs.jcim.0c00613] [Citation(s) in RCA: 217] [Impact Index Per Article: 43.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
Predicting protein-ligand binding affinities and the associated thermodynamics of biomolecular recognition is a primary objective of structure-based drug design. Alchemical free energy simulations offer a highly accurate and computationally efficient route to achieving this goal. While the AMBER molecular dynamics package has successfully been used for alchemical free energy simulations in academic research groups for decades, widespread impact in industrial drug discovery settings has been minimal because of the previous limitations within the AMBER alchemical code, coupled with challenges in system setup and postprocessing workflows. Through a close academia-industry collaboration we have addressed many of the previous limitations with an aim to improve accuracy, efficiency, and robustness of alchemical binding free energy simulations in industrial drug discovery applications. Here, we highlight some of the recent advances in AMBER20 with a focus on alchemical binding free energy (BFE) calculations, which are less computationally intensive than alternative binding free energy methods where full binding/unbinding paths are explored. In addition to scientific and technical advances in AMBER20, we also describe the essential practical aspects associated with running relative alchemical BFE calculations, along with recommendations for best practices, highlighting the importance not only of the alchemical simulation code but also the auxiliary functionalities and expertise required to obtain accurate and reliable results. This work is intended to provide a contemporary overview of the scientific, technical, and practical issues associated with running relative BFE simulations in AMBER20, with a focus on real-world drug discovery applications.
Collapse
Affiliation(s)
- Tai-Sung Lee
- Rutgers, the State University of New Jersey, Laboratory for Biomolecular Simulation Research, and Department of Chemistry and Chemical Biology, United States
| | - Bryce K. Allen
- Silicon Therapeutics, Boston, Massachusetts 02210, United States
| | - Timothy J. Giese
- Rutgers, the State University of New Jersey, Laboratory for Biomolecular Simulation Research, and Department of Chemistry and Chemical Biology, United States
| | - Zhenyu Guo
- Silicon Therapeutics, Boston, Massachusetts 02210, United States
| | - Pengfei Li
- Silicon Therapeutics, Boston, Massachusetts 02210, United States
| | - Charles Lin
- Silicon Therapeutics, Boston, Massachusetts 02210, United States
| | - T. Dwight McGee
- Silicon Therapeutics, Boston, Massachusetts 02210, United States
| | - David A. Pearlman
- QSimulate Incorporated, Cambridge, Massachusetts 02139, United States
| | - Brian K. Radak
- Silicon Therapeutics, Boston, Massachusetts 02210, United States
| | - Yujun Tao
- Rutgers, the State University of New Jersey, Laboratory for Biomolecular Simulation Research, and Department of Chemistry and Chemical Biology, United States
| | - Hsu-Chun Tsai
- Rutgers, the State University of New Jersey, Laboratory for Biomolecular Simulation Research, and Department of Chemistry and Chemical Biology, United States
| | - Huafeng Xu
- Silicon Therapeutics, Boston, Massachusetts 02210, United States
| | - Woody Sherman
- Silicon Therapeutics, Boston, Massachusetts 02210, United States
| | - Darrin M. York
- Rutgers, the State University of New Jersey, Laboratory for Biomolecular Simulation Research, and Department of Chemistry and Chemical Biology, United States
| |
Collapse
|
32
|
Hu W, Li P, Wang JN, Xue Y, Mo Y, Zheng J, Pan X, Shao Y, Mei Y. Accelerated Computation of Free Energy Profile at Ab Initio Quantum Mechanical/Molecular Mechanics Accuracy via a Semiempirical Reference Potential. 3. Gaussian Smoothing on Density-of-States. J Chem Theory Comput 2020; 16:6814-6822. [PMID: 32975951 PMCID: PMC7658029 DOI: 10.1021/acs.jctc.0c00794] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
Calculations of the free energy profile, also known as potential of mean force (PMF), along a chosen collective variable (CV) are now routinely applied in the studies of chemical processes, such as enzymatic reactions and chemical reactions in condensed phases. However, if the ab initio quantum mechanical/molecular mechanics (QM/MM) level of accuracy is required for the PMF, it can be formidably demanding even with the most advanced enhanced sampling methods, such as umbrella sampling. To ameliorate this difficulty, we developed a novel method for the computation of the free energy profile based on the reference-potential method recently, in which a low-level reference Hamiltonian is employed for phase space sampling and the free energy profile can be corrected to the level of interest (the target Hamiltonian) by energy reweighting in a nonparametric way. However, when the reference Hamiltonian is very different from the target Hamiltonian, the calculated ensemble averages, including the PMF, often suffer from numerical instability, which mainly comes from the overestimation of the density-of-states (DoS) in the low-energy region. Stochastic samplings of these low-energy configurations are rare events, and some low-energy conformations may get oversampled in simulations of a finite length. In this work, an assumption of Gaussian distribution is applied to the DoS in each CV bin, and the weight of each configuration is rescaled according to the accumulated DoS. The results show that this smoothing process can remarkably reduce the ruggedness of the PMF and increase the reliability of the reference-potential method.
Collapse
Affiliation(s)
- Wenxin Hu
- The Computer Center, School of Data Science & Engineering, East China Normal University, Shanghai 200062, China
| | - Pengfei Li
- State Key Laboratory of Precision Spectroscopy, School of Physics and Electronic Science, East China Normal University, Shanghai 200062, China
| | - Jia-Ning Wang
- State Key Laboratory of Precision Spectroscopy, School of Physics and Electronic Science, East China Normal University, Shanghai 200062, China
| | - Yuanfei Xue
- State Key Laboratory of Precision Spectroscopy, School of Physics and Electronic Science, East China Normal University, Shanghai 200062, China
| | - Yan Mo
- State Key Laboratory of Precision Spectroscopy, School of Physics and Electronic Science, East China Normal University, Shanghai 200062, China
- NYU-ECNU Center for Computational Chemistry at NYU Shanghai, Shanghai 200062, China
- Collaborative Innovation Center of Extreme Optics, Shanxi University, Taiyuan, Shanxi 030006, China
| | - Jun Zheng
- The Computer Center, School of Data Science & Engineering, East China Normal University, Shanghai 200062, China
| | - Xiaoliang Pan
- Department of Chemistry and Biochemistry, University of Oklahoma, Norman, Oklahoma 73019, United States
| | - Yihan Shao
- Department of Chemistry and Biochemistry, University of Oklahoma, Norman, Oklahoma 73019, United States
| | - Ye Mei
- State Key Laboratory of Precision Spectroscopy, School of Physics and Electronic Science, East China Normal University, Shanghai 200062, China
- NYU-ECNU Center for Computational Chemistry at NYU Shanghai, Shanghai 200062, China
- Collaborative Innovation Center of Extreme Optics, Shanxi University, Taiyuan, Shanxi 030006, China
| |
Collapse
|
33
|
König G, Glaser N, Schroeder B, Kubincová A, Hünenberger PH, Riniker S. An Alternative to Conventional λ-Intermediate States in Alchemical Free Energy Calculations: λ-Enveloping Distribution Sampling. J Chem Inf Model 2020; 60:5407-5423. [PMID: 32794763 DOI: 10.1021/acs.jcim.0c00520] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
Alchemical free energy calculations typically rely on intermediate states to bridge between the relevant phase spaces of the two end states. These intermediate states are usually created by mixing the energies or parameters of the end states according to a coupling parameter λ. The choice of the procedure has a strong impact on the efficiency of the calculation, as it affects both the encountered energy barriers and the phase space overlap between the states. The present work builds on the connection between the minimum variance pathway (MVP) and enveloping distribution sampling (EDS). It is shown that both methods can be regarded as special cases of a common scheme referred to as λ-EDS, which can also reproduce the behavior of conventional λ-intermediate states. A particularly attractive feature of λ-EDS is its ability to emulate the use of soft core potentials (SCP) while avoiding the associated computational overhead when applying efficient free energy estimators such as the multistate Bennett's acceptance ratio (MBAR). The method is illustrated for both relative and absolute free energy calculations considering five benchmark systems. The first two systems (charge inversion and cavity creation in a dipolar solvent) demonstrate the use of λ-EDS as an alternative coupling scheme in the context of thermodynamic integration (TI). The three other systems (change of bond length, change of dihedral angles, and cavity creation in water) investigate the efficiency and optimal choice of parameters in the context of free energy perturbation (FEP) and Bennett's acceptance ratio (BAR). It is shown that λ-EDS allows larger steps along the alchemical pathway than conventional intermediate states.
Collapse
Affiliation(s)
- Gerhard König
- Laboratory of Physical Chemistry, Department of Chemistry and Applied Biosciences, ETH Zürich, Vladimir-Prelog-Weg 2, 8093 Zürich, Switzerland
| | - Nina Glaser
- Laboratory of Physical Chemistry, Department of Chemistry and Applied Biosciences, ETH Zürich, Vladimir-Prelog-Weg 2, 8093 Zürich, Switzerland
| | - Benjamin Schroeder
- Laboratory of Physical Chemistry, Department of Chemistry and Applied Biosciences, ETH Zürich, Vladimir-Prelog-Weg 2, 8093 Zürich, Switzerland
| | - Alžbeta Kubincová
- Laboratory of Physical Chemistry, Department of Chemistry and Applied Biosciences, ETH Zürich, Vladimir-Prelog-Weg 2, 8093 Zürich, Switzerland
| | - Philippe H Hünenberger
- Laboratory of Physical Chemistry, Department of Chemistry and Applied Biosciences, ETH Zürich, Vladimir-Prelog-Weg 2, 8093 Zürich, Switzerland
| | - Sereina Riniker
- Laboratory of Physical Chemistry, Department of Chemistry and Applied Biosciences, ETH Zürich, Vladimir-Prelog-Weg 2, 8093 Zürich, Switzerland
| |
Collapse
|
34
|
Mintis DG, Mavrantzas VG. Phase Boundary and Salt Partitioning in Coacervate Complexes Formed between Poly(acrylic acid) and Poly(N,N-dimethylaminoethyl methacrylate) from Detailed Atomistic Simulations Combined with Free Energy Perturbation and Thermodynamic Integration Calculations. Macromolecules 2020. [DOI: 10.1021/acs.macromol.0c00728] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Affiliation(s)
- Dimitris G. Mintis
- Department of Chemical Engineering, University of Patras & FORTH-ICE/HT, Patras GR26504, Greece
| | - Vlasis G. Mavrantzas
- Department of Chemical Engineering, University of Patras & FORTH-ICE/HT, Patras GR26504, Greece
- Particle Technology Laboratory, Department of Mechanical and Process Engineering, ETH Zürich, CH-8092 Zürich, Switzerland
| |
Collapse
|
35
|
Zhuang Y, Bureau HR, Quirk S, Hernandez R. Adaptive steered molecular dynamics of biomolecules. MOLECULAR SIMULATION 2020. [DOI: 10.1080/08927022.2020.1807542] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Affiliation(s)
- Yi Zhuang
- Department of Chemistry, Johns Hopkins University, Baltimore, MD, USA
| | - Hailey R. Bureau
- Department of Chemistry, Johns Hopkins University, Baltimore, MD, USA
| | | | | |
Collapse
|
36
|
Ito S, Cui Q. Multi-level free energy simulation with a staged transformation approach. J Chem Phys 2020; 153:044115. [PMID: 32752685 DOI: 10.1063/5.0012494] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Combining multiple levels of theory in free energy simulations to balance computational accuracy and efficiency is a promising approach for studying processes in the condensed phase. While the basic idea has been proposed and explored for quite some time, it remains challenging to achieve convergence for such multi-level free energy simulations as it requires a favorable distribution overlap between different levels of theory. Previous efforts focused on improving the distribution overlap by either altering the low-level of theory for the specific system of interest or ignoring certain degrees of freedom. Here, we propose an alternative strategy that first identifies the degrees of freedom that lead to gaps in the distributions of different levels of theory and then treats them separately with either constraints or restraints or by introducing an intermediate model that better connects the low and high levels of theory. As a result, the conversion from the low level to the high level model is done in a staged fashion that ensures a favorable distribution overlap along the way. Free energy components associated with different steps are mostly evaluated explicitly, and thus, the final result can be meaningfully compared to the rigorous free energy difference between the two levels of theory with limited and well-defined approximations. The additional free energy component calculations involve simulations at the low level of theory and therefore do not incur high computational costs. The approach is illustrated with two simple but non-trivial solution examples, and factors that dictate the reliability of the result are discussed.
Collapse
Affiliation(s)
- Shingo Ito
- Department of Chemistry, Boston University, 590 Commonwealth Avenue, Boston, Massachusetts 02215, USA
| | - Qiang Cui
- Departments of Chemistry, Physics and Biomedical Engineering, Boston University, 590 Commonwealth Avenue, Boston, Massachusetts 02215, USA
| |
Collapse
|
37
|
Kuhn M, Firth-Clark S, Tosco P, Mey ASJS, Mackey M, Michel J. Assessment of Binding Affinity via Alchemical Free-Energy Calculations. J Chem Inf Model 2020; 60:3120-3130. [DOI: 10.1021/acs.jcim.0c00165] [Citation(s) in RCA: 60] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Affiliation(s)
- Maximilian Kuhn
- Cresset, New Cambridge House, Bassingbourn Road, Litlington SG8 0SS, Cambridgeshire, U.K
- EaStCHEM School of Chemistry, University of Edinburgh, David Brewster Road, Edinburgh EH9 3FJ, U.K
| | - Stuart Firth-Clark
- Cresset, New Cambridge House, Bassingbourn Road, Litlington SG8 0SS, Cambridgeshire, U.K
| | - Paolo Tosco
- Cresset, New Cambridge House, Bassingbourn Road, Litlington SG8 0SS, Cambridgeshire, U.K
| | - Antonia S. J. S. Mey
- EaStCHEM School of Chemistry, University of Edinburgh, David Brewster Road, Edinburgh EH9 3FJ, U.K
| | - Mark Mackey
- Cresset, New Cambridge House, Bassingbourn Road, Litlington SG8 0SS, Cambridgeshire, U.K
| | - Julien Michel
- EaStCHEM School of Chemistry, University of Edinburgh, David Brewster Road, Edinburgh EH9 3FJ, U.K
| |
Collapse
|
38
|
Barrera M, Jorge M. A Polarization-Consistent Model for Alcohols to Predict Solvation Free Energies. J Chem Inf Model 2020; 60:1352-1367. [PMID: 31977210 PMCID: PMC7145284 DOI: 10.1021/acs.jcim.9b01005] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2019] [Indexed: 11/28/2022]
Abstract
Classical nonpolarizable models, normally based on a combination of Lennard-Jones sites and point charges, are extensively used to model thermodynamic properties of fluids, including solvation. An important shortcoming of these models is that they do not explicitly account for polarization effects, i.e., a description of how the electron density responds to changes in the molecular environment. Instead, polarization is implicitly included, in a mean-field sense, into the parameters of the model, usually by fitting to pure liquid properties (e.g., density). This causes problems when trying to describe thermodynamic properties that involve a change of phase (e.g., enthalpy of vaporization), that directly depend on the electronic response of the medium (e.g., dielectric constant), and that require mixing or solvation in different media (e.g., solvation free energies). Fully polarizable models present a natural route for addressing these limitations but at the price of a much higher computational cost. In this work, we combine the best of those two approaches by running fast simulations using nonpolarizable models and applying post facto corrections to the computed properties in order to account for the effects of polarization. By applying this new paradigm, a new united-atom force field for alcohols is developed that is able to predict both pure liquid properties, including dielectric constant, and solvation free energies in different solvents with a high degree of accuracy. This paves the way for the development of a generic classical nonpolarizable force field that can predict solvation of drug-like molecules in a variety of solvents.
Collapse
Affiliation(s)
- Maria
Cecilia Barrera
- Department of Chemical and
Process Engineering, University of Strathclyde, Glasgow G1 1XJ, United Kingdom
| | - Miguel Jorge
- Department of Chemical and
Process Engineering, University of Strathclyde, Glasgow G1 1XJ, United Kingdom
| |
Collapse
|
39
|
Mey ASJS, Allen BK, Macdonald HEB, Chodera JD, Hahn DF, Kuhn M, Michel J, Mobley DL, Naden LN, Prasad S, Rizzi A, Scheen J, Shirts MR, Tresadern G, Xu H. Best Practices for Alchemical Free Energy Calculations [Article v1.0]. LIVING JOURNAL OF COMPUTATIONAL MOLECULAR SCIENCE 2020; 2:18378. [PMID: 34458687 PMCID: PMC8388617 DOI: 10.33011/livecoms.2.1.18378] [Citation(s) in RCA: 147] [Impact Index Per Article: 29.4] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
Abstract
Alchemical free energy calculations are a useful tool for predicting free energy differences associated with the transfer of molecules from one environment to another. The hallmark of these methods is the use of "bridging" potential energy functions representing alchemical intermediate states that cannot exist as real chemical species. The data collected from these bridging alchemical thermodynamic states allows the efficient computation of transfer free energies (or differences in transfer free energies) with orders of magnitude less simulation time than simulating the transfer process directly. While these methods are highly flexible, care must be taken in avoiding common pitfalls to ensure that computed free energy differences can be robust and reproducible for the chosen force field, and that appropriate corrections are included to permit direct comparison with experimental data. In this paper, we review current best practices for several popular application domains of alchemical free energy calculations performed with equilibrium simulations, in particular relative and absolute small molecule binding free energy calculations to biomolecular targets.
Collapse
Affiliation(s)
- Antonia S. J. S. Mey
- EaStCHEM School of Chemistry, David Brewster Road, Joseph Black Building, The King’s Buildings, Edinburgh, EH9 3FJ, UK
| | | | - Hannah E. Bruce Macdonald
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York NY, USA
| | - John D. Chodera
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York NY, USA
| | - David F. Hahn
- Computational Chemistry, Janssen Research & Development, Turnhoutseweg 30, Beerse B-2340, Belgium
| | - Maximilian Kuhn
- EaStCHEM School of Chemistry, David Brewster Road, Joseph Black Building, The King’s Buildings, Edinburgh, EH9 3FJ, UK
- Cresset, Cambridgeshire, UK
| | - Julien Michel
- EaStCHEM School of Chemistry, David Brewster Road, Joseph Black Building, The King’s Buildings, Edinburgh, EH9 3FJ, UK
| | - David L. Mobley
- Departments of Pharmaceutical Sciences and Chemistry, University of California, Irvine, Irvine, USA
| | - Levi N. Naden
- Molecular Sciences Software Institute, Blacksburg VA, USA
| | | | - Andrea Rizzi
- Silicon Therapeutics, Boston, MA, USA
- Tri-Institutional Training Program in Computational Biology and Medicine, New York, NY, USA
| | - Jenke Scheen
- EaStCHEM School of Chemistry, David Brewster Road, Joseph Black Building, The King’s Buildings, Edinburgh, EH9 3FJ, UK
| | | | - Gary Tresadern
- Computational Chemistry, Janssen Research & Development, Turnhoutseweg 30, Beerse B-2340, Belgium
| | | |
Collapse
|
40
|
Affiliation(s)
| | - Chloe Luyet
- Department of Chemical Engineering and Materials Science, Wayne State University, Detroit, MI, USA
| | - Jeffrey J. Potoff
- Department of Chemical Engineering and Materials Science, Wayne State University, Detroit, MI, USA
| |
Collapse
|
41
|
Giese TJ, York DM. Development of a Robust Indirect Approach for MM → QM Free Energy Calculations That Combines Force-Matched Reference Potential and Bennett's Acceptance Ratio Methods. J Chem Theory Comput 2019; 15:5543-5562. [PMID: 31507179 DOI: 10.1021/acs.jctc.9b00401] [Citation(s) in RCA: 51] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
We use the PBE0/6-31G* density functional method to perform ab initio quantum mechanical/molecular mechanical (QM/MM) molecular dynamics (MD) simulations under periodic boundary conditions with rigorous electrostatics using the ambient potential composite Ewald method in order to test the convergence of MM → QM/MM free energy corrections for the prediction of 17 small-molecule solvation free energies and eight ligand binding free energies to T4 lysozyme. The "indirect" thermodynamic cycle for calculating free energies is used to explore whether a series of reference potentials improve the statistical quality of the predictions. Specifically, we construct a series of reference potentials that optimize a molecular mechanical (MM) force field's parameters to reproduce the ab initio QM/MM forces from a QM/MM simulation. The optimizations form a systematic progression of successively expanded parameters that include bond, angle, dihedral, and charge parameters. For each reference potential, we calculate benchmark quality reference values for the MM → QM/MM correction by performing the mixed MM and QM/MM Hamiltonians at 11 intermediate states, each for 200 ps. We then compare forward and reverse application of Zwanzig's relation, thermodynamic integration (TI), and Bennett's acceptance ratio (BAR) methods as a function of reference potential, simulation time, and the number of simulated intermediate states. We find that Zwanzig's equation is inadequate unless a large number of intermediate states are explicitly simulated. The TI and BAR mean signed errors are very small even when only the end-state simulations are considered, and the standard deviations of the TI and BAR errors are decreased by choosing a reference potential that optimizes the bond and angle parameters. We find a robust approach for the data sets of fairly rigid molecules considered here is to use bond + angle reference potential together with the end-state-only BAR analysis. This requires QM/MM simulations to be performed in order to generate reference data to parametrize the bond + angle reference potential, and then this same simulation serves a dual purpose as the full QM/MM end state. The convergence of the results with respect to time suggests that computational resources may be used more efficiently by running multiple simulations for no more than 50 ps, rather than running one long simulation.
Collapse
Affiliation(s)
- Timothy J Giese
- Laboratory for Biomolecular Simulation Research, Center for Integrative Proteomics Research and Department of Chemistry and Chemical Biology , Rutgers University , Piscataway , New Jersey 08854-8087 , United States
| | - Darrin M York
- Laboratory for Biomolecular Simulation Research, Center for Integrative Proteomics Research and Department of Chemistry and Chemical Biology , Rutgers University , Piscataway , New Jersey 08854-8087 , United States
| |
Collapse
|
42
|
Hudson PS, Woodcock HL, Boresch S. Use of Interaction Energies in QM/MM Free Energy Simulations. J Chem Theory Comput 2019; 15:4632-4645. [PMID: 31142113 DOI: 10.1021/acs.jctc.9b00084] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
The use of the most accurate (i.e., QM or QM/MM) levels of theory for free energy simulations (FES) is typically not possible. Primarily, this is because the computational cost associated with the extensive configurational sampling needed for converging FES is prohibitive. To ensure the feasibility of QM-based FES, the "indirect" approach is generally taken, necessitating a free energy calculation between the MM and QM/MM potential energy surfaces. Ideally, this step is performed with standard free energy perturbation (Zwanzig's equation) as it only requires simulations be carried out at the low level of theory; however, work from several groups over the past few years has conclusively shown that Zwanzig's equation is ill-suited to this task. As such, many approximations have arisen to mitigate difficulties with Zwanzig's equation. One particularly popular notion is that the convergence of Zwanzig's equation can be improved by using interaction energy differences instead of total energy differences. Although problematic numerical fluctuations (a major problem when using Zwanzig's equation) are indeed reduced, our results and analysis demonstrate that this "interaction energy approximation" (IEA) is theoretically incorrect, and the implicit approximation invoked is spurious at best. Herein, we demonstrate this via solvation free energy calculations using IEA from two different low levels of theory to the same target high level. Results from this proof-of-concept consistently yield the wrong results, deviating by ∼1.5 kcal/mol from the rigorously obtained value.
Collapse
Affiliation(s)
- Phillip S Hudson
- Department of Chemistry , University of South Florida , 4202 East Fowler Avenue, CHE205 , Tampa , Florida 33620-5250 , United States.,Laboratory of Computational Biology , National Institutes of Health, National Heart, Lung and Blood Institute , 12 South Drive, Rm 3053 , Bethesda , Maryland 20892-5690 , United States
| | - H Lee Woodcock
- Department of Chemistry , University of South Florida , 4202 East Fowler Avenue, CHE205 , Tampa , Florida 33620-5250 , United States
| | - Stefan Boresch
- Faculty of Chemistry, Department of Computational Biological Chemistry , University of Vienna , Währingerstraße 17 , Vienna A-1090 , Austria
| |
Collapse
|
43
|
Wang M, Mei Y, Ryde U. Host-Guest Relative Binding Affinities at Density-Functional Theory Level from Semiempirical Molecular Dynamics Simulations. J Chem Theory Comput 2019; 15:2659-2671. [PMID: 30811192 DOI: 10.1021/acs.jctc.8b01280] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Abstract
Relative free energies for the binding of nine cyclic carboxylate ligands to the octa-acid deep-cavity host were calculated at the combined density-functional theory and molecular mechanics (DFT/MM) level of theory. The DFT calculations employed the BLYP functional and the 6-31G* basis set for the ligand. We employed free-energy perturbations (FEP) with the reference-potential approach and used molecular dynamics (MD) simulations with the semiempirical quantum mechanical (SQM) PM6-DH+ method for the ligand as an intermediate level between MM and DFT/MM to improve the convergence. Thus, the relative binding free energy of two ligands was first calculated at the MM level by an alchemical transformation from one ligand to another in both the bound and unbound states. Then, for each ligand the free-energy correction for going from the MM to the SQM/MM potentials was calculated using explicit SQM/MM MD simulations. Finally, the free-energy correction for going from the SQM/MM to the DFT/MM potentials was estimated with FEP without running any DFT/MM simulations. Instead, the free energy was calculated by single-step exponential averaging (ssEA) or employing the cumulant approximation to the second order (CA). The results show that CA converges much better than ssEA, and with 500-4500 DFT/MM single-point energy calculations, converged free energies with a precision of 0.3 kJ/mol can be obtained. These free energies reproduce the experimental binding free energy differences with a mean absolute deviation of 3.4 kJ/mol, a correlation ( R2) of 0.97, and correct signs for all of the eight free-energy differences. This is appreciably better than the results obtained at the SQM/MM level of theory and also slightly better than those obtained with MM. We show that the convergence of the SQM/MM → DFT/MM perturbations can be monitored by the use of Wu and Kofke's bias metric Π and by the standard deviation of the difference between the SQM/MM and DFT/MM energies. Finally, we show that the use of the intermediate SQM/MM MD simulations improves the convergence of the free energies by a factor of at least two, compared to doing direct MM → DFT/MM perturbations.
Collapse
Affiliation(s)
- Meiting Wang
- State Key Laboratory of Precision Spectroscopy, School of Physics and Materials Science , East China Normal University , Shanghai 200062 , China.,Department of Theoretical Chemistry , Lund University, Chemical Centre , P.O. Box 124, SE-221 00 Lund , Sweden
| | - Ye Mei
- State Key Laboratory of Precision Spectroscopy, School of Physics and Materials Science , East China Normal University , Shanghai 200062 , China.,NYU-ECNU Center for Computational Chemistry at NYU Shanghai, Shanghai 200062 , China.,Collaborative Innovation Center of Extreme Optics , Shanxi University , Taiyuan , Shanxi 030006 , China
| | - Ulf Ryde
- Department of Theoretical Chemistry , Lund University, Chemical Centre , P.O. Box 124, SE-221 00 Lund , Sweden
| |
Collapse
|
44
|
Boubeta FM, Contestín García RM, Lorenzo EN, Boechi L, Estrin D, Sued M, Arrar M. Lessons learned about steered molecular dynamics simulations and free energy calculations. Chem Biol Drug Des 2019; 93:1129-1138. [PMID: 30793836 DOI: 10.1111/cbdd.13485] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2018] [Revised: 12/18/2018] [Accepted: 12/22/2018] [Indexed: 01/30/2023]
Abstract
The calculation of free energy profiles is central in understanding differential enzymatic activity, for instance, involving chemical reactions that require QM-MM tools, ligand migration, and conformational rearrangements that can be modeled using classical potentials. The use of steered molecular dynamics (sMD) together with the Jarzynski equality is a popular approach in calculating free energy profiles. Here, we first briefly review the application of the Jarzynski equality to sMD simulations, then revisit the so-called stiff-spring approximation and the consequent expectation of Gaussian work distributions and, finally, reiterate the practical utility of the second-order cumulant expansion, as it coincides with the parametric maximum-likelihood estimator in this scenario. We illustrate this procedure using simulations of CO, both in aqueous solution and in a carbon nanotube as a model system for biologically relevant nanoheterogeneous environments. We conclude the use of the second-order cumulant expansion permits the use of faster pulling velocities in sMD simulations, without introducing bias due to large dispersion in the non-equilibrium work distribution.
Collapse
Affiliation(s)
- Fernando Martín Boubeta
- CONICET-Facultad de Ciencias Exactas y Naturales, Instituto de Química-Física de los Materiales, Medio Ambiente y Energía, Universidad de Buenos Aires, Buenos Aires, Argentina.,Departamento de Química Inorgánica, Analítica y Química Física, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Buenos Aires, Argentina
| | - Rocío María Contestín García
- CONICET-Facultad de Ciencias Exactas y Naturales, Instituto de Química-Física de los Materiales, Medio Ambiente y Energía, Universidad de Buenos Aires, Buenos Aires, Argentina
| | - Ezequiel Norberto Lorenzo
- CONICET-Facultad de Ciencias Exactas y Naturales, Instituto de Química-Física de los Materiales, Medio Ambiente y Energía, Universidad de Buenos Aires, Buenos Aires, Argentina
| | - Leonardo Boechi
- CONICET-Facultad de Ciencias Exactas y Naturales, Instituto de Cálculo, Universidad de Buenos Aires, Buenos Aires, Argentina
| | - Dario Estrin
- CONICET-Facultad de Ciencias Exactas y Naturales, Instituto de Química-Física de los Materiales, Medio Ambiente y Energía, Universidad de Buenos Aires, Buenos Aires, Argentina.,Departamento de Química Inorgánica, Analítica y Química Física, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Buenos Aires, Argentina
| | - Mariela Sued
- CONICET-Facultad de Ciencias Exactas y Naturales, Instituto de Cálculo, Universidad de Buenos Aires, Buenos Aires, Argentina
| | - Mehrnoosh Arrar
- CONICET-Facultad de Ciencias Exactas y Naturales, Instituto de Química-Física de los Materiales, Medio Ambiente y Energía, Universidad de Buenos Aires, Buenos Aires, Argentina.,Departamento de Química Inorgánica, Analítica y Química Física, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Buenos Aires, Argentina
| |
Collapse
|
45
|
Kearns FL, Warrensford L, Boresch S, Woodcock HL. The Good, the Bad, and the Ugly: "HiPen", a New Dataset for Validating (S)QM/MM Free Energy Simulations. Molecules 2019; 24:E681. [PMID: 30769826 PMCID: PMC6413162 DOI: 10.3390/molecules24040681] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2018] [Revised: 01/28/2019] [Accepted: 01/29/2019] [Indexed: 11/25/2022] Open
Abstract
Indirect (S)QM/MM free energy simulations (FES) are vital to efficiently incorporating sufficient sampling and accurate (QM) energetic evaluations when estimating free energies of practical/experimental interest. Connecting between levels of theory, i.e., calculating Δ A l o w → h i g h , remains to be the most challenging step within an indirect FES protocol. To improve calculations of Δ A l o w → h i g h , we must: (1) compare the performance of all FES methods currently available; and (2) compile and maintain datasets of Δ A l o w → h i g h calculated for a wide-variety of molecules so that future practitioners may replicate or improve upon the current state-of-the-art. Towards these two aims, we introduce a new dataset, "HiPen", which tabulates Δ A g a s M M → 3 o b (the free energy associated with switching from an M M to an S C C - D F T B molecular description using the 3ob parameter set in gas phase), calculated for 22 drug-like small molecules. We compare the calculation of this value using free energy perturbation, Bennett's acceptance ratio, Jarzynski's equation, and Crooks' equation. We also predict the reliability of each calculated Δ A g a s M M → 3 o b by evaluating several convergence criteria including sample size hysteresis, overlap statistics, and bias metric ( Π ). Within the total dataset, three distinct categories of molecules emerge: the "good" molecules, for which we can obtain converged Δ A g a s M M → 3 o b using Jarzynski's equation; "bad" molecules which require Crooks' equation to obtain a converged Δ A g a s M M → 3 o b ; and "ugly" molecules for which we cannot obtain reliably converged Δ A g a s M M → 3 o b with either Jarzynski's or Crooks' equations. We discuss, in depth, results from several example molecules in each of these categories and describe how dihedral discrepancies between levels of theory cause convergence failures even for these gas phase free energy simulations.
Collapse
Affiliation(s)
- Fiona L Kearns
- Department of Chemistry, University of South Florida, 4202 E. Fowler Avenue, Tampa, FL 33620, USA.
| | - Luke Warrensford
- Department of Chemistry, University of South Florida, 4202 E. Fowler Avenue, Tampa, FL 33620, USA.
| | - Stefan Boresch
- Department of Computational Biological Chemistry, Faculty of Chemistry, University of Vienna, Waehringerstrasse 17, A-1090 Vienna, Austria.
| | - H Lee Woodcock
- Department of Chemistry, University of South Florida, 4202 E. Fowler Avenue, Tampa, FL 33620, USA.
| |
Collapse
|
46
|
Duarte Ramos Matos G, Mobley DL. Challenges in the use of atomistic simulations to predict solubilities of drug-like molecules. F1000Res 2019; 7:686. [PMID: 30109026 PMCID: PMC6069752 DOI: 10.12688/f1000research.14960.2] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 12/06/2018] [Indexed: 12/19/2022] Open
Abstract
Background: Solubility is a physical property of high importance to the pharmaceutical industry, the prediction of which for potential drugs has so far been a hard task. We attempted to predict the solubility of acetylsalicylic acid (ASA) by estimating the absolute chemical potentials of its most stable polymorph and of solutions with different concentrations of the drug molecule. Methods: Chemical potentials were estimated from all-atom molecular dynamics simulations. We used the Einstein molecule method (EMM) to predict the absolute chemical potential of the solid and solvation free energy calculations to predict the excess chemical potentials of the liquid-phase systems. Results: Reliable estimations of the chemical potentials for the solid and for a single ASA molecule using the EMM required an extremely large number of intermediate states for the free energy calculations, meaning that the calculations were extremely demanding computationally. Despite the computational cost, however, the computed value did not agree well with the experimental value, potentially due to limitations with the underlying energy model. Perhaps better values could be obtained with a better energy model; however, it seems likely computational cost may remain a limiting factor for use of this particular approach to solubility estimation. Conclusions: Solubility prediction of drug-like solids remains computationally challenging, and it appears that both the underlying energy model and the computational approach applied may need improvement before the approach is suitable for routine use.
Collapse
Affiliation(s)
| | - David L Mobley
- Department of Chemistry, University of California, Irvine, Irvine, California, USA.,Departments of Pharmaceutical Sciences and Chemistry, University of California, Irvine, Irvine, California, USA
| |
Collapse
|
47
|
Arrar M, Boubeta FM, Szretter ME, Sued M, Boechi L, Rodriguez D. On the accurate estimation of free energies using the jarzynski equality. J Comput Chem 2018; 40:688-696. [DOI: 10.1002/jcc.25754] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2018] [Revised: 06/18/2018] [Accepted: 09/03/2018] [Indexed: 11/09/2022]
Affiliation(s)
- Mehrnoosh Arrar
- Instituto de Química-Física de los Materiales, Medio Ambiente y Energía, CONICET-Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Buenos Aires, Argentina, Departamento de Química Inorgánica, Analítica y Química Física, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires; Buenos Aires Argentina
| | - Fernando Martín Boubeta
- Instituto de Química-Física de los Materiales, Medio Ambiente y Energía, CONICET-Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Buenos Aires, Argentina, Departamento de Química Inorgánica, Analítica y Química Física, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires; Buenos Aires Argentina
| | - Maria Eugenia Szretter
- Departamento de Matemática, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Buenos Aires, Argentina, Instituto de Cálculo, CONICET-Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires; Buenos Aires Argentina
| | - Mariela Sued
- Instituto de Cálculo, CONICET-Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires; Buenos Aires Argentina
| | - Leonardo Boechi
- Instituto de Cálculo, CONICET-Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires; Buenos Aires Argentina
| | - Daniela Rodriguez
- Instituto de Cálculo, CONICET-Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires; Buenos Aires Argentina
| |
Collapse
|
48
|
Hudson PS, Boresch S, Rogers DM, Woodcock HL. Accelerating QM/MM Free Energy Computations via Intramolecular Force Matching. J Chem Theory Comput 2018; 14:6327-6335. [PMID: 30300543 PMCID: PMC6314469 DOI: 10.1021/acs.jctc.8b00517] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
The calculation of free energy differences between levels of theory has numerous potential pitfalls. Chief among them is the lack of overlap, i.e., ensembles generated at one level of theory (e.g., "low") not being good approximations of ensembles at the other (e.g., "high"). Numerous strategies have been devised to mitigate this issue. However, the most straightforward approach is to ensure that the "low" level ensemble more closely resembles that of the "high". Ideally, this is done without increasing computational cost. Herein, we demonstrate that by reparametrizing classical intramolecular potentials to reproduce high level forces (i.e., force matching) configurational overlap between a "low" (i.e., classical) and "high" (i.e., quantum) level can be significantly improved. This procedure is validated on two test cases and results in vastly improved convergence of free energy simulations.
Collapse
Affiliation(s)
- Phillip S Hudson
- Department of Chemistry , University of South Florida , 4202 East Fowler Avenue, CHE205 , Tampa , Florida 33620-5250 , United States
- Laboratory of Computational Biology , National Institutes of Health, National Heart, Lung and Blood Institute , 12 South Drive Rm 3053 , Bethesda , Maryland 20892-5690 , United States
| | - Stefan Boresch
- Faculty of Chemistry, Department of Computational Biological Chemistry , University of Vienna , Währingerstraße 17 , A-1090 Vienna , Austria
| | - David M Rogers
- Department of Chemistry , University of South Florida , 4202 East Fowler Avenue, CHE205 , Tampa , Florida 33620-5250 , United States
| | - H Lee Woodcock
- Department of Chemistry , University of South Florida , 4202 East Fowler Avenue, CHE205 , Tampa , Florida 33620-5250 , United States
| |
Collapse
|
49
|
Hudson PS, Han K, Woodcock HL, Brooks BR. Force matching as a stepping stone to QM/MM CB[8] host/guest binding free energies: a SAMPL6 cautionary tale. J Comput Aided Mol Des 2018; 32:983-999. [PMID: 30276502 PMCID: PMC6867086 DOI: 10.1007/s10822-018-0165-3] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2018] [Accepted: 09/14/2018] [Indexed: 10/28/2022]
Abstract
Use of quantum mechanical/molecular mechanical (QM/MM) methods in binding free energy calculations, particularly in the SAMPL challenge, often fail to achieve improvement over standard additive (MM) force fields. Frequently, the implementation is through use of reference potentials, or the so-called "indirect approach", and inherently relies on sufficient overlap existing between MM and QM/MM configurational spaces. This overlap is generally poor, particularly for the use of free energy perturbation to perform the MM to QM/MM free energy correction at the end states of interest (e.g., bound and unbound states). However, by utilizing MM parameters that best reproduce forces obtained at the desired QM level of theory, it is possible to lessen the configurational disparity between MM and QM/MM. To this end, we sought to use force matching to generate MM parameters for the SAMPL6 CB[8] host-guest binding challenge, classically compute binding free energies, and apply energetic end state corrections to obtain QM/MM binding free energy differences. For the standard set of 11 molecules and the bonus set (including three additional challenge molecules), error statistics, such as the root mean square deviation (RMSE) were moderately poor (5.5 and 5.4 kcal/mol). Correlation statistics, however, were in the top two for both standard and bonus set submissions ([Formula: see text] of 0.42 and 0.26, [Formula: see text] of 0.64 and 0.47 respectively). High RMSE and moderate correlation strongly indicated the presence of systematic error. Identifiable issues were ameliorated for two of the guest molecules, resulting in a reduction of error and pointing to strong prospects for the future use of this methodology.
Collapse
Affiliation(s)
- Phillip S Hudson
- Laboratory of Computational Biology, National Heart, Lung and Blood Institute, National Institutes of Health, Bethesda, MD, 20852, USA.
- Department of Chemistry, University of South Florida, Tampa, Florida, 33620, USA.
| | - Kyungreem Han
- Laboratory of Computational Biology, National Heart, Lung and Blood Institute, National Institutes of Health, Bethesda, MD, 20852, USA
| | - H Lee Woodcock
- Department of Chemistry, University of South Florida, Tampa, Florida, 33620, USA
| | - Bernard R Brooks
- Department of Chemistry, University of South Florida, Tampa, Florida, 33620, USA
| |
Collapse
|
50
|
Caldararu O, Olsson MA, Misini Ignjatović M, Wang M, Ryde U. Binding free energies in the SAMPL6 octa-acid host-guest challenge calculated with MM and QM methods. J Comput Aided Mol Des 2018; 32:1027-1046. [PMID: 30203229 DOI: 10.1007/s10822-018-0158-2] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2018] [Accepted: 08/31/2018] [Indexed: 01/15/2023]
Abstract
We have estimated free energies for the binding of eight carboxylate ligands to two variants of the octa-acid deep-cavity host in the SAMPL6 blind-test challenge (with or without endo methyl groups on the four upper-rim benzoate groups, OAM and OAH, respectively). We employed free-energy perturbation (FEP) for relative binding energies at the molecular mechanics (MM) and the combined quantum mechanical (QM) and MM (QM/MM) levels, the latter obtained with the reference-potential approach with QM/MM sampling for the MM → QM/MM FEP. The semiempirical QM method PM6-DH+ was employed for the ligand in the latter calculations. Moreover, binding free energies were also estimated from QM/MM optimised structures, combined with COSMO-RS estimates of the solvation energy and thermostatistical corrections from MM frequencies. They were performed at the PM6-DH+ level of theory with the full host and guest molecule in the QM system (and also four water molecules in the geometry optimisations) for 10-20 snapshots from molecular dynamics simulations of the complex. Finally, the structure with the lowest free energy was recalculated using the dispersion-corrected density-functional theory method TPSS-D3, for both the structure and the energy. The two FEP approaches gave similar results (PM6-DH+/MM slightly better for OAM), which were among the five submissions with the best performance in the challenge and gave the best results without any fit to data from the SAMPL5 challenge, with mean absolute deviations (MAD) of 2.4-5.2 kJ/mol and a correlation coefficient (R2) of 0.77-0.93. This is the first time QM/MM approaches give binding free energies that are competitive to those obtained with MM for the octa-acid host. The QM/MM-optimised structures gave somewhat worse performance (MAD = 3-8 kJ/mol and R2 = 0.1-0.9), but the results were improved compared to previous studies of this system with similar methods.
Collapse
Affiliation(s)
- Octav Caldararu
- Department of Theoretical Chemistry, Lund University, Chemical Centre, P. O. Box 124, 221 00, Lund, Sweden
| | - Martin A Olsson
- Department of Theoretical Chemistry, Lund University, Chemical Centre, P. O. Box 124, 221 00, Lund, Sweden
| | - Majda Misini Ignjatović
- Department of Theoretical Chemistry, Lund University, Chemical Centre, P. O. Box 124, 221 00, Lund, Sweden
| | - Meiting Wang
- Department of Theoretical Chemistry, Lund University, Chemical Centre, P. O. Box 124, 221 00, Lund, Sweden
| | - Ulf Ryde
- Department of Theoretical Chemistry, Lund University, Chemical Centre, P. O. Box 124, 221 00, Lund, Sweden.
| |
Collapse
|