1
|
John R, Herron L, Tiwary P. A comparison of probabilistic generative frameworks for molecular simulations. J Chem Phys 2025; 162:114121. [PMID: 40116311 DOI: 10.1063/5.0249683] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2024] [Accepted: 02/24/2025] [Indexed: 03/23/2025] Open
Abstract
Generative artificial intelligence is now a widely used tool in molecular science. Despite the popularity of probabilistic generative models, numerical experiments benchmarking their performance on molecular data are lacking. In this work, we introduce and explain several classes of generative models, broadly sorted into two categories: flow-based models and diffusion models. We select three representative models: neural spline flows, conditional flow matching, and denoising diffusion probabilistic models, and examine their accuracy, computational cost, and generation speed across datasets with tunable dimensionality, complexity, and modal asymmetry. Our findings are varied, with no one framework being the best for all purposes. In a nutshell, (i) neural spline flows do best at capturing mode asymmetry present in low-dimensional data, (ii) conditional flow matching outperforms other models for high-dimensional data with low complexity, and (iii) denoising diffusion probabilistic models appear the best for low-dimensional data with high complexity. Our datasets include a Gaussian mixture model and the dihedral torsion angle distribution of the Aib9 peptide, generated via a molecular dynamics simulation. We hope our taxonomy of probabilistic generative frameworks and numerical results may guide model selection for a wide range of molecular tasks.
Collapse
Affiliation(s)
- Richard John
- Department of Physics and Institute for Physical Science and Technology, University of Maryland, College Park, Maryland 20742, USA
| | - Lukas Herron
- Biophysics Program and Institute for Physical Science and Technology, University of Maryland, College Park, Maryland 20742, USA
- University of Maryland Institute for Health Computing, Bethesda, Maryland 20852, USA
| | - Pratyush Tiwary
- University of Maryland Institute for Health Computing, Bethesda, Maryland 20852, USA
- Department of Chemistry and Biochemistry and Institute for Physical Science and Technology, University of Maryland, College Park, Maryland 20742, USA
| |
Collapse
|
2
|
Olehnovics E, Liu YM, Mehio N, Sheikh AY, Shirts MR, Salvalaglio M. Accurate Lattice Free Energies of Packing Polymorphs from Probabilistic Generative Models. J Chem Theory Comput 2025; 21:2244-2255. [PMID: 39982864 PMCID: PMC11912200 DOI: 10.1021/acs.jctc.4c01612] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2024] [Revised: 01/31/2025] [Accepted: 02/07/2025] [Indexed: 02/23/2025]
Abstract
Finite-temperature lattice free energy differences between polymorphs of molecular crystals are fundamental to understanding and predicting the relative stability relationships underpinning polymorphism, yet are computationally expensive to obtain. Here, we implement and critically assess machine-learning-enabled targeted free energy calculations derived from flow-based generative models to compute the free energy difference between two ice crystal polymorphs (Ice XI and Ic), modeled with a fully flexible empirical classical force field. We demonstrate that even when remapping from an analytical reference distribution, such methods enable a cost-effective and accurate calculation of free energy differences between disconnected metastable ensembles when trained on locally ergodic data sampled exclusively from the ensembles of interest. Unlike classical free energy perturbation methods, such as the Einstein crystal method, the targeted approach analyzed in this work requires no additional sampling of intermediate perturbed Hamiltonians, offering significant computational savings. To systematically assess the accuracy of the method, we monitored the convergence of free energy estimates during training by implementing an overfitting-aware weighted averaging strategy. By comparing our results with ground-truth free energy differences computed with the Einstein crystal method, we assess the accuracy and efficiency of two different model architectures, employing two different representations of the supercell degrees of freedom (Cartesian vs quaternion-based). We conduct our assessment by comparing free energy differences between crystal supercells of different sizes and temperatures and assessing the accuracy in extrapolating lattice free energies to the thermodynamic limit. While at low temperatures and in small system sizes, the models perform with similar accuracy. We note that for larger systems and high temperatures, the choice of representation is key to obtaining generalizable results of quality comparable to that obtained from the Einstein crystal method. We believe this work to be a stepping stone toward efficient free energy calculations in larger, more complex molecular crystals.
Collapse
Affiliation(s)
- Edgar Olehnovics
- Thomas
Young Centre and Department of Chemical Engineering, University College London, London WC1E 7JE, United Kingdom
| | - Yifei Michelle Liu
- Molecular
Profiling and Drug Delivery, Research & Development, AbbVie Bioresearch Center, Worcester, Massachusetts 01605, United States
| | - Nada Mehio
- Molecular
Profiling and Drug Delivery, Research & Development, AbbVie Inc., North Chicago, Illinois 60064, United States
| | - Ahmad Y. Sheikh
- Molecular
Profiling and Drug Delivery, Research & Development, AbbVie Inc., North Chicago, Illinois 60064, United States
| | - Michael R. Shirts
- University
of Colorado Boulder, Boulder, Colorado 80309, United States
| | - Matteo Salvalaglio
- Thomas
Young Centre and Department of Chemical Engineering, University College London, London WC1E 7JE, United Kingdom
| |
Collapse
|
3
|
Olehnovics E, Liu YM, Mehio N, Sheikh AY, Shirts MR, Salvalaglio M. Assessing the Accuracy and Efficiency of Free Energy Differences Obtained from Reweighted Flow-Based Probabilistic Generative Models. J Chem Theory Comput 2024; 20:5913-5922. [PMID: 38984825 PMCID: PMC11270817 DOI: 10.1021/acs.jctc.4c00520] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2024] [Revised: 06/21/2024] [Accepted: 06/21/2024] [Indexed: 07/11/2024]
Abstract
Computing free energy differences between metastable states characterized by nonoverlapping Boltzmann distributions is often a computationally intensive endeavor, usually requiring chains of intermediate states to connect them. Targeted free energy perturbation (TFEP) can significantly lower the computational cost of FEP calculations by choosing a set of invertible maps used to directly connect the distributions of interest, achieving the necessary statistically significant overlaps without sampling any intermediate states. Probabilistic generative models (PGMs) based on normalizing flow architectures can make it much easier via machine learning to train invertible maps needed for TFEP. However, the accuracy and applicability of approaches based on empirically learned maps depend crucially on the choice of reweighting method adopted to estimate the free energy differences. In this work, we assess the accuracy, rate of convergence, and data efficiency of different free energy estimators, including exponential averaging, Bennett acceptance ratio (BAR), and multistate Bennett acceptance ratio (MBAR), in reweighting PGMs trained by maximum likelihood on limited amounts of molecular dynamics data sampled only from end-states of interest. We carry out the comparisons on a set of simple but representative case studies, including conformational ensembles of alanine dipeptide and ibuprofen. Our results indicate that BAR and MBAR are both data efficient and robust, even in the presence of significant model overfitting in the generation of invertible maps. This analysis can serve as a stepping stone for the deployment of efficient and quantitatively accurate ML-based free energy calculation methods in complex systems.
Collapse
Affiliation(s)
- Edgar Olehnovics
- Thomas
Young Centre and Department of Chemical Engineering, University College London, London WC1E 7JE, U.K.
| | - Yifei Michelle Liu
- Molecular
Profiling and Drug Delivery, Research & Development, AbbVie Bioresearch Center, Worcester, Massachusetts 01605, United States
| | - Nada Mehio
- Molecular
Profiling and Drug Delivery, Research & Development, AbbVie Inc, North
Chicago, Illinois 60064, United States
| | - Ahmad Y. Sheikh
- Molecular
Profiling and Drug Delivery, Research & Development, AbbVie Inc, North
Chicago, Illinois 60064, United States
| | - Michael R. Shirts
- University
of Colorado Boulder, Boulder, Colorado 80309, United States
| | - Matteo Salvalaglio
- Thomas
Young Centre and Department of Chemical Engineering, University College London, London WC1E 7JE, U.K.
| |
Collapse
|
4
|
Rizzi A, Carloni P, Parrinello M. Free energies at QM accuracy from force fields via multimap targeted estimation. Proc Natl Acad Sci U S A 2023; 120:e2304308120. [PMID: 37931103 PMCID: PMC10655219 DOI: 10.1073/pnas.2304308120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2023] [Accepted: 09/25/2023] [Indexed: 11/08/2023] Open
Abstract
Accurate predictions of ligand binding affinities would greatly accelerate the first stages of drug discovery campaigns. However, using highly accurate interatomic potentials based on quantum mechanics (QM) in free energy methods has been so far largely unfeasible due to their prohibitive computational cost. Here, we present an efficient method to compute QM free energies from simulations using cheap reference potentials, such as force fields (FFs). This task has traditionally been out of reach due to the slow convergence of computing the correction from the FF to the QM potential. To overcome this bottleneck, we generalize targeted free energy methods to employ multiple maps-implemented with normalizing flow neural networks (NNs)-that maximize the overlap between the distributions. Critically, the method requires neither a separate expensive training phase for the NNs nor samples from the QM potential. We further propose a one-epoch learning policy to efficiently avoid overfitting, and we combine our approach with enhanced sampling strategies to overcome the pervasive problem of poor convergence due to slow degrees of freedom. On the drug-like molecules in the HiPen dataset, the method accelerates the calculation of the free energy difference of switching from an FF to a DFTB3 potential by three orders of magnitude compared to standard free energy perturbation and by a factor of eight compared to previously published nonequilibrium calculations. Our results suggest that our method, in combination with efficient QM/MM calculations, may be used in lead optimization campaigns in drug discovery and to study protein-ligand molecular recognition processes.
Collapse
Affiliation(s)
- Andrea Rizzi
- Computational Biomedicine, Institute of Advanced Simulations IAS-5/Institute for Neuroscience and Medicine INM-9, Forschungszentrum Jülich GmbH, Jülich52428, Germany
- Atomistic Simulations, Italian Institute of Technology, Genova16163, Italy
| | - Paolo Carloni
- Computational Biomedicine, Institute of Advanced Simulations IAS-5/Institute for Neuroscience and Medicine INM-9, Forschungszentrum Jülich GmbH, Jülich52428, Germany
- Department of Physics and Universitätsklinikum, RWTH Aachen University, Aachen52074, Germany
| | - Michele Parrinello
- Atomistic Simulations, Italian Institute of Technology, Genova16163, Italy
| |
Collapse
|
5
|
Willow SY, Kang L, Minh DDL. Learned mappings for targeted free energy perturbation between peptide conformations. J Chem Phys 2023; 159:124104. [PMID: 38127367 PMCID: PMC10517865 DOI: 10.1063/5.0164662] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2023] [Accepted: 09/04/2023] [Indexed: 12/23/2023] Open
Abstract
Targeted free energy perturbation uses an invertible mapping to promote configuration space overlap and the convergence of free energy estimates. However, developing suitable mappings can be challenging. Wirnsberger et al. [J. Chem. Phys. 153, 144112 (2020)] demonstrated the use of machine learning to train deep neural networks that map between Boltzmann distributions for different thermodynamic states. Here, we adapt their approach to the free energy differences of a flexible bonded molecule, deca-alanine, with harmonic biases and different spring centers. When the neural network is trained until "early stopping"-when the loss value of the test set increases-we calculate accurate free energy differences between thermodynamic states with spring centers separated by 1 Å and sometimes 2 Å. For more distant thermodynamic states, the mapping does not produce structures representative of the target state, and the method does not reproduce reference calculations.
Collapse
Affiliation(s)
- Soohaeng Yoo Willow
- Department of Chemistry, Illinois Institute of Technology, Chicago, Illinois 60616, USA
| | - Lulu Kang
- Department of Applied Mathematics, Illinois Institute of Technology, Chicago, Illinois 60616, USA
| | - David D. L. Minh
- Department of Chemistry, Department of Biology, and Center for Interdisciplinary Scientific Computation, Illinois Institute of Technology, Chicago, Illinois 60616, USA
| |
Collapse
|
6
|
Wang JN, Xue Y, Li P, Pan X, Wang M, Shao Y, Mo Y, Mei Y. Perspective: Reference-Potential Methods for the Study of Thermodynamic Properties in Chemical Processes: Theory, Applications, and Pitfalls. J Phys Chem Lett 2023; 14:4866-4875. [PMID: 37196031 PMCID: PMC10840091 DOI: 10.1021/acs.jpclett.3c00671] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/19/2023]
Abstract
In silico investigations of enzymatic reactions and chemical reactions in condensed phases often suffer from formidable computational costs due to a large number of degrees of freedom and enormous important volume in phase space. Usually, accuracy must be compromised to trade for efficiency by lowering the reliability of the Hamiltonians employed or reducing the sampling time. Reference-potential methods (RPMs) offer an alternative approach to reaching high accuracy of simulation without much loss of efficiency. In this Perspective, we summarize the idea of RPMs and showcase some recent applications. Most importantly, the pitfalls of these methods are also discussed, and remedies to these pitfalls are presented.
Collapse
Affiliation(s)
- Jia-Ning Wang
- State Key Laboratory of Precision Spectroscopy, School of Physics and Electronic Science, East China Normal University, Shanghai 200241, China
| | - Yuanfei Xue
- State Key Laboratory of Precision Spectroscopy, School of Physics and Electronic Science, East China Normal University, Shanghai 200241, China
| | - Pengfei Li
- Single Particle, LLC, San Diego 92127, California, United States
| | - Xiaoliang Pan
- Department of Chemistry and Biochemistry, University of Oklahoma, Norman 73019, Oklahoma, United States
| | - Meiting Wang
- School of Medical Engineering, Xinxiang Medical University, Xinxiang 453003, Henan, China
| | - Yihan Shao
- Department of Chemistry and Biochemistry, University of Oklahoma, Norman 73019, Oklahoma, United States
| | - Yan Mo
- State Key Laboratory of Precision Spectroscopy, School of Physics and Electronic Science, East China Normal University, Shanghai 200241, China
- NYU-ECNU Center for Computational Chemistry at NYU Shanghai, Shanghai 200062, China
- Collaborative Innovation Center of Extreme Optics, Shanxi University, Taiyuan 030006, Shanxi, China
| | - Ye Mei
- State Key Laboratory of Precision Spectroscopy, School of Physics and Electronic Science, East China Normal University, Shanghai 200241, China
- NYU-ECNU Center for Computational Chemistry at NYU Shanghai, Shanghai 200062, China
- Collaborative Innovation Center of Extreme Optics, Shanxi University, Taiyuan 030006, Shanxi, China
| |
Collapse
|
7
|
Wirnsberger P, Papamakarios G, Ibarz B, Racanière S, Ballard AJ, Pritzel A, Blundell C. Normalizing flows for atomic solids. MACHINE LEARNING: SCIENCE AND TECHNOLOGY 2022. [DOI: 10.1088/2632-2153/ac6b16] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022] Open
Abstract
Abstract
We present a machine-learning approach, based on normalizing flows, for modelling atomic solids. Our model transforms an analytically tractable base distribution into the target solid without requiring ground-truth samples for training. We report Helmholtz free energy estimates for cubic and hexagonal ice modelled as monatomic water as well as for a truncated and shifted Lennard-Jones system, and find them to be in excellent agreement with literature values and with estimates from established baseline methods. We further investigate structural properties and show that the model samples are nearly indistinguishable from the ones obtained with molecular dynamics. Our results thus demonstrate that normalizing flows can provide high-quality samples and free energy estimates without the need for multi-staging.
Collapse
|
8
|
Rizzi A, Carloni P, Parrinello M. Targeted Free Energy Perturbation Revisited: Accurate Free Energies from Mapped Reference Potentials. J Phys Chem Lett 2021; 12:9449-9454. [PMID: 34555284 DOI: 10.1021/acs.jpclett.1c02135] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
We present an approach that extends the theory of targeted free energy perturbation (TFEP) to calculate free energy differences and free energy surfaces at an accurate quantum mechanical level of theory from a cheaper reference potential. The convergence is accelerated by a mapping function that increases the overlap between the target and the reference distributions. Building on recent work, we show that this map can be learned with a normalizing flow neural network, without requiring simulations with the expensive target potential but only a small number of single-point calculations, and, crucially, avoiding the systematic error that was found previously. We validate the method by numerically evaluating the free energy difference in a system with a double-well potential and by describing the free energy landscape of a simple chemical reaction in the gas phase.
Collapse
Affiliation(s)
- Andrea Rizzi
- Computational Biomedicine, Institute of Advanced Simulations IAS-5/Institute for Neuroscience and Medicine INM-9, Forschungszentrum Jülich GmbH, Jülich 52428, Germany
- Atomistic Simulations, Italian Institute of Technology, Via Morego 30, Genova 16163, Italy
| | - Paolo Carloni
- Computational Biomedicine, Institute of Advanced Simulations IAS-5/Institute for Neuroscience and Medicine INM-9, Forschungszentrum Jülich GmbH, Jülich 52428, Germany
- Molecular Neuroscience and Neuroimaging (INM-11), Forschungszentrum Jülich GmbH, Jülich 52428, Germany
- Department of Physics and Universitätsklinikum, RWTH Aachen University, Aachen 52074, Germany
| | - Michele Parrinello
- Atomistic Simulations, Italian Institute of Technology, Via Morego 30, Genova 16163, Italy
| |
Collapse
|
9
|
Abstract
Fast and accurate evaluation of free energy has broad applications from drug design to material engineering. Computing the absolute free energy is of particular interest since it allows the assessment of the relative stability between states without intermediates. Here, we introduce a general framework for calculating the absolute free energy of a state. A key step of the calculation is the definition of a reference state with tractable deep generative models using locally sampled configurations. The absolute free energy of this reference state is zero by design. The free energy for the state of interest can then be determined as the difference from the reference. We applied this approach to both discrete and continuous systems and demonstrated its effectiveness. It was found that the Bennett acceptance ratio method provides more accurate and efficient free energy estimations than approximate expressions based on work. We anticipate the method presented here to be a valuable strategy for computing free energy differences.
Collapse
Affiliation(s)
- Xinqiang Ding
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Bin Zhang
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| |
Collapse
|
10
|
Wirnsberger P, Ballard AJ, Papamakarios G, Abercrombie S, Racanière S, Pritzel A, Jimenez Rezende D, Blundell C. Targeted free energy estimation via learned mappings. J Chem Phys 2020; 153:144112. [PMID: 33086827 DOI: 10.1063/5.0018903] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023] Open
Abstract
Free energy perturbation (FEP) was proposed by Zwanzig [J. Chem. Phys. 22, 1420 (1954)] more than six decades ago as a method to estimate free energy differences and has since inspired a huge body of related methods that use it as an integral building block. Being an importance sampling based estimator, however, FEP suffers from a severe limitation: the requirement of sufficient overlap between distributions. One strategy to mitigate this problem, called Targeted FEP, uses a high-dimensional mapping in configuration space to increase the overlap of the underlying distributions. Despite its potential, this method has attracted only limited attention due to the formidable challenge of formulating a tractable mapping. Here, we cast Targeted FEP as a machine learning problem in which the mapping is parameterized as a neural network that is optimized so as to increase the overlap. We develop a new model architecture that respects permutational and periodic symmetries often encountered in atomistic simulations and test our method on a fully periodic solvation system. We demonstrate that our method leads to a substantial variance reduction in free energy estimates when compared against baselines, without requiring any additional data.
Collapse
|
11
|
Sun Z. SAMPL7 TrimerTrip host-guest binding poses and binding affinities from spherical-coordinates-biased simulations. J Comput Aided Mol Des 2020; 35:105-115. [PMID: 32776199 DOI: 10.1007/s10822-020-00335-9] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2020] [Accepted: 08/04/2020] [Indexed: 12/21/2022]
Abstract
Host-guest binding remains a major challenge in modern computational modelling. The newest 7th statistical assessment of the modeling of proteins and ligands (SAMPL) challenge contains a new series of host-guest systems. The TrimerTrip host binds to 16 structurally diverse guests. Previously, we have successfully employed the spherical coordinates as the collective variables coupled with the enhanced sampling technique metadynamics to enhance the sampling of the binding/unbinding event, search for possible binding poses and calculate the binding affinities in all three host-guest binding cases of the 6th SAMPL challenge. In this work, we report a retrospective study on the TrimerTrip host-guest systems by employing the same protocol to investigate the TrimerTrip host in the SAMPL7 challenge. As no binding pose is provided by the SAMPL7 host, our simulations initiate from randomly selected configurations and are proceeded long enough to obtain converged free energy estimates and search for possible binding poses. The calculated binding affinities are in good agreement with the experimental reference, and the obtained binding poses serve as a nice starting point for end-point or alchemical free energy calculations. Note that as the work is performed after the close of the SAMPL7 challenge, we do not participate in the challenge and the results are not formally submitted to the SAMPL7 challenge.
Collapse
Affiliation(s)
- Zhaoxi Sun
- State Key Laboratory of Precision Spectroscopy, School of Chemistry and Molecular Engineering, East China Normal University, Shanghai, 200062, China.
| |
Collapse
|
12
|
Wang X, Tu X, Deng B, Zhang JZH, Sun Z. BAR-based optimum adaptive steered MD for configurational sampling. J Comput Chem 2019; 40:1270-1289. [PMID: 30762879 DOI: 10.1002/jcc.25784] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2018] [Revised: 11/05/2018] [Accepted: 01/06/2019] [Indexed: 11/08/2022]
Abstract
The equilibrium and nonequilibrium adaptive alchemical free energy simulation methods optimum Bennett's acceptance ratio and optimum crooks' equation (OCE), based on the statistically optimal bidirectional reweighting estimator named Bennett's Acceptance Ratio or Crooks' equation, perform initial sampling in the staging alchemical transformation and then determine the importance rank of different states via the time-derivative of the variance. The method is proven to give speedups compared with the equal time rule. In the current work, we extend the time derivative of variance guided adaptive sampling method to the configurational space, falling in the term of steered MD (SMD). The SMD approach biasing physically meaningful collective variable (CV) such as one dihedral or one distance to pulling the system from one conformational state to another. By minimizing the variance of the free energy differences along the pathway in an optimized way, a new type of adaptive SMD (ASMD) is introduced. As exhibits in the alchemical case, this adaptive sampling method outperforms the traditional equal-time SMD in nonequilibrium stratification. Also, the method gives much more efficient calculation of potential of mean force than the selection criterion-based ASMD scheme, which is proven to be more efficient than traditional SMD. The OCE workflow is periodicity-of-CV dependent while ASMD is not. The performance is demonstrated in a dihedral flipping case and two distance pulling cases, accounting for periodic and nonperiodic CVs, respectively. © 2019 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Xiaohui Wang
- State Key Laboratory of Precision Spectroscopy, East China Normal University, Shanghai 200062, China.,Institute of Computational Science, Università della Svizzera italiana (USI), CH-6900, Lugano, Ticino, Switzerland
| | - Xingzhao Tu
- Institute of Organic Chemistry, University of Leipzig, Leipzig 04103, Germany
| | - Boming Deng
- Laboratory of Oil Analysis, Beijing Hangfengkewei Equipment Technology Co., Ltd., Beijing 100141, China
| | - John Z H Zhang
- State Key Laboratory of Precision Spectroscopy, East China Normal University, Shanghai 200062, China.,NYU-ECNU Center for Computational Chemistry at NYU Shanghai, Shanghai 200062, China.,Department of Chemistry, New York University, New York, New York, 10003
| | - Zhaoxi Sun
- State Key Laboratory of Precision Spectroscopy, East China Normal University, Shanghai 200062, China.,Computational Biomedicine (IAS-5/INM-9), Forschungszentrum Juelich, Jülich 52425, Germany
| |
Collapse
|
13
|
Wang X, Tu X, Zhang JZH, Sun Z. BAR-based optimum adaptive sampling regime for variance minimization in alchemical transformation: the nonequilibrium stratification. Phys Chem Chem Phys 2018; 20:2009-2021. [PMID: 29299568 DOI: 10.1039/c7cp07573a] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Following the previously proposed equilibrate-state sampling based adaptive sampling regime Optimum Bennett Acceptance Ratio (OBAR), we introduce its nonequilibrium extension, the Optimum Crooks' Equation (OCE) in the current work. The efficiency of the NonEquilibrium Work (NEW) stratification is improved by adaptively manipulating the significance of each nonequilibrium realization followed by importance sampling. As is exhibited in the equilibrium case, the nonequilibrium extension outperforms the simple equal time rule used in nonequilibrium stratification in the sense of minimizing the total variance of the free energy estimate. The speedup of this non-equal time rule is more than 1-fold. The Time Derivative of total Variance (TDV) proposed for the OBAR criterion is extended to determine the importance of each nonequilibrium transformation, which is linearly dependent on the variance. The TDV in the nonequilibrium case gives a totally different importance rank from the standard errors of the free energy differences and OBAR TDV due to the duration of nonequilibrium pulling being added into the OCE equation. The performance of the OCE workflow is demonstrated in the solvation of several small molecules with a series of lambda increments and relaxation times between successive perturbations. To the best of our knowledge, such a nonequilibrium adaptive sampling regime in alchemical transformation is unprecedented.
Collapse
Affiliation(s)
- Xiaohui Wang
- State Key Laboratory of Precision Spectroscopy, School of Chemistry and Molecular Engineering, East China Normal University, Shanghai 200062, China.
| | | | | | | |
Collapse
|
14
|
Sun ZX, Wang XH, Zhang JZH. BAR-based optimum adaptive sampling regime for variance minimization in alchemical transformation. Phys Chem Chem Phys 2017; 19:15005-15020. [DOI: 10.1039/c7cp01561e] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
The efficiency of alchemical free energy simulations with a staging strategy is improved by adaptively manipulating the significance of each ensemble followed by importance sampling.
Collapse
Affiliation(s)
- Zhao X. Sun
- State Key Laboratory of Precision Spectroscopy
- Institute of Theoretical and Computational Science
- East China Normal University
- Shanghai 200062
- China
| | - Xiao H. Wang
- State Key Laboratory of Precision Spectroscopy
- Institute of Theoretical and Computational Science
- East China Normal University
- Shanghai 200062
- China
| | - John Z. H. Zhang
- NYU-ECNU Center for Computational Chemistry at NYU Shanghai
- Shanghai 200062
- China
- School of Chemistry and Molecular Engineering
- East China Normal University
| |
Collapse
|
15
|
Seifert U. Stochastic thermodynamics, fluctuation theorems and molecular machines. REPORTS ON PROGRESS IN PHYSICS. PHYSICAL SOCIETY (GREAT BRITAIN) 2012; 75:126001. [PMID: 23168354 DOI: 10.1088/0034-4885/75/12/126001] [Citation(s) in RCA: 1280] [Impact Index Per Article: 98.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/18/2023]
Abstract
Stochastic thermodynamics as reviewed here systematically provides a framework for extending the notions of classical thermodynamics such as work, heat and entropy production to the level of individual trajectories of well-defined non-equilibrium ensembles. It applies whenever a non-equilibrium process is still coupled to one (or several) heat bath(s) of constant temperature. Paradigmatic systems are single colloidal particles in time-dependent laser traps, polymers in external flow, enzymes and molecular motors in single molecule assays, small biochemical networks and thermoelectric devices involving single electron transport. For such systems, a first-law like energy balance can be identified along fluctuating trajectories. For a basic Markovian dynamics implemented either on the continuum level with Langevin equations or on a discrete set of states as a master equation, thermodynamic consistency imposes a local-detailed balance constraint on noise and rates, respectively. Various integral and detailed fluctuation theorems, which are derived here in a unifying approach from one master theorem, constrain the probability distributions for work, heat and entropy production depending on the nature of the system and the choice of non-equilibrium conditions. For non-equilibrium steady states, particularly strong results hold like a generalized fluctuation-dissipation theorem involving entropy production. Ramifications and applications of these concepts include optimal driving between specified states in finite time, the role of measurement-based feedback processes and the relation between dissipation and irreversibility. Efficiency and, in particular, efficiency at maximum power can be discussed systematically beyond the linear response regime for two classes of molecular machines, isothermal ones such as molecular motors, and heat engines such as thermoelectric devices, using a common framework based on a cycle decomposition of entropy production.
Collapse
Affiliation(s)
- Udo Seifert
- II. Institut für Theoretische Physik, Universität Stuttgart, 70550 Stuttgart, Germany
| |
Collapse
|
16
|
Shevkunov SV. Collective interactions in the mechanism of adhesion of condensed phase nuclei to a crystal surface. 2. Thermodynamic stability. COLLOID JOURNAL 2012. [DOI: 10.1134/s1061933x12050122] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
|
17
|
|
18
|
Hahn AM, Then H. Measuring the convergence of Monte Carlo free-energy calculations. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2010; 81:041117. [PMID: 20481687 DOI: 10.1103/physreve.81.041117] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/15/2009] [Revised: 02/24/2010] [Indexed: 05/29/2023]
Abstract
The nonequilibrium work fluctuation theorem provides the way for calculations of (equilibrium) free-energy based on work measurements of nonequilibrium, finite-time processes, and their reversed counterparts by applying Bennett's acceptance ratio method. A nice property of this method is that each free-energy estimate readily yields an estimate of the asymptotic mean square error. Assuming convergence, it is easy to specify the uncertainty of the results. However, sample sizes have often to be balanced with respect to experimental or computational limitations and the question arises whether available samples of work values are sufficiently large in order to ensure convergence. Here, we propose a convergence measure for the two-sided free-energy estimator and characterize some of its properties, explain how it works, and test its statistical behavior. In total, we derive a convergence criterion for Bennett's acceptance ratio method.
Collapse
Affiliation(s)
- Aljoscha M Hahn
- Institut für Physik, Carl von Ossietzky Universität, 26111 Oldenburg, Germany
| | | |
Collapse
|
19
|
Geiger P, Dellago C. Optimum protocol for fast-switching free-energy calculations. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2010; 81:021127. [PMID: 20365550 DOI: 10.1103/physreve.81.021127] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/01/2009] [Indexed: 05/29/2023]
Abstract
Free-energy differences computed from fast-switching simulations or measurements according to the Jarzynski equation are independent of the particular protocol specifying how the control parameter is changed in time. In contrast, the average work carried out on the system as well the accuracy of the resulting free energy strongly depend on the protocol. Recently, Schmiedl and Seifert [Phys. Rev. Lett. 98, 108301 (2007)] found that protocols that minimize the average work for a given duration of the switching process have discrete steps at the beginning and the end. Here we determine numerically the protocols that minimize the statistical error in the free energy estimate and find that such minimum error protocols have similar discrete jumps. Our analysis shows that the reduction in computational effort achieved by the use of steplike protocols can be considerable. Such large savings of computing time, however, typically occur for parameter ranges in which an application of the Jarzynski equation is impractical due to large statistical errors arising from the exponential work average.
Collapse
Affiliation(s)
- Philipp Geiger
- Faculty of Physics, University of Vienna, Boltzmanngasse 5, 1090 Vienna, Austria
| | | |
Collapse
|
20
|
Hahn AM, Then H. Characteristic of Bennett's acceptance ratio method. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2009; 80:031111. [PMID: 19905066 DOI: 10.1103/physreve.80.031111] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/12/2009] [Indexed: 05/28/2023]
Abstract
A powerful and well-established tool for free-energy estimation is Bennett's acceptance ratio method. Central properties of this estimator, which employs samples of work values of a forward and its time-reversed process, are known: for given sets of measured work values, it results in the best estimate of the free-energy difference in the large sample limit. Here we state and prove a further characteristic of the acceptance ratio method: the convexity of its mean-square error. As a two-sided estimator, it depends on the ratio of the numbers of forward and reverse work values used. Convexity of its mean-square error immediately implies that there exists a unique optimal ratio for which the error becomes minimal. Further, it yields insight into the relation of the acceptance ratio method and estimators based on the Jarzynski equation. As an application, we study the performance of a dynamic strategy of sampling forward and reverse work values.
Collapse
Affiliation(s)
- Aljoscha M Hahn
- Institut für Physik, Carl von Ossietzky Universität, 26111 Oldenburg, Germany
| | | |
Collapse
|
21
|
Engel A. Asymptotics of work distributions in nonequilibrium systems. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2009; 80:021120. [PMID: 19792090 DOI: 10.1103/physreve.80.021120] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/25/2008] [Revised: 06/24/2009] [Indexed: 05/28/2023]
Abstract
The asymptotic behavior of the work distribution in driven nonequilibrium systems is determined using the method of optimal fluctuations. For systems described by Langevin dynamics the corresponding Euler-Lagrange equation together with the appropriate boundary conditions and an equation for the leading pre-exponential factor are derived. The method is applied to three representative examples and the results are used to improve the accuracy of free-energy estimates based on the application of the Jarzynski equation.
Collapse
Affiliation(s)
- A Engel
- Institut für Physik, Universität Oldenburg, 26111 Oldenburg, Germany
| |
Collapse
|