1
|
Shi Z, Lele AD, Jasper AW, Klippenstein SJ, Ju Y. Quasi-Classical Trajectory Calculation of Rate Constants Using an Ab Initio Trained Machine Learning Model (aML-MD) with Multifidelity Data. J Phys Chem A 2024; 128:3449-3457. [PMID: 38642065 DOI: 10.1021/acs.jpca.4c00750] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/22/2024]
Abstract
Machine learning (ML) provides a great opportunity for the construction of models with improved accuracy in classical molecular dynamics (MD). However, the accuracy of a ML trained model is limited by the quality and quantity of the training data. Generating large sets of accurate ab initio training data can require significant computational resources. Furthermore, inconsistent or incompatible data with different accuracies obtained using different methods may lead to biased or unreliable ML models that do not accurately represent the underlying physics. Recently, transfer learning showed its potential for avoiding these problems as well as for improving the accuracy, efficiency, and generalization of ML models using multifidelity data. In this work, ab initio trained ML-based MD (aML-MD) models are developed through transfer learning using DFT and multireference data from multiple sources with varying accuracy within the Deep Potential MD framework. The accuracy of the force field is demonstrated by calculating rate constants for the H + HO2 → H2 + 3O2 reaction using quasi-classical trajectories. We show that the aML-MD model with transfer learning can accurately predict the rate constants while reducing the computational cost by more than five times compared to the use of more expensive quantum chemistry training data sets. Hence, the aML-MD model with transfer learning shows great potential in using multifidelity data to reduce the computational cost involved in generating the training set for these potentials.
Collapse
Affiliation(s)
- Zhiyu Shi
- Department of Mechanical and Aerospace Engineering, Princeton University, Princeton, New Jersey 08544, United States
| | - Aditya Dilip Lele
- Department of Mechanical and Aerospace Engineering, Princeton University, Princeton, New Jersey 08544, United States
| | - Ahren W Jasper
- Chemical Sciences and Engineering Division, Argonne National Laboratory, Lemont, Illinois 60439, United States
| | - Stephen J Klippenstein
- Chemical Sciences and Engineering Division, Argonne National Laboratory, Lemont, Illinois 60439, United States
| | - Yiguang Ju
- Department of Mechanical and Aerospace Engineering, Princeton University, Princeton, New Jersey 08544, United States
| |
Collapse
|
2
|
Li F, Liu X, Ma H, Bian W. A diabatization method based upon integrating the diabatic potential gradient difference. Phys Chem Chem Phys 2024. [PMID: 38656815 DOI: 10.1039/d4cp00375f] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/26/2024]
Abstract
In this work we develop a new scheme to construct a diabatic potential energy matrix (DPEM). We propose a diabatization method which is based on integrating the diabatic potential gradient difference to diabatize adiabatic ab initio energies. This method is capable of performing high-precision adiabatic-to-diabatic transformations, with a unique advantage in effectively handling the significant fluctuations in derivative-couplings caused by conical intersection (CI) seams. The above scheme is applied to the DPEM construction of the Na(3p) + H2 → NaH + H reaction. The fitting data including adiabatic energies, energy gradients and derivative-couplings obtained from a previous benchmark DPEM are diabatized and fitted using a general neural network fitting procedure to generate the DPEM. The produced DPEM can effectively describe nonadiabatic processes involving different electronic states. We further perform quantum dynamical calculations on the new DPEM and the previous benchmark DPEM, and the obtained results demonstrate the effectiveness of our scheme.
Collapse
Affiliation(s)
- Fengyi Li
- Beijing National Laboratory for Molecular Sciences, Institute of Chemistry, Chinese Academy of Sciences, Beijing 100190, China.
- University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Xiaoxi Liu
- Beijing National Laboratory for Molecular Sciences, Institute of Chemistry, Chinese Academy of Sciences, Beijing 100190, China.
- University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Haitao Ma
- Beijing National Laboratory for Molecular Sciences, Institute of Chemistry, Chinese Academy of Sciences, Beijing 100190, China.
| | - Wensheng Bian
- Beijing National Laboratory for Molecular Sciences, Institute of Chemistry, Chinese Academy of Sciences, Beijing 100190, China.
- University of Chinese Academy of Sciences, Beijing, 100049, China
| |
Collapse
|
3
|
Houston PL, Qu C, Yu Q, Pandey P, Conte R, Nandi A, Bowman JM, Kukolich SG. Formic Acid-Ammonia Heterodimer: A New Δ-Machine Learning CCSD(T)-Level Potential Energy Surface Allows Investigation of the Double Proton Transfer. J Chem Theory Comput 2024; 20:1821-1828. [PMID: 38382541 DOI: 10.1021/acs.jctc.3c01273] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/23/2024]
Abstract
The formic acid-ammonia dimer is an important example of a hydrogen-bonded complex in which a double proton transfer can occur. Its microwave spectrum has recently been reported and rotational constants and quadrupole coupling constants were determined. Calculated estimates of the double-well barrier and the internal barriers to rotation were also reported. Here, we report a full-dimensional potential energy surface (PES) for this complex, using two closely related Δ-machine learning methods to bring it to the CCSD(T) level of accuracy. The PES dissociates smoothly and accurately. Using a 2d quantum model the ground vibrational-state tunneling splitting is estimated to be less than 10-4 cm-1. The dipole moment along the intrinsic reaction coordinate is calculated along with a Mullikan charge analysis and supports the mildly ionic character of the minimum and strongly ionic character at the double-well barrier.
Collapse
Affiliation(s)
- Paul L Houston
- Department of Chemistry and Chemical Biology, Cornell University, Ithaca, New York 14853, U.S.A. and Department of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta, Georgia 30332, United States
| | - Chen Qu
- Independent Researcher, Toronto, Ontario M9B0E3, Canada
| | - Qi Yu
- Department of Chemistry and Cherry L. Emerson Center for Scientific Computation, Emory University, Atlanta, Georgia 30322, United States
| | - Priyanka Pandey
- Department of Chemistry and Cherry L. Emerson Center for Scientific Computation, Emory University, Atlanta, Georgia 30322, United States
| | - Riccardo Conte
- Dipartimento di Chimica, Università Degli Studi di Milano, Via Golgi 19, Milano 20133, Italy
| | - Apurba Nandi
- Department of Chemistry and Cherry L. Emerson Center for Scientific Computation, Emory University, Atlanta, Georgia 30322, United States
- Department of Physics and Materials Science, University of Luxembourg, Luxembourg City L-1511, Luxembourg
| | - Joel M Bowman
- Department of Chemistry and Cherry L. Emerson Center for Scientific Computation, Emory University, Atlanta, Georgia 30322, United States
| | - Stephen G Kukolich
- Department of Chemistry and Biochemistry, University of Arizona, 1306 E. University Avenue, Tucson, Arizona 85721, United States
| |
Collapse
|
4
|
Liu Y, Guo H. A Gaussian Process Based Δ-Machine Learning Approach to Reactive Potential Energy Surfaces. J Phys Chem A 2023; 127:8765-8772. [PMID: 37815868 DOI: 10.1021/acs.jpca.3c05318] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/12/2023]
Abstract
The Gaussian process (GP) is an efficient nonparametric machine learning (ML) method. A distinct advantage of the GP is its ability to provide an estimate of statistical uncertainties. This is particularly useful in constructing high-dimensional potential energy surfaces (PESs) from ab initio data as it offers an optimal way to add new geometries to reduce the overall error. In this work, GP is employed in the context of Δ-machine learning (Δ-ML), in which a correction PES to an inaccurate low-level PES is constructed using a small number of high-level ab initio calculations. This new method is tested in three prototypical reactive systems, namely, the H + H2 → H + H2, OH + H2 → H2O + H, and H + CH4 → H2 + CH3 reactions. The results show that the GP-based Δ-ML approach is more efficient than its direct application in constructing high-level PESs. We also compare the new method to a previously proposed neural-network-based Δ-ML approach [Liu and Li J. Phys. Chem. Lett. 2022, 13, 4729-4738]. The results indicate that the two Δ-ML methods have comparable efficiencies in constructing accurate PESs.
Collapse
Affiliation(s)
- Yang Liu
- Department of Chemistry and Chemical Biology, University of New Mexico, Albuquerque, New Mexico 87131, United States
| | - Hua Guo
- Department of Chemistry and Chemical Biology, University of New Mexico, Albuquerque, New Mexico 87131, United States
| |
Collapse
|
5
|
Yu Q, Qu C, Houston PL, Nandi A, Pandey P, Conte R, Bowman JM. A Status Report on "Gold Standard" Machine-Learned Potentials for Water. J Phys Chem Lett 2023; 14:8077-8087. [PMID: 37656898 PMCID: PMC10510435 DOI: 10.1021/acs.jpclett.3c01791] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2023] [Accepted: 08/28/2023] [Indexed: 09/03/2023]
Abstract
Owing to the central importance of water to life as well as its unusual properties, potentials for water have been the subject of extensive research over the past 50 years. Recently, five potentials based on different machine learning approaches have been reported that are at or near the "gold standard" CCSD(T) level of theory. The development of such high-level potentials enables efficient and accurate simulations of water systems using classical and quantum dynamical approaches. This Perspective serves as a status report of these potentials, focusing on their methodology and applications to water systems across different phases. Their performances on the energies of gas phase water clusters, as well as condensed phase structural and dynamical properties, are discussed.
Collapse
Affiliation(s)
- Qi Yu
- Department
of Chemistry and Cherry L. Emerson Center for Scientific Computation, Emory University, Atlanta, Georgia 30322, United States
| | - Chen Qu
- Independent
Researcher, Toronto, Ontario M9B 0E3, Canada
| | - Paul L. Houston
- Department
of Chemistry and Chemical Biology, Cornell
University, Ithaca, New York 14853, United States
- Department of Chemistry
and Biochemistry, Georgia Institute of Technology, Atlanta, Georgia 30332, United States
| | - Apurba Nandi
- Department
of Chemistry and Cherry L. Emerson Center for Scientific Computation, Emory University, Atlanta, Georgia 30322, United States
- Department
of Physics and Materials Science, University
of Luxembourg, L-1511, Luxembourg City, Luxembourg
| | - Priyanka Pandey
- Department
of Chemistry and Cherry L. Emerson Center for Scientific Computation, Emory University, Atlanta, Georgia 30322, United States
| | - Riccardo Conte
- Dipartimento
di Chimica, Università degli Studi
di Milano, via Golgi 19, 20133 Milano, Italy
| | - Joel M. Bowman
- Department
of Chemistry and Cherry L. Emerson Center for Scientific Computation, Emory University, Atlanta, Georgia 30322, United States
| |
Collapse
|
6
|
Tao C, Yang J, Hong Q, Sun Q, Li J. Global and Full-Dimensional Potential Energy Surfaces of the N 2 + O 2 Reaction for Hyperthermal Collisions. J Phys Chem A 2023; 127:4027-4042. [PMID: 37128765 DOI: 10.1021/acs.jpca.3c01065] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]
Abstract
The energy transfer, dissociations, and chemical reactions between O2 and N2 play an important role in the re-entry process of aircraft and many atmospheric, combustion, and plasma processes. Recently, Varga et al. (J. Chem. Phys., 2016, 144, 024310) developed a full-dimensional high-precision potential energy surface (PES) of the ground triplet electronic state for the O2 and N2 system based on ca. 55,000 data points, whose energies were calculated by multi-state complete-active-space second-order perturbation theory/minimally augmented correlation-consistent polarized valence triple-zeta electronic structure calculations plus dynamically scaled external correlation. The fitting function adopted the many-body expansion form with the four-body interactions fitted by the permutationally invariant polynomial in terms of bond-order functions of the six interatomic distances (MB-PIP). In this work, we refit the PES of the N2O2 system by two methods based on the same data set that was used by Varga et al. The first uses the permutation invariant polynomial-neural network (PIP-NN) method to fit the entire energy of the 55,000 data points. In the second approach, the PIP-NN method is used to fit only the four-body interaction component, a similar treatment in the MB-PIP method, and the resulting PES is named MB-PIP-NN. Then, the performances of these new PESs and the MB-PIP PES are comprehensively and systematically compared, such as comparisons of various scans, properties of stationary points, and dynamics simulations. Possible improvements for the PES of N2O2 are suggested. A more reliable PES of the system can be constructed in terms of data sampling range, electronic structure calculation level, and fitting method for high-temperature calculation and simulation in the future.
Collapse
Affiliation(s)
- Chun Tao
- School of Chemistry and Chemical Engineering and Chongqing Key Laboratory of Theoretical and Computational Chemistry, Chongqing University, Chongqing 401331, China
| | - Jiawei Yang
- School of Chemistry and Chemical Engineering and Chongqing Key Laboratory of Theoretical and Computational Chemistry, Chongqing University, Chongqing 401331, China
| | - Qizhen Hong
- State Key Laboratory of High Temperature Gas Dynamics, Institute of Mechanics, Chinese Academy of Sciences, Beijing 100190, China
| | - Quanhua Sun
- State Key Laboratory of High Temperature Gas Dynamics, Institute of Mechanics, Chinese Academy of Sciences, Beijing 100190, China
| | - Jun Li
- School of Chemistry and Chemical Engineering and Chongqing Key Laboratory of Theoretical and Computational Chemistry, Chongqing University, Chongqing 401331, China
| |
Collapse
|
7
|
Käser S, Vazquez-Salazar LI, Meuwly M, Töpfer K. Neural network potentials for chemistry: concepts, applications and prospects. DIGITAL DISCOVERY 2023; 2:28-58. [PMID: 36798879 PMCID: PMC9923808 DOI: 10.1039/d2dd00102k] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/23/2022] [Accepted: 12/20/2022] [Indexed: 12/24/2022]
Abstract
Artificial Neural Networks (NN) are already heavily involved in methods and applications for frequent tasks in the field of computational chemistry such as representation of potential energy surfaces (PES) and spectroscopic predictions. This perspective provides an overview of the foundations of neural network-based full-dimensional potential energy surfaces, their architectures, underlying concepts, their representation and applications to chemical systems. Methods for data generation and training procedures for PES construction are discussed and means for error assessment and refinement through transfer learning are presented. A selection of recent results illustrates the latest improvements regarding accuracy of PES representations and system size limitations in dynamics simulations, but also NN application enabling direct prediction of physical results without dynamics simulations. The aim is to provide an overview for the current state-of-the-art NN approaches in computational chemistry and also to point out the current challenges in enhancing reliability and applicability of NN methods on a larger scale.
Collapse
Affiliation(s)
- Silvan Käser
- Department of Chemistry, University of Basel Klingelbergstrasse 80 CH-4056 Basel Switzerland
| | | | - Markus Meuwly
- Department of Chemistry, University of Basel Klingelbergstrasse 80 CH-4056 Basel Switzerland
| | - Kai Töpfer
- Department of Chemistry, University of Basel Klingelbergstrasse 80 CH-4056 Basel Switzerland
| |
Collapse
|
8
|
Bowman JM, Qu C, Conte R, Nandi A, Houston PL, Yu Q. Δ-Machine Learned Potential Energy Surfaces and Force Fields. J Chem Theory Comput 2023; 19:1-17. [PMID: 36527383 DOI: 10.1021/acs.jctc.2c01034] [Citation(s) in RCA: 22] [Impact Index Per Article: 22.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
There has been great progress in developing machine-learned potential energy surfaces (PESs) for molecules and clusters with more than 10 atoms. Unfortunately, this number of atoms generally limits the level of electronic structure theory to less than the "gold standard" CCSD(T) level. Indeed, for the well-known MD17 dataset for molecules with 9-20 atoms, all of the energies and forces were obtained with DFT calculations (PBE). This Perspective is focused on a Δ-machine learning method that we recently proposed and applied to bring DFT-based PESs to close to CCSD(T) accuracy. This is demonstrated for hydronium, N-methylacetamide, acetyl acetone, and ethanol. For 15-atom tropolone, it appears that special approaches (e.g., molecular tailoring, local CCSD(T)) are needed to obtain the CCSD(T) energies. A new aspect of this approach is the extension of Δ-machine learning to force fields. The approach is based on many-body corrections to polarizable force field potentials. This is examined in detail using the TTM2.1 water potential. The corrections make use of our recent CCSD(T) datasets for 2-b, 3-b, and 4-b interactions for water. These datasets were used to develop a new fully ab initio potential for water, termed q-AQUA.
Collapse
Affiliation(s)
- Joel M Bowman
- Department of Chemistry and Cherry L. Emerson Center for Scientific Computation, Emory University, Atlanta, Georgia 30322, United States
| | - Chen Qu
- Independent Researcher, Toronto, Canada 66777
| | - Riccardo Conte
- Dipartimento di Chimica, Università Degli Studi di Milano, via Golgi 19, 20133 Milano, Italy
| | - Apurba Nandi
- Department of Chemistry and Cherry L. Emerson Center for Scientific Computation, Emory University, Atlanta, Georgia 30322, United States
| | - Paul L Houston
- Department of Chemistry and Chemical Biology, Cornell University, Ithaca, New York 14853, United States.,Department of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta, Georgia 30332, United States
| | - Qi Yu
- Department of Chemistry, Yale University, New Haven, Connecticut 06520, United States
| |
Collapse
|
9
|
Giese TJ, Zeng J, York DM. Multireference Generalization of the Weighted Thermodynamic Perturbation Method. J Phys Chem A 2022; 126:8519-8533. [PMID: 36301936 PMCID: PMC9771595 DOI: 10.1021/acs.jpca.2c06201] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
We describe the generalized weighted thermodynamic perturbation (gwTP) method for estimating the free energy surface of an expensive "high-level" potential energy function from the umbrella sampling performed with multiple inexpensive "low-level" reference potentials. The gwTP method is a generalization of the weighted thermodynamic perturbation (wTP) method developed by Li and co-workers [J. Chem. Theory Comput. 2018, 14, 5583-5596] that uses a single "low-level" reference potential. The gwTP method offers new possibilities in model design whereby the sampling generated from several low-level potentials may be combined (e.g., specific reaction parameter models that might have variable accuracy at different stages of a multistep reaction). The gwTP method is especially well suited for use with machine learning potentials (MLPs) that are trained against computationally expensive ab initio quantum mechanical/molecular mechanical (QM/MM) energies and forces using active learning procedures that naturally produce multiple distinct neural network potentials. Simulations can be performed with greater sampling using the fast MLPs and then corrected to the ab initio level using gwTP. The capabilities of the gwTP method are demonstrated by creating reference potentials based on the MNDO/d and DFTB2/MIO semiempirical models supplemented with the "range-corrected deep potential" (DPRc). The DPRc parameters are trained to ab initio QM/MM data, and the potentials are used to calculate the free energy surface of stepwise mechanisms for nonenzymatic RNA 2'-O-transesterification model reactions. The extended sampling made possible by the reference potentials allows one to identify unequilibrated portions of the simulations that are not always evident from the short time scale commonly used with ab initio QM/MM potentials. We show that the reference potential approach can yield more accurate ab initio free energy predictions than the wTP method or what can be reasonably afforded from explicit ab initio QM/MM sampling.
Collapse
Affiliation(s)
- Timothy J. Giese
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, NJ 08854, USA
| | - Jinzhe Zeng
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, NJ 08854, USA
| | - Darrin M. York
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, NJ 08854, USA
| |
Collapse
|