1
|
Derbel N, Alijah A, Robertson SH, Lauvaux T, Joly L. First-Generation Products of Trans-2-Hexenal Ozonolyis: A New Look at the Mechanism. J Phys Chem A 2025; 129:3272-3279. [PMID: 40171757 PMCID: PMC11995380 DOI: 10.1021/acs.jpca.4c07608] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2024] [Revised: 03/18/2025] [Accepted: 03/19/2025] [Indexed: 04/04/2025]
Abstract
The ozonolysis reaction of trans-2-hexenal was studied theoretically on the basis of highly accurate CCSD(T)-F12b/AVTZ energy values obtained in M06-2X/AVTZ preoptimized nuclear configurations. The kinetics was modeled with the help of the master equation solver MESMER. Apart from the expected stable oxidation products 1-butanal (17%) and glyoxal (35%), a secondary ozonide is formed on the glyoxal channel, which is the principal first-generation product (49%). It is further shown that glyoxal is created on two competing pathways, one of which leads to simultaneous production of the ester propylformate (18%). The inclusion of all of these mechanisms explains the experimental findings and identifies for the first time the origin of the experimental carbon deficit.
Collapse
Affiliation(s)
- Najoua Derbel
- Laboratoire
de Spectroscopie Atomique, Moléculaire et Applications, Department
of Physics, Faculty of Sciences, University
Tunis - El Manar, 1060 Tunis, Tunisia
- Faculty
of Sciences of Bizerte, University of Carthage, Jarzouna, 7021 Bizerte, Tunisia
- GSMA,
Groupe de Spectrométrie Moléculaire et Atmosphérique,
UMR CNRS 7331, University of Reims Champagne-Ardenne, 51100 Reims, France
| | - Alexander Alijah
- GSMA,
Groupe de Spectrométrie Moléculaire et Atmosphérique,
UMR CNRS 7331, University of Reims Champagne-Ardenne, 51100 Reims, France
| | | | - Thomas Lauvaux
- GSMA,
Groupe de Spectrométrie Moléculaire et Atmosphérique,
UMR CNRS 7331, University of Reims Champagne-Ardenne, 51100 Reims, France
| | - Lilian Joly
- GSMA,
Groupe de Spectrométrie Moléculaire et Atmosphérique,
UMR CNRS 7331, University of Reims Champagne-Ardenne, 51100 Reims, France
| |
Collapse
|
2
|
Drehwald MS, Jamali A, Vargas-Hernández RA. MOLPIPx: An end-to-end differentiable package for permutationally invariant polynomials in Python and Rust. J Chem Phys 2025; 162:084115. [PMID: 40019201 DOI: 10.1063/5.0250837] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2024] [Accepted: 01/31/2025] [Indexed: 03/01/2025] Open
Abstract
In this work, we present MOLPIPx, a versatile library designed to seamlessly integrate permutationally invariant polynomials with modern machine learning frameworks, enabling the efficient development of linear models, neural networks, and Gaussian process models. These methodologies are widely employed for parameterizing potential energy surfaces across diverse molecular systems. MOLPIPx leverages two powerful automatic differentiation engines-JAX and EnzymeAD-Rust-to facilitate the efficient computation of energy gradients and higher-order derivatives, which are essential for tasks such as force field development and dynamic simulations. MOLPIPx is available at https://github.com/ChemAI-Lab/molpipx.
Collapse
Affiliation(s)
- Manuel S Drehwald
- Department of Computer Science, University of Toronto, Toronto, Ontario M5S 2E4, Canada
- Department of Chemistry and Chemical Biology, McMaster University, Hamilton, Ontario L8S 4L8, Canada
| | - Asma Jamali
- Department of Chemistry and Chemical Biology, McMaster University, Hamilton, Ontario L8S 4L8, Canada
- School of Computational Science and Engineering, McMaster University, Hamilton, Ontario L8S 4K1, Canada
| | - Rodrigo A Vargas-Hernández
- Department of Chemistry and Chemical Biology, McMaster University, Hamilton, Ontario L8S 4L8, Canada
- School of Computational Science and Engineering, McMaster University, Hamilton, Ontario L8S 4K1, Canada
- Brockhouse Institute for Materials Research, McMaster University, Hamilton, Ontario L8S 4M1, Canada
| |
Collapse
|
3
|
Huo J, Dong H. Δ-EGNN Method Accelerates the Construction of Machine Learning Potential. J Phys Chem Lett 2025; 16:2080-2088. [PMID: 39973338 DOI: 10.1021/acs.jpclett.4c03474] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/21/2025]
Abstract
Recent advancements in molecular simulations highlight the substantial computational demands of generating high-precision quantum mechanical labels for training neural network potentials. These challenges emphasize the need for improvements in delta-machine learning techniques. The Equivariant Graph Neural Network (EGNN) framework, grounded in a message-passing mechanism that preserves structural equivariance, enables refined atomic representations through interaction-driven updates. We introduce the Δ-EGNN model, which achieves high prediction accuracy for both molecular and condensed-phase systems. For example, in periodic water box systems, a mean absolute error of 1.722 meV/atom for energy (global property) and 0.0027 e for partial charge (local property) were achieved with training on just 800 labels. Δ-EGNN is computationally efficient, achieving speedups of 1-2 orders of magnitude compared to conventional methods at the MP2 level. In contrast to models directly trained on total energies, such as NequIP, MACE, and Allegro, the Δ-EGNN model employs delta-machine learning to predict the difference between energies derived from low- and high-level electronic structure methods, providing a significant advantage in reducing computational costs while preserving high accuracy. In summary, Δ-EGNN opens a new avenue for exploring energy landscapes and constructing machine learning potentials with afforable computational overhead, facilitating routine quantum mechanical calculations for complex molecular systems.
Collapse
Affiliation(s)
- Jun Huo
- Kuang Yaming Honors School, Nanjing University, Nanjing 210023, China
| | - Hao Dong
- Kuang Yaming Honors School, Nanjing University, Nanjing 210023, China
- State Key Laboratory of Analytical Chemistry for Life Science, Chemistry and Biomedicine Innovation Centre (ChemBIC), ChemBioMed Interdisciplinary Research Centre at Nanjing University, and Institute for Brain Sciences, Nanjing University, Nanjing 210023, China
| |
Collapse
|
4
|
Hao Y, Lu X, Fu B, Zhang DH. New Algorithms to Generate Permutationally Invariant Polynomials and Fundamental Invariants for Potential Energy Surface Fitting. J Chem Theory Comput 2025; 21:1046-1053. [PMID: 39841118 DOI: 10.1021/acs.jctc.4c01447] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2025]
Abstract
Symmetric functions, such as Permutationally Invariant Polynomials (PIPs) and Fundamental Invariants (FIs), are effective and concise descriptors for incorporating permutation symmetry into neural network (NN) potential energy surface (PES) fitting. The traditional algorithm for generating such symmetric polynomials has a factorial time complexity of N!, where N is the number of identical atoms, posing a significant challenge to applying symmetric polynomials as descriptors of NN PESs for larger systems, particularly with more than 10 atoms. Herein, we report a new algorithm which has only linear time complexity for identical atoms. It can tremendously accelerate generation process of symmetric polynomials for molecular systems. The proposed algorithm is based on graph connectivity analysis following the action of the generation set of molecular permutational group. For instance, in the case of calculating the invariant polynomials for a 15-atom molecule, such as tropolone, our algorithm is approximately 2 million times faster than the previous method. The efficiency of the new algorithm can be further enhanced with increasing molecular size and number of identical atoms, making the FI-NN approach feasible for systems with over 10 atoms and high symmetry demands.
Collapse
Affiliation(s)
- Yiping Hao
- State Key Laboratory of Molecular Reaction Dynamics, Dalian Institute of Chemical Physics, Chinese Academy of Science, Dalian 116023, People's Republic of China
- School of Chemical Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Xiaoxiao Lu
- State Key Laboratory of Molecular Reaction Dynamics, Dalian Institute of Chemical Physics, Chinese Academy of Science, Dalian 116023, People's Republic of China
| | - Bina Fu
- State Key Laboratory of Molecular Reaction Dynamics, Dalian Institute of Chemical Physics, Chinese Academy of Science, Dalian 116023, People's Republic of China
- School of Chemical Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
- Hefei National Laboratory, Hefei 230088, China
| | - Dong H Zhang
- State Key Laboratory of Molecular Reaction Dynamics, Dalian Institute of Chemical Physics, Chinese Academy of Science, Dalian 116023, People's Republic of China
- School of Chemical Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
- Hefei National Laboratory, Hefei 230088, China
| |
Collapse
|
5
|
Li J, Vindel-Zandbergen P, Li J, Felker PM, Bačić Z. HF Trimer: A New Full-Dimensional Potential Energy Surface and Rigorous 12D Quantum Calculations of Vibrational States. J Phys Chem A 2024; 128:9707-9720. [PMID: 39484697 DOI: 10.1021/acs.jpca.4c03771] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2024]
Abstract
HF trimer, as the smallest and the lightest cyclic hydrogen-bonded (HB) cluster, has long been a favorite prototype system for spectroscopic and theoretical investigations of the structure, energetics, spectroscopy, and dynamics of hydrogen-bond networks. Recently, rigorous quantum 12D calculations of the coupled intra- and intermolecular vibrations of this fundamental HB trimer (J. Chem. Phys. 2023, 158, 234109) were performed, employing an older ab initio-based many-body potential energy surface (PES). While the theoretical results were found to be in reasonably good agreement with the available spectroscopic data, it was also evident that it is highly desirable to develop a more accurate 12D PES of HF trimer. Motivated by this, here we report a new, and the first fully ab initio 12D PES of this paradigmatic system. Approximately 42,540 geometries were sampled and calculated at the level of CCSD(T)-F12a/AVTZ. The permutationally invariant polynomial-neural network based Δ-machine learning approach (J. Phys. Chem. Lett. 2022, 13, 4729) was employed to perform cost-efficient calculations of the basis-set-superposition error (BSSE) correction. By strategically selecting data points, this approach facilitated the construction of a high-precision PES with BSSE correction, while requiring only a minimal number of BSSE value computations. The fitting error of the final PES is only 0.035 kcal/mol. To assess its performance, the 12D fully coupled quantum calculations of excited intra- and intermolecular vibrational states of HF trimer are carried out using the rigorous methodology developed by us earlier. The results are found to be in a significantly better agreement with the available spectroscopic data than those obtained with the previously existing semiempirical 12D PES.
Collapse
Affiliation(s)
- Jia Li
- School of Chemistry and Chemical Engineering & Chongqing Key Laboratory of Chemical Theory and Mechanism, Chongqing University, Chongqing 401331, China
| | - Patricia Vindel-Zandbergen
- Department of Chemistry, New York University, New York, New York 10003, United States
- Simons Center for Computational Physical Chemistry, New York University, New York, New York 10003, United States
| | - Jun Li
- School of Chemistry and Chemical Engineering & Chongqing Key Laboratory of Chemical Theory and Mechanism, Chongqing University, Chongqing 401331, China
| | - Peter M Felker
- Department of Chemistry and Biochemistry, University of California, Los Angeles, California 90095-1569, United States
| | - Zlatko Bačić
- Department of Chemistry, New York University, New York, New York 10003, United States
- Simons Center for Computational Physical Chemistry, New York University, New York, New York 10003, United States
- NYU-ECNU Center for Computational Chemistry at NYU Shanghai, 3663 Zhongshan Road North, Shanghai 200062, China
| |
Collapse
|
6
|
Nandi A, Pandey P, Houston PL, Qu C, Yu Q, Conte R, Tkatchenko A, Bowman JM. Δ-Machine Learning to Elevate DFT-Based Potentials and a Force Field to the CCSD( T) Level Illustrated for Ethanol. J Chem Theory Comput 2024; 20:8807-8819. [PMID: 39361051 PMCID: PMC11500277 DOI: 10.1021/acs.jctc.4c00977] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2024] [Revised: 09/17/2024] [Accepted: 09/18/2024] [Indexed: 10/23/2024]
Abstract
Progress in machine learning has facilitated the development of potentials that offer both the accuracy of first-principles techniques and vast increases in the speed of evaluation. Recently, Δ-machine learning has been used to elevate the quality of a potential energy surface (PES) based on low-level, e.g., density functional theory (DFT) energies and gradients to close to the gold-standard coupled cluster level of accuracy. We have demonstrated the success of this approach for molecules, ranging in size from H3O+ to 15-atom acetyl-acetone and tropolone. These were all done using the B3LYP functional. Here, we investigate the generality of this approach for the PBE, M06, M06-2X, and PBE0 + MBD functionals, using ethanol as the example molecule. Linear regression with permutationally invariant polynomials is used to fit both low-level and correction PESs. These PESs are employed for standard RMSE analysis for training and test data sets, and then general fidelity tests such as energetics of stationary points, normal-mode frequencies, and torsional potentials are examined. We achieve similar improvements in all cases. Interestingly, we obtained significant improvement over DFT gradients where coupled cluster gradients were not used to correct the low-level PES. Finally, we present some results for correcting a recent molecular mechanics force field for ethanol and comment on the possible generality of this approach.
Collapse
Affiliation(s)
- Apurba Nandi
- Department
of Physics and Materials Science, University
of Luxembourg, L-1511 Luxembourg City, Luxembourg
| | - Priyanka Pandey
- Department
of Chemistry and Cherry L. Emerson Center for Scientific Computation, Emory University, Atlanta, Georgia 30322, United States
| | - Paul L. Houston
- Department
of Chemistry and Chemical Biology, Cornell
University, Ithaca, New York 14853, United States
- Department
of Chemistry and Biochemistry, Georgia Institute
of Technology, Atlanta, Georgia 30332, United States
| | - Chen Qu
- Independent
Researcher, Toronto, Ontario M9B0E3, Canada
| | - Qi Yu
- Department
of Chemistry, Fudan University, Shanghai 200438, P. R. China
| | - Riccardo Conte
- Dipartimento
di Chimica, Università degli Studi
di Milano, via Golgi 19, 20133 Milano, Italy
| | - Alexandre Tkatchenko
- Department
of Physics and Materials Science, University
of Luxembourg, L-1511 Luxembourg City, Luxembourg
| | - Joel M. Bowman
- Department
of Chemistry and Cherry L. Emerson Center for Scientific Computation, Emory University, Atlanta, Georgia 30322, United States
| |
Collapse
|
7
|
Song K, Li J. Fundamental Invariant Neural Network (FI-NN) Potential Energy Surface for the OH + CH 3OH Reaction with Analytical Forces. J Phys Chem A 2024; 128:6636-6647. [PMID: 39096277 DOI: 10.1021/acs.jpca.4c02432] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/05/2024]
Abstract
The hydrogen abstraction reaction of OH + CH3OH plays a great role in combustion and atmospheric and interstellar chemistry and has been extensively studied theoretically and experimentally. Theoretically, the numerical gradients with respect to the Cartesian coordinates of atoms in molecular simulations on our recent potential energy surface (PES) for the title reaction trained using the permutationally invariant polynomial neural network (PIP-NN) approach hinder the extensive calculation because of the unaffordable computation cost. To address this issue, we in this work report a new full-dimensional accurate analytical PES for the title reaction using the fundamental invariant neural network (FI-NN) approach based on 140,192 points of the quality UCCSD(T)-F12a/AVTZ. Besides, the spin-orbit (SO) corrections of OH in the entrance channel were determined at the level of complete active space self-consistent field with the AVTZ basis set. As a compromise between computational cost and efficiency, the Δ-machine learning approach was employed to construct the SO-corrected PES. Based on this new FI-NN PES with analytical forces, thermal rate coefficients and various dynamic properties, including the integral cross sections, the differential cross sections, and the product energy partitioning, were determined by running a total of 5.5 million trajectories. The use of analytical gradients of the FI-NN PES accelerated simulations and about 99% of computation cost was saved, compared to that for the PIP-NN PES with numerical gradients. Such a significant acceleration is achieved mainly by replacing PIPs with FIs.
Collapse
Affiliation(s)
- Kaisheng Song
- School of Chemistry and Chemical Engineering & Chongqing Key Laboratory of Chemical Theory and Mechanism, Chongqing University, Chongqing 401331, P.R. China
| | - Jun Li
- School of Chemistry and Chemical Engineering & Chongqing Key Laboratory of Chemical Theory and Mechanism, Chongqing University, Chongqing 401331, P.R. China
| |
Collapse
|
8
|
Giese TJ, Zeng J, Lerew L, McCarthy E, Tao Y, Ekesan Ş, York DM. Software Infrastructure for Next-Generation QM/MM-ΔMLP Force Fields. J Phys Chem B 2024; 128:6257-6271. [PMID: 38905451 PMCID: PMC11414325 DOI: 10.1021/acs.jpcb.4c01466] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/23/2024]
Abstract
We present software infrastructure for the design and testing of new quantum mechanical/molecular mechanical and machine-learning potential (QM/MM-ΔMLP) force fields for a wide range of applications. The software integrates Amber's molecular dynamics simulation capabilities with fast, approximate quantum models in the xtb package and machine-learning potential corrections in DeePMD-kit. The xtb package implements the recently developed density-functional tight-binding QM models with multipolar electrostatics and density-dependent dispersion (GFN2-xTB), and the interface with Amber enables their use in periodic boundary QM/MM simulations with linear-scaling QM/MM particle-mesh Ewald electrostatics. The accuracy of the semiempirical models is enhanced by including machine-learning correction potentials (ΔMLPs) enabled through an interface with the DeePMD-kit software. The goal of this paper is to present and validate the implementation of this software infrastructure in molecular dynamics and free energy simulations. The utility of the new infrastructure is demonstrated in proof-of-concept example applications. The software elements presented here are open source and freely available. Their interface provides a powerful enabling technology for the design of new QM/MM-ΔMLP models for studying a wide range of problems, including biomolecular reactivity and protein-ligand binding.
Collapse
Affiliation(s)
- Timothy J. Giese
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, NJ 08854, USA
| | - Jinzhe Zeng
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, NJ 08854, USA
| | - Lauren Lerew
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, NJ 08854, USA
| | - Erika McCarthy
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, NJ 08854, USA
| | - Yujun Tao
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, NJ 08854, USA
| | - Şölen Ekesan
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, NJ 08854, USA
| | - Darrin M. York
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, NJ 08854, USA
| |
Collapse
|
9
|
Tao Y, Giese TJ, Ekesan Ş, Zeng J, Aradi B, Hourahine B, Aktulga HM, Götz AW, Merz KM, York DM. Amber free energy tools: Interoperable software for free energy simulations using generalized quantum mechanical/molecular mechanical and machine learning potentials. J Chem Phys 2024; 160:224104. [PMID: 38856060 DOI: 10.1063/5.0211276] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2024] [Accepted: 05/15/2024] [Indexed: 06/11/2024] Open
Abstract
We report the development and testing of new integrated cyberinfrastructure for performing free energy simulations with generalized hybrid quantum mechanical/molecular mechanical (QM/MM) and machine learning potentials (MLPs) in Amber. The Sander molecular dynamics program has been extended to leverage fast, density-functional tight-binding models implemented in the DFTB+ and xTB packages, and an interface to the DeePMD-kit software enables the use of MLPs. The software is integrated through application program interfaces that circumvent the need to perform "system calls" and enable the incorporation of long-range Ewald electrostatics into the external software's self-consistent field procedure. The infrastructure provides access to QM/MM models that may serve as the foundation for QM/MM-ΔMLP potentials, which supplement the semiempirical QM/MM model with a MLP correction trained to reproduce ab initio QM/MM energies and forces. Efficient optimization of minimum free energy pathways is enabled through a new surface-accelerated finite-temperature string method implemented in the FE-ToolKit package. Furthermore, we interfaced Sander with the i-PI software by implementing the socket communication protocol used in the i-PI client-server model. The new interface with i-PI allows for the treatment of nuclear quantum effects with semiempirical QM/MM-ΔMLP models. The modular interoperable software is demonstrated on proton transfer reactions in guanine-thymine mispairs in a B-form deoxyribonucleic acid helix. The current work represents a considerable advance in the development of modular software for performing free energy simulations of chemical reactions that are important in a wide range of applications.
Collapse
Affiliation(s)
- Yujun Tao
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, New Jersey 08854, USA
| | - Timothy J Giese
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, New Jersey 08854, USA
| | - Şölen Ekesan
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, New Jersey 08854, USA
| | - Jinzhe Zeng
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, New Jersey 08854, USA
| | - Bálint Aradi
- Bremen Center for Computational Materials Science, University of Bremen, D-28334 Bremen, Germany
| | - Ben Hourahine
- SUPA, Department of Physics, University of Strathclyde, Glasgow G4 0NG, United Kingdom
| | - Hasan Metin Aktulga
- Department of Chemistry, Michigan State University, East Lansing, Michigan 48824, USA
| | - Andreas W Götz
- San Diego Supercomputer Center, University of California San Diego, La Jolla, California 92093, USA
| | - Kenneth M Merz
- Department of Chemistry, Michigan State University, East Lansing, Michigan 48824, USA
| | - Darrin M York
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, New Jersey 08854, USA
| |
Collapse
|
10
|
Li F, Liu X, Ma H, Bian W. A diabatization method based upon integrating the diabatic potential gradient difference. Phys Chem Chem Phys 2024; 26:16477-16487. [PMID: 38656815 DOI: 10.1039/d4cp00375f] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/26/2024]
Abstract
In this work we develop a new scheme to construct a diabatic potential energy matrix (DPEM). We propose a diabatization method which is based on integrating the diabatic potential gradient difference to diabatize adiabatic ab initio energies. This method is capable of performing high-precision adiabatic-to-diabatic transformations, with a unique advantage in effectively handling the significant fluctuations in derivative-couplings caused by conical intersection (CI) seams. The above scheme is applied to the DPEM construction of the Na(3p) + H2 → NaH + H reaction. The fitting data including adiabatic energies, energy gradients and derivative-couplings obtained from a previous benchmark DPEM are diabatized and fitted using a general neural network fitting procedure to generate the DPEM. The produced DPEM can effectively describe nonadiabatic processes involving different electronic states. We further perform quantum dynamical calculations on the new DPEM and the previous benchmark DPEM, and the obtained results demonstrate the effectiveness of our scheme.
Collapse
Affiliation(s)
- Fengyi Li
- Beijing National Laboratory for Molecular Sciences, Institute of Chemistry, Chinese Academy of Sciences, Beijing 100190, China.
- University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Xiaoxi Liu
- Beijing National Laboratory for Molecular Sciences, Institute of Chemistry, Chinese Academy of Sciences, Beijing 100190, China.
- University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Haitao Ma
- Beijing National Laboratory for Molecular Sciences, Institute of Chemistry, Chinese Academy of Sciences, Beijing 100190, China.
| | - Wensheng Bian
- Beijing National Laboratory for Molecular Sciences, Institute of Chemistry, Chinese Academy of Sciences, Beijing 100190, China.
- University of Chinese Academy of Sciences, Beijing, 100049, China
| |
Collapse
|
11
|
Shi Z, Lele AD, Jasper AW, Klippenstein SJ, Ju Y. Quasi-Classical Trajectory Calculation of Rate Constants Using an Ab Initio Trained Machine Learning Model (aML-MD) with Multifidelity Data. J Phys Chem A 2024; 128:3449-3457. [PMID: 38642065 DOI: 10.1021/acs.jpca.4c00750] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/22/2024]
Abstract
Machine learning (ML) provides a great opportunity for the construction of models with improved accuracy in classical molecular dynamics (MD). However, the accuracy of a ML trained model is limited by the quality and quantity of the training data. Generating large sets of accurate ab initio training data can require significant computational resources. Furthermore, inconsistent or incompatible data with different accuracies obtained using different methods may lead to biased or unreliable ML models that do not accurately represent the underlying physics. Recently, transfer learning showed its potential for avoiding these problems as well as for improving the accuracy, efficiency, and generalization of ML models using multifidelity data. In this work, ab initio trained ML-based MD (aML-MD) models are developed through transfer learning using DFT and multireference data from multiple sources with varying accuracy within the Deep Potential MD framework. The accuracy of the force field is demonstrated by calculating rate constants for the H + HO2 → H2 + 3O2 reaction using quasi-classical trajectories. We show that the aML-MD model with transfer learning can accurately predict the rate constants while reducing the computational cost by more than five times compared to the use of more expensive quantum chemistry training data sets. Hence, the aML-MD model with transfer learning shows great potential in using multifidelity data to reduce the computational cost involved in generating the training set for these potentials.
Collapse
Affiliation(s)
- Zhiyu Shi
- Department of Mechanical and Aerospace Engineering, Princeton University, Princeton, New Jersey 08544, United States
| | - Aditya Dilip Lele
- Department of Mechanical and Aerospace Engineering, Princeton University, Princeton, New Jersey 08544, United States
| | - Ahren W Jasper
- Chemical Sciences and Engineering Division, Argonne National Laboratory, Lemont, Illinois 60439, United States
| | - Stephen J Klippenstein
- Chemical Sciences and Engineering Division, Argonne National Laboratory, Lemont, Illinois 60439, United States
| | - Yiguang Ju
- Department of Mechanical and Aerospace Engineering, Princeton University, Princeton, New Jersey 08544, United States
| |
Collapse
|
12
|
Houston PL, Qu C, Yu Q, Pandey P, Conte R, Nandi A, Bowman JM, Kukolich SG. Formic Acid-Ammonia Heterodimer: A New Δ-Machine Learning CCSD(T)-Level Potential Energy Surface Allows Investigation of the Double Proton Transfer. J Chem Theory Comput 2024; 20:1821-1828. [PMID: 38382541 DOI: 10.1021/acs.jctc.3c01273] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/23/2024]
Abstract
The formic acid-ammonia dimer is an important example of a hydrogen-bonded complex in which a double proton transfer can occur. Its microwave spectrum has recently been reported and rotational constants and quadrupole coupling constants were determined. Calculated estimates of the double-well barrier and the internal barriers to rotation were also reported. Here, we report a full-dimensional potential energy surface (PES) for this complex, using two closely related Δ-machine learning methods to bring it to the CCSD(T) level of accuracy. The PES dissociates smoothly and accurately. Using a 2d quantum model the ground vibrational-state tunneling splitting is estimated to be less than 10-4 cm-1. The dipole moment along the intrinsic reaction coordinate is calculated along with a Mullikan charge analysis and supports the mildly ionic character of the minimum and strongly ionic character at the double-well barrier.
Collapse
Affiliation(s)
- Paul L Houston
- Department of Chemistry and Chemical Biology, Cornell University, Ithaca, New York 14853, U.S.A. and Department of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta, Georgia 30332, United States
| | - Chen Qu
- Independent Researcher, Toronto, Ontario M9B0E3, Canada
| | - Qi Yu
- Department of Chemistry and Cherry L. Emerson Center for Scientific Computation, Emory University, Atlanta, Georgia 30322, United States
| | - Priyanka Pandey
- Department of Chemistry and Cherry L. Emerson Center for Scientific Computation, Emory University, Atlanta, Georgia 30322, United States
| | - Riccardo Conte
- Dipartimento di Chimica, Università Degli Studi di Milano, Via Golgi 19, Milano 20133, Italy
| | - Apurba Nandi
- Department of Chemistry and Cherry L. Emerson Center for Scientific Computation, Emory University, Atlanta, Georgia 30322, United States
- Department of Physics and Materials Science, University of Luxembourg, Luxembourg City L-1511, Luxembourg
| | - Joel M Bowman
- Department of Chemistry and Cherry L. Emerson Center for Scientific Computation, Emory University, Atlanta, Georgia 30322, United States
| | - Stephen G Kukolich
- Department of Chemistry and Biochemistry, University of Arizona, 1306 E. University Avenue, Tucson, Arizona 85721, United States
| |
Collapse
|
13
|
Liu Y, Guo H. A Gaussian Process Based Δ-Machine Learning Approach to Reactive Potential Energy Surfaces. J Phys Chem A 2023; 127:8765-8772. [PMID: 37815868 DOI: 10.1021/acs.jpca.3c05318] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/12/2023]
Abstract
The Gaussian process (GP) is an efficient nonparametric machine learning (ML) method. A distinct advantage of the GP is its ability to provide an estimate of statistical uncertainties. This is particularly useful in constructing high-dimensional potential energy surfaces (PESs) from ab initio data as it offers an optimal way to add new geometries to reduce the overall error. In this work, GP is employed in the context of Δ-machine learning (Δ-ML), in which a correction PES to an inaccurate low-level PES is constructed using a small number of high-level ab initio calculations. This new method is tested in three prototypical reactive systems, namely, the H + H2 → H + H2, OH + H2 → H2O + H, and H + CH4 → H2 + CH3 reactions. The results show that the GP-based Δ-ML approach is more efficient than its direct application in constructing high-level PESs. We also compare the new method to a previously proposed neural-network-based Δ-ML approach [Liu and Li J. Phys. Chem. Lett. 2022, 13, 4729-4738]. The results indicate that the two Δ-ML methods have comparable efficiencies in constructing accurate PESs.
Collapse
Affiliation(s)
- Yang Liu
- Department of Chemistry and Chemical Biology, University of New Mexico, Albuquerque, New Mexico 87131, United States
| | - Hua Guo
- Department of Chemistry and Chemical Biology, University of New Mexico, Albuquerque, New Mexico 87131, United States
| |
Collapse
|
14
|
Yu Q, Qu C, Houston PL, Nandi A, Pandey P, Conte R, Bowman JM. A Status Report on "Gold Standard" Machine-Learned Potentials for Water. J Phys Chem Lett 2023; 14:8077-8087. [PMID: 37656898 PMCID: PMC10510435 DOI: 10.1021/acs.jpclett.3c01791] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2023] [Accepted: 08/28/2023] [Indexed: 09/03/2023]
Abstract
Owing to the central importance of water to life as well as its unusual properties, potentials for water have been the subject of extensive research over the past 50 years. Recently, five potentials based on different machine learning approaches have been reported that are at or near the "gold standard" CCSD(T) level of theory. The development of such high-level potentials enables efficient and accurate simulations of water systems using classical and quantum dynamical approaches. This Perspective serves as a status report of these potentials, focusing on their methodology and applications to water systems across different phases. Their performances on the energies of gas phase water clusters, as well as condensed phase structural and dynamical properties, are discussed.
Collapse
Affiliation(s)
- Qi Yu
- Department
of Chemistry and Cherry L. Emerson Center for Scientific Computation, Emory University, Atlanta, Georgia 30322, United States
| | - Chen Qu
- Independent
Researcher, Toronto, Ontario M9B 0E3, Canada
| | - Paul L. Houston
- Department
of Chemistry and Chemical Biology, Cornell
University, Ithaca, New York 14853, United States
- Department of Chemistry
and Biochemistry, Georgia Institute of Technology, Atlanta, Georgia 30332, United States
| | - Apurba Nandi
- Department
of Chemistry and Cherry L. Emerson Center for Scientific Computation, Emory University, Atlanta, Georgia 30322, United States
- Department
of Physics and Materials Science, University
of Luxembourg, L-1511, Luxembourg City, Luxembourg
| | - Priyanka Pandey
- Department
of Chemistry and Cherry L. Emerson Center for Scientific Computation, Emory University, Atlanta, Georgia 30322, United States
| | - Riccardo Conte
- Dipartimento
di Chimica, Università degli Studi
di Milano, via Golgi 19, 20133 Milano, Italy
| | - Joel M. Bowman
- Department
of Chemistry and Cherry L. Emerson Center for Scientific Computation, Emory University, Atlanta, Georgia 30322, United States
| |
Collapse
|
15
|
Tao C, Yang J, Hong Q, Sun Q, Li J. Global and Full-Dimensional Potential Energy Surfaces of the N 2 + O 2 Reaction for Hyperthermal Collisions. J Phys Chem A 2023; 127:4027-4042. [PMID: 37128765 DOI: 10.1021/acs.jpca.3c01065] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]
Abstract
The energy transfer, dissociations, and chemical reactions between O2 and N2 play an important role in the re-entry process of aircraft and many atmospheric, combustion, and plasma processes. Recently, Varga et al. (J. Chem. Phys., 2016, 144, 024310) developed a full-dimensional high-precision potential energy surface (PES) of the ground triplet electronic state for the O2 and N2 system based on ca. 55,000 data points, whose energies were calculated by multi-state complete-active-space second-order perturbation theory/minimally augmented correlation-consistent polarized valence triple-zeta electronic structure calculations plus dynamically scaled external correlation. The fitting function adopted the many-body expansion form with the four-body interactions fitted by the permutationally invariant polynomial in terms of bond-order functions of the six interatomic distances (MB-PIP). In this work, we refit the PES of the N2O2 system by two methods based on the same data set that was used by Varga et al. The first uses the permutation invariant polynomial-neural network (PIP-NN) method to fit the entire energy of the 55,000 data points. In the second approach, the PIP-NN method is used to fit only the four-body interaction component, a similar treatment in the MB-PIP method, and the resulting PES is named MB-PIP-NN. Then, the performances of these new PESs and the MB-PIP PES are comprehensively and systematically compared, such as comparisons of various scans, properties of stationary points, and dynamics simulations. Possible improvements for the PES of N2O2 are suggested. A more reliable PES of the system can be constructed in terms of data sampling range, electronic structure calculation level, and fitting method for high-temperature calculation and simulation in the future.
Collapse
Affiliation(s)
- Chun Tao
- School of Chemistry and Chemical Engineering and Chongqing Key Laboratory of Theoretical and Computational Chemistry, Chongqing University, Chongqing 401331, China
| | - Jiawei Yang
- School of Chemistry and Chemical Engineering and Chongqing Key Laboratory of Theoretical and Computational Chemistry, Chongqing University, Chongqing 401331, China
| | - Qizhen Hong
- State Key Laboratory of High Temperature Gas Dynamics, Institute of Mechanics, Chinese Academy of Sciences, Beijing 100190, China
| | - Quanhua Sun
- State Key Laboratory of High Temperature Gas Dynamics, Institute of Mechanics, Chinese Academy of Sciences, Beijing 100190, China
| | - Jun Li
- School of Chemistry and Chemical Engineering and Chongqing Key Laboratory of Theoretical and Computational Chemistry, Chongqing University, Chongqing 401331, China
| |
Collapse
|
16
|
Käser S, Vazquez-Salazar LI, Meuwly M, Töpfer K. Neural network potentials for chemistry: concepts, applications and prospects. DIGITAL DISCOVERY 2023; 2:28-58. [PMID: 36798879 PMCID: PMC9923808 DOI: 10.1039/d2dd00102k] [Citation(s) in RCA: 28] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/23/2022] [Accepted: 12/20/2022] [Indexed: 12/24/2022]
Abstract
Artificial Neural Networks (NN) are already heavily involved in methods and applications for frequent tasks in the field of computational chemistry such as representation of potential energy surfaces (PES) and spectroscopic predictions. This perspective provides an overview of the foundations of neural network-based full-dimensional potential energy surfaces, their architectures, underlying concepts, their representation and applications to chemical systems. Methods for data generation and training procedures for PES construction are discussed and means for error assessment and refinement through transfer learning are presented. A selection of recent results illustrates the latest improvements regarding accuracy of PES representations and system size limitations in dynamics simulations, but also NN application enabling direct prediction of physical results without dynamics simulations. The aim is to provide an overview for the current state-of-the-art NN approaches in computational chemistry and also to point out the current challenges in enhancing reliability and applicability of NN methods on a larger scale.
Collapse
Affiliation(s)
- Silvan Käser
- Department of Chemistry, University of Basel Klingelbergstrasse 80 CH-4056 Basel Switzerland
| | | | - Markus Meuwly
- Department of Chemistry, University of Basel Klingelbergstrasse 80 CH-4056 Basel Switzerland
| | - Kai Töpfer
- Department of Chemistry, University of Basel Klingelbergstrasse 80 CH-4056 Basel Switzerland
| |
Collapse
|
17
|
Bowman JM, Qu C, Conte R, Nandi A, Houston PL, Yu Q. Δ-Machine Learned Potential Energy Surfaces and Force Fields. J Chem Theory Comput 2023; 19:1-17. [PMID: 36527383 DOI: 10.1021/acs.jctc.2c01034] [Citation(s) in RCA: 36] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
There has been great progress in developing machine-learned potential energy surfaces (PESs) for molecules and clusters with more than 10 atoms. Unfortunately, this number of atoms generally limits the level of electronic structure theory to less than the "gold standard" CCSD(T) level. Indeed, for the well-known MD17 dataset for molecules with 9-20 atoms, all of the energies and forces were obtained with DFT calculations (PBE). This Perspective is focused on a Δ-machine learning method that we recently proposed and applied to bring DFT-based PESs to close to CCSD(T) accuracy. This is demonstrated for hydronium, N-methylacetamide, acetyl acetone, and ethanol. For 15-atom tropolone, it appears that special approaches (e.g., molecular tailoring, local CCSD(T)) are needed to obtain the CCSD(T) energies. A new aspect of this approach is the extension of Δ-machine learning to force fields. The approach is based on many-body corrections to polarizable force field potentials. This is examined in detail using the TTM2.1 water potential. The corrections make use of our recent CCSD(T) datasets for 2-b, 3-b, and 4-b interactions for water. These datasets were used to develop a new fully ab initio potential for water, termed q-AQUA.
Collapse
Affiliation(s)
- Joel M Bowman
- Department of Chemistry and Cherry L. Emerson Center for Scientific Computation, Emory University, Atlanta, Georgia 30322, United States
| | - Chen Qu
- Independent Researcher, Toronto, Canada 66777
| | - Riccardo Conte
- Dipartimento di Chimica, Università Degli Studi di Milano, via Golgi 19, 20133 Milano, Italy
| | - Apurba Nandi
- Department of Chemistry and Cherry L. Emerson Center for Scientific Computation, Emory University, Atlanta, Georgia 30322, United States
| | - Paul L Houston
- Department of Chemistry and Chemical Biology, Cornell University, Ithaca, New York 14853, United States.,Department of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta, Georgia 30332, United States
| | - Qi Yu
- Department of Chemistry, Yale University, New Haven, Connecticut 06520, United States
| |
Collapse
|
18
|
Giese TJ, Zeng J, York DM. Multireference Generalization of the Weighted Thermodynamic Perturbation Method. J Phys Chem A 2022; 126:8519-8533. [PMID: 36301936 PMCID: PMC9771595 DOI: 10.1021/acs.jpca.2c06201] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
We describe the generalized weighted thermodynamic perturbation (gwTP) method for estimating the free energy surface of an expensive "high-level" potential energy function from the umbrella sampling performed with multiple inexpensive "low-level" reference potentials. The gwTP method is a generalization of the weighted thermodynamic perturbation (wTP) method developed by Li and co-workers [J. Chem. Theory Comput. 2018, 14, 5583-5596] that uses a single "low-level" reference potential. The gwTP method offers new possibilities in model design whereby the sampling generated from several low-level potentials may be combined (e.g., specific reaction parameter models that might have variable accuracy at different stages of a multistep reaction). The gwTP method is especially well suited for use with machine learning potentials (MLPs) that are trained against computationally expensive ab initio quantum mechanical/molecular mechanical (QM/MM) energies and forces using active learning procedures that naturally produce multiple distinct neural network potentials. Simulations can be performed with greater sampling using the fast MLPs and then corrected to the ab initio level using gwTP. The capabilities of the gwTP method are demonstrated by creating reference potentials based on the MNDO/d and DFTB2/MIO semiempirical models supplemented with the "range-corrected deep potential" (DPRc). The DPRc parameters are trained to ab initio QM/MM data, and the potentials are used to calculate the free energy surface of stepwise mechanisms for nonenzymatic RNA 2'-O-transesterification model reactions. The extended sampling made possible by the reference potentials allows one to identify unequilibrated portions of the simulations that are not always evident from the short time scale commonly used with ab initio QM/MM potentials. We show that the reference potential approach can yield more accurate ab initio free energy predictions than the wTP method or what can be reasonably afforded from explicit ab initio QM/MM sampling.
Collapse
Affiliation(s)
- Timothy J. Giese
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, NJ 08854, USA
| | - Jinzhe Zeng
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, NJ 08854, USA
| | - Darrin M. York
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, NJ 08854, USA
| |
Collapse
|