1
|
Gupta S, Bull-Vulpe EF, Agnew H, Iyer S, Zhu X, Zhou R, Knight C, Paesani F. MBX V1.2: Accelerating Data-Driven Many-Body Molecular Dynamics Simulations. J Chem Theory Comput 2025; 21:1838-1849. [PMID: 39951328 DOI: 10.1021/acs.jctc.4c01333] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/26/2025]
Abstract
The MBX software provides an advanced platform for molecular dynamics simulations, leveraging state-of-the-art MB-pol and MB-nrg data-driven many-body potential energy functions. Developed over the past decade, these potential energy functions integrate physics-based and machine-learned many-body terms trained on electronic structure data calculated at the "gold standard" coupled-cluster level of theory. Recent advancements in MBX have focused on optimizing its performance, resulting in the release of MBX v1.2. While the inherently many-body nature of MB-pol and MB-nrg ensures high accuracy, it poses computational challenges. MBX v1.2 addresses these challenges with significant performance improvements, including enhanced parallelism that fully harnesses the power of modern multicore CPUs. These advancements enable simulations on nanosecond time scales for condensed-phase systems, significantly expanding the scope of high-accuracy, predictive simulations of complex molecular systems powered by data-driven many-body potential energy functions.
Collapse
Affiliation(s)
- Shreya Gupta
- Department of Chemistry and Biochemistry, University of California San Diego, La Jolla, California 92093, United States
| | - Ethan F Bull-Vulpe
- Department of Chemistry and Biochemistry, University of California San Diego, La Jolla, California 92093, United States
| | - Henry Agnew
- Department of Chemistry and Biochemistry, University of California San Diego, La Jolla, California 92093, United States
| | - Shishir Iyer
- Department of Chemistry and Biochemistry, University of California San Diego, La Jolla, California 92093, United States
| | - Xuanyu Zhu
- Department of Chemistry and Biochemistry, University of California San Diego, La Jolla, California 92093, United States
| | - Ruihan Zhou
- Department of Chemistry and Biochemistry, University of California San Diego, La Jolla, California 92093, United States
| | - Christopher Knight
- University of Chicago, Chicago, Illinois 60637, United States
- Computational Science Division, Argonne National Laboratory, Lemont, Illinois 60439, United States
| | - Francesco Paesani
- Department of Chemistry and Biochemistry, University of California San Diego, La Jolla, California 92093, United States
- Materials Science and Engineering, University of California San Diego, La Jolla, California 92093, United States
- Halicioğlu Data Science Institute, University of California San Diego, La Jolla, California 92093, United States
- San Diego Supercomputer Center, University of California San Diego, La Jolla, California 92093, United States
| |
Collapse
|
2
|
Palos E, Bull-Vulpe EF, Zhu X, Agnew H, Gupta S, Saha S, Paesani F. Current Status of the MB-pol Data-Driven Many-Body Potential for Predictive Simulations of Water Across Different Phases. J Chem Theory Comput 2024; 20:9269-9289. [PMID: 39401055 DOI: 10.1021/acs.jctc.4c01005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/15/2024]
Abstract
Developing a molecular-level understanding of the properties of water is central to numerous scientific and technological applications. However, accurately modeling water through computer simulations has been a significant challenge due to the complex nature of the hydrogen-bonding network that water molecules form under different thermodynamic conditions. This complexity has led to over five decades of research and many modeling attempts. The introduction of the MB-pol data-driven many-body potential energy function marked a significant advancement toward a universal molecular model capable of predicting the structural, thermodynamic, dynamical, and spectroscopic properties of water across all phases. By integrating physics-based and data-driven (i.e., machine-learned) components, which correctly capture the delicate balance among different many-body interactions, MB-pol achieves chemical and spectroscopic accuracy, enabling realistic molecular simulations of water, from gas-phase clusters to liquid water and ice. In this review, we present a comprehensive overview of the data-driven many-body formalism adopted by MB-pol, highlight the main results and predictions made from computer simulations with MB-pol to date, and discuss the prospects for future extensions to data-driven many-body potentials of generic and reactive molecular systems.
Collapse
Affiliation(s)
- Etienne Palos
- Department of Chemistry and Biochemistry, University of California San Diego, La Jolla, California 92093, United States
| | - Ethan F Bull-Vulpe
- Department of Chemistry and Biochemistry, University of California San Diego, La Jolla, California 92093, United States
| | - Xuanyu Zhu
- Department of Chemistry and Biochemistry, University of California San Diego, La Jolla, California 92093, United States
| | - Henry Agnew
- Department of Chemistry and Biochemistry, University of California San Diego, La Jolla, California 92093, United States
| | - Shreya Gupta
- Department of Chemistry and Biochemistry, University of California San Diego, La Jolla, California 92093, United States
| | - Suman Saha
- Department of Chemistry and Biochemistry, University of California San Diego, La Jolla, California 92093, United States
| | - Francesco Paesani
- Department of Chemistry and Biochemistry, University of California San Diego, La Jolla, California 92093, United States
- Materials Science and Engineering, University of California San Diego, La Jolla, California 92093, United States
- Halicioǧlu Data Science Institute, University of California San Diego, La Jolla, California 92093, United States
- San Diego Supercomputer Center, University of California San Diego, La Jolla, California 92093, United States
| |
Collapse
|
3
|
Riera M, Knight C, Bull-Vulpe EF, Zhu X, Agnew H, Smith DGA, Simmonett AC, Paesani F. MBX: A many-body energy and force calculator for data-driven many-body simulations. J Chem Phys 2023; 159:054802. [PMID: 37526156 PMCID: PMC10550339 DOI: 10.1063/5.0156036] [Citation(s) in RCA: 21] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2023] [Accepted: 07/11/2023] [Indexed: 08/02/2023] Open
Abstract
Many-Body eXpansion (MBX) is a C++ library that implements many-body potential energy functions (PEFs) within the "many-body energy" (MB-nrg) formalism. MB-nrg PEFs integrate an underlying polarizable model with explicit machine-learned representations of many-body interactions to achieve chemical accuracy from the gas to the condensed phases. MBX can be employed either as a stand-alone package or as an energy/force engine that can be integrated with generic software for molecular dynamics and Monte Carlo simulations. MBX is parallelized internally using Open Multi-Processing and can utilize Message Passing Interface when available in interfaced molecular simulation software. MBX enables classical and quantum molecular simulations with MB-nrg PEFs, as well as hybrid simulations that combine conventional force fields and MB-nrg PEFs, for diverse systems ranging from small gas-phase clusters to aqueous solutions and molecular fluids to biomolecular systems and metal-organic frameworks.
Collapse
Affiliation(s)
- Marc Riera
- Department of Chemistry and Biochemistry, University of California San Diego, La Jolla, California 92093, USA
| | - Christopher Knight
- Argonne National Laboratory, Computational Science Division, Lemont, Illinois 60439, USA
| | - Ethan F. Bull-Vulpe
- Department of Chemistry and Biochemistry, University of California San Diego, La Jolla, California 92093, USA
| | - Xuanyu Zhu
- Department of Chemistry and Biochemistry, University of California San Diego, La Jolla, California 92093, USA
| | - Henry Agnew
- Department of Chemistry and Biochemistry, University of California San Diego, La Jolla, California 92093, USA
| | | | - Andrew C. Simmonett
- Laboratory of Computational Biology, National Heart, Lung and Blood Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
| | | |
Collapse
|
4
|
Li XL, Li CM, Zhu JY, Zhou Z, Hao Q, Wang CS. A scheme for rapid evaluation of the intermolecular three-body polarization effect in water clusters. J Comput Chem 2023; 44:677-686. [PMID: 36408852 DOI: 10.1002/jcc.27032] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2022] [Revised: 09/22/2022] [Accepted: 10/24/2022] [Indexed: 11/22/2022]
Abstract
The ability to accurately and rapidly evaluate the intermolecular many-body polarization effect of the water system is very important for computer simulations of biomolecule in aqueous. In this paper, a scheme is proposed based on the polarizable dipole-dipole interaction model and used to rapidly estimate the intermolecular many-body polarization effect in water clusters. We use a bond-dipole-based polarization function to evaluate the polarization energy. We regard two OH bonds of a water molecule as two bond-dipoles and set the permanent OH bond-dipole moment of a water molecule to be 1.51 Debye. We estimate the induced OH bond-dipole moment via a simple formula in which only one correction factor is needed. This scheme is then applied to tens of water clusters to calculate the three- and four-body interaction energies. The three-body interaction energies of 93 water clusters produced by our scheme are compared with those produced by the counterpoise-corrected CCSD(T)/aug-cc-pVDZ, MP2/aug-cc-pVDZ, M06-2X/jul-cc-pVTZ methods, by the AMOEBApro13, iAMOEBA, AMOEBA+, AMOEBA+(CF) methods, and by the MB-pol method. The four-body interaction energies of 47 water clusters yielded by our scheme are compared with those yielded by the counterpoise-corrected MP2/aug-cc-pVDZ and M06-2X/ jul-cc-pVTZ methods, by the AMOEBApro13, AMOEBA+, AMOEBA+(CF) methods, and by the MB-pol method. The comparison results show that the scheme proposed in this paper can reproduce the counterpoise-corrected CCSD(T)/aug-cc-pVDZ three-body interaction energies and reproduce the counterpoise-corrected MP2/aug-cc-pVDZ four-body interaction energies both accurately and efficiently. We anticipate the scheme proposed here can be useful for computer simulations of liquid water and aqueous solutions.
Collapse
Affiliation(s)
- Xiao-Lei Li
- School of Chemistry and Chemical Engineering, Liaoning Normal University, Dalian, People's Republic of China
| | - Chao-Ming Li
- School of Chemistry and Chemical Engineering, Liaoning Normal University, Dalian, People's Republic of China
| | - Jia-Yi Zhu
- School of Chemistry and Chemical Engineering, Liaoning Normal University, Dalian, People's Republic of China
| | - Zhan Zhou
- School of Chemistry and Chemical Engineering, Liaoning Normal University, Dalian, People's Republic of China
| | - Qiang Hao
- School of Chemistry and Chemical Engineering, Liaoning Normal University, Dalian, People's Republic of China
| | - Chang-Sheng Wang
- School of Chemistry and Chemical Engineering, Liaoning Normal University, Dalian, People's Republic of China
| |
Collapse
|
5
|
Hao H, Ruiz Pestana L, Qian J, Liu M, Xu Q, Head‐Gordon T. Chemical transformations and transport phenomena at interfaces. WIRES COMPUTATIONAL MOLECULAR SCIENCE 2022. [DOI: 10.1002/wcms.1639] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Affiliation(s)
- Hongxia Hao
- Kenneth S. Pitzer Theory Center and Department of Chemistry University of California Berkeley California USA
- Chemical Sciences Division Lawrence Berkeley National Laboratory Berkeley California USA
| | - Luis Ruiz Pestana
- Department of Civil and Architectural Engineering University of Miami Coral Gables Florida USA
| | - Jin Qian
- Chemical Sciences Division Lawrence Berkeley National Laboratory Berkeley California USA
| | - Meili Liu
- Department of Civil and Architectural Engineering University of Miami Coral Gables Florida USA
| | - Qiang Xu
- Chemical Sciences Division Lawrence Berkeley National Laboratory Berkeley California USA
| | - Teresa Head‐Gordon
- Kenneth S. Pitzer Theory Center and Department of Chemistry University of California Berkeley California USA
- Chemical Sciences Division Lawrence Berkeley National Laboratory Berkeley California USA
- Department of Bioengineering and Chemical and Biomolecular Engineering University of California Berkeley California USA
| |
Collapse
|
6
|
Bull-Vulpe EF, Riera M, Bore SL, Paesani F. Data-Driven Many-Body Potential Energy Functions for Generic Molecules: Linear Alkanes as a Proof-of-Concept Application. J Chem Theory Comput 2022. [PMID: 36113028 DOI: 10.1021/acs.jctc.2c00645] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
We present a generalization of the many-body energy (MB-nrg) theoretical/computational framework that enables the development of data-driven potential energy functions (PEFs) for generic covalently bonded molecules, with arbitrary quantum mechanical accuracy. The "nearsightedness of electronic matter" is exploited to define monomers as "natural building blocks" on the basis of their distinct chemical identity. The energy of generic molecules is then expressed as a sum of individual many-body energies of incrementally larger subsystems. The MB-nrg PEFs represent the low-order n-body energies, with n = 1-4, using permutationally invariant polynomials derived from electronic structure data carried out at an arbitrary quantum mechanical level of theory, while all higher-order n-body terms (n > 4) are represented by a classical many-body polarization term. As a proof-of-concept application of the general MB-nrg framework, we present MB-nrg PEFs for linear alkanes. The MB-nrg PEFs are shown to accurately reproduce reference energies, harmonic frequencies, and potential energy scans of alkanes, independently of their length. Since, by construction, the MB-nrg framework introduced here can be applied to generic covalently bonded molecules, we envision future computer simulations of complex molecular systems using data-driven MB-nrg PEFs, with arbitrary quantum mechanical accuracy.
Collapse
Affiliation(s)
- Ethan F. Bull-Vulpe
- Department of Chemistry and Biochemistry, University of California San Diego, La Jolla, California 92093, United States
| | - Marc Riera
- Department of Chemistry and Biochemistry, University of California San Diego, La Jolla, California 92093, United States
| | - Sigbjørn L. Bore
- Department of Chemistry and Biochemistry, University of California San Diego, La Jolla, California 92093, United States
| | - Francesco Paesani
- Department of Chemistry and Biochemistry, University of California San Diego, La Jolla, California 92093, United States
- Materials Science and Engineering, University of California San Diego, La Jolla, California 92093, United States
- San Diego Supercomputer Center, University of California San Diego, La Jolla, California 92093, United States
| |
Collapse
|