1
|
Lapsien M, Bonus M, Gahan L, Raguin A, Gohlke H. PyPE_RESP: A Tool to Facilitate and Standardize Derivation of RESP Charges. J Chem Inf Model 2025; 65:4251-4256. [PMID: 40285710 DOI: 10.1021/acs.jcim.5c00041] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/29/2025]
Abstract
We introduce PyPE_RESP, a tool to facilitate and standardize partial atomic charge derivation using the Restrained Electrostatic Potential (RESP) approach. PyPE_RESP builds upon the open-source Python package RDKit for chemoinformatics and the AMBER suite for molecular simulations. PyPE_RESP provides an easy setup of multiconformer and multimolecule RESP fitting while allowing a comprehensive definition of charge constraint groups through multiple methods. As a command line tool, PyPE_RESP can be integrated into batch processes. The software enables the derivation of partial atomic charges for additive and polarizable force fields. It outputs constraint group and nonconstraint group charges to give an immediate overview of the fit result. PyPE_RESP will be distributed with AmberTools and compatible with most computing platforms.
Collapse
Affiliation(s)
- Marco Lapsien
- Institute for Pharmaceutical and Medicinal Chemistry & Bioeconomy Science Center (BioSC), Heinrich Heine University Düsseldorf, 40225 Düsseldorf, Germany
| | - Michele Bonus
- Institute for Pharmaceutical and Medicinal Chemistry & Bioeconomy Science Center (BioSC), Heinrich Heine University Düsseldorf, 40225 Düsseldorf, Germany
| | - Lianne Gahan
- Institute for Computational Cell Biology & Bioeconomy Science Center (BioSC), Heinrich Heine University Düsseldorf, 40225 Düsseldorf, Germany
| | - Adélaïde Raguin
- Institute for Computational Cell Biology & Bioeconomy Science Center (BioSC), Heinrich Heine University Düsseldorf, 40225 Düsseldorf, Germany
| | - Holger Gohlke
- Institute for Pharmaceutical and Medicinal Chemistry & Bioeconomy Science Center (BioSC), Heinrich Heine University Düsseldorf, 40225 Düsseldorf, Germany
- Institute of Bio- and Geosciences (IBG-4: Bioinformatics), Forschungszentrum Jülich, 52425 Jülich, Germany
| |
Collapse
|
2
|
Huang Z, Wu X, Luo R. Isotropic Periodic Sum for Polarizable Gaussian Multipole Model. J Chem Theory Comput 2025; 21:4040-4050. [PMID: 40194962 DOI: 10.1021/acs.jctc.5c00123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/09/2025]
Abstract
The isotropic periodic sum (IPS) method provides an efficient approach for computing long-range interactions by approximating distant molecular structures through isotropic periodic images of a local region. Here, we present a novel integration of IPS with the polarizable Gaussian multipole (pGM) model, extending its applicability to systems with Gaussian-distributed charges and dipoles. By developing and implementing the IPS multipole tensor theorem within the Gaussian multipole framework, we derive analytical expressions for IPS potentials that efficiently handle both permanent and induced multipole interactions. Our comprehensive validation includes energy conservation tests in the NVE ensemble, potential energy distributions in the NVT ensemble, structural analysis through radial distribution functions, diffusion coefficients, induced dipole calculations across various molecular systems, and ionic charging free energies. The results demonstrate that the pGM-IPS approach successfully reproduces energetic, structural, and dynamic properties of molecular systems with accuracy comparable to the traditional particle mesh Ewald method. Our work establishes pGM-IPS as a promising method for simulations of polarizable molecular systems, achieving a balance between computational efficiency and accuracy.
Collapse
Affiliation(s)
- Zhen Huang
- Chemical and Materials Physics Graduate Program, Departments of Chemistry, Molecular Biology and Biochemistry, Chemical and Biomolecular Engineering, Materials Science and Engineering, and Biomedical Engineering, University of California Irvine. Irvine, California 92697, United States
| | - Xiongwu Wu
- Laboratory of Computational Biology, NHLBI, NIH, Bethesda, Maryland 20892, United States
| | - Ray Luo
- Chemical and Materials Physics Graduate Program, Departments of Chemistry, Molecular Biology and Biochemistry, Chemical and Biomolecular Engineering, Materials Science and Engineering, and Biomedical Engineering, University of California Irvine. Irvine, California 92697, United States
| |
Collapse
|
3
|
Wu Y, Zhu Q, Huang Z, Cieplak P, Duan Y, Luo R. Automated Refinement of Property-Specific Polarizable Gaussian Multipole Water Models Using Bayesian Black-Box Optimization. J Chem Theory Comput 2025; 21:3563-3575. [PMID: 40108759 DOI: 10.1021/acs.jctc.5c00039] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/22/2025]
Abstract
The critical importance of water in sustaining life highlights the need for accurate water models in computer simulations, aiming to mimic biochemical processes experimentally. The polarizable Gaussian multipole (pGM) model, recently introduced for biomolecular simulations, improves the handling of complex biomolecular interactions. As an integral part of our initial exploration, we examined a minimalist fixed geometry three-center pGM water model using ab initio quantum mechanical calculations of water oligomers. However, our final model development was based on liquid-phase water properties, leveraging automated machine learning (AutoML) techniques for optimization. This allows the development of a framework to refine both van der Waals and electrostatic parameters of the pGM model, aiming to accurately reproduce specific properties such as the oxygen-oxygen radial distribution function, density, and dipole moment, all at 298 K and 1.0 bar pressure. The efficacy of the optimized three-center pGM water model, pGM3P-25, was assessed through simulations of a water box of 512 water molecules, showcasing marked enhancements in both accuracy and practical utility. Notably, the model accurately reproduces thermodynamic properties not explicitly included in training while significantly reducing the time and human effort required for optimization. It was found that pGM3P-25 can reproduce temperature-dependent properties such as density, self-diffusion constants, heat capacity, second virial coefficient, and dielectric constant, which are important in biomolecular simulations. This study underscores the potential of AutoML-driven frameworks to streamline parameter refinement for molecular dynamics simulations, paving the way for broader applications in computational chemistry and beyond.
Collapse
Affiliation(s)
- Yongxian Wu
- Departments of Chemical and Biomolecular Engineering, Molecular Biology and Biochemistry, Materials Science and Engineering, and Biomedical Engineering, University of California, Irvine, Irvine, California 92697, United States
| | - Qiang Zhu
- Departments of Chemical and Biomolecular Engineering, Molecular Biology and Biochemistry, Materials Science and Engineering, and Biomedical Engineering, University of California, Irvine, Irvine, California 92697, United States
| | - Zhen Huang
- Departments of Chemical and Biomolecular Engineering, Molecular Biology and Biochemistry, Materials Science and Engineering, and Biomedical Engineering, University of California, Irvine, Irvine, California 92697, United States
| | - Piotr Cieplak
- SBP Medical Discovery Institute, 10901 North Torrey Pines Road, La Jolla, California 92037, United States
| | - Yong Duan
- UC Davis Genome Center and Department of Biomedical Engineering, University of California, Davis, One Shields Avenue, Davis, California 95616, United States
| | - Ray Luo
- Departments of Chemical and Biomolecular Engineering, Molecular Biology and Biochemistry, Materials Science and Engineering, and Biomedical Engineering, University of California, Irvine, Irvine, California 92697, United States
| |
Collapse
|
4
|
Duan Y, Wang J, Cieplak P, Luo R. Refinement of Atomic Polarizabilities for a Polarizable Gaussian Multipole Force Field with Simultaneous Considerations of Both Molecular Polarizability Tensors and In-Solution Electrostatic Potentials. J Chem Inf Model 2025; 65:1428-1440. [PMID: 39865620 PMCID: PMC11815842 DOI: 10.1021/acs.jcim.4c02175] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2024] [Revised: 01/11/2025] [Accepted: 01/14/2025] [Indexed: 01/28/2025]
Abstract
Atomic polarizabilities are considered to be fundamental parameters in polarizable molecular mechanical force fields that play pivotal roles in determining model transferability across different electrostatic environments. In an earlier work, the atomic polarizabilities were obtained by fitting them to the B3LYP/aug-cc-pvtz molecular polarizability tensors of mainly small molecules. Taking advantage of the recent PCMRESPPOL method, we refine the atomic polarizabilities for condensed-phase simulations using a polarizable Gaussian Multipole (pGM) force field. Departing from earlier works, in this work, we incorporated polarizability tensors of a large number of dimers and electrostatic potentials (ESPs) in multiple solvents. We calculated 1565 × 4 ESPs of small molecule monomers and dimers of noble gas and small molecules and 4742 × 4 ESPs of small molecule dimers in four solvents (diethyl ether, ε = 4.24, dichloroethane, ε = 10.13, acetone, ε = 20.49, and water, ε = 78.36). For the gas-phase polarizability tensors, we supplemented the molecule set that was used in our earlier work by adding both the 4252 monomer and dimer sets studied by Shaw and co-workers and the 7211 small molecule monomers listed in the QM7b database to a combined total of 13,523 molecular polarizability tensors of monomers and dimers. The QM7b polarizability set was obtained from quantum-machine.org and was calculated at the LR-CCSD/d-aug-cc-pVDZ level of theory. All other polarizability tensors and all ESPs were calculated at the ωB97X-D/aug-cc-pVTZ level of theory. The atomic polarizabilities were developed using all polarizability tensors and the 1565 × 4 ESPs of small molecule monomers and were then assessed by comparing them to the 4742 × 4 ab initio ESPs of small molecule dimers. The predicted dimer ESPs had an average relative root-mean-square error (RRMSE) of 9.30%, which was only slightly larger than the average fitting RRMSE of 9.15% of the monomer ESPs. The transferability of the polarizability set was further evaluated by comparing the ESPs calculated using parameters developed in another dielectric environment for both tetrapeptide and DES monomer data sets. It was observed that the polarizabilities of this work retained or slightly improved the transferability over the one discussed in earlier work even though the number of parameters in the present set is about half of that in the earlier set. Excluding the gas-phase data, for the DES monomer set, the average transfer RRMSEs were 16.25% and 10.83% for pGM-ind and pGM-perm methods, respectively, comparable to the average fitting RRMSEs of 16.03% and 10.54%; for tetrapeptides, the average transfer RRMSEs were 5.62% and 3.95% for pGM-ind and pGM-perm methods, respectively, slightly larger than 5.41% and 3.61% of the fitting RRMSEs. Therefore, we conclude that the pGM methods with updated polarizabilities achieved remarkable transferability from monomer to dimer and from one solvent to another.
Collapse
Affiliation(s)
- Yong Duan
- UC
Davis Genome Center and Department of Biomedical Engineering, University of California, Davis, One Shields Avenue, Davis, California 95616, United States
| | - Junmei Wang
- Department
of Pharmaceutical Sciences, University of
Pittsburgh, 3501 Terrace Street, Pittsburgh, Pennsylvania 15261, United States
| | - Piotr Cieplak
- SBP
Medical Discovery Institute, 10901 North Torrey Pines Road, La Jolla, California 92037, United States
| | - Ray Luo
- Departments
of Molecular Biology and Biochemistry, Chemical and Biomolecular Engineering,
Materials Science and Engineering, and Biomedical Engineering, University of California, Irvine, Irvine, California 92697, United States
| |
Collapse
|
5
|
Huang Z, Wu Y, Duan Y, Luo R. Performance Tuning of Polarizable Gaussian Multipole Model in Molecular Dynamics Simulations. J Chem Theory Comput 2025; 21:847-858. [PMID: 39772516 PMCID: PMC11854381 DOI: 10.1021/acs.jctc.4c01368] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2025]
Abstract
Molecular dynamics (MD) simulations are essential for understanding molecular phenomena at the atomic level, with their accuracy largely dependent on both the employed force field and sampling. Polarizable force fields, which incorporate atomic polarization effects, represent a significant advancement in simulation technology. The polarizable Gaussian multipole (pGM) model has been noted for its accurate reproduction of ab initio electrostatic interactions. In this study, we document our effort to enhance the computational efficiency and scalability of the pGM simulations within the AMBER framework using MPI (message passing interface). Performance evaluations reveal that our MPI-based pGM model significantly reduces runtime and scales effectively while maintaining computational accuracy. Additionally, we investigated the stability and reliability of the MPI implementation under the NVE simulation ensemble. Optimal Ewald and induction parameters for the pGM model are also explored, and its statistical properties are assessed under various simulation ensembles. Our findings demonstrate that the MPI-implementation maintains enhanced stability and robustness during extended simulation times. We further evaluated the model performance under both NVT (constant number, volume, and temperature) and NPT (constant number, pressure, and temperature) ensembles and assessed the effects of varying timesteps and convergence tolerance on induced dipole calculations. The lessons learned from these exercises are expected to help the users to make informed decisions on simulation setup. The improved performance under these ensembles enables the study of larger molecular systems, thereby expanding the applicability of the pGM model in detailed MD simulations.
Collapse
Affiliation(s)
- Zhen Huang
- Chemical and Materials Physics Graduate Program, Departments of Molecular Biology and Biochemistry, Chemical and Biomolecular Engineering, Materials Science and Engineering, and Biomedical Engineering, University of California, Irvine, Irvine, California 92697, United States
| | - Yongxian Wu
- Chemical and Materials Physics Graduate Program, Departments of Molecular Biology and Biochemistry, Chemical and Biomolecular Engineering, Materials Science and Engineering, and Biomedical Engineering, University of California, Irvine, Irvine, California 92697, United States
| | - Yong Duan
- UC Davis Genome Center and Department of Biomedical Engineering, University of California, Davis, One Shields Avenue, Davis, California 95616, United States
| | - Ray Luo
- Chemical and Materials Physics Graduate Program, Departments of Molecular Biology and Biochemistry, Chemical and Biomolecular Engineering, Materials Science and Engineering, and Biomedical Engineering, University of California, Irvine, Irvine, California 92697, United States
| |
Collapse
|
6
|
Duan Y, Niu T, Wang J, Cieplak P, Luo R. PCMRESP: A Method for Polarizable Force Field Parameter Development and Transferability of the Polarizable Gaussian Multipole Models Across Multiple Solvents. J Chem Theory Comput 2024; 20:2820-2829. [PMID: 38502776 PMCID: PMC11008095 DOI: 10.1021/acs.jctc.4c00064] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2024] [Revised: 02/27/2024] [Accepted: 03/01/2024] [Indexed: 03/21/2024]
Abstract
The transferability of force field parameters is a crucial aspect of high-quality force fields. Previous investigations have affirmed the transferability of electrostatic parameters derived from polarizable Gaussian multipole models (pGMs) when applied to water oligomer clusters, polypeptides across various conformations, and different sequences. In this study, we introduce PCMRESP, a novel method for electrostatic parametrization in solution, intended for the development of polarizable force fields. We utilized this method to assess the transferability of three models: a fixed charge model and two variants of pGM models. Our analysis involved testing these models on 377 small molecules and 100 tetra-peptides in five representative dielectric environments: gas, diethyl ether, dichloroethane, acetone, and water. Our findings reveal that the inclusion of atomic polarization significantly enhances transferability and the incorporation of permanent atomic dipoles, in the form of covalent bond dipoles, leads to further improvements. Moreover, our tests on dual-solvent strategies demonstrate consistent transferability for all three models, underscoring the robustness of the dual-solvent approach. In contrast, an evaluation of the traditional HF/6-31G* method indicates poor transferability for the pGM-ind and pGM-perm models, suggesting the limitations of this conventional approach.
Collapse
Affiliation(s)
- Yong Duan
- UC
Davis Genome Center and Department of Biomedical Engineering, University of California, Davis, One Shields Avenue, Davis, California 95616, United States
| | - Taoyu Niu
- Department
of Pharmaceutical Sciences and Computational Chemical Genomics Screening
Center, School of Pharmacy, University of
Pittsburgh, Pittsburgh, Pennsylvania 15261, United States
| | - Junmei Wang
- Department
of Pharmaceutical Sciences and Computational Chemical Genomics Screening
Center, School of Pharmacy, University of
Pittsburgh, Pittsburgh, Pennsylvania 15261, United States
| | - Piotr Cieplak
- SBP
Medical Discovery Institute, 10901 North Torrey Pines Road, La Jolla, California 92037, United States
| | - Ray Luo
- Departments
of Molecular Biology and Biochemistry, Chemical and Biomolecular Engineering,
Materials Science and Engineering, and Biomedical Engineering, University of California, Irvine. Irvine, California 92697, United States
| |
Collapse
|
7
|
Zhao S, Cieplak P, Duan Y, Luo R. Assessment of Amino Acid Electrostatic Parametrizations of the Polarizable Gaussian Multipole Model. J Chem Theory Comput 2024; 20:2098-2110. [PMID: 38394331 PMCID: PMC11060985 DOI: 10.1021/acs.jctc.3c01347] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/25/2024]
Abstract
Accurate parametrization of amino acids is pivotal for the development of reliable force fields for molecular modeling of biomolecules such as proteins. This study aims to assess amino acid electrostatic parametrizations with the polarizable Gaussian Multipole (pGM) model by evaluating the performance of the pGM-perm (with atomic permanent dipoles) and pGM-ind (without atomic permanent dipoles) variants compared to the traditional RESP model. The 100-conf-combterm fitting strategy on tetrapeptides was adopted, in which (1) all peptide bond atoms (-CO-NH-) share identical set of parameters and (2) the total charges of the two terminal N-acetyl (ACE) and N-methylamide (NME) groups were set to neutral. The accuracy and transferability of electrostatic parameters across peptides with varying lengths and real-world examples were examined. The results demonstrate the enhanced performance of the pGM-perm model in accurately representing the electrostatic properties of amino acids. This insight underscores the potential of the pGM-perm model and the 100-conf-combterm strategy for the future development of the pGM force field.
Collapse
Affiliation(s)
- Shiji Zhao
- Nurix Therapeutics, Inc., 1700 Owens St. Suite 205, San Francisco, CA 94158, USA
| | - Piotr Cieplak
- SBP Medical Discovery Institute, 10901 North Torrey Pines Road, La Jolla, California 92037, United States
| | - Yong Duan
- UC Davis Genome Center and Department of Biomedical Engineering, University of California, Davis, One Shields Avenue, Davis, California 95616, United States
| | - Ray Luo
- Departments of Molecular Biology and Biochemistry, Chemical and Biomolecular Engineering, Materials Science and Engineering, and Biomedical Engineering, University of California, Irvine. Irvine, California 92697, United States
| |
Collapse
|
8
|
Case D, Aktulga HM, Belfon K, Cerutti DS, Cisneros GA, Cruzeiro VD, Forouzesh N, Giese TJ, Götz AW, Gohlke H, Izadi S, Kasavajhala K, Kaymak MC, King E, Kurtzman T, Lee TS, Li P, Liu J, Luchko T, Luo R, Manathunga M, Machado MR, Nguyen HM, O’Hearn KA, Onufriev AV, Pan F, Pantano S, Qi R, Rahnamoun A, Risheh A, Schott-Verdugo S, Shajan A, Swails J, Wang J, Wei H, Wu X, Wu Y, Zhang S, Zhao S, Zhu Q, Cheatham TE, Roe DR, Roitberg A, Simmerling C, York DM, Nagan MC, Merz KM. AmberTools. J Chem Inf Model 2023; 63:6183-6191. [PMID: 37805934 PMCID: PMC10598796 DOI: 10.1021/acs.jcim.3c01153] [Citation(s) in RCA: 573] [Impact Index Per Article: 286.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2023] [Indexed: 10/10/2023]
Abstract
AmberTools is a free and open-source collection of programs used to set up, run, and analyze molecular simulations. The newer features contained within AmberTools23 are briefly described in this Application note.
Collapse
Affiliation(s)
- David
A. Case
- Department
of Chemistry and Chemical Biology, Rutgers
University, Piscataway 08854, New Jersey, United States
| | - Hasan Metin Aktulga
- Department
of Computer Science and Engineering, Michigan
State University, East Lansing 48824-1322, Michigan, United States
| | - Kellon Belfon
- FOG
Pharmaceuticals Inc., Cambridge 02140, Massachusetts, United States
| | - David S. Cerutti
- Psivant, 451 D Street, Suite 205, Boston 02210, Massachusetts, United States
| | - G. Andrés Cisneros
- Department
of Physics, Department of Chemistry and Biochemistry, University of Texas at Dallas, Richardson 75801, Texas, United States
| | - Vinícius
Wilian D. Cruzeiro
- Department
of Chemistry and The PULSE Institute, Stanford
University, Stanford 94305, California, United States
| | - Negin Forouzesh
- Department
of Computer Science, California State University, Los Angeles 90032, California, United States
| | - Timothy J. Giese
- Laboratory
for Biomolecular Simulation Research, Institute for Quantitative Biomedicine
and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway 08854, New Jersey, United States
| | - Andreas W. Götz
- San
Diego Supercomputer Center, University of
California San Diego, La Jolla 92093-0505, California, United States
| | - Holger Gohlke
- Institute
for Pharmaceutical and Medicinal Chemistry, Heinrich Heine University Düsseldorf, Düsseldorf 40225, Germany
- Institute
of Bio- and Geosciences (IBG-4: Bioinformatics), Forschungszentrum Jülich GmbH, Jülich 52425, Germany
| | - Saeed Izadi
- Pharmaceutical
Development, Genentech, Inc., South San Francisco 94080, California, United
States
| | - Koushik Kasavajhala
- Laufer
Center for Physical and Quantitative Biology, Department of Chemistry, Stony Brook University, Stony Brook 11794, New York, United States
| | - Mehmet C. Kaymak
- Department
of Computer Science and Engineering, Michigan
State University, East Lansing 48824-1322, Michigan, United States
| | - Edward King
- Departments
of Molecular Biology and Biochemistry, Chemical and Biomolecular Engineering,
Materials Science and Engineering, and Biomedical Engineering, Graduate
Program in Chemical and Materials Physics, University of California, Irvine 92697, California, United States
| | - Tom Kurtzman
- Ph.D.
Programs in Chemistry, Biochemistry, and Biology, The Graduate Center of the City University of New York, 365 Fifth Avenue, New York 10016, New York, United States
- Department
of Chemistry, Lehman College, 250 Bedford Park Blvd West, Bronx 10468, New York, United States
| | - Tai-Sung Lee
- Laboratory
for Biomolecular Simulation Research, Institute for Quantitative Biomedicine
and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway 08854, New Jersey, United States
| | - Pengfei Li
- Department
of Chemistry and Biochemistry, Loyola University
Chicago, Chicago 60660, Illinois, United States
| | - Jian Liu
- Beijing
National Laboratory for Molecular Sciences, Institute of Theoretical
and Computational Chemistry, College of Chemistry and Molecular Engineering, Peking University, Beijing 100871, China
| | - Tyler Luchko
- Department
of Physics and Astronomy, California State
University, Northridge, Northridge 91330, California, United States
| | - Ray Luo
- Departments
of Molecular Biology and Biochemistry, Chemical and Biomolecular Engineering,
Materials Science and Engineering, and Biomedical Engineering, Graduate
Program in Chemical and Materials Physics, University of California, Irvine 92697, California, United States
| | - Madushanka Manathunga
- Department
of Chemistry and Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing 48824-1322, Michigan, United States
| | | | - Hai Minh Nguyen
- Department
of Chemistry and Chemical Biology, Rutgers
University, Piscataway 08854, New Jersey, United States
| | - Kurt A. O’Hearn
- Department
of Computer Science and Engineering, Michigan
State University, East Lansing 48824-1322, Michigan, United States
| | - Alexey V. Onufriev
- Departments
of Computer Science and Physics, Virginia
Tech, Blacksburg 24061, Virginia, United
States
| | - Feng Pan
- Department
of Statistics, Florida State University, Tallahassee 32304, Florida, United States
| | - Sergio Pantano
- Institut Pasteur de Montevideo, Montevideo 11400, Uruguay
| | - Ruxi Qi
- Cryo-EM
Center, Southern University of Science and
Technology, Shenzhen 518055, China
| | - Ali Rahnamoun
- Department
of Computer Science and Engineering, Michigan
State University, East Lansing 48824-1322, Michigan, United States
| | - Ali Risheh
- Department
of Computer Science, California State University, Los Angeles 90032, California, United States
| | - Stephan Schott-Verdugo
- Institute
of Bio- and Geosciences (IBG-4: Bioinformatics), Forschungszentrum Jülich GmbH, Jülich 52425, Germany
| | - Akhil Shajan
- Department
of Chemistry and Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing 48824-1322, Michigan, United States
| | - Jason Swails
- Entos, 4470 W Sunset
Blvd, Suite 107, Los Angeles 90027, California, United States
| | - Junmei Wang
- Department
of Pharmaceutical Sciences, School of Pharmacy, University of Pittsburgh, Pittsburgh 15261, Pennsylvania, United States
| | - Haixin Wei
- Departments
of Molecular Biology and Biochemistry, Chemical and Biomolecular Engineering,
Materials Science and Engineering, and Biomedical Engineering, Graduate
Program in Chemical and Materials Physics, University of California, Irvine 92697, California, United States
| | - Xiongwu Wu
- Laboratory
of Computational Biology, NHLBI, NIH, Bethesda 20892, Maryland, United States
| | - Yongxian Wu
- Departments
of Molecular Biology and Biochemistry, Chemical and Biomolecular Engineering,
Materials Science and Engineering, and Biomedical Engineering, Graduate
Program in Chemical and Materials Physics, University of California, Irvine 92697, California, United States
| | - Shi Zhang
- Laboratory
for Biomolecular Simulation Research, Institute for Quantitative Biomedicine
and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway 08854, New Jersey, United States
| | - Shiji Zhao
- Departments
of Molecular Biology and Biochemistry, Chemical and Biomolecular Engineering,
Materials Science and Engineering, and Biomedical Engineering, Graduate
Program in Chemical and Materials Physics, University of California, Irvine 92697, California, United States
- Nurix Therapeutics, Inc., San Francisco 94158, California, United States
| | - Qiang Zhu
- Departments
of Molecular Biology and Biochemistry, Chemical and Biomolecular Engineering,
Materials Science and Engineering, and Biomedical Engineering, Graduate
Program in Chemical and Materials Physics, University of California, Irvine 92697, California, United States
| | - Thomas E. Cheatham
- Department
of Medicinal Chemistry, The University of
Utah, 30 South 2000 East, Salt Lake City 84112, Utah, United
States
| | - Daniel R. Roe
- Laboratory
of Computational Biology, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda 20892, Maryland, United States
| | - Adrian Roitberg
- Department
of Chemistry, The University of Florida, 440 Leigh Hall, Gainesville 32611-7200, Florida, United States
| | - Carlos Simmerling
- Laufer
Center for Physical and Quantitative Biology, Department of Chemistry, Stony Brook University, Stony Brook 11794, New York, United States
| | - Darrin M. York
- Laboratory
for Biomolecular Simulation Research, Institute for Quantitative Biomedicine
and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway 08854, New Jersey, United States
| | - Maria C. Nagan
- Department
of Chemistry, Stony Brook University, Stony Brook 11794, New York, United States
| | - Kenneth M. Merz
- Department
of Chemistry and Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing 48824-1322, Michigan, United States
| |
Collapse
|
9
|
Zhu Q, Wu Y, Zhao S, Cieplak P, Duan Y, Luo R. Streamlining and Optimizing Strategies of Electrostatic Parameterization. J Chem Theory Comput 2023; 19:6353-6365. [PMID: 37676646 PMCID: PMC10530599 DOI: 10.1021/acs.jctc.3c00659] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/08/2023]
Abstract
Accurate characterization of electrostatic interactions is crucial in molecular simulation. Various methods and programs have been developed to obtain electrostatic parameters for additive or polarizable models to replicate electrostatic properties obtained from experimental measurements or theoretical calculations. Electrostatic potentials (ESPs), a set of physically well-defined observables from quantum mechanical (QM) calculations, are well suited for optimization efforts due to the ease of collecting a large amount of conformation-dependent data. However, a reliable set of QM ESP computed at an appropriate level of theory and atomic basis set is necessary. In addition, despite the recent development of the PyRESP program for electrostatic parameterizations of induced dipole-polarizable models, the time-consuming and error-prone input file preparation process has limited the widespread use of these protocols. This work aims to comprehensively evaluate the quality of QM ESPs derived by eight methods, including wave function methods such as Hartree-Fock (HF), second-order Møller-Plesset (MP2), and coupled cluster-singles and doubles (CCSD), as well as five hybrid density functional theory (DFT) methods, used in conjunction with 13 different basis sets. The highest theory levels CCSD/aug-cc-pV5Z (a5z) and MP2/aug-cc-pV5Z (a5z) were selected as benchmark data over two homemade data sets. The results show that the hybrid DFT method, ωB97X-D, combined with the aug-cc-pVTZ (a3z) basis set, performs well in reproducing ESPs while taking both accuracy and efficiency into consideration. Moreover, a flexible and user-friendly program called PyRESP_GEN was developed to streamline input file preparation. The restraining strengths, along with strategies for polarizable Gaussian multipole (pGM) model parameterizations, were also optimized. These findings and the program presented in this work facilitate the development and application of induced dipole-polarizable models, such as pGM models, for molecular simulations of both chemical and biological significance.
Collapse
Affiliation(s)
- Qiang Zhu
- Department of Molecular Biology and Biochemistry, Chemical and Biomolecular Engineering, Materials Science and Engineering, and Biomedical Engineering, University of California, Irvine, Irvine, California 92697, United States
| | - Yongxian Wu
- Department of Molecular Biology and Biochemistry, Chemical and Biomolecular Engineering, Materials Science and Engineering, and Biomedical Engineering, University of California, Irvine, Irvine, California 92697, United States
| | - Shiji Zhao
- Nurix Therapeutics, Inc., 1700 Owens St, Suite 205, San Francisco, California 94158, United States
| | - Piotr Cieplak
- SBP Medical Discovery Institute, La Jolla, California 92037, United States
| | - Yong Duan
- UC Davis Genome Center and Department of Biomedical Engineering, University of California, Davis, Davis, California 95616, United States
| | - Ray Luo
- Department of Molecular Biology and Biochemistry, Chemical and Biomolecular Engineering, Materials Science and Engineering, and Biomedical Engineering, University of California, Irvine, Irvine, California 92697, United States
| |
Collapse
|
10
|
Wang X, Wang Y, Guo M, Wang X, Li Y, Zhang JZH. Assessment of an Electrostatic Energy-Based Charge Model for Modeling the Electrostatic Interactions in Water Solvent. J Chem Theory Comput 2023; 19:6294-6312. [PMID: 37656610 DOI: 10.1021/acs.jctc.3c00467] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/03/2023]
Abstract
The protein force field based on the restrained electrostatic potential (RESP) charges has limitations in accurately describing hydrogen bonding interactions in proteins. To address this issue, we propose an alternative approach called the electrostatic energy-based charges (EEC) model, which shows improved performance in describing electrostatic interactions (EIs) of hydrogen bonds in proteins. In this study, we further investigate the performance of the EEC model in modeling EIs in water solvent. Our findings demonstrate that the fixed EEC model can effectively reproduce the quantum mechanics/molecular mechanics (QM/MM)-calculated EIs between a water molecule and various water solvent environments. However, to achieve the same level of computational accuracy, the electrostatic potential (ESP) charge model needs to fluctuate according to the electrostatic environment. Our analysis indicates that the requirement for charge adjustments depends on the specific mathematical and physical representation of EIs as a function of the environment for deriving charges. By comparing with widely used empirical water models calibrated to reproduce experimental properties, we confirm that the performance of the EEC model in reproducing QM/MM EIs is similar to that of general purpose TIP4P-like water models such as TIP4P-Ew and TIP4P/2005. When comparing the computed 10,000 distinct EI values within the range of -40 to 0 kcal/mol with the QM/MM results calculated at the MP2/aug-cc-pVQZ/TIP3P level, we noticed that the mean unsigned error (MUE) for the EEC model is merely 0.487 kcal/mol, which is remarkably similar to the MUE values of the TIP4P-Ew (0.63 kcal/mol) and TIP4P/2005 (0.579 kcal/mol) models. However, both the RESP method and the TIP3P model exhibit a tendency to overestimate the EIs, as evidenced by their higher MUE values of 1.761 and 1.293 kcal/mol, respectively. EEC-based molecular dynamics simulations have demonstrated that, when combined with appropriate van der Waals parameters, the EEC model can closely reproduce oxygen-oxygen radial distribution function and density of water, showing a remarkable similarity to the well-established TIP4P-like empirical water models. Our results demonstrate that the EEC model has the potential to build force fields with comparable accuracy to more sophisticated empirical TIP4P-like water models.
Collapse
Affiliation(s)
- Xianwei Wang
- College of Science, Zhejiang University of Technology, Hangzhou, Zhejiang 310023, China
| | - Yiying Wang
- College of Science, Zhejiang University of Technology, Hangzhou, Zhejiang 310023, China
| | - Man Guo
- College of Science, Zhejiang University of Technology, Hangzhou, Zhejiang 310023, China
| | - Xuechao Wang
- College of Science, Zhejiang University of Technology, Hangzhou, Zhejiang 310023, China
| | - Yang Li
- College of Information Science and Engineering, Shandong Agricultural University, Tai'an, Shandong 271018, China
| | - John Z H Zhang
- Shenzhen Institute of Synthetic Biology, Faculty of Synthetic Biology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, Guangdong 518055, China
- Shanghai Engineering Research Center of Molecular Therapeutics and New Drug Development, School of Chemistry and Molecular Engineering, East China Normal University, Shanghai 200062, China
- NYU-ECNU Center for Computational Chemistry at NYU Shanghai, Shanghai 200062, China
- Collaborative Innovation Center of Extreme Optics, Shanxi University, Taiyuan, Shanxi 030006, China
| |
Collapse
|
11
|
Huang Z, Zhao S, Cieplak P, Duan Y, Luo R, Wei H. Optimal Scheme to Achieve Energy Conservation in Induced Dipole Models. J Chem Theory Comput 2023; 19:5047-5057. [PMID: 37441805 PMCID: PMC10434752 DOI: 10.1021/acs.jctc.3c00226] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/15/2023]
Abstract
Induced dipole models have proven to be effective tools for simulating electronic polarization effects in biochemical processes, yet their potential has been constrained by energy conservation issue, particularly when historical data is utilized for dipole prediction. This study identifies error outliers as the primary factor causing this failure of energy conservation and proposes a comprehensive scheme to overcome this limitation. Leveraging maximum relative errors as a convergence metric, our data demonstrates that energy conservation can be upheld even when using historical information for dipole predictions. Our study introduces the multi-order extrapolation method to quicken induction iteration and optimize the use of historical data, while also developing the preconditioned conjugate gradient with local iterations to refine the iteration process and effectively remove error outliers. This scheme further incorporates a "peek" step via Jacobi under-relaxation for optimal performance. Simulation evidence suggests that our proposed scheme can achieve energy convergence akin to that of point-charge models within a limited number of iterations, thus promising significant improvements in efficiency and accuracy.
Collapse
Affiliation(s)
- Zhen Huang
- Chemical and Materials Physics Graduate Program, University of California, Irvine. Irvine, California 92697, United States
- Departments of Molecular Biology and Biochemistry, Chemical and Biomolecular Engineering, Materials Science and Engineering, and Biomedical Engineering, University of California, Irvine. Irvine, California 92697, United States
| | - Shiji Zhao
- Departments of Molecular Biology and Biochemistry, Chemical and Biomolecular Engineering, Materials Science and Engineering, and Biomedical Engineering, University of California, Irvine. Irvine, California 92697, United States
- Nurix Therapeutics, Inc., 1700 Owens St. Suite 205, San Francisco, CA 94158, United States
| | - Piotr Cieplak
- SBP Medical Discovery Institute, 10901 North Torrey Pines Road, La Jolla, California 92037, United States
| | - Yong Duan
- UC Davis Genome Center and Department of Biomedical Engineering, University of California, Davis, One Shields Avenue, Davis, California 95616, United States
| | - Ray Luo
- Chemical and Materials Physics Graduate Program, University of California, Irvine. Irvine, California 92697, United States
- Departments of Molecular Biology and Biochemistry, Chemical and Biomolecular Engineering, Materials Science and Engineering, and Biomedical Engineering, University of California, Irvine. Irvine, California 92697, United States
| | - Haixin Wei
- Departments of Molecular Biology and Biochemistry, Chemical and Biomolecular Engineering, Materials Science and Engineering, and Biomedical Engineering, University of California, Irvine. Irvine, California 92697, United States
- Department of Chemistry & Biochemistry, University of California, San Diego. 9500 Gilman Drive, La Jolla, California 92093, United States
| |
Collapse
|
12
|
Zhao S, Cieplak P, Duan Y, Luo R. Transferability of the Electrostatic Parameters of the Polarizable Gaussian Multipole Model. J Chem Theory Comput 2023; 19:924-941. [PMID: 36696564 PMCID: PMC10152989 DOI: 10.1021/acs.jctc.2c01048] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
Abstract
Accuracy and transferability are the two highly desirable properties of molecular mechanical force fields. Compared with the extensively used point-charge additive force fields that apply fixed atom-centered point partial charges to model electrostatic interactions, polarizable force fields are thought to have the advantage of modeling the atomic polarization effects. Previous works have demonstrated the accuracy of the recently developed polarizable Gaussian multipole (pGM) models. In this work, we assessed the transferability of the electrostatic parameters of the pGM models with (pGM-perm) and without (pGM-ind) atomic permanent dipoles in terms of reproducing the electrostatic potentials surrounding molecules/oligomers absent from electrostatic parameterizations. Encouragingly, both the pGM-perm and pGM-ind models show significantly improved transferability than the additive model in the tests (1) from water monomer to water oligomer clusters; (2) across different conformations of amino acid dipeptides and tetrapeptides; (3) from amino acid tetrapeptides to longer polypeptides; and (4) from nucleobase monomers to Watson-Crick base pair dimers and tetramers. Furthermore, we demonstrated that the double-conformation fittings using amino acid tetrapeptides in the αR and β conformations can result in good transferability not only across different tetrapeptide conformations but also from tetrapeptides to polypeptides with lengths ranging from 1 to 20 repetitive residues for both the pGM-ind and pGM-perm models. In addition, the observation that the pGM-ind model has significantly better accuracy and transferability than the point-charge additive model, even though they have an identical number of parameters, strongly suggest the importance of intramolecular polarization effects. In summary, this and previous works together show that the pGM models possess both accuracy and transferability, which are expected to serve as foundations for the development of next-generation polarizable force fields for modeling various polarization-sensitive biological systems and processes.
Collapse
Affiliation(s)
- Shiji Zhao
- Departments of Molecular Biology and Biochemistry, Chemical and Biomolecular Engineering, Materials Science and Engineering, and Biomedical Engineering, University of California, Irvine, Irvine, California 92697, United States
| | - Piotr Cieplak
- SBP Medical Discovery Institute, 10901 North Torrey Pines Road, La Jolla, California 92037, United States
| | - Yong Duan
- UC Davis Genome Center and Department of Biomedical Engineering, University of California, Davis, One Shields Avenue, Davis, California 95616, United States
| | - Ray Luo
- Departments of Molecular Biology and Biochemistry, Chemical and Biomolecular Engineering, Materials Science and Engineering, and Biomedical Engineering, University of California, Irvine, Irvine, California 92697, United States
| |
Collapse
|