1
|
Bowling PE, Dasgupta S, Herbert JM. Eliminating Imaginary Vibrational Frequencies in Quantum-Chemical Cluster Models of Enzymatic Active Sites. J Chem Inf Model 2024; 64:3912-3922. [PMID: 38648614 DOI: 10.1021/acs.jcim.4c00221] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/25/2024]
Abstract
In constructing finite models of enzyme active sites for quantum-chemical calculations, atoms at the periphery of the model must be constrained to prevent unphysical rearrangements during geometry relaxation. A simple fixed-atom or "coordinate-lock" approach is commonly employed but leads to undesirable artifacts in the form of small imaginary frequencies. These preclude evaluation of finite-temperature free-energy corrections, limiting thermochemical calculations to enthalpies only. Full-dimensional vibrational frequency calculations are possible by replacing the fixed-atom constraints with harmonic confining potentials. Here, we compare that approach to an alternative strategy in which fixed-atom contributions to the Hessian are simply omitted. While the latter strategy does eliminate imaginary frequencies, it tends to underestimate both the zero-point energy and the vibrational entropy while introducing artificial rigidity. Harmonic confining potentials eliminate imaginary frequencies and provide a flexible means to construct active-site models that can be used in unconstrained geometry relaxations, affording better convergence of reaction energies and barrier heights with respect to the model size, as compared to models with fixed-atom constraints.
Collapse
Affiliation(s)
- Paige E Bowling
- Biophysics Graduate Program, The Ohio State University, Columbus, Ohio 43210, United States
- Department of Chemistry and Biochemistry, The Ohio State University, Columbus, Ohio 43210, United States
| | - Saswata Dasgupta
- Department of Chemistry and Biochemistry, The Ohio State University, Columbus, Ohio 43210, United States
- Department of Chemistry and Biochemistry, University of California-San Diego, La Jolla, California 92093, United States
| | - John M Herbert
- Biophysics Graduate Program, The Ohio State University, Columbus, Ohio 43210, United States
- Department of Chemistry and Biochemistry, The Ohio State University, Columbus, Ohio 43210, United States
| |
Collapse
|
2
|
Agbaglo DA, Summers TJ, Cheng Q, DeYonker NJ. The influence of model building schemes and molecular dynamics sampling on QM-cluster models: the chorismate mutase case study. Phys Chem Chem Phys 2024; 26:12467-12482. [PMID: 38618904 PMCID: PMC11090134 DOI: 10.1039/d3cp06100k] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/16/2024]
Abstract
Most QM-cluster models of enzymes are constructed based on X-ray crystal structures, which limits comparison to in vivo structure and mechanism. The active site of chorismate mutase from Bacillus subtilis and the enzymatic transformation of chorismate to prephenate is used as a case study to guide construction of QM-cluster models built first from the X-ray crystal structure, then from molecular dynamics (MD) simulation snapshots. The Residue Interaction Network ResidUe Selector (RINRUS) software toolkit, developed by our group to simplify and automate the construction of QM-cluster models, is expanded to handle MD to QM-cluster model workflows. Several options, some employing novel topological clustering from residue interaction network (RIN) information, are evaluated for generating conformational clustering from MD simulation. RINRUS then generates a statistical thermodynamic framework for QM-cluster modeling of the chorismate mutase mechanism via refining 250 MD frames with density functional theory (DFT). The 250 QM-cluster models sampled provide a mean ΔG‡ of 10.3 ± 2.6 kcal mol-1 compared to the experimental value of 15.4 kcal mol-1 at 25 °C. While the difference between theory and experiment is consequential, the level of theory used is modest and therefore "chemical" accuracy is unexpected. More important are the comparisons made between QM-cluster models designed from the X-ray crystal structure versus those from MD frames. The large variations in kinetic and thermodynamic properties arise from geometric changes in the ensemble of QM-cluster models, rather from the composition of the QM-cluster models or from the active site-solvent interface. The findings open the way for further quantitative and reproducible calibration in the field of computational enzymology using the model construction framework afforded with the RINRUS software toolkit.
Collapse
Affiliation(s)
- Donatus A Agbaglo
- Department of Chemistry, University of Memphis, Memphis, TN 38152, USA.
| | - Thomas J Summers
- Department of Chemistry, University of Memphis, Memphis, TN 38152, USA.
| | - Qianyi Cheng
- Department of Chemistry, University of Memphis, Memphis, TN 38152, USA.
| | - Nathan J DeYonker
- Department of Chemistry, University of Memphis, Memphis, TN 38152, USA.
| |
Collapse
|
3
|
Wappett DA, Goerigk L. Exploring CPS-Extrapolated DLPNO-CCSD(T 1) Reference Values for Benchmarking DFT Methods on Enzymatically Catalyzed Reactions. J Phys Chem A 2024; 128:62-72. [PMID: 38124376 DOI: 10.1021/acs.jpca.3c05086] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2023]
Abstract
Domain-based local pair natural orbital coupled-cluster singles doubles with perturbative triples [DLPNO-CCSD(T)] is regularly used to calculate reliable benchmark reference values at a computational cost significantly lower than that of canonical CCSD(T). Recent work has shown that even greater accuracy can be obtained at only a small additional cost through extrapolation to the complete PNO space (CPS) limit. Herein, we test two levels of CPS extrapolation, CPS(5,6), which approximates the accuracy of standard TightPNO, and CPS(6,7), which surpasses it, as benchmark values to test density functional approximations (DFAs) on a small set of organic and transition-metal-dependent enzyme active site models. Between the different reference levels of theory, there are changes in the magnitudes of the absolute deviations for all functionals, but these are small and there is minimal impact on the relative rankings of the tested DFAs. The differences are more significant for the metalloenzymes than the organic enzymes, so we repeat the tests on our entire ENZYMES22 set of organic enzyme active site models [Wappett, D. A.; Goerigk, L. J. Phys. Chem. A 2019, 123, 7057-7074] to confirm that using the CPS extrapolations for the reference values has negligible impact on the benchmarking outcomes. This means that we can particularly recommend CPS(5,6) as an alternative to standard TightPNO settings for calculating reference values, increasing the applicability of DLPNO-CCSD(T) in benchmarking reaction energies and barrier heights of larger models of organic enzymes. DLPNO-CCSD(T1)/CPS(6,7) energies for ENZYMES22 are finally presented as updated reference values for the set, reflecting the recent improvements in the method.
Collapse
Affiliation(s)
- Dominique A Wappett
- School of Chemistry, The University of Melbourne, Parkville, Victoria 3010, Australia
| | - Lars Goerigk
- School of Chemistry, The University of Melbourne, Parkville, Victoria 3010, Australia
| |
Collapse
|
4
|
Stylianakis I, Zervos N, Lii JH, Pantazis DA, Kolocouris A. Conformational energies of reference organic molecules: benchmarking of common efficient computational methods against coupled cluster theory. J Comput Aided Mol Des 2023; 37:607-656. [PMID: 37597063 PMCID: PMC10618395 DOI: 10.1007/s10822-023-00513-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2023] [Accepted: 06/03/2023] [Indexed: 08/21/2023]
Abstract
We selected 145 reference organic molecules that include model fragments used in computer-aided drug design. We calculated 158 conformational energies and barriers using force fields, with wide applicability in commercial and free softwares and extensive application on the calculation of conformational energies of organic molecules, e.g. the UFF and DREIDING force fields, the Allinger's force fields MM3-96, MM3-00, MM4-8, the MM2-91 clones MMX and MM+, the MMFF94 force field, MM4, ab initio Hartree-Fock (HF) theory with different basis sets, the standard density functional theory B3LYP, the second-order post-HF MP2 theory and the Domain-based Local Pair Natural Orbital Coupled Cluster DLPNO-CCSD(T) theory, with the latter used for accurate reference values. The data set of the organic molecules includes hydrocarbons, haloalkanes, conjugated compounds, and oxygen-, nitrogen-, phosphorus- and sulphur-containing compounds. We reviewed in detail the conformational aspects of these model organic molecules providing the current understanding of the steric and electronic factors that determine the stability of low energy conformers and the literature including previous experimental observations and calculated findings. While progress on the computer hardware allows the calculations of thousands of conformations for later use in drug design projects, this study is an update from previous classical studies that used, as reference values, experimental ones using a variety of methods and different environments. The lowest mean error against the DLPNO-CCSD(T) reference was calculated for MP2 (0.35 kcal mol-1), followed by B3LYP (0.69 kcal mol-1) and the HF theories (0.81-1.0 kcal mol-1). As regards the force fields, the lowest errors were observed for the Allinger's force fields MM3-00 (1.28 kcal mol-1), ΜΜ3-96 (1.40 kcal mol-1) and the Halgren's MMFF94 force field (1.30 kcal mol-1) and then for the MM2-91 clones MMX (1.77 kcal mol-1) and MM+ (2.01 kcal mol-1) and MM4 (2.05 kcal mol-1). The DREIDING (3.63 kcal mol-1) and UFF (3.77 kcal mol-1) force fields have the lowest performance. These model organic molecules we used are often present as fragments in drug-like molecules. The values calculated using DLPNO-CCSD(T) make up a valuable data set for further comparisons and for improved force field parameterization.
Collapse
Affiliation(s)
- Ioannis Stylianakis
- Department of Medicinal Chemistry, Faculty of Pharmacy, National and Kapodistrian University of Athens, Panepistimioupolis Zografou, 15771, Athens, Greece
| | - Nikolaos Zervos
- Department of Medicinal Chemistry, Faculty of Pharmacy, National and Kapodistrian University of Athens, Panepistimioupolis Zografou, 15771, Athens, Greece
| | - Jenn-Huei Lii
- Department of Chemistry, National Changhua University of Education, Changhua City, Taiwan
| | - Dimitrios A Pantazis
- Max-Planck-Institut für Kohlenforschung, Kaiser-Wilhelm-Platz 1, 45470, Mülheim an der Ruhr, Germany
| | - Antonios Kolocouris
- Department of Medicinal Chemistry, Faculty of Pharmacy, National and Kapodistrian University of Athens, Panepistimioupolis Zografou, 15771, Athens, Greece.
- Laboratory of Medicinal Chemistry, Section of Pharmaceutical Chemistry, Department of Pharmacy, National and Kapodistrian University of Athens, Panepistimiopolis-Zografou, 15771, Athens, Greece.
| |
Collapse
|
5
|
Wappett D, Goerigk L. Benchmarking Density Functional Theory Methods for Metalloenzyme Reactions: The Introduction of the MME55 Set. J Chem Theory Comput 2023; 19:8365-8383. [PMID: 37943578 PMCID: PMC10688432 DOI: 10.1021/acs.jctc.3c00558] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2023] [Revised: 10/13/2023] [Accepted: 10/16/2023] [Indexed: 11/10/2023]
Abstract
We present a new benchmark set of metalloenzyme model reaction energies and barrier heights that we call MME55. The set contains 10 different enzymes, representing eight transition metals, both open and closed shell systems, and system sizes of up to 116 atoms. We use four DLPNO-CCSD(T)-based approaches to calculate reference values against which we then benchmark the performance of a range of density functional approximations with and without dispersion corrections. Dispersion corrections improve the results across the board, and triple-ζ basis sets provide the best balance of efficiency and accuracy. Jacob's ladder is reproduced for the whole set based on averaged mean absolute (percent) deviations, with the double hybrids SOS0-PBE0-2-D3(BJ) and revDOD-PBEP86-D4 standing out as the most accurate methods for the MME55 set. The range-separated hybrids ωB97M-V and ωB97X-V also perform well here and can be recommended as a reliable compromise between accuracy and efficiency; they have already been shown to be robust across many other types of chemical problems, as well. Despite the popularity of B3LYP in computational enzymology, it is not a strong performer on our benchmark set, and we discourage its use for enzyme energetics.
Collapse
Affiliation(s)
- Dominique
A. Wappett
- School of Chemistry, The University
of Melbourne, Melbourne, Victoria 3010, Australia
| | - Lars Goerigk
- School of Chemistry, The University
of Melbourne, Melbourne, Victoria 3010, Australia
| |
Collapse
|
6
|
Cheng Q, DeYonker NJ. The Glycine N-Methyltransferase Case Study: Another Challenge for QM-Cluster Models? J Phys Chem B 2023; 127:9282-9294. [PMID: 37870315 PMCID: PMC11018112 DOI: 10.1021/acs.jpcb.3c04138] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2023]
Abstract
The methyl transfer reaction between SAM and glycine catalyzed by glycine N-methyltransferase (GNMT) was examined using QM-cluster models generated by Residue Interaction Network ResidUe Selector (RINRUS). RINRUS is a Python-based tool that can build QM-cluster models with rules-based processing of the active site residue interaction network. This way of enzyme model-building allows quantitative analysis of residue and fragment contributions to kinetic and thermodynamic properties of the enzyme. Many residue fragments are important for the GNMT catalytic reaction, such as Gly137, Asn138, and Arg175, which interact with the glycine substrate, and Trp30, Asp85, and Tyr242, which interact with the SAM cofactor. Our study shows that active site fragments that interact with the glycine substrate and the SAM cofactor must both be included in the QM-cluster models. Even though the proposed mechanism is a simple one-step reaction, GNMT may be a rather challenging case study for QM-cluster models because convergence in energetics requires models with >350 atoms. "Maximal" QM-cluster models built with either qualitative contact count ranking or quantitative interaction energies from functional group symmetry adapted perturbation theory provide acceptable results. Hence, important residue fragments that contribute to the energetics of the methyl-transfer reaction in GNMT are correctly identified in the RIN. Observations from this work suggest new directions to better establish an effective approach for constructing atomic-level enzyme models.
Collapse
Affiliation(s)
- Qianyi Cheng
- Department of Chemistry, University of Memphis, Memphis, TN 38152, U.S.A
| | - Nathan J. DeYonker
- Department of Chemistry, University of Memphis, Memphis, TN 38152, U.S.A
| |
Collapse
|
7
|
Falcioni F, Popelier PLA. How to Compute Atomistic Insight in DFT Clusters: The REG-IQA Approach. J Chem Inf Model 2023. [PMID: 37428724 PMCID: PMC10369488 DOI: 10.1021/acs.jcim.3c00404] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/12/2023]
Abstract
The relative energy gradient (REG) method is paired with the topological energy partitioning method interacting quantum atoms (IQA), as REG-IQA, to provide detailed and unbiased knowledge on the intra- and interatomic interactions. REG operates on a sequence of geometries representing a dynamical change of a system. Its recent application to peptide hydrolysis of the human immunodeficiency virus-1 (HIV-1) protease (PDB code: 4HVP) has demonstrated its full potential in recovering reaction mechanisms and through-space electrostatic and exchange-correlation effects, making it a compelling tool for analyzing enzymatic reactions. In this study, the computational efficiency of the REG-IQA method for the 133-atom HIV-1 protease quantum mechanical system is analyzed in every detail and substantially improved by means of three different approaches. The first approach of smaller integration grids for IQA integrations reduces the computational overhead by about a factor of 3. The second approach uses the line-simplification Ramer-Douglas-Peucker (RDP) algorithm, which outputs the minimal number of geometries necessary for the REG-IQA analysis for a predetermined root mean squared error (RMSE) tolerance. This cuts the computational time of the whole REG analysis by a factor of 2 if an RMSE of 0.5 kJ/mol is considered. The third approach consists of a "biased" or "unbiased" selection of a specific subset of atoms of the whole initial quantum mechanical model wave-function, which results in more than a 10-fold speed-up per geometry for the IQA calculation, without deterioration of the outcome of the REG-IQA analysis. Finally, to show the capability of these approaches, the findings gathered from the HIV-1 protease system are also applied to a different system named haloalcohol dehalogenase (HheC). In summary, this study takes the REG-IQA method to a computationally feasible and highly accurate level, making it viable for the analysis of a multitude of enzymatic systems.
Collapse
Affiliation(s)
- Fabio Falcioni
- Department of Chemistry, University of Manchester, Oxford Road, Manchester M13 9PL, Great Britain
| | - Paul L A Popelier
- Department of Chemistry, University of Manchester, Oxford Road, Manchester M13 9PL, Great Britain
| |
Collapse
|
8
|
Santra G, Calinsky R, Martin JML. Benefits of Range-Separated Hybrid and Double-Hybrid Functionals for a Large and Diverse Data Set of Reaction Energies and Barrier Heights. J Phys Chem A 2022; 126:5492-5505. [PMID: 35930677 PMCID: PMC9393870 DOI: 10.1021/acs.jpca.2c03922] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2022] [Revised: 07/28/2022] [Indexed: 11/28/2022]
Abstract
To better understand the thermochemical kinetics and mechanism of a specific chemical reaction, an accurate estimation of barrier heights (forward and reverse) and reaction energies is vital. Because of the large size of reactants and transition state structures involved in real-life mechanistic studies (e.g., enzymatically catalyzed reactions), density functional theory remains the workhorse for such calculations. In this paper, we have assessed the performance of 91 density functionals for modeling the reaction energies and barrier heights on a large and chemically diverse data set (BH9) composed of 449 organic chemistry reactions. We have shown that range-separated hybrid functionals perform better than the global hybrids for BH9 barrier heights and reaction energies. Except for the PBE-based range-separated nonempirical double hybrids, range separation of the exchange term helps improve the performance for barrier heights and reaction energies. The 16-parameter Berkeley double hybrid, ωB97M(2), performs remarkably well for both properties. However, our minimally empirical range-separated double hybrid functionals offer marginally better accuracy than ωB97M(2) for BH9 barrier heights and reaction energies.
Collapse
Affiliation(s)
- Golokesh Santra
- Department of Molecular Chemistry
and Materials Science, Weizmann Institute
of Science, 7610001 Reḥovot, Israel
| | - Rivka Calinsky
- Department of Molecular Chemistry
and Materials Science, Weizmann Institute
of Science, 7610001 Reḥovot, Israel
| | - Jan M. L. Martin
- Department of Molecular Chemistry
and Materials Science, Weizmann Institute
of Science, 7610001 Reḥovot, Israel
| |
Collapse
|
9
|
Prasad VK, Pei Z, Edelmann S, Otero-de-la-Roza A, DiLabio GA. BH9, a New Comprehensive Benchmark Data Set for Barrier Heights and Reaction Energies: Assessment of Density Functional Approximations and Basis Set Incompleteness Potentials. J Chem Theory Comput 2021; 18:151-166. [PMID: 34911294 DOI: 10.1021/acs.jctc.1c00694] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
Abstract
The calculation of accurate reaction energies and barrier heights is essential in computational studies of reaction mechanisms and thermochemistry. To assess methods regarding their ability to predict these two properties, high-quality benchmark sets are required that comprise a reasonably large and diverse set of organic reactions. Due to the time-consuming nature of both locating transition states and computing accurate reference energies for reactions involving large molecules, previous benchmark sets have been limited in scope, the number of reactions considered, and the size of the reactant and product molecules. Recent advances in coupled-cluster theory, in particular local correlation methods like DLPNO-CCSD(T), now allow the calculation of reaction energies and barrier heights for relatively large systems. In this work, we present a comprehensive and diverse benchmark set of barrier heights and reaction energies based on DLPNO-CCSD(T)/CBS called BH9. BH9 comprises 449 chemical reactions belonging to nine types common in organic chemistry and biochemistry. We examine the accuracy of DLPNO-CCSD(T) vis-a-vis canonical CCSD(T) for a subset of BH9 and conclude that, although there is a penalty in using the DLPNO approximation, the reference data are accurate enough to serve as a benchmark for density functional theory (DFT) methods. We then present two applications of the BH9 set. First, we examine the performance of several density functional approximations commonly used in thermochemical and mechanistic studies. Second, we assess our basis set incompleteness potentials regarding their ability to mitigate basis set incompleteness errors. The number of data points, the diversity of the reactions considered, and the relatively large size of the reactant molecules make BH9 the most comprehensive thermochemical benchmark set to date and a useful tool for the development and assessment of computational methods.
Collapse
Affiliation(s)
- Viki Kumar Prasad
- Department of Chemistry, University of British Columbia, 3247 University Way, Kelowna, British Columbia, Canada V1V 1V7
| | - Zhipeng Pei
- Department of Chemistry, University of British Columbia, 3247 University Way, Kelowna, British Columbia, Canada V1V 1V7
| | - Simon Edelmann
- Department of Chemistry, University of British Columbia, 3247 University Way, Kelowna, British Columbia, Canada V1V 1V7
| | - Alberto Otero-de-la-Roza
- Departamento de Química Física y Analítica and MALTA Consolider Team, Facultad de Química, Universidad de Oviedo, 33006 Oviedo, Spain
| | - Gino A DiLabio
- Department of Chemistry, University of British Columbia, 3247 University Way, Kelowna, British Columbia, Canada V1V 1V7
| |
Collapse
|
10
|
Pinto AV, Ferreira P, Neves RPP, Fernandes PA, Ramos MJ, Magalhães AL. Reaction Mechanism of MHETase, a PET Degrading Enzyme. ACS Catal 2021. [DOI: 10.1021/acscatal.1c02444] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
Affiliation(s)
- Alexandre V. Pinto
- LAQV/Requimte, Departamento de Química e Bioquímica, Faculdade de Ciências da Universidade do Porto, Rua do Campo Alegre, s/n, 4169-007 Porto, Portugal
| | - Pedro Ferreira
- LAQV/Requimte, Departamento de Química e Bioquímica, Faculdade de Ciências da Universidade do Porto, Rua do Campo Alegre, s/n, 4169-007 Porto, Portugal
| | - Rui P. P. Neves
- LAQV/Requimte, Departamento de Química e Bioquímica, Faculdade de Ciências da Universidade do Porto, Rua do Campo Alegre, s/n, 4169-007 Porto, Portugal
| | - Pedro A. Fernandes
- LAQV/Requimte, Departamento de Química e Bioquímica, Faculdade de Ciências da Universidade do Porto, Rua do Campo Alegre, s/n, 4169-007 Porto, Portugal
| | - Maria J. Ramos
- LAQV/Requimte, Departamento de Química e Bioquímica, Faculdade de Ciências da Universidade do Porto, Rua do Campo Alegre, s/n, 4169-007 Porto, Portugal
| | - Alexandre L. Magalhães
- LAQV/Requimte, Departamento de Química e Bioquímica, Faculdade de Ciências da Universidade do Porto, Rua do Campo Alegre, s/n, 4169-007 Porto, Portugal
| |
Collapse
|
11
|
Wappett DA, Goerigk L. A guide to benchmarking enzymatically catalysed reactions: the importance of accurate reference energies and the chemical environment. Theor Chem Acc 2021. [DOI: 10.1007/s00214-021-02770-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]
|
12
|
Mehta N, Fellowes T, White JM, Goerigk L. CHAL336 Benchmark Set: How Well Do Quantum-Chemical Methods Describe Chalcogen-Bonding Interactions? J Chem Theory Comput 2021; 17:2783-2806. [PMID: 33881869 DOI: 10.1021/acs.jctc.1c00006] [Citation(s) in RCA: 41] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
We present the CHAL336 benchmark set-the most comprehensive database for the assessment of chalcogen-bonding (CB) interactions. After careful selection of suitable systems and identification of three high-level reference methods, the set comprises 336 dimers each consisting of up to 49 atoms and covers both σ- and π-hole interactions across four categories: chalcogen-chalcogen, chalcogen-π, chalcogen-halogen, and chalcogen-nitrogen interactions. In a subsequent study of DFT methods, we re-emphasize the need for using proper London dispersion corrections when treating noncovalent interactions. We also point out that the deterioration of results and systematic overestimation of interaction energies for some dispersion-corrected DFT methods does not hint at problems with the chosen dispersion correction but is a consequence of large density-driven errors. We conclude this work by performing the most detailed DFT benchmark study for CB interactions to date. We assess 109 variations of dispersion-corrected and dispersion-uncorrected DFT methods and carry out a detailed analysis of 80 of them. Double-hybrid functionals are the most reliable approaches for CB interactions, and they should be used whenever computationally feasible. The best three double hybrids are SOS0-PBE0-2-D3(BJ), revDSD-PBEP86-D3(BJ), and B2NCPLYP-D3(BJ). The best hybrids in this study are ωB97M-V, PW6B95-D3(0), and PW6B95-D3(BJ). We do not recommend using the popular B3LYP functional nor the MP2 approach, which have both been frequently used to describe CB interactions in the past. We hope to inspire a change in computational protocols surrounding CB interactions that leads away from the commonly used, popular methods to the more robust and accurate ones recommended herein. We would also like to encourage method developers to use our set for the investigation and reduction of density-driven errors in new density functional approximations.
Collapse
Affiliation(s)
- Nisha Mehta
- School of Chemistry, The University of Melbourne, Victoria 3010, Australia
| | - Thomas Fellowes
- School of Chemistry, The University of Melbourne, Victoria 3010, Australia.,Bio21 Molecular Science and Biotechnology Institute, University of Melbourne, Victoria 3010, Australia
| | - Jonathan M White
- School of Chemistry, The University of Melbourne, Victoria 3010, Australia.,Bio21 Molecular Science and Biotechnology Institute, University of Melbourne, Victoria 3010, Australia
| | - Lars Goerigk
- School of Chemistry, The University of Melbourne, Victoria 3010, Australia
| |
Collapse
|
13
|
Sandler I, Chen J, Taylor M, Sharma S, Ho J. Accuracy of DLPNO-CCSD(T): Effect of Basis Set and System Size. J Phys Chem A 2021; 125:1553-1563. [PMID: 33560853 DOI: 10.1021/acs.jpca.0c11270] [Citation(s) in RCA: 59] [Impact Index Per Article: 14.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
The DLPNO-CCSD(T) method is designed to study large molecular systems at significantly reduced cost relative to its canonical counterpart. However, the error in this approach is also size-extensive and relies on cancellation of errors for the calculation of relative energies. This work provides a direct comparison of canonical CCSD(T) and TightPNO DLPNO-CCSD(T) calculations of reaction energies and barriers of a broad range of chemical reactions. The dataset includes acidities, anion binding affinities, enolization, Diels-Alder, nucleophilic substitution, and atom transfer reactions and complements existing theoretical datasets in terms of system size as well as new reaction types (e.g., anion binding affinities and chlorine atom transfer reactions). The performance of DLPNO-CCSD(T) was further examined with respect to systematic variation of basis set and system size and amounts of nonbonded interaction present in the system. The errors in the DLPNO-CCSD(T) were found to be relatively insensitive to the choice of basis set for small systems but increase monotonically with system size. Additionally, calculations of barriers appear to be more challenging than reaction energies with errors exceeding 5 kJ mol-1 for many Diels-Alder reactions. Further tests on three realistic organic reactions reveal the impact of the DLPNO approximation in calculating absolute and relative barriers that are important for predictions such as stereoselectivity.
Collapse
Affiliation(s)
- Isolde Sandler
- School of Chemistry, University of New South Wales, Sydney, NSW 2052, Australia
| | - Junbo Chen
- School of Chemistry, University of New South Wales, Sydney, NSW 2052, Australia
| | - Mackenzie Taylor
- School of Chemistry, University of New South Wales, Sydney, NSW 2052, Australia
| | - Shaleen Sharma
- School of Chemistry, University of New South Wales, Sydney, NSW 2052, Australia
| | - Junming Ho
- School of Chemistry, University of New South Wales, Sydney, NSW 2052, Australia
| |
Collapse
|
14
|
Gyevi-Nagy L, Kállay M, Nagy PR. Accurate Reduced-Cost CCSD(T) Energies: Parallel Implementation, Benchmarks, and Large-Scale Applications. J Chem Theory Comput 2021; 17:860-878. [PMID: 33400527 PMCID: PMC7884001 DOI: 10.1021/acs.jctc.0c01077] [Citation(s) in RCA: 43] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2020] [Indexed: 11/28/2022]
Abstract
The accurate and systematically improvable frozen natural orbital (FNO) and natural auxiliary function (NAF) cost-reducing approaches are combined with our recent coupled-cluster singles, doubles, and perturbative triples [CCSD(T)] implementations. Both of the closed- and open-shell FNO-CCSD(T) codes benefit from OpenMP parallelism, completely or partially integral-direct density-fitting algorithms, checkpointing, and hand-optimized, memory- and operation count effective implementations exploiting all permutational symmetries. The closed-shell CCSD(T) code requires negligible disk I/O and network bandwidth, is MPI/OpenMP parallel, and exhibits outstanding peak performance utilization of 50-70% up to hundreds of cores. Conservative FNO and NAF truncation thresholds benchmarked for challenging reaction, atomization, and ionization energies of both closed- and open-shell species are shown to maintain 1 kJ/mol accuracy against canonical CCSD(T) for systems of 31-43 atoms even with large basis sets. The cost reduction of up to an order of magnitude achieved extends the reach of FNO-CCSD(T) to systems of 50-75 atoms (up to 2124 atomic orbitals) with triple- and quadruple-ζ basis sets, which is unprecedented without local approximations. Consequently, a considerably larger portion of the chemical compound space can now be covered by the practically "gold standard" quality FNO-CCSD(T) method using affordable resources and about a week of wall time. Large-scale applications are presented for organocatalytic and transition-metal reactions as well as noncovalent interactions. Possible applications for benchmarking local CCSD(T) methods, as well as for the accuracy assessment or parametrization of less complete models, for example, density functional approximations or machine learning potentials, are also outlined.
Collapse
Affiliation(s)
- László Gyevi-Nagy
- Department of Physical Chemistry and
Materials Science, Budapest University of
Technology and Economics, P.O. Box 91, H-1521 Budapest, Hungary
| | - Mihály Kállay
- Department of Physical Chemistry and
Materials Science, Budapest University of
Technology and Economics, P.O. Box 91, H-1521 Budapest, Hungary
| | - Péter R. Nagy
- Department of Physical Chemistry and
Materials Science, Budapest University of
Technology and Economics, P.O. Box 91, H-1521 Budapest, Hungary
| |
Collapse
|
15
|
Borosky GL. Alkaline Phosphatases: in Silico Study on the Catalytic Effect of Conserved Active Site Residues Using Human Placental Alkaline Phosphatase (PLAP) As a Model Protein. J Chem Inf Model 2020; 60:6228-6241. [PMID: 33306371 DOI: 10.1021/acs.jcim.0c00860] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
The metalloenzymes from the alkaline phosphatase (AP) superfamily catalyze the hydrolysis and transphosphorylation of phosphate monoesters. The role of several amino acids highly conserved in the active site of this family of enzymes was examined, using human placental AP (PLAP) as a model protein. By employing an active-site model based on the X-ray crystal structure of PLAP, mutations of several key residues were modeled by quantum mechanical methods in order to determine their impact on the catalytic activity. Kinetic and thermodynamic estimations were achieved for each reaction step of the catalytic mechanism by characterization of the intermediates and transition states on the reaction pathway, and the effects of mutations on the activation barriers were analyzed. A good accordance was observed between the present computational results and experimental measurements reported in the literature.
Collapse
Affiliation(s)
- Gabriela L Borosky
- INFIQC, CONICET and Departamento de Química Teórica y Computacional, Facultad de Ciencias Químicas, Universidad Nacional de Córdoba, Ciudad Universitaria, Córdoba 5000, Argentina
| |
Collapse
|
16
|
Paiva P, Ramos MJ, Fernandes PA. Assessing the validity of DLPNO-CCSD(T) in the calculation of activation and reaction energies of ubiquitous enzymatic reactions. J Comput Chem 2020; 41:2459-2468. [PMID: 32882060 DOI: 10.1002/jcc.26401] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2020] [Revised: 07/27/2020] [Accepted: 07/27/2020] [Indexed: 12/31/2022]
Abstract
The domain-based local pair natural orbital coupled-cluster with single, double, and perturbative triples excitation (DLPNO-CCSD(T)) method was employed to portray the activation and reaction energies of four ubiquitous enzymatic reactions, and its performance was confronted to CCSD(T)/complete basis set (CBS) to assess its accuracy and robustness in this specific field. The DLPNO-CCSD(T) results were also confronted to those of a set of density functionals (DFs) to understand the benefit of implementing this technique in enzymatic quantum mechanics/molecular mechanics (QM/MM) calculations as a second QM component, which is often treated with DF theory (DFT). On average, the DLPNO-CCSD(T)/aug-cc-pVTZ results were 0.51 kcal·mol-1 apart from the canonic CCSD(T)/CBS, without noticeable biases toward any of the reactions under study. All DFs fell short to the DLPNO-CCSD(T), both in terms of accuracy and robustness, which suggests that this method is advantageous to characterize enzymatic reactions and that its use in QM/MM calculations, either alone or in conjugation with DFT, in a two-region QM layer (DLPNO-CCSD(T):DFT), should enhance the quality and faithfulness of the results.
Collapse
Affiliation(s)
- Pedro Paiva
- LAQV@REQUIMTE, Departamento de Química e Bioquímica, Faculdade de Ciências, Universidade do Porto, Porto, Portugal
| | - Maria J Ramos
- LAQV@REQUIMTE, Departamento de Química e Bioquímica, Faculdade de Ciências, Universidade do Porto, Porto, Portugal
| | - Pedro A Fernandes
- LAQV@REQUIMTE, Departamento de Química e Bioquímica, Faculdade de Ciências, Universidade do Porto, Porto, Portugal
| |
Collapse
|
17
|
Najibi A, Goerigk L. DFT
‐D4
counterparts of leading
meta‐
generalized‐gradient approximation and hybrid density functionals for energetics and geometries. J Comput Chem 2020; 41:2562-2572. [DOI: 10.1002/jcc.26411] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2020] [Revised: 08/10/2020] [Accepted: 08/12/2020] [Indexed: 12/25/2022]
Affiliation(s)
- Asim Najibi
- School of Chemistry The University of Melbourne Parkville Australia
| | - Lars Goerigk
- School of Chemistry The University of Melbourne Parkville Australia
| |
Collapse
|
18
|
Förster A, Visscher L. Double hybrid DFT calculations with Slater type orbitals. J Comput Chem 2020; 41:1660-1684. [PMID: 32297682 PMCID: PMC7317772 DOI: 10.1002/jcc.26209] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2020] [Revised: 03/31/2020] [Accepted: 04/01/2020] [Indexed: 12/14/2022]
Abstract
On a comprehensive database with 1,644 datapoints, covering several aspects of main-group as well as of transition metal chemistry, we assess the performance of 60 density functional approximations (DFA), among them 36 double hybrids (DH). All calculations are performed using a Slater type orbital (STO) basis set of triple-ζ (TZ) quality and the highly efficient pair atomic resolution of the identity approach for the exchange- and Coulomb-term of the KS matrix (PARI-K and PARI-J, respectively) and for the evaluation of the MP2 energy correction (PARI-MP2). Employing the quadratic scaling SOS-AO-PARI-MP2 algorithm, DHs based on the spin-opposite-scaled (SOS) MP2 approximation are benchmarked against a database of large molecules. We evaluate the accuracy of STO/PARI calculations for B3LYP as well as for the DH B2GP-PLYP and show that the combined basis set and PARI-error is comparable to the one obtained using the well-known def2-TZVPP Gaussian-type basis set in conjunction with global density fitting. While quadruple-ζ (QZ) calculations are currently not feasible for PARI-MP2 due to numerical issues, we show that, on the TZ level, Jacob's ladder for classifying DFAs is reproduced. However, while the best DHs are more accurate than the best hybrids, the improvements are less pronounced than the ones commonly found on the QZ level. For conformers of organic molecules and noncovalent interactions where very high accuracy is required for qualitatively correct results, DHs provide only small improvements over hybrids, while they still excel in thermochemistry, kinetics, transition metal chemistry and the description of strained organic systems.
Collapse
Affiliation(s)
- Arno Förster
- Theoretical ChemistryVrije UniversiteitAmsterdamThe Netherlands
| | - Lucas Visscher
- Theoretical ChemistryVrije UniversiteitAmsterdamThe Netherlands
| |
Collapse
|
19
|
Sirirak J, Lawan N, Van der Kamp MW, Harvey JN, Mulholland AJ. Benchmarking quantum mechanical methods for calculating reaction energies of reactions catalyzed by enzymes. PEERJ PHYSICAL CHEMISTRY 2020. [DOI: 10.7717/peerj-pchem.8] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
To assess the accuracy of different quantum mechanical methods for biochemical modeling, the reaction energies of 20 small model reactions (chosen to represent chemical steps catalyzed by commonly studied enzymes) were calculated. The methods tested included several popular Density Functional Theory (DFT) functionals, second-order Møller Plesset perturbation theory (MP2) and its spin-component scaled variant (SCS-MP2), and coupled cluster singles and doubles and perturbative triples (CCSD(T)). Different basis sets were tested. CCSD(T)/aug-cc-pVTZ results for all 20 reactions were used to benchmark the other methods. It was found that MP2 and SCS-MP2 reaction energy calculation results are similar in quality to CCSD(T) (mean absolute error (MAE) of 1.2 and 1.3 kcal mol−1, respectively). MP2 calculations gave a large error in one case, and are more subject to basis set effects, so in general SCS-MP2 calculations are a good choice when CCSD(T) calculations are not feasible. Results with different DFT functionals were of reasonably good quality (MAEs of 2.5–5.1 kcal mol−1), whereas popular semi-empirical methods (AM1, PM3, SCC-DFTB) gave much larger errors (MAEs of 11.6–14.6 kcal mol−1). These results should be useful in guiding methodological choices and assessing the accuracy of QM/MM calculations on enzyme-catalyzed reactions.
Collapse
Affiliation(s)
- Jitnapa Sirirak
- Department of Chemistry, Faculty of Science, Silpakorn University, Nakhon Pathom, Thailand
| | - Narin Lawan
- Department of Chemistry, Faculty of Science, Chiang Mai University, Chiang Mai, Thailand
| | | | | | - Adrian J. Mulholland
- Centre for Computational Chemistry, School of Chemistry, University of Bristol, Bristol, United Kingdom
| |
Collapse
|
20
|
Magalhães RP, Fernandes HS, Sousa SF. Modelling Enzymatic Mechanisms with QM/MM Approaches: Current Status and Future Challenges. Isr J Chem 2020. [DOI: 10.1002/ijch.202000014] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Affiliation(s)
- Rita P. Magalhães
- UCIBIO@REQUIMTE, BioSIMDepartamento de Biomedicina, Faculdade de Medicina da Universidade do Porto Alameda Professor Hernâni Monteiro 4200-319 Porto Portugal
| | - Henriques S. Fernandes
- UCIBIO@REQUIMTE, BioSIMDepartamento de Biomedicina, Faculdade de Medicina da Universidade do Porto Alameda Professor Hernâni Monteiro 4200-319 Porto Portugal
| | - Sérgio F. Sousa
- UCIBIO@REQUIMTE, BioSIMDepartamento de Biomedicina, Faculdade de Medicina da Universidade do Porto Alameda Professor Hernâni Monteiro 4200-319 Porto Portugal
| |
Collapse
|