51
|
Kříž K, Nováček M, Řezáč J. Non-Covalent Interactions Atlas Benchmark Data Sets 3: Repulsive Contacts. J Chem Theory Comput 2021; 17:1548-1561. [PMID: 33620192 DOI: 10.1021/acs.jctc.0c01341] [Citation(s) in RCA: 37] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
The new R739×5 data set from the Non-Covalent Interactions Atlas series (www.nciatlas.org) focuses on repulsive contacts in molecular complexes, covering organic molecules, sulfur, phosphorus, halogens, and noble gases. Information on the repulsive parts of the potential energy surface is crucial for the development of robust empirically parametrized computational methods. We use the new data set of highly accurate CCSD(T)/CBS interaction energies to test selected density functional theory (DFT) and semiempirical quantum-mechanical methods. The double-hybrid functionals were the best performing, with the revDSD-PBEP86-D3 being the most accurate DFT method, followed by the range-separated ωB97X functionals. Out of semiempirical methods, GFN2-xTB yielded the best results. On the example of the PM6 method, we analyze the source of error and its relation to the difficulties in the description of conformational energies, and we also devise an immediately applicable correction that fixes the most serious uncorrected issues previously encountered in practical calculations.
Collapse
Affiliation(s)
- Kristian Kříž
- Institute of Organic Chemistry and Biochemistry, Czech Academy of Sciences, 166 10 Prague, Czech Republic.,Department of Physical and Macromolecular Chemistry, Faculty of Science, Charles University, Hlavova 8, 128 40 Prague, Czech Republic
| | - Martin Nováček
- Gymnázium Třebíč, Masarykovo náměstí 9, 674 01 Třebíč, Czech Republic
| | - Jan Řezáč
- Institute of Organic Chemistry and Biochemistry, Czech Academy of Sciences, 166 10 Prague, Czech Republic
| |
Collapse
|
52
|
Grimme S, Hansen A, Ehlert S, Mewes JM. r 2SCAN-3c: A "Swiss army knife" composite electronic-structure method. J Chem Phys 2021; 154:064103. [PMID: 33588555 DOI: 10.1063/5.0040021] [Citation(s) in RCA: 373] [Impact Index Per Article: 93.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
The recently proposed r2SCAN meta-generalized-gradient approximation (mGGA) of Furness and co-workers is used to construct an efficient composite electronic-structure method termed r2SCAN-3c. To this end, the unaltered r2SCAN functional is combined with a tailor-made triple-ζ Gaussian atomic orbital basis set as well as with refitted D4 and geometrical counter-poise corrections for London-dispersion and basis set superposition error. The performance of the new method is evaluated for the GMTKN55 database covering large parts of chemical space with about 1500 data points, as well as additional benchmarks for non-covalent interactions, organometallic reactions, and lattice energies of organic molecules and ices, as well as for the adsorption on polar salt and non-polar coinage-metal surfaces. These comprehensive tests reveal a spectacular performance and robustness of r2SCAN-3c: It by far surpasses its predecessor B97-3c at only twice the cost and provides one of the best results of all semi-local density-functional theory (DFT)/QZ methods ever tested for the GMTKN55 database at one-tenth of the cost. Specifically, for reaction and conformational energies as well as non-covalent interactions, it outperforms prominent hybrid-DFT/QZ approaches at two to three orders of magnitude lower cost. Perhaps, the most relevant remaining issue of r2SCAN-3c is self-interaction error (SIE), owing to its mGGA nature. However, SIE is slightly reduced compared to other (m)GGAs, as is demonstrated in two examples. After all, this remarkably efficient and robust method is chosen as our new group default, replacing previous composite DFT and partially even expensive high-level methods in most standard applications for systems with up to several hundreds of atoms.
Collapse
Affiliation(s)
- Stefan Grimme
- Mulliken Center for Theoretical Chemistry, Institut für Physikalische und Theoretische Chemie, Rheinische Friedrich-Wilhelms Universität Bonn, Beringstraße 4, 53115 Bonn, Germany
| | - Andreas Hansen
- Mulliken Center for Theoretical Chemistry, Institut für Physikalische und Theoretische Chemie, Rheinische Friedrich-Wilhelms Universität Bonn, Beringstraße 4, 53115 Bonn, Germany
| | - Sebastian Ehlert
- Mulliken Center for Theoretical Chemistry, Institut für Physikalische und Theoretische Chemie, Rheinische Friedrich-Wilhelms Universität Bonn, Beringstraße 4, 53115 Bonn, Germany
| | - Jan-Michael Mewes
- Mulliken Center for Theoretical Chemistry, Institut für Physikalische und Theoretische Chemie, Rheinische Friedrich-Wilhelms Universität Bonn, Beringstraße 4, 53115 Bonn, Germany
| |
Collapse
|
53
|
Gyevi-Nagy L, Kállay M, Nagy PR. Accurate Reduced-Cost CCSD(T) Energies: Parallel Implementation, Benchmarks, and Large-Scale Applications. J Chem Theory Comput 2021; 17:860-878. [PMID: 33400527 PMCID: PMC7884001 DOI: 10.1021/acs.jctc.0c01077] [Citation(s) in RCA: 43] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2020] [Indexed: 11/28/2022]
Abstract
The accurate and systematically improvable frozen natural orbital (FNO) and natural auxiliary function (NAF) cost-reducing approaches are combined with our recent coupled-cluster singles, doubles, and perturbative triples [CCSD(T)] implementations. Both of the closed- and open-shell FNO-CCSD(T) codes benefit from OpenMP parallelism, completely or partially integral-direct density-fitting algorithms, checkpointing, and hand-optimized, memory- and operation count effective implementations exploiting all permutational symmetries. The closed-shell CCSD(T) code requires negligible disk I/O and network bandwidth, is MPI/OpenMP parallel, and exhibits outstanding peak performance utilization of 50-70% up to hundreds of cores. Conservative FNO and NAF truncation thresholds benchmarked for challenging reaction, atomization, and ionization energies of both closed- and open-shell species are shown to maintain 1 kJ/mol accuracy against canonical CCSD(T) for systems of 31-43 atoms even with large basis sets. The cost reduction of up to an order of magnitude achieved extends the reach of FNO-CCSD(T) to systems of 50-75 atoms (up to 2124 atomic orbitals) with triple- and quadruple-ζ basis sets, which is unprecedented without local approximations. Consequently, a considerably larger portion of the chemical compound space can now be covered by the practically "gold standard" quality FNO-CCSD(T) method using affordable resources and about a week of wall time. Large-scale applications are presented for organocatalytic and transition-metal reactions as well as noncovalent interactions. Possible applications for benchmarking local CCSD(T) methods, as well as for the accuracy assessment or parametrization of less complete models, for example, density functional approximations or machine learning potentials, are also outlined.
Collapse
Affiliation(s)
- László Gyevi-Nagy
- Department of Physical Chemistry and
Materials Science, Budapest University of
Technology and Economics, P.O. Box 91, H-1521 Budapest, Hungary
| | - Mihály Kállay
- Department of Physical Chemistry and
Materials Science, Budapest University of
Technology and Economics, P.O. Box 91, H-1521 Budapest, Hungary
| | - Péter R. Nagy
- Department of Physical Chemistry and
Materials Science, Budapest University of
Technology and Economics, P.O. Box 91, H-1521 Budapest, Hungary
| |
Collapse
|
54
|
Culka M, Kalvoda T, Gutten O, Rulíšek L. Mapping Conformational Space of All 8000 Tripeptides by Quantum Chemical Methods: What Strain Is Affordable within Folded Protein Chains? J Phys Chem B 2021; 125:58-69. [PMID: 33393778 DOI: 10.1021/acs.jpcb.0c09251] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
To gain more insight into the physicochemical aspects of a protein structure from the first principles, conformational space of all 8000 "capped" tripeptides (i.e., N-Ac-X1X2X3-NH-CH3, where Xi is one of the 20 natural amino acids) was investigated computationally. An enormous dataset (denoted P-CONF_1.6M and containing close to 1 600 000 conformers in total) has been obtained by employing a composite protocol combining density functional theory, semiempirical quantum mechanics (SQM), and state-of-the-art solvation methods with 1000 K molecular dynamics (MD) used to generate initial structures (200 snapshots for each tripeptide). This allowed us to present the first rigorous QM-based glimpse at the vast conformational space spanned by small protein fragments. The same computational procedure was repeated for tripeptide fragments taken from the SCOPe database of three-dimensional protein folds, by restraining them to their geometry in a protein. Such complementary data allowed us to compare the distribution of conformational strain energies of unrestrained tripeptidic fragments "in solvent" with those in existing protein chains. Besides providing a rigorous (ab initio) proof of a few well-known concepts and hypotheses concerning protein structures, such as the distribution of (φ, ψ) angles in Ramachandran plots, we have made several observations that came as a certain surprise: (1) distribution of conformational energies does not significantly differ between the "unbiased/unrestrained" conformers obtained from MD sampling in solvent and the biased conformers, i.e., those of a given tripeptide obtained from protein structures; (2) conformational (strain) energy window up to ∼20 to 25 kcal·mol-1 is readily available to tripeptide fragments within the context of a protein chain; (3) overpopulation in certain regions of Ramachandran plot was observed for the unbiased conformers. Last but not least, the massive dataset of accurate (DFT-D3//COSMO-RS) conformational (free) energies of ∼1.6 M peptide conformers, P-CONF_1.6M, obtained throughout this work may serve as excellent dataset for calibrating and benchmarking of popular force fields.
Collapse
Affiliation(s)
- Martin Culka
- Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences, Flemingovo náměstí 2, 166 10 Praha 6, Czech Republic
| | - Tadeáš Kalvoda
- Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences, Flemingovo náměstí 2, 166 10 Praha 6, Czech Republic
| | - Ondrej Gutten
- Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences, Flemingovo náměstí 2, 166 10 Praha 6, Czech Republic
| | - Lubomír Rulíšek
- Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences, Flemingovo náměstí 2, 166 10 Praha 6, Czech Republic
| |
Collapse
|
55
|
Bursch M, Hansen A, Pracht P, Kohn JT, Grimme S. Theoretical study on conformational energies of transition metal complexes. Phys Chem Chem Phys 2021; 23:287-299. [PMID: 33336657 DOI: 10.1039/d0cp04696e] [Citation(s) in RCA: 54] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]
Abstract
Conformational energies are an important chemical property for which a performance assessment of theoretical methods is mandatory. Existing benchmark sets are often limited to biochemical or main group element containing molecules, while organometallic systems are generally less studied. A key problem herein is to routinely generate conformers for these molecules due to their complexity and manifold of possible coordination patterns. In this study we used our recently published CREST protocol [Pracht et al., Phys. Chem. Chem. Phys., 2020, 22, 7169-7192] to generate conformer ensembles for a variety of 40 challenging transition metal containing molecules, which were then used to form a comprehensive conformational energy benchmark set termed TMCONF40. Several low-cost semiempirical, density functional theory (DFT) and force-field methods were compared to high level DLPNO-CCSD(T1) and double-hybrid DFT reference values. Close attention was paid to the energetic ordering of the conformers in the statistical evaluation. With respect to the double-hybrid references, both tested low-cost composite DFT methods produce high Pearson correlation coefficients of rp,mean,B97-3c//B97-3c = 0.922 and rp,mean,PBEh-3c//B97-3c = 0.890, with mean absolute deviations close to or below 1 kcal mol-1. This good performance also holds for a comparison to DLPNO-CCSD(T1) reference energies for a smaller subset termed TMCONF5. Based on DFT geometries, the GFNn-xTB methods yield reasonable Pearson correlation coefficients of rp,mean,GFN1-xTB//B97-3c = 0.617 (MADmean = 2.15 kcal mol-1) and rp,mean,GFN2-xTB//B97-3c = 0.567 (MADmean = 2.68 kcal mol-1), outperforming the widely used PMx methods on the TMCONF40 test set. Employing the low-cost composite DFT method B97-3c on GFN2-xTB geometries yields an slightly improved correlation of rp,mean,B97-3c//GFN2-xTB = 0.632. Furthermore, for 68% of the investigated complexes at least one low-energy conformer was found that is more stable than the respective crystal structure conformation, which signals the importance of conformational studies. General recommendations for the application of the CREST protocol and DFT methods for transition metal conformational energies are given.
Collapse
Affiliation(s)
- Markus Bursch
- Mulliken Center for Theoretical Chemistry, Universität Bonn, Beringstr. 4, 53115 Bonn, Germany.
| | | | | | | | | |
Collapse
|
56
|
Lim VT, Hahn DF, Tresadern G, Bayly CI, Mobley DL. Benchmark assessment of molecular geometries and energies from small molecule force fields. F1000Res 2020; 9:Chem Inf Sci-1390. [PMID: 33604023 PMCID: PMC7863993 DOI: 10.12688/f1000research.27141.1] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 11/18/2020] [Indexed: 12/22/2022] Open
Abstract
Background: Force fields are used in a wide variety of contexts for classical molecular simulation, including studies on protein-ligand binding, membrane permeation, and thermophysical property prediction. The quality of these studies relies on the quality of the force fields used to represent the systems. Methods: Focusing on small molecules of fewer than 50 heavy atoms, our aim in this work is to compare nine force fields: GAFF, GAFF2, MMFF94, MMFF94S, OPLS3e, SMIRNOFF99Frosst, and the Open Force Field Parsley, versions 1.0, 1.1, and 1.2. On a dataset comprising 22,675 molecular structures of 3,271 molecules, we analyzed force field-optimized geometries and conformer energies compared to reference quantum mechanical (QM) data. Results: We show that while OPLS3e performs best, the latest Open Force Field Parsley release is approaching a comparable level of accuracy in reproducing QM geometries and energetics for this set of molecules. Meanwhile, the performance of established force fields such as MMFF94S and GAFF2 is generally somewhat worse. We also find that the series of recent Open Force Field versions provide significant increases in accuracy. Conclusions: This study provides an extensive test of the performance of different molecular mechanics force fields on a diverse molecule set, and highlights two (OPLS3e and OpenFF 1.2) that perform better than the others tested on the present comparison. Our molecule set and results are available for other researchers to use in testing.
Collapse
Affiliation(s)
- Victoria T. Lim
- Department of Chemistry, University of California, Irvine, CA, 92697, USA
| | - David F. Hahn
- Computational Chemistry, Janssen Research & Development, Beerse, B-2340, Belgium
| | - Gary Tresadern
- Computational Chemistry, Janssen Research & Development, Beerse, B-2340, Belgium
| | | | - David L. Mobley
- Department of Chemistry, University of California, Irvine, CA, 92697, USA
- Department of Pharmaceutical Sciences, University of California, Irvine, CA, 92697, USA
| |
Collapse
|
57
|
Li W, Battistel MD, Reeves H, Oh L, Yu H, Chen X, Wang LP, Freedberg DI. A combined NMR, MD and DFT conformational analysis of 9-O-acetyl sialic acid-containing GM3 ganglioside glycan and its 9-N-acetyl mimic. Glycobiology 2020; 30:787-801. [PMID: 32350512 PMCID: PMC8179627 DOI: 10.1093/glycob/cwaa040] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2020] [Revised: 04/22/2020] [Accepted: 04/22/2020] [Indexed: 01/30/2023] Open
Abstract
O-Acetylation of carbohydrates such as sialic acids is common in nature, but its role is not clearly understood due to the lability of O-acetyl groups. We demonstrated previously that 9-acetamido-9-deoxy-N-acetylneuraminic acid (Neu5Ac9NAc) is a chemically and biologically stable mimic of the 9-O-acetyl-N-acetylneuraminic acid (Neu5,9Ac2) of the corresponding sialoglycans. Here, a systematic nuclear magnetic resonance (NMR) spectroscopic and molecular dynamics (MD) simulation study was undertaken for Neu5,9Ac2-containing GM3 ganglioside glycan (GM3-glycan) and its Neu5Ac9NAc analog. GM3-glycan with Neu5Ac as the non-O-acetyl form of Neu5,9Ac2 was used as a control. Complete 1H and 13C NMR chemical shift assignments, three-bond 1H-13C trans-glycosidic coupling constants (3JCH), accurate 1H-1H coupling constants (3JHH), nuclear Overhauser effects and hydrogen bonding detection were carried out. Results show that structural modification (O- or N-acetylation) on the C-9 of Neu5Ac in GM3 glycan does not cause significant conformational changes on either its glycosidic dihedral angles or its secondary structure. All structural differences are confined to the Neu5Ac glycerol chain, and minor temperature-dependent changes are seen in the aglycone portion. We also used Density Functional Theory (DFT) quantum mechanical calculations to improve currently used 3JHH Karplus relations. Furthermore, OH chemical shifts were assigned at -10°C and no evidence of an intramolecular hydrogen bond was observed. The results provide additional evidence regarding structural similarities between sialosides containing 9-N-acetylated and 9-O-acetylated Neu5Ac and support the opportunity of using 9-N-acetylated Neu5Ac as a stable mimic to study the biochemical role of 9-O-acetylated Neu5Ac.
Collapse
Affiliation(s)
- Wanqing Li
- Department of Chemistry, University of California-Davis, One Shields Avenue, Davis, California 95616, USA
| | - Marcos D Battistel
- Laboratory of Bacterial Polysaccharides, Food and Drug Administration (FDA), Silver Spring, MD 20993, USA
| | - Hannah Reeves
- Department of Chemistry, University of California-Davis, One Shields Avenue, Davis, California 95616, USA
| | - Lisa Oh
- Department of Chemistry, University of California-Davis, One Shields Avenue, Davis, California 95616, USA
| | - Hai Yu
- Department of Chemistry, University of California-Davis, One Shields Avenue, Davis, California 95616, USA
| | - Xi Chen
- Department of Chemistry, University of California-Davis, One Shields Avenue, Davis, California 95616, USA
| | - Lee-Ping Wang
- Department of Chemistry, University of California-Davis, One Shields Avenue, Davis, California 95616, USA
| | - Darón I Freedberg
- Laboratory of Bacterial Polysaccharides, Food and Drug Administration (FDA), Silver Spring, MD 20993, USA
| |
Collapse
|
58
|
Toward accurate prediction of amino acid derivatives structure and energetics from DFT: glycine conformers and their interconversions. J Mol Model 2020; 26:129. [DOI: 10.1007/s00894-020-4342-7] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2019] [Accepted: 03/04/2020] [Indexed: 12/30/2022]
|
59
|
Culka M, Rulíšek L. Interplay between Conformational Strain and Intramolecular Interaction in Protein Structures: Which of Them Is Evolutionarily Conserved? J Phys Chem B 2020; 124:3252-3260. [PMID: 32237747 DOI: 10.1021/acs.jpcb.9b11784] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]
Abstract
By computing strain energies of peptide fragments within protein structures and their intramolecular interaction energies, we attempt to reveal general biophysical trends behind the secondary structure formation in the context of protein evolution. Our "protein basis set" consisted of 1143 representatives of different folds obtained from curated SCOPe database, and for each member of the set, the strain and intramolecular energy was calculated on the "rolling tripeptide" basis, employing the DFT-D3/COSMO-RS method for the former and the QM-calibrated force field method (MM) for the latter. The calculated data, strain and interactions, were correlated with the conservation of amino acid residues in secondary structure elements and also with the level of the residue burial within the protein three-dimensional structure. It allowed us to formulate several observations concerning fundamental differences between two main secondary structure motifs: α-helices and β-strands. We have shown that a strong interaction is one of the determining characteristics of the β-sheet formation, at least at the level of tripeptides (and likely penta- or heptapeptides, too), and that the β-strand is a prevailing secondary structure in the strongly-interacting regions of the protein folds conserved by evolution. On the other hand, low strain was neither proven to be an important physicochemical property conserved by evolution nor does it correlate with the propensity for the α-helix and β-strand. Finally, it has been demonstrated that the strong interaction has a certain level of connection with residue burial; however, we demonstrate that these two characteristics should be rather regarded as two complementary factors. These findings represent an important contribution to understanding protein folding from first principles, which is a complementary approach to ongoing efforts to solve the protein folding problem by knowledge-based approaches and machine-learning.
Collapse
Affiliation(s)
- Martin Culka
- Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences, Flemingovo náměstí 2, 166 10 Praha 6, Czech Republic
| | - Lubomír Rulíšek
- Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences, Flemingovo náměstí 2, 166 10 Praha 6, Czech Republic
| |
Collapse
|
60
|
Pracht P, Bohle F, Grimme S. Automated exploration of the low-energy chemical space with fast quantum chemical methods. Phys Chem Chem Phys 2020; 22:7169-7192. [PMID: 32073075 DOI: 10.1039/c9cp06869d] [Citation(s) in RCA: 1073] [Impact Index Per Article: 214.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
We propose and discuss an efficient scheme for the in silico sampling for parts of the molecular chemical space by semiempirical tight-binding methods combined with a meta-dynamics driven search algorithm. The focus of this work is set on the generation of proper thermodynamic ensembles at a quantum chemical level for conformers, but similar procedures for protonation states, tautomerism and non-covalent complex geometries are also discussed. The conformational ensembles consisting of all significantly populated minimum energy structures normally form the basis of further, mostly DFT computational work, such as the calculation of spectra or macroscopic properties. By using basic quantum chemical methods, electronic effects or possible bond breaking/formation are accounted for and a very reasonable initial energetic ranking of the candidate structures is obtained. Due to the huge computational speedup gained by the fast low-cost quantum chemical methods, overall short computation times even for systems with hundreds of atoms (typically drug-sized molecules) are achieved. Furthermore, specialized applications, such as sampling with implicit solvation models or constrained conformational sampling for transition-states, metal-, surface-, or noncovalently bound complexes are discussed, opening many possible applications in modern computational chemistry and drug discovery. The procedures have been implemented in a freely available computer code called CREST, that makes use of the fast and reliable GFNn-xTB methods.
Collapse
Affiliation(s)
- Philipp Pracht
- Mulliken Center for Theoretical Chemistry, Universität Bonn, Beringstr. 4, 53115 Bonn, Germany.
| | | | | |
Collapse
|
61
|
Kříž K, Řezáč J. Benchmarking of Semiempirical Quantum-Mechanical Methods on Systems Relevant to Computer-Aided Drug Design. J Chem Inf Model 2020; 60:1453-1460. [PMID: 32062970 DOI: 10.1021/acs.jcim.9b01171] [Citation(s) in RCA: 35] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]
Abstract
The semiempirical quantum mechanical (SQM) methods used in drug design are commonly parametrized and tested on data sets of systems that may not be representative models for drug-biomolecule interactions in terms of both size and chemical composition. This is addressed here with a new benchmark data set, PLF547, derived from protein-ligand complexes, consisting of complexes of ligands with protein fragments (such as amino-acid side chains), with interaction energies based on MP2-F12 and DLPNO-CCSD(T) calculations. From these, composite benchmark interaction energies are also built for complexes of the ligand with the complete active site of the protein (PLA15 data set). These data sets are used to test multiple SQM methods with corrections for noncovalent interactions; the role of the solvation model in the calculations is tested as well.
Collapse
Affiliation(s)
- Kristian Kříž
- Institute of Organic Chemistry and Biochemistry, Czech Academy of Sciences, 166 10 Prague, Czech Republic.,Department of Physical and Macromolecular Chemistry, Faculty of Science, Charles University, Hlavova 8, 128 40 Praha 2, Czech Republic
| | - Jan Řezáč
- Institute of Organic Chemistry and Biochemistry, Czech Academy of Sciences, 166 10 Prague, Czech Republic
| |
Collapse
|
62
|
Chan B. Aqueous-Phase Conformations of Lactose, Maltose, and Sucrose and the Assessment of Low-Cost DFT Methods with the DSCONF Set of Conformers for the Three Disaccharides. J Phys Chem A 2020; 124:582-590. [PMID: 31927999 DOI: 10.1021/acs.jpca.9b10932] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
In the present study, we have examined a range of quantum chemistry methods for the calculation of conformers for lactose, maltose, and sucrose. We find that the DSD-PBE-P86/aug'-cc-pVTZ//B3-LYP-D3BJ/6-311+G(2d,p) protocol yields good relative energies in comparison with reference CCSD(T)/CBS//B3-LYP-D3BJ/maug-cc-pVTZ values. We have surveyed a total of ∼550 conformers for the three disaccharides with the chosen DSD-PBE-P86 method in conjunction with continuum aqueous solvation. In each case, the lowest free energy conformer is characterized by hydrogen bond(s) between the two rings. Another finding is that the major contributors to the overall variations in aqueous free energies are the electronic energies and the solvation energies. To facilitate investigations of larger systems, we have compiled the DSCONF set of conformers for the three disaccharides, and we have assessed lower cost methods with this set. We find MS1-D3/6-31+G(2d,p) to be cost-effective and accurate for both geometry optimization and the calculation of relative energies for disaccharides. In addition, we note that MS1-D3 has previously been found to yield good relative energies for the WATER27 set of water clusters. We thus deem this method to be appropriate for the study of saccharide conformations in both gas phase and aqueous solution.
Collapse
Affiliation(s)
- Bun Chan
- Graduate School of Engineering , Nagasaki University , Bunkyo 1-14 , Nagasaki 852-8521 , Japan
| |
Collapse
|
63
|
Kim M, Gould T, Rocca D, Lebègue S. Establishing the accuracy of density functional approaches for the description of noncovalent interactions in biomolecules. Phys Chem Chem Phys 2020; 22:21685-21695. [DOI: 10.1039/d0cp04137h] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Biomolecules have complex structures, and noncovalent interactions are crucial to determine their conformations and functionalities.
Collapse
Affiliation(s)
- Minho Kim
- Université de Lorraine and CNRS
- LPCT
- UMR 7019
- Vandoeuvre-lès-Nancy 54506
- France
| | - Tim Gould
- Queensland Micro- and Nanotechnology Centre
- Griffith University
- Nathan
- Australia
| | - Dario Rocca
- Université de Lorraine and CNRS
- LPCT
- UMR 7019
- Vandoeuvre-lès-Nancy 54506
- France
| | - Sébastien Lebègue
- Université de Lorraine and CNRS
- LPCT
- UMR 7019
- Vandoeuvre-lès-Nancy 54506
- France
| |
Collapse
|
64
|
Abstract
Estimating the range of three-dimensional structures (conformations) that are available to a molecule is a key component of computer-aided drug design. Quantum mechanical simulation offers improved accuracy over forcefield methods, but at a high computational cost. The question is whether this increased cost can be justified in a context in which high-throughput analysis of large numbers of molecules is often key. This chapter discusses the application of quantum mechanics to conformational searching, with a focus on three key challenges: (1) the generation of ensembles that include a good approximation to a molecule's bioactive conformation at as prominent a ranking as possible; (2) rational analysis and modification of a pre-established bioactive conformation in terms of its energetics; and (3) approximation of real solution-phase conformational ensembles in tandem with NMR data. The impact of QM on the high-throughput application (1) is debatable, meaning that for the moment its primary application is still lower-throughput applications such as (2) and (3). The optimal choice of QM method is also discussed. Rigorous benchmarking suggests that DFT methods are only acceptable when used with large basis sets, but a trickle of papers continue to obtain useful results with relatively low-cost methods, leading to a dilemma that the literature has yet to fully resolve.
Collapse
|
65
|
Lu X, Duchimaza-Heredia J, Cui Q. Analysis of Density Functional Tight Binding with Natural Bonding Orbitals. J Phys Chem A 2019; 123:7439-7453. [PMID: 31373822 DOI: 10.1021/acs.jpca.9b05072] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
The description of chemical bonding by the density functional tight binding (DFTB) model is analyzed using natural bonding orbitals (NBOs) and compared to results from density functional theory (B3LYP/aug-cc-pVTZ) calculations. Several molecular systems have been chosen to represent fairly diverse bonding scenarios that include standard covalent bonds, hypervalent interactions, multicenter bonds, metal-ligand interactions (with and without the pseudo-Jahn-Teller effect), and through-space donor-acceptor interactions. Overall, the results suggest that DFTB3/3OB provides physically sound descriptions for the different bonding scenarios analyzed here, as reflected by the general agreement between DFTB3 and B3LYP NBO properties, such as the nature of the NBOs, the magnitudes of natural charges and bond orders, and the dominant donor-acceptor interactions. The degree of ligand-to-metal charge transfer and the ionic nature of pentavalent phosphate are overestimated, likely reflecting the minimal-basis nature of DFTB3/3OB. Moreover, certain orbital interactions, such as geminal interactions, are observed to be grossly overestimated by DFTB3 for hypervalent phosphate and several transition metal compounds that involve copper and nickel. The study indicates that results from NBO analysis can be instructive for identifying electronic structure descriptions at the approximate quantum-mechanical level that require improvement and thus for guiding the systematic improvement of these methods.
Collapse
Affiliation(s)
- Xiya Lu
- Department of Chemistry and Theoretical Chemistry Institute , University of Wisconsin-Madison , 1101 University Avenue , Madison , Wisconsin 53706 , United States
| | | | | |
Collapse
|
66
|
Zubatyuk R, Smith JS, Leszczynski J, Isayev O. Accurate and transferable multitask prediction of chemical properties with an atoms-in-molecules neural network. SCIENCE ADVANCES 2019; 5:eaav6490. [PMID: 31448325 PMCID: PMC6688864 DOI: 10.1126/sciadv.aav6490] [Citation(s) in RCA: 136] [Impact Index Per Article: 22.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/07/2018] [Accepted: 06/27/2019] [Indexed: 05/06/2023]
Abstract
Atomic and molecular properties could be evaluated from the fundamental Schrodinger's equation and therefore represent different modalities of the same quantum phenomena. Here, we present AIMNet, a modular and chemically inspired deep neural network potential. We used AIMNet with multitarget training to learn multiple modalities of the state of the atom in a molecular system. The resulting model shows on several benchmark datasets state-of-the-art accuracy, comparable to the results of orders of magnitude more expensive DFT methods. It can simultaneously predict several atomic and molecular properties without an increase in the computational cost. With AIMNet, we show a new dimension of transferability: the ability to learn new targets using multimodal information from previous training. The model can learn implicit solvation energy (SMD method) using only a fraction of the original training data and an archive median absolute deviation error of 1.1 kcal/mol compared to experimental solvation free energies in the MNSol database.
Collapse
Affiliation(s)
- Roman Zubatyuk
- Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
- Center for Nonlinear Studies, Los Alamos National Laboratory, Los Alamos, NM 87545, USA
- Interdisciplinary Nanotoxicity Center, Department of Chemistry, Physics and Atmospheric Sciences, Jackson State University, Jackson, MS 39217, USA
| | - Justin S. Smith
- Center for Nonlinear Studies, Los Alamos National Laboratory, Los Alamos, NM 87545, USA
- Theoretical Division, Los Alamos National Laboratory, Los Alamos, NM 87545, USA
| | - Jerzy Leszczynski
- Interdisciplinary Nanotoxicity Center, Department of Chemistry, Physics and Atmospheric Sciences, Jackson State University, Jackson, MS 39217, USA
| | - Olexandr Isayev
- Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| |
Collapse
|
67
|
Culka M, Rulíšek L. Factors Stabilizing β-Sheets in Protein Structures from a Quantum-Chemical Perspective. J Phys Chem B 2019; 123:6453-6461. [PMID: 31287693 DOI: 10.1021/acs.jpcb.9b04866] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Protein folds are determined by the interplay between various (de)stabilizing forces, which can be broadly divided into a local strain of the protein chain and intramolecular interactions. In contrast to the α-helix, the β-sheet secondary protein structure is significantly stabilized by long-range interactions between the individual β-strands. It has been observed that quite diverse amino acid sequences can form a very similar small β-sheet fold, such as in the three-β-strand WW domain. Employing "calibrated" quantum-chemical methods, we show herein on two sequentially diverse examples of the WW domain that the internal strain energy is higher in the β-strands and lower in the loops, while the interaction energy has an opposite trend. Low strain energy computed for peptide sequences in the loop 1 correlates with its postulated early formation in the folding process. The relatively high strain energy within the β-strands (up to 8 kcal mol-1 per amino acid residue) is compensated by even higher intramolecular interaction energy (up to 15 kcal mol-1 per residue). It is shown in a quantitative way that the most conserved residues across the structural family of WW domains have the highest contributions to the intramolecular interaction energy. On the other hand, the residues in the regions with the lowest strain are not conserved. We conclude that the internal interaction energy is the physical quantity tuned by evolution to define the β-sheet protein fold.
Collapse
Affiliation(s)
- Martin Culka
- Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences , Flemingovo náměstí 2 , 166 10 Praha 6 , Czech Republic
| | - Lubomír Rulíšek
- Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences , Flemingovo náměstí 2 , 166 10 Praha 6 , Czech Republic
| |
Collapse
|
68
|
Wappett DA, Goerigk L. Toward a Quantum-Chemical Benchmark Set for Enzymatically Catalyzed Reactions: Important Steps and Insights. J Phys Chem A 2019; 123:7057-7074. [DOI: 10.1021/acs.jpca.9b05088] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Affiliation(s)
| | - Lars Goerigk
- School of Chemistry, The University of Melbourne, Victoria 3010, Australia
| |
Collapse
|
69
|
Vuong VQ, Nishimoto Y, Fedorov DG, Sumpter BG, Niehaus TA, Irle S. The Fragment Molecular Orbital Method Based on Long-Range Corrected Density-Functional Tight-Binding. J Chem Theory Comput 2019; 15:3008-3020. [PMID: 30998360 DOI: 10.1021/acs.jctc.9b00108] [Citation(s) in RCA: 27] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
The presently available linear scaling approaches to density-functional tight-binding (DFTB) based on the fragment molecular orbital (FMO) method are severely impacted by the problem of artificial charge transfer due to the self-interaction error (SIE), which hampers the simulation of zwitterionic systems such as biopolymers or ionic liquids. Here we report an extension of FMO-DFTB where we included a long-range corrected (LC) functional designed to mitigate the DFTB SIE, called the FMO-LC-DFTB method, resulting in a robust method which succeeds in simulating zwitterionic systems. Both energy and analytic gradient are developed for the gas phase and the polarizable continuum model of solvation. The scaling of FMO-LC-DFTB with system size N is shown to be almost linear, O( N1.13-1.28), and its numerical accuracy is established for a variety of representative systems including neutral and charged polypeptides. It is shown that pair interaction energies between fragments for two mini-proteins are in excellent agreement with results from long-range corrected density functional theory. The new method was employed in long time scale (1 ns) molecular dynamics simulations of the tryptophan cage protein (PDB: 1L2Y ) in the gas phase for four different protonation states and in stochastic global minimum structure searches for 1-ethyl-3-methylimidazolium nitrate ionic liquid clusters containing up to 2300 atoms.
Collapse
Affiliation(s)
- Van Quan Vuong
- Bredesen Center for Interdisciplinary Research and Graduate Education , University of Tennessee , Knoxville , Tennessee 37996 , United States
| | - Yoshio Nishimoto
- Fukui Institute for Fundamental Chemistry , Kyoto University , Kyoto 606-8501 , Japan
| | - Dmitri G Fedorov
- Research Center for Computational Design of Advanced Functional Materials (CD-FMat) , National Institute of Advanced Industrial Science and Technology (AIST) , Tsukuba 305-8568 , Japan
| | - Bobby G Sumpter
- Center for Nanophase Materials Sciences and Computational Sciences and Engineering Division , Oak Ridge National Laboratory , Oak Ridge , Tennessee 37831 , United States
| | - Thomas A Niehaus
- Univ Lyon, Université Claude Bernard Lyon 1 , CNRS, Institut Lumière Matière , F-69622 Villeurbanne , France
| | - Stephan Irle
- Bredesen Center for Interdisciplinary Research and Graduate Education , University of Tennessee , Knoxville , Tennessee 37996 , United States.,Center for Nanophase Materials Sciences and Computational Sciences and Engineering Division , Oak Ridge National Laboratory , Oak Ridge , Tennessee 37831 , United States.,Chemical Sciences Division , Oak Ridge National Laboratory , Oak Ridge , Tennessee 37831 , United States
| |
Collapse
|
70
|
Culka M, Galgonek J, Vymětal J, Vondrášek J, Rulíšek L. Toward Ab Initio Protein Folding: Inherent Secondary Structure Propensity of Short Peptides from the Bioinformatics and Quantum-Chemical Perspective. J Phys Chem B 2019; 123:1215-1227. [PMID: 30645123 DOI: 10.1021/acs.jpcb.8b09245] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
By combining bioinformatics with quantum-chemical calculations, we attempt to address quantitatively some of the physical principles underlying protein folding. The former allowed us to identify tripeptide sequences in existing protein three-dimensional structures with a strong preference for either helical or extended structure. The selected representatives of pro-helical and pro-extended sequences were converted into "isolated" tripeptides-capped at N- and C-termini-and these were subjected to an extensive conformational sampling and geometry optimization (typically thousands to tens of thousands of conformers for each tripeptide). For each conformer, the QM(DFT-D3)/COSMO-RS free-energy value was then calculated, Gconf(solv). The Δ Gconf(solv) is expected to provide an objective, unbiased, and quantitatively accurate measure of the conformational preference of the particular tripeptide sequence. It has been shown that irrespective of the helical vs extended preferences of the selected tripeptide sequences in context of the protein, most of the low-energy conformers of isolated tripeptides prefer the R-helical structure. Nevertheless, pro-helical tripeptides show slightly stronger helix preference than their pro-extended counterparts. Furthermore, when the sampling is repeated in the presence of a partner tripeptide to mimic the situation in a β-sheet, pro-extended tripeptides (exemplified by the VIV) show a larger free-energy benefit than pro-helical tripeptides (exemplified by the EAM). This effect is even more pronounced in a hydrophobic solvent, which mimics the less polar parts of a protein. This is in line with our bioinformatic results showing that the majority of pro-extended tripeptides are hydrophobic. The preference for a specific secondary structure by the studied tripeptides is thus governed by the plasticity to adopt to its environment. In addition, we show that most of the "naturally occurring" conformations of tripeptide sequences, i.e., those found in existing three-dimensional protein structures, are within ∼10 kcal·mol-1 from their global minima. In summary, our "ab initio" data suggest that complex protein structures may start to emerge already at the level of their small oligopeptidic units, which is in line with a hierarchical nature of protein folding.
Collapse
Affiliation(s)
- Martin Culka
- Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences , Flemingovo náměstí 2 , 166 10 , Praha 6 , Czech Republic
| | - Jakub Galgonek
- Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences , Flemingovo náměstí 2 , 166 10 , Praha 6 , Czech Republic
| | - Jiří Vymětal
- Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences , Flemingovo náměstí 2 , 166 10 , Praha 6 , Czech Republic
| | - Jiří Vondrášek
- Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences , Flemingovo náměstí 2 , 166 10 , Praha 6 , Czech Republic
| | - Lubomír Rulíšek
- Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences , Flemingovo náměstí 2 , 166 10 , Praha 6 , Czech Republic
| |
Collapse
|
71
|
PEPCONF, a diverse data set of peptide conformational energies. Sci Data 2019; 6:180310. [PMID: 30667382 PMCID: PMC6343515 DOI: 10.1038/sdata.2018.310] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2018] [Accepted: 11/30/2018] [Indexed: 12/13/2022] Open
Abstract
We present an extensive and diverse database of peptide conformational energies. Our database contains five different classes of model geometries: dipeptides, tripeptides, and disulfide-bridged, bioactive, and cyclic peptides. In total, the database consists of 3775 conformational energy data points and 4530 conformer geometries. All the reference energies have been calculated at the LC-ωPBE-XDM/aug-cc-pVTZ level of theory, which is shown to yield conformational energies with an accuracy in the order of tenths of a kcal/mol when compared to complete-basis-set coupled-cluster reference data. The peptide conformational data set (PEPCONF) is presented as a high-quality reference set for the development and benchmarking of molecular-mechanics and semi-empirical electronic structure methods, which are the most commonly used techniques in the modeling of medium to large proteins.
Collapse
|
72
|
Huang SR, Liu Y, Tureček F. Non-covalent complexes of the peptide fragment Gly-Asn-Asn-Gln-Gln-Asn-Tyr in the gas-phase. Photodissociative cross-linking, Born–Oppenheimer molecular dynamics, and ab initio computational binding study. Phys Chem Chem Phys 2019; 21:2046-2056. [DOI: 10.1039/c8cp06893c] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Photodissociative cross-linking in the Gly-Asn-Asn-Gln-Gln-Asn-Tyr peptide ion complex revealed non-bonding interactions between the peptide units.
Collapse
Affiliation(s)
- Shu R. Huang
- Department of Chemistry
- University of Washington
- Seattle
- USA
| | - Yang Liu
- Department of Chemistry
- University of Washington
- Seattle
- USA
| | | |
Collapse
|
73
|
Sharapa DI, Genaev A, Cavallo L, Minenkov Y. A Robust and Cost‐Efficient Scheme for Accurate Conformational Energies of Organic Molecules. Chemphyschem 2018; 20:92-102. [DOI: 10.1002/cphc.201801063] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2018] [Indexed: 11/09/2022]
Affiliation(s)
- Dmitry I. Sharapa
- Institute of Catalysis Research and TechnologyKarlsruhe Institute of Technology (KIT) Hermann-von-Helmholtz Platz 1 Eggenstein-Leopoldshafen D-76344 Germany
| | - Alexander Genaev
- Vorozhtsov Novosibirsk Institute of Organic Chemistry Academician Lavrent'ev Ave., 9 Novosibirsk 630090 Russian Federation
| | - Luigi Cavallo
- KAUST Catalysis Center (KCC)King Abdullah University of Science and Technology Thuwal- 23955-6900 Saudi Arabia
| | - Yury Minenkov
- Moscow Institute of Physics and Technology Institutskiy Pereulok 9, Dolgoprudny Moscow Region 141700 Russia
| |
Collapse
|
74
|
Loboda OA, Dolgonos GA, Boese AD. Towards hybrid density functional calculations of molecular crystals via fragment-based methods. J Chem Phys 2018; 149:124104. [DOI: 10.1063/1.5046908] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Affiliation(s)
- Oleksandr A. Loboda
- Institute of Chemistry, University of Graz, Heinrichstrasse 28/IV, A-8010 Graz, Austria
| | - Grygoriy A. Dolgonos
- Institute of Chemistry, University of Graz, Heinrichstrasse 28/IV, A-8010 Graz, Austria
| | - A. Daniel Boese
- Institute of Chemistry, University of Graz, Heinrichstrasse 28/IV, A-8010 Graz, Austria
| |
Collapse
|
75
|
Řezáč J, Greenwell C, Beran GJO. Accurate Noncovalent Interactions via Dispersion-Corrected Second-Order Møller-Plesset Perturbation Theory. J Chem Theory Comput 2018; 14:4711-4721. [PMID: 30086225 DOI: 10.1021/acs.jctc.8b00548] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Noncovalent interactions govern many important areas of chemistry, ranging from biomolecules to molecular crystals. Here, an accurate and computationally inexpensive dispersion-corrected second-order Møller-Plesset perturbation theory model (MP2D) is presented. MP2D recasts the highly successful dispersion-corrected MP2C model in a framework based on Grimme's D3 dispersion correction, combining Grimme's D3 dispersion coefficients with new analogous uncoupled Hartree-Fock ones and five global empirical parameters. MP2D is faster than MP2C, and unlike MP2C, it is suitable for geometry optimizations and can describe both intra- and intermolecular noncovalent interactions with high accuracy. MP2D approaches the accuracy of higher-level ab initio wave function techniques and out-performs a widely used hybrid dispersion-corrected density functional on a range of intermolecular, intramolecular, and thermochemical benchmarks.
Collapse
Affiliation(s)
- Jan Řezáč
- Institute of Organic Chemistry and Biochemistry , Czech Academy of Sciences , 166 10 Prague , Czech Republic
| | - Chandler Greenwell
- Department of Chemistry , University of California-Riverside , Riverside , California 92521 United States
| | - Gregory J O Beran
- Department of Chemistry , University of California-Riverside , Riverside , California 92521 United States
| |
Collapse
|
76
|
Gutten O, Bím D, Řezáč J, Rulíšek L. Macrocycle Conformational Sampling by DFT-D3/COSMO-RS Methodology. J Chem Inf Model 2017; 58:48-60. [PMID: 29182321 DOI: 10.1021/acs.jcim.7b00453] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
To find and calibrate a robust and reliable computational protocol for mapping conformational space of medium-sized molecules, exhaustive conformational sampling has been carried out for a series of seven macrocyclic compounds of varying ring size and one acyclic analogue. While five of them were taken from the MD/LLMOD/force field study by Shelley and co-workers ( Watts , K. S. ; Dalal , P. ; Tebben , A. J. ; Cheney , D. L. ; Shelley , J. C. Macrocycle Conformational Sampling with MacroModel . J. Chem. Inf. MODEL 2014 , 54 , 2680 - 2696 ), three represent potential macrocyclic inhibitors of human cyclophilin A. The free energy values (GDFT/COSMO-RS) for all of the conformers of each compound were obtained by a composite protocol based on in vacuo quantum mechanics (DFT-D3 method in a large basis set), standard gas-phase thermodynamics, and the COSMO-RS solvation model. The GDFT/COSMO-RS values were used as the reference for evaluating the performance of conformational sampling algorithms: standard and extended MD/LLMOD search (simulated-annealing molecular dynamics with low-lying eigenvector following algorithms, employing the OPLS2005 force field plus GBSA solvation) available in MacroModel and plain molecular dynamics (MD) sampling at high temperature (1000 K) using the semiempirical quantum mechanics (SQM) potential SQM(PM6-D3H4/COSMO) followed by energy minimization of the snapshots. It has been shown that the former protocol (MD/LLMOD) may provide a more complete set of initial structures that ultimately leads to the identification of a greater number of low-energy conformers (as assessed by GDFT/COSMO-RS) than the latter (i.e., plain SQM MD). The CPU time needed to fully evaluate one medium-sized compound (∼100 atoms, typically resulting in several hundred or a few thousand conformers generated and treated quantum-mechanically) is approximately 1,000-100,000 CPU hours on today's computers, which transforms to 1-7 days on a small-sized computer cluster with a few hundred CPUs. Finally, our data sets based on the rigorous quantum-chemical approach allow us to formulate a composite conformational sampling protocol with multiple checkpoints and truncation of redundant structural data that offers superior insights at affordable computational cost.
Collapse
Affiliation(s)
- Ondrej Gutten
- The Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences , Gilead Sciences Research Center & IOCB, Flemingovo náměstí 2, 166 10 Praha 6, Czech Republic
| | - Daniel Bím
- The Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences , Gilead Sciences Research Center & IOCB, Flemingovo náměstí 2, 166 10 Praha 6, Czech Republic
| | - Jan Řezáč
- The Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences , Gilead Sciences Research Center & IOCB, Flemingovo náměstí 2, 166 10 Praha 6, Czech Republic
| | - Lubomír Rulíšek
- The Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences , Gilead Sciences Research Center & IOCB, Flemingovo náměstí 2, 166 10 Praha 6, Czech Republic
| |
Collapse
|