1
|
Forouzesh N, Ghafouri F, Tolokh IS, Onufriev AV. Optimal Dielectric Boundary for Binding Free Energy Estimates in the Implicit Solvent. J Chem Inf Model 2024; 64:9433-9448. [PMID: 39656550 DOI: 10.1021/acs.jcim.4c01190] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2024]
Abstract
Accuracy of binding free energy calculations utilizing implicit solvent models is critically affected by parameters of the underlying dielectric boundary, specifically, the atomic and water probe radii. Here, a multidimensional optimization pipeline is used to find optimal atomic radii, specifically for binding calculations in the implicit solvent. To reduce overfitting, the optimization target includes separate, weighted contributions from both binding and hydration free energies. The resulting five-parameter radii set, OPT_BIND5D, is evaluated against experiment for binding free energies of 20 host-guest (H-G) systems, unrelated to the types of structures used in the training. The resulting accuracy for this H-G test set (root mean square error of 2.03 kcal/mol, mean signed error of -0.13 kcal/mol, mean absolute error of 1.68 kcal/mol, and Pearson's correlation of r = 0.79 with the experimental values) is on par with what can be expected from the fixed charge explicit solvent models. Best agreement with the experiment is achieved when the implicit salt concentration is set equal or close to the experimental conditions.
Collapse
Affiliation(s)
- Negin Forouzesh
- Department of Computer Science, California State University, Los Angeles, California 90032, United States
| | - Fatemeh Ghafouri
- Genetics, Bioinformatics, and Computational Biology, Virginia Polytechnic Institute & State University, Blacksburg, Virginia 24061, United States
| | - Igor S Tolokh
- Department of Computer Science, Virginia Polytechnic Institute & State University, Blacksburg, Virginia 24061, United States
| | - Alexey V Onufriev
- Department of Computer Science, Virginia Polytechnic Institute & State University, Blacksburg, Virginia 24061, United States
- Department of Physics, Virginia Polytechnic Institute & State University, Blacksburg, Virginia 24061, United States
- Center for Soft Matter and Biological Physics, Virginia Polytechnic Institute & State University, Blacksburg, Virginia 24061, United States
| |
Collapse
|
2
|
Kolesnikov ES, Xiong Y, Onufriev AV. Implicit Solvent with Explicit Ions Generalized Born Model in Molecular Dynamics: Application to DNA. J Chem Theory Comput 2024; 20:8724-8739. [PMID: 39283928 PMCID: PMC11465471 DOI: 10.1021/acs.jctc.4c00833] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2024] [Revised: 08/27/2024] [Accepted: 08/28/2024] [Indexed: 09/21/2024]
Abstract
The ion atmosphere surrounding highly charged biomolecules, such as nucleic acids, is crucial for their dynamics, structure, and interactions. Here, we develop an approach for the explicit treatment of ions within an implicit solvent framework suitable for atomistic simulations of biomolecules. The proposed implicit solvent/explicit ions model, GBION, is based on a modified generalized Born (GB) model; it includes separate, modified GB terms for solute-ion and ion-ion interactions. The model is implemented in the AMBER package (version 24), and its performance is thoroughly investigated in atomistic molecular dynamics (MD) simulations of double-stranded DNA on a microsecond time scale. The aggregate characteristics of monovalent (Na+ and K+) and trivalent (Cobalt Hexammine, CoHex3+) counterion distributions around double-stranded DNA predicted by the model are in reasonable agreement with the experiment (where available), all-atom explicit water MD simulations, and the expectation from the Manning condensation theory. The radial distributions of monovalent cations around DNA are reasonably close to the ones obtained using the explicit water model: expressed in units of energy, the maximum deviations of local ion concentrations from the explicit solvent reference are within 1 kBT, comparable to the corresponding deviations expected between different established explicit water models. The proposed GBION model is able to simulate DNA fragments in a large volume of solvent with explicit ions with little additional computational overhead compared with the fully implicit GB treatment of ions. Ions simulated using the developed model explore conformational space at least 2 orders of magnitude faster than in the explicit solvent. These advantages allowed us to observe and explore an unexpected "stacking" mode of DNA condensation in the presence of trivalent counterions (CoHex3+) that was revealed by recent experiments.
Collapse
Affiliation(s)
- Egor S. Kolesnikov
- Department
of Physics, Virginia Tech, Blacksburg, Virginia 24061, United States
| | - Yeyue Xiong
- Department
of Biomedical Engineering and Mechanics, Virginia Tech, Blacksburg, Virginia 24061, United States
| | - Alexey V. Onufriev
- Departments
of Computer Science and Physics, Center for Soft Matter and Biological
Physics, Virginia Tech, Blacksburg, Virginia 24061, United States
| |
Collapse
|
3
|
Tolokh IS, Folescu DE, Onufriev AV. Inclusion of Water Multipoles into the Implicit Solvation Framework Leads to Accuracy Gains. J Phys Chem B 2024; 128:5855-5873. [PMID: 38860842 PMCID: PMC11194828 DOI: 10.1021/acs.jpcb.4c00254] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2024] [Revised: 05/28/2024] [Accepted: 05/29/2024] [Indexed: 06/12/2024]
Abstract
The current practical "workhorses" of the atomistic implicit solvation─the Poisson-Boltzmann (PB) and generalized Born (GB) models─face fundamental accuracy limitations. Here, we propose a computationally efficient implicit solvation framework, the Implicit Water Multipole GB (IWM-GB) model, that systematically incorporates the effects of multipole moments of water molecules in the first hydration shell of a solute, beyond the dipole water polarization already present at the PB/GB level. The framework explicitly accounts for coupling between polar and nonpolar contributions to the total solvation energy, which is missing from many implicit solvation models. An implementation of the framework, utilizing the GAFF force field and AM1-BCC atomic partial charges model, is parametrized and tested against the experimental hydration free energies of small molecules from the FreeSolv database. The resulting accuracy on the test set (RMSE ∼ 0.9 kcal/mol) is 12% better than that of the explicit solvation (TIP3P) treatment, which is orders of magnitude slower. We also find that the coupling between polar and nonpolar parts of the solvation free energy is essential to ensuring that several features of the IWM-GB model are physically meaningful, including the sign of the nonpolar contributions.
Collapse
Affiliation(s)
- Igor S. Tolokh
- Department
of Computer Science, Virginia Tech, Blacksburg, Virginia 24061, United States
| | - Dan E. Folescu
- Department
of Computer Science, Virginia Tech, Blacksburg, Virginia 24061, United States
- Department
of Mathematics, Virginia Tech, Blacksburg, Virginia 24061, United States
| | - Alexey V. Onufriev
- Department
of Computer Science, Virginia Tech, Blacksburg, Virginia 24061, United States
- Department
of Physics, Virginia Tech, Blacksburg, Virginia 24061, United States
- Center
for Soft Matter and Biological Physics, Virginia Tech, Blacksburg, Virginia 24061, United States
| |
Collapse
|
4
|
Bass L, Elder LH, Folescu DE, Forouzesh N, Tolokh IS, Karpatne A, Onufriev AV. Improving the Accuracy of Physics-Based Hydration-Free Energy Predictions by Machine Learning the Remaining Error Relative to the Experiment. J Chem Theory Comput 2024; 20:396-410. [PMID: 38149593 PMCID: PMC10950260 DOI: 10.1021/acs.jctc.3c00981] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2023]
Abstract
The accuracy of computational models of water is key to atomistic simulations of biomolecules. We propose a computationally efficient way to improve the accuracy of the prediction of hydration-free energies (HFEs) of small molecules: the remaining errors of the physics-based models relative to the experiment are predicted and mitigated by machine learning (ML) as a postprocessing step. Specifically, the trained graph convolutional neural network attempts to identify the "blind spots" in the physics-based model predictions, where the complex physics of aqueous solvation is poorly accounted for, and partially corrects for them. The strategy is explored for five classical solvent models representing various accuracy/speed trade-offs, from the fast analytical generalized Born (GB) to the popular TIP3P explicit solvent model; experimental HFEs of small neutral molecules from the FreeSolv set are used for the training and testing. For all of the models, the ML correction reduces the resulting root-mean-square error relative to the experiment for HFEs of small molecules, without significant overfitting and with negligible computational overhead. For example, on the test set, the relative accuracy improvement is 47% for the fast analytical GB, making it, after the ML correction, almost as accurate as uncorrected TIP3P. For the TIP3P model, the accuracy improvement is about 39%, bringing the ML-corrected model's accuracy below the 1 kcal/mol threshold. In general, the relative benefit of the ML corrections is smaller for more accurate physics-based models, reaching the lower limit of about 20% relative accuracy gain compared with that of the physics-based treatment alone. The proposed strategy of using ML to learn the remaining error of physics-based models offers a distinct advantage over training ML alone directly on reference HFEs: it preserves the correct overall trend, even well outside of the training set.
Collapse
Affiliation(s)
- Lewis Bass
- Department of Computer Engineering, Virginia Tech, Blacksburg, Virginia 24061, United States
| | - Luke H Elder
- Department of Computer Science, Virginia Tech, Blacksburg, Virginia 24061, United States
| | - Dan E Folescu
- Department of Computer Science, Virginia Tech, Blacksburg, Virginia 24061, United States
- Department of Mathematics, Virginia Tech, Blacksburg, Virginia 24061, United States
| | - Negin Forouzesh
- Department of Computer Science, California State University, Los Angeles, California 90032, United States
| | - Igor S Tolokh
- Department of Computer Science, Virginia Tech, Blacksburg, Virginia 24061, United States
| | - Anuj Karpatne
- Department of Computer Science, Virginia Tech, Blacksburg, Virginia 24061, United States
| | - Alexey V Onufriev
- Department of Computer Science, Virginia Tech, Blacksburg, Virginia 24061, United States
- Department of Physics, Virginia Tech, Blacksburg, Virginia 24061, United States
- Center for Soft Matter and Biological Physics, Virginia Tech, Blacksburg, Virginia 24061, United States
| |
Collapse
|
5
|
Setiadi J, Boothroyd S, Slochower DR, Dotson DL, Thompson MW, Wagner JR, Wang LP, Gilson MK. Tuning Potential Functions to Host-Guest Binding Data. J Chem Theory Comput 2024; 20:239-252. [PMID: 38147689 PMCID: PMC10838530 DOI: 10.1021/acs.jctc.3c01050] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2023]
Abstract
Software to more rapidly and accurately predict protein-ligand binding affinities is of high interest for early-stage drug discovery, and physics-based methods are among the most widely used technologies for this purpose. The accuracy of these methods depends critically on the accuracy of the potential functions that they use. Potential functions are typically trained against a combination of quantum chemical and experimental data. However, although binding affinities are among the most important quantities to predict, experimental binding affinities have not to date been integrated into the experimental data set used to train potential functions. In recent years, the use of host-guest complexes as simple and tractable models of binding thermodynamics has gained popularity due to their small size and simplicity, relative to protein-ligand systems. Host-guest complexes can also avoid ambiguities that arise in protein-ligand systems such as uncertain protonation states. Thus, experimental host-guest binding data are an appealing additional data type to integrate into the experimental data set used to optimize potential functions. Here, we report the extension of the Open Force Field Evaluator framework to enable the systematic calculation of host-guest binding free energies and their gradients with respect to force field parameters, coupled with the curation of 126 host-guest complexes with available experimental binding free energies. As an initial application of this novel infrastructure, we optimized generalized Born (GB) cavity radii for the OBC2 GB implicit solvent model against experimental data for 36 host-guest systems. This refitting led to a dramatic improvement in accuracy for both the training set and a separate test set with 90 additional host-guest systems. The optimized radii also showed encouraging transferability from host-guest systems to 59 protein-ligand systems. However, the new radii are significantly smaller than the baseline radii and lead to excessively favorable hydration free energies (HFEs). Thus, users of the OBC2 GB model currently may choose between GB cavity radii that yield more accurate binding affinities and GB cavity radii that yield more accurate HFEs. We suspect that achieving good accuracy on both will require more far-reaching adjustments to the GB model. We note that binding free-energy calculations using the OBC2 model in OpenMM gain about a 10× speedup relative to corresponding explicit solvent calculations, suggesting a future role for implicit solvent absolute binding free-energy (ABFE) calculations in virtual compound screening. This study proves the principle of using host-guest systems to train potential functions that are transferrable to protein-ligand systems and provides an infrastructure that enables a range of applications.
Collapse
Affiliation(s)
- Jeffry Setiadi
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, 9255 Pharmacy Lane, La Jolla, California 92093, United States
| | - Simon Boothroyd
- Boothroyd Scientific Consulting Ltd., London WC2H 9JQ, U.K
- Psivant Therapeutics, Boston, Massachusetts 02210, United States
| | | | - David L Dotson
- Datryllic LLC, Phoenix, Arizona 85003, United States
- The Open Force Field Consortium, Open Molecular Software Foundation, Davis, California 95616, United States
| | - Matthew W Thompson
- The Open Force Field Consortium, Open Molecular Software Foundation, Davis, California 95616, United States
| | - Jeffrey R Wagner
- The Open Force Field Consortium, Open Molecular Software Foundation, Davis, California 95616, United States
| | - Lee-Ping Wang
- Chemistry Department, University of California Davis, Davis, California 95616, United States
| | - Michael K Gilson
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, 9255 Pharmacy Lane, La Jolla, California 92093, United States
| |
Collapse
|
6
|
Corrigan RA, Thiel AC, Lynn JR, Casavant TL, Ren P, Ponder JW, Schnieders MJ. A generalized Kirkwood implicit solvent for the polarizable AMOEBA protein model. J Chem Phys 2023; 159:054102. [PMID: 37526158 PMCID: PMC10396400 DOI: 10.1063/5.0158914] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2023] [Accepted: 07/17/2023] [Indexed: 08/02/2023] Open
Abstract
Computational simulation of biomolecules can provide important insights into protein design, protein-ligand binding interactions, and ab initio biomolecular folding, among other applications. Accurate treatment of the solvent environment is essential in such applications, but the use of explicit solvents can add considerable cost. Implicit treatment of solvent effects using a dielectric continuum model is an attractive alternative to explicit solvation since it is able to describe solvation effects without the inclusion of solvent degrees of freedom. Previously, we described the development and parameterization of implicit solvent models for small molecules. Here, we extend the parameterization of the generalized Kirkwood (GK) implicit solvent model for use with biomolecules described by the AMOEBA force field via the addition of corrections to the calculation of effective radii that account for interstitial spaces that arise within biomolecules. These include element-specific pairwise descreening scale factors, a short-range neck contribution to describe the solvent-excluded space between pairs of nearby atoms, and finally tanh-based rescaling of the overall descreening integral. We then apply the AMOEBA/GK implicit solvent to a set of ten proteins and achieve an average coordinate root mean square deviation for the experimental structures of 2.0 Å across 500 ns simulations. Overall, the continued development of implicit solvent models will help facilitate the simulation of biomolecules on mechanistically relevant timescales.
Collapse
Affiliation(s)
- Rae A. Corrigan
- Roy J. Carver Department of Biomedical Engineering, The University of Iowa, Iowa City, Iowa 52242, USA
| | - Andrew C. Thiel
- Roy J. Carver Department of Biomedical Engineering, The University of Iowa, Iowa City, Iowa 52242, USA
| | - Jack R. Lynn
- Roy J. Carver Department of Biomedical Engineering, The University of Iowa, Iowa City, Iowa 52242, USA
| | - Thomas L. Casavant
- Department of Electrical and Computer Engineering, The University of Iowa, Iowa City, Iowa 52242, USA
| | - Pengyu Ren
- Department of Biomedical Engineering, The University of Texas in Austin, Austin, Texas 78712, USA
| | - Jay W. Ponder
- Department of Chemistry, Washington University in St. Louis, St. Louis, Missouri 63130, USA
| | | |
Collapse
|
7
|
Parkin D, Takano M. The Intrinsic Radius as a Key Parameter in the Generalized Born Model to Adjust Protein-Protein Electrostatic Interaction. Int J Mol Sci 2023; 24:ijms24054700. [PMID: 36902130 PMCID: PMC10003186 DOI: 10.3390/ijms24054700] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2023] [Revised: 01/30/2023] [Accepted: 02/19/2023] [Indexed: 03/05/2023] Open
Abstract
The generalized Born (GB) model is an extension of the continuum dielectric theory of Born solvation energy and is a powerful method for accelerating the molecular dynamic (MD) simulations of charged biological molecules in water. While the effective dielectric constant of water that varies as a function of the separation distance between solute molecules is incorporated into the GB model, adjustment of the parameters is indispensable for accurate calculation of the Coulomb (electrostatic) energy. One of the key parameters is the lower limit of the spatial integral of the energy density of the electric field around a charged atom, known as the intrinsic radius ρ. Although ad hoc adjustment of ρ has been conducted to improve the Coulombic (ionic) bond stability, the physical mechanism by which ρ affects the Coulomb energy remains unclear. Via energetic analysis of three differently sized systems, here, we clarify that the Coulomb bond stability increases with increasing ρ and that the increased stability is caused by the interaction energy term, not by the self-energy (desolvation energy) term, as was supposed previously. Our results suggest that the use of larger values for the intrinsic radii of hydrogen and oxygen atoms, together with the use of a relatively small value for the spatial integration cutoff in the GB model, can better reproduce the Coulombic attraction between protein molecules.
Collapse
Affiliation(s)
- Dan Parkin
- Research Institute for Science and Engineering, Waseda University, Okubo 3-4-1, Sinjuku-ku, Tokyo 169-8555, Japan
| | - Mitsunori Takano
- Research Institute for Science and Engineering, Waseda University, Okubo 3-4-1, Sinjuku-ku, Tokyo 169-8555, Japan
- Department of Pure and Applied Physics, Waseda University, Okubo 3-4-1, Sinjuku-ku, Tokyo 169-8555, Japan
- Correspondence: ; Tel.: +81-3-5286-3512
| |
Collapse
|
8
|
Qing R, Hao S, Smorodina E, Jin D, Zalevsky A, Zhang S. Protein Design: From the Aspect of Water Solubility and Stability. Chem Rev 2022; 122:14085-14179. [PMID: 35921495 PMCID: PMC9523718 DOI: 10.1021/acs.chemrev.1c00757] [Citation(s) in RCA: 102] [Impact Index Per Article: 34.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2021] [Indexed: 12/13/2022]
Abstract
Water solubility and structural stability are key merits for proteins defined by the primary sequence and 3D-conformation. Their manipulation represents important aspects of the protein design field that relies on the accurate placement of amino acids and molecular interactions, guided by underlying physiochemical principles. Emulated designer proteins with well-defined properties both fuel the knowledge-base for more precise computational design models and are used in various biomedical and nanotechnological applications. The continuous developments in protein science, increasing computing power, new algorithms, and characterization techniques provide sophisticated toolkits for solubility design beyond guess work. In this review, we summarize recent advances in the protein design field with respect to water solubility and structural stability. After introducing fundamental design rules, we discuss the transmembrane protein solubilization and de novo transmembrane protein design. Traditional strategies to enhance protein solubility and structural stability are introduced. The designs of stable protein complexes and high-order assemblies are covered. Computational methodologies behind these endeavors, including structure prediction programs, machine learning algorithms, and specialty software dedicated to the evaluation of protein solubility and aggregation, are discussed. The findings and opportunities for Cryo-EM are presented. This review provides an overview of significant progress and prospects in accurate protein design for solubility and stability.
Collapse
Affiliation(s)
- Rui Qing
- State
Key Laboratory of Microbial Metabolism, School of Life Sciences and
Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China
- Media
Lab, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts 02139, United States
- The
David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts 02139, United States
| | - Shilei Hao
- Media
Lab, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts 02139, United States
- Key
Laboratory of Biorheological Science and Technology, Ministry of Education, College of Bioengineering, Chongqing University, Chongqing 400030, China
| | - Eva Smorodina
- Department
of Immunology, University of Oslo and Oslo
University Hospital, Oslo 0424, Norway
| | - David Jin
- Avalon GloboCare
Corp., Freehold, New Jersey 07728, United States
| | - Arthur Zalevsky
- Laboratory
of Bioinformatics Approaches in Combinatorial Chemistry and Biology, Shemyakin−Ovchinnikov Institute of Bioorganic
Chemistry RAS, Moscow 117997, Russia
| | - Shuguang Zhang
- Media
Lab, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts 02139, United States
| |
Collapse
|
9
|
Conti S, Ovchinnikov V, Karplus M. ppdx: Automated modeling of protein-protein interaction descriptors for use with machine learning. J Comput Chem 2022; 43:1747-1757. [PMID: 35930347 DOI: 10.1002/jcc.26974] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2022] [Revised: 07/01/2022] [Accepted: 07/13/2022] [Indexed: 11/07/2022]
Abstract
This paper describes ppdx, a python workflow tool that combines protein sequence alignment, homology modeling, and structural refinement, to compute a broad array of descriptors for characterizing protein-protein interactions. The descriptors can be used to predict various properties of interest, such as protein-protein binding affinities, or inhibitory concentrations (IC50 ), using approaches that range from simple regression to more complex machine learning models. The software is highly modular. It supports different protocols for generating structures, and 95 descriptors can be currently computed. More protocols and descriptors can be easily added. The implementation is highly parallel and can fully exploit the available cores in a single workstation, or multiple nodes on a supercomputer, allowing many systems to be analyzed simultaneously. As an illustrative application, ppdx is used to parametrize a model that predicts the IC50 of a set of antigens and a class of antibodies directed to the influenza hemagglutinin stalk.
Collapse
Affiliation(s)
- Simone Conti
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, Massachusetts, USA
| | - Victor Ovchinnikov
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, Massachusetts, USA
| | - Martin Karplus
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, Massachusetts, USA.,Laboratoire de Chimie Biophysique, Institut de Science et d'Ingénierie Supramoléculaires, Université de Strasbourg, Strasbourg, France
| |
Collapse
|
10
|
Folescu DE, Onufriev AV. A Closed-Form, Analytical Approximation for Apparent Surface Charge and Electric Field of Molecules. ACS OMEGA 2022; 7:26123-26136. [PMID: 35936397 PMCID: PMC9352323 DOI: 10.1021/acsomega.2c01484] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/12/2022] [Accepted: 06/10/2022] [Indexed: 06/15/2023]
Abstract
Closed-form, analytical approximations for electrostatic properties of molecules are of unique value as these can provide computational speed, versatility, and physical insight. Here, we have derived a simple, closed-form formula for the apparent surface charge (ASC) as well as for the electric field generated by a molecular charge distribution in aqueous solution. The approximation, with no fitted parameters, was tested against numerical solutions of the Poisson equation, where it has produced a significant speed-up. For neutral small molecules, the hydration free energies estimated from the closed-form ASC formula are within 0.8 kcal/mol RMSD from the numerical Poisson reference; the electric field at the surface is in quantitative agreement with the reference. Performance of the approximation was also tested on larger structures, including a protein, a DNA fragment, and a viral receptor-target complex. For all structures tested, a near-quantitative agreement with the numerical Poisson reference was achieved, except in regions of high negative curvature, where the new approximation is still qualitatively correct. A unique efficiency feature of the proposed "source-based″ closed-form approximation is that the ASC and electric field can be estimated individually at any point or surface patch, without the need to obtain the full global solution. An open-source software implementation of the method is available: https://people.cs.vt.edu/~onufriev/CODES/aasc.zip.
Collapse
Affiliation(s)
- Dan E. Folescu
- Department
of Computer Science, Virginia Tech, Blacksburg, Virginia 24061, United States
| | - Alexey V. Onufriev
- Department
of Computer Science, Virginia Tech, Blacksburg, Virginia 24061, United States
- Department
of Physics, Virginia Tech, Blacksburg, Virginia 24061, United States
- Center
for Soft Matter and Biological Physics, Virginia Tech, Blacksburg, Virginia 24061, United States
| |
Collapse
|
11
|
Sun Q, Biswas A, Vijayan RSK, Craveur P, Forli S, Olson AJ, Castaner AE, Kirby KA, Sarafianos SG, Deng N, Levy R. Structure-based virtual screening workflow to identify antivirals targeting HIV-1 capsid. J Comput Aided Mol Des 2022; 36:193-203. [PMID: 35262811 PMCID: PMC8904208 DOI: 10.1007/s10822-022-00446-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2021] [Accepted: 02/24/2022] [Indexed: 02/07/2023]
Abstract
We have identified novel HIV-1 capsid inhibitors targeting the PF74 binding site. Acting as the building block of the HIV-1 capsid core, the HIV-1 capsid protein plays an important role in the viral life cycle and is an attractive target for antiviral development. A structure-based virtual screening workflow for hit identification was employed, which includes docking 1.6 million commercially-available drug-like compounds from the ZINC database to the capsid dimer, followed by applying two absolute binding free energy (ABFE) filters on the 500 top-ranked molecules from docking. The first employs the Binding Energy Distribution Analysis Method (BEDAM) in implicit solvent. The top-ranked compounds are then refined using the Double Decoupling method in explicit solvent. Both docking and BEDAM refinement were carried out on the IBM World Community Grid as part of the FightAIDS@Home project. Using this virtual screening workflow, we identified 24 molecules with calculated binding free energies between − 6 and − 12 kcal/mol. We performed thermal shift assays on these molecules to examine their potential effects on the stability of HIV-1 capsid hexamer and found that two compounds, ZINC520357473 and ZINC4119064 increased the melting point of the latter by 14.8 °C and 33 °C, respectively. These results support the conclusion that the two ZINC compounds are primary hits targeting the capsid dimer interface. Our simulations also suggest that the two hit molecules may bind at the capsid dimer interface by occupying a new sub-pocket that has not been exploited by existing CA inhibitors. The possible causes for why other top-scored compounds suggested by ABFE filters failed to show measurable activity are discussed.
Collapse
Affiliation(s)
- Qinfang Sun
- Center for Biophysics and Computational Biology and Department of Chemistry, Temple University, Philadelphia, PA, 19122, USA
| | - Avik Biswas
- Center for Biophysics and Computational Biology and Department of Chemistry, Temple University, Philadelphia, PA, 19122, USA
| | - R S K Vijayan
- Institute for Applied Cancer Science, MD Anderson Cancer Center, Houston, TX, 77030, USA
| | - Pierrick Craveur
- Department of Integrative Structural and Computational Biology, The Scripps Research Institute, La Jolla, CA, 92037, USA
| | - Stefano Forli
- Department of Integrative Structural and Computational Biology, The Scripps Research Institute, La Jolla, CA, 92037, USA
| | - Arthur J Olson
- Department of Integrative Structural and Computational Biology, The Scripps Research Institute, La Jolla, CA, 92037, USA
| | - Andres Emanuelli Castaner
- Laboratory of Biochemical Pharmacology, Department of Pediatrics, Emory University School of Medicine, Atlanta, GA, 30322, USA.,Children's Healthcare of Atlanta, Atlanta, GA, 30322, USA
| | - Karen A Kirby
- Laboratory of Biochemical Pharmacology, Department of Pediatrics, Emory University School of Medicine, Atlanta, GA, 30322, USA.,Children's Healthcare of Atlanta, Atlanta, GA, 30322, USA
| | - Stefan G Sarafianos
- Laboratory of Biochemical Pharmacology, Department of Pediatrics, Emory University School of Medicine, Atlanta, GA, 30322, USA.,Children's Healthcare of Atlanta, Atlanta, GA, 30322, USA
| | - Nanjie Deng
- Department of Chemistry and Physical Sciences, Pace University, New York, NY, 10038, USA.
| | - Ronald Levy
- Center for Biophysics and Computational Biology and Department of Chemistry, Temple University, Philadelphia, PA, 19122, USA
| |
Collapse
|
12
|
Gallicchio E. Free Energy-Based Computational Methods for the Study of Protein-Peptide Binding Equilibria. Methods Mol Biol 2022; 2405:303-334. [PMID: 35298820 DOI: 10.1007/978-1-0716-1855-4_15] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
This chapter discusses the theory and application of physics-based free energy methods to estimate protein-peptide binding free energies. It presents a statistical mechanics formulation of molecular binding, which is then specialized in three methodologies: (1) alchemical absolute binding free energy estimation with implicit solvation, (2) alchemical relative binding free energy estimation with explicit solvation, and (3) potential of mean force binding free energy estimation. Case studies of protein-peptide binding application taken from the recent literature are discussed for each method.
Collapse
Affiliation(s)
- Emilio Gallicchio
- Department of Chemistry, Ph.D. Program in Biochemistry and Ph.D. Program in Chemistry at The Graduate Center of the City University of New York, Brooklyn College of the City University of New York, New York, NY, USA.
| |
Collapse
|
13
|
Gong X, Zhang Y, Chen J. Advanced Sampling Methods for Multiscale Simulation of Disordered Proteins and Dynamic Interactions. Biomolecules 2021; 11:1416. [PMID: 34680048 PMCID: PMC8533332 DOI: 10.3390/biom11101416] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2021] [Revised: 09/22/2021] [Accepted: 09/24/2021] [Indexed: 11/16/2022] Open
Abstract
Intrinsically disordered proteins (IDPs) are highly prevalent and play important roles in biology and human diseases. It is now also recognized that many IDPs remain dynamic even in specific complexes and functional assemblies. Computer simulations are essential for deriving a molecular description of the disordered protein ensembles and dynamic interactions for a mechanistic understanding of IDPs in biology, diseases, and therapeutics. Here, we provide an in-depth review of recent advances in the multi-scale simulation of disordered protein states, with a particular emphasis on the development and application of advanced sampling techniques for studying IDPs. These techniques are critical for adequate sampling of the manifold functionally relevant conformational spaces of IDPs. Together with dramatically improved protein force fields, these advanced simulation approaches have achieved substantial success and demonstrated significant promise towards the quantitative and predictive modeling of IDPs and their dynamic interactions. We will also discuss important challenges remaining in the atomistic simulation of larger systems and how various coarse-grained approaches may help to bridge the remaining gaps in the accessible time- and length-scales of IDP simulations.
Collapse
Affiliation(s)
- Xiping Gong
- Department of Chemistry, University of Massachusetts Amherst, Amherst, MA 01003, USA; (X.G.); (Y.Z.)
| | - Yumeng Zhang
- Department of Chemistry, University of Massachusetts Amherst, Amherst, MA 01003, USA; (X.G.); (Y.Z.)
| | - Jianhan Chen
- Department of Chemistry, University of Massachusetts Amherst, Amherst, MA 01003, USA; (X.G.); (Y.Z.)
- Department of Biochemistry and Molecular Biology, University of Massachusetts Amherst, Amherst, MA 01003, USA
| |
Collapse
|
14
|
Melse O, Hecht S, Antes I. DynaBiS: A hierarchical sampling algorithm to identify flexible binding sites for large ligands and peptides. Proteins 2021; 90:18-32. [PMID: 34288078 DOI: 10.1002/prot.26182] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2020] [Revised: 06/24/2021] [Accepted: 07/11/2021] [Indexed: 11/11/2022]
Abstract
Knowing the ligand or peptide binding site in proteins is highly important to guide drug discovery, but experimental elucidation of the binding site is difficult. Therefore, various computational approaches have been developed to identify potential binding sites in protein structures. However, protein and ligand flexibility are often neglected in these methods due to efficiency considerations despite the recognition that protein-ligand interactions can be strongly affected by mutual structural adaptations. This is particularly true if the binding site is unknown, as the screening will typically be performed based on an unbound protein structure. Herein we present DynaBiS, a hierarchical sampling algorithm to identify flexible binding sites for a target ligand with explicit consideration of protein and ligand flexibility, inspired by our previously presented flexible docking algorithm DynaDock. DynaBiS applies soft-core potentials between the ligand and the protein, thereby allowing a certain protein-ligand overlap resulting in efficient sampling of conformational adaptation effects. We evaluated DynaBiS and other commonly used binding site identification algorithms against a diverse evaluation set consisting of 26 proteins featuring peptide as well as small ligand binding sites. We show that DynaBiS outperforms the other evaluated methods for the identification of protein binding sites for large and highly flexible ligands such as peptides, both with a holo or apo structure used as input.
Collapse
Affiliation(s)
- Okke Melse
- TUM Center for Functional Protein Assemblies and TUM School of Life Sciences, Technische Universität München, Freising, Germany
| | - Sabrina Hecht
- TUM Center for Functional Protein Assemblies and TUM School of Life Sciences, Technische Universität München, Freising, Germany.,Quattro Research, Planegg, Germany
| | - Iris Antes
- TUM Center for Functional Protein Assemblies and TUM School of Life Sciences, Technische Universität München, Freising, Germany
| |
Collapse
|
15
|
Arsiccio A, Shea JE. Protein Cold Denaturation in Implicit Solvent Simulations: A Transfer Free Energy Approach. J Phys Chem B 2021; 125:5222-5232. [DOI: 10.1021/acs.jpcb.1c01694] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Affiliation(s)
- Andrea Arsiccio
- Department of Chemistry and Biochemistry, University of California, Santa Barbara, California 93106, United States
| | - Joan-Emma Shea
- Department of Chemistry and Biochemistry, University of California, Santa Barbara, California 93106, United States
- Department of Physics, University of California, Santa Barbara, California 93106, United States
| |
Collapse
|
16
|
Forouzesh N, Mishra N. An Effective MM/GBSA Protocol for Absolute Binding Free Energy Calculations: A Case Study on SARS-CoV-2 Spike Protein and the Human ACE2 Receptor. Molecules 2021; 26:2383. [PMID: 33923909 PMCID: PMC8074138 DOI: 10.3390/molecules26082383] [Citation(s) in RCA: 59] [Impact Index Per Article: 14.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2021] [Revised: 04/09/2021] [Accepted: 04/13/2021] [Indexed: 12/23/2022] Open
Abstract
The binding free energy calculation of protein-ligand complexes is necessary for research into virus-host interactions and the relevant applications in drug discovery. However, many current computational methods of such calculations are either inefficient or inaccurate in practice. Utilizing implicit solvent models in the molecular mechanics generalized Born surface area (MM/GBSA) framework allows for efficient calculations without significant loss of accuracy. Here, GBNSR6, a new flavor of the generalized Born model, is employed in the MM/GBSA framework for measuring the binding affinity between SARS-CoV-2 spike protein and the human ACE2 receptor. A computational protocol is developed based on the widely studied Ras-Raf complex, which has similar binding free energy to SARS-CoV-2/ACE2. Two options for representing the dielectric boundary of the complexes are evaluated: one based on the standard Bondi radii and the other based on a newly developed set of atomic radii (OPT1), optimized specifically for protein-ligand binding. Predictions based on the two radii sets provide upper and lower bounds on the experimental references: -14.7(ΔGbindBondi)<-10.6(ΔGbindExp.)<-4.1(ΔGbindOPT1) kcal/mol. The consensus estimates of the two bounds show quantitative agreement with the experiment values. This work also presents a novel truncation method and computational strategies for efficient entropy calculations with normal mode analysis. Interestingly, it is observed that a significant decrease in the number of snapshots does not affect the accuracy of entropy calculation, while it does lower computation time appreciably. The proposed MM/GBSA protocol can be used to study the binding mechanism of new variants of SARS-CoV-2, as well as other relevant structures.
Collapse
Affiliation(s)
- Negin Forouzesh
- Department of Computer Science, California State University, Los Angeles, CA 90032, USA
| | - Nikita Mishra
- Department of Chemistry and Biochemistry, California State University, Los Angeles, CA 90032, USA;
| |
Collapse
|
17
|
Corrigan RA, Qi G, Thiel AC, Lynn JR, Walker BD, Casavant TL, Lagardere L, Piquemal JP, Ponder JW, Ren P, Schnieders MJ. Implicit Solvents for the Polarizable Atomic Multipole AMOEBA Force Field. J Chem Theory Comput 2021; 17:2323-2341. [PMID: 33769814 DOI: 10.1021/acs.jctc.0c01286] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
Computational protein design, ab initio protein/RNA folding, and protein-ligand screening can be too computationally demanding for explicit treatment of solvent. For these applications, implicit solvent offers a compelling alternative, which we describe here for the polarizable atomic multipole AMOEBA force field based on three treatments of continuum electrostatics: numerical solutions to the nonlinear and linearized versions of the Poisson-Boltzmann equation (PBE), the domain-decomposition conductor-like screening model (ddCOSMO) approximation to the PBE, and the analytic generalized Kirkwood (GK) approximation. The continuum electrostatics models are combined with a nonpolar estimator based on novel cavitation and dispersion terms. Electrostatic model parameters are numerically optimized using a least-squares style target function based on a library of 103 small-molecule solvation free energy differences. Mean signed errors for the adaptive Poisson-Boltzmann solver (APBS), ddCOSMO, and GK models are 0.05, 0.00, and 0.00 kcal/mol, respectively, while the mean unsigned errors are 0.70, 0.63, and 0.58 kcal/mol, respectively. Validation of the electrostatic response of the resulting implicit solvents, which are available in the Tinker (or Tinker-HP), OpenMM, and Force Field X software packages, is based on comparisons to explicit solvent simulations for a series of proteins and nucleic acids. Overall, the emergence of performative implicit solvent models for polarizable force fields opens the door to their use for folding and design applications.
Collapse
Affiliation(s)
- Rae A Corrigan
- Roy J Carver Department of Biomedical Engineering, University of Iowa, Iowa City, Iowa 52242, United States
| | - Guowei Qi
- Department of Biochemistry, University of Iowa, Iowa City, Iowa 52242, United States
| | - Andrew C Thiel
- Roy J Carver Department of Biomedical Engineering, University of Iowa, Iowa City, Iowa 52242, United States
| | - Jack R Lynn
- Roy J Carver Department of Biomedical Engineering, University of Iowa, Iowa City, Iowa 52242, United States
| | - Brandon D Walker
- Department of Biomedical Engineering, University of Texas in Austin, Austin, Texas 78712, United States
| | - Thomas L Casavant
- Roy J Carver Department of Biomedical Engineering, University of Iowa, Iowa City, Iowa 52242, United States
| | - Louis Lagardere
- Department of Chemistry, Sorbonne Université, F-75005 Paris, France
| | | | - Jay W Ponder
- Department of Chemistry, Washington University in St. Louis, St. Louis, Missouri 63130, United States
| | - Pengyu Ren
- Department of Biomedical Engineering, University of Texas in Austin, Austin, Texas 78712, United States
| | - Michael J Schnieders
- Roy J Carver Department of Biomedical Engineering, University of Iowa, Iowa City, Iowa 52242, United States.,Department of Biochemistry, University of Iowa, Iowa City, Iowa 52242, United States
| |
Collapse
|
18
|
Herbert JM. Dielectric continuum methods for quantum chemistry. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE 2021. [DOI: 10.1002/wcms.1519] [Citation(s) in RCA: 37] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Affiliation(s)
- John M. Herbert
- Department of Chemistry and Biochemistry The Ohio State University Columbus Ohio USA
| |
Collapse
|
19
|
Patra J, Singh D, Jain S, Mahindroo N. Application of Docking for Lead Optimization. MOLECULAR DOCKING FOR COMPUTER-AIDED DRUG DESIGN 2021:271-294. [DOI: 10.1016/b978-0-12-822312-3.00012-6] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/06/2025]
|
20
|
Forouzesh N, Onufriev AV. MMGB/SA Consensus Estimate of the Binding Free Energy Between the Novel Coronavirus Spike Protein to the Human ACE2 Receptor. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2020:2020.08.25.267625. [PMID: 32869029 PMCID: PMC7457614 DOI: 10.1101/2020.08.25.267625] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
Abstract
The ability to estimate protein-protein binding free energy in a computationally efficient via a physics-based approach is beneficial to research focused on the mechanism of viruses binding to their target proteins. Implicit solvation methodology may be particularly useful in the early stages of such research, as it can offer valuable insights into the binding process, quickly. Here we evaluate the potential of the related molecular mechanics generalized Born surface area (MMGB/SA) approach to estimate the binding free energy ΔGbind between the SARS-CoV-2 spike receptor-binding domain and the human ACE2 receptor. The calculations are based on a recent flavor of the generalized Born model, GBNSR6. Two estimates of ΔGbind are performed: one based on standard bondi radii, and the other based on a newly developed set of atomic radii (OPT1), optimized specifically for protein-ligand binding. We take the average of the resulting two ΔGbind values as the consensus estimate. For the well-studied Ras-Raf protein-protein complex, which has similar binding free energy to that of the SARS-CoV-2/ACE2 complex, the consensus ΔGbind = -11.8 ± 1 kcal/mol, vs. experimental -9.7 ± 0.2 kcal/mol. The consensus estimates for the SARS-CoV-2/ACE2 complex is ΔGbind = -9.4 ± 1.5 kcal/mol, which is in near quantitative agreement with experiment (-10.6 kcal/mol). The availability of a conceptually simple MMGB/SA-based protocol for analysis of the SARS-CoV-2 /ACE2 binding may be beneficial in light of the need to move forward fast.
Collapse
Affiliation(s)
- Negin Forouzesh
- Department of Computer Science, California State University, Los Angeles, Los Angeles, CA 90032, USA
| | - Alexey V Onufriev
- Department of Computer Science, Virginia Polytechnic Institute & State University, Blacksburg, VA 24061, USA
- Department of Physics, Virginia Polytechnic Institute & State University, Blacksburg, VA 24061, USA
- Center for Soft Matter and Biological Physics, Virginia Polytechnic Institute & State University, Blacksburg, VA 24061, USA
| |
Collapse
|
21
|
Forouzesh N, Mukhopadhyay A, Watson LT, Onufriev AV. Multidimensional Global Optimization and Robustness Analysis in the Context of Protein-Ligand Binding. J Chem Theory Comput 2020; 16:4669-4684. [PMID: 32450041 PMCID: PMC8594251 DOI: 10.1021/acs.jctc.0c00142] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Accuracy of protein-ligand binding free energy calculations utilizing implicit solvent models is critically affected by parameters of the underlying dielectric boundary, specifically, the atomic and water probe radii. Here, a global multidimensional optimization pipeline is developed to find optimal atomic radii specifically for protein-ligand binding calculations in implicit solvent. The computational pipeline has these three key components: (1) a massively parallel implementation of a deterministic global optimization algorithm (VTDIRECT95), (2) an accurate yet reasonably fast generalized Born implicit solvent model (GBNSR6), and (3) a novel robustness metric that helps distinguish between nearly degenerate local minima via a postprocessing step of the optimization. A graph-based "kT-connectivity" approach to explore and visualize the multidimensional energy landscape is proposed: local minima that can be reached from the global minimum without exceeding a given energy threshold (kT) are considered to be connected. As an illustration of the capabilities of the optimization pipeline, we apply it to find a global optimum in the space of just five radii: four atomic (O, H, N, and C) radii and water probe radius. The optimized radii, ρW = 1.37 Å, ρC = 1.40 Å, ρH = 1.55 Å, ρN = 2.35 Å, and ρO = 1.28 Å, lead to a closer agreement of electrostatic binding free energies with the explicit solvent reference than two commonly used sets of radii previously optimized for small molecules. At the same time, the ability of the optimizer to find the global optimum reveals fundamental limits of the common two-dielectric implicit solvation model: the computed electrostatic binding free energies are still almost 4 kcal/mol away from the explicit solvent reference. The proposed computational approach opens the possibility to further improve the accuracy of practical computational protocols for binding free energy calculations.
Collapse
Affiliation(s)
- Negin Forouzesh
- Department of Computer Science, Virginia Polytechnic Institute & State University, Blacksburg, Virginia 24061, United States
| | - Abhishek Mukhopadhyay
- Department of Physics, Virginia Polytechnic Institute & State University, Blacksburg, Virginia 24061, United States
| | - Layne T Watson
- Department of Computer Science, Virginia Polytechnic Institute & State University, Blacksburg, Virginia 24061, United States
- Department of Mathematics, Virginia Polytechnic Institute & State University, Blacksburg, Virginia 24061, United States
- Department of Aerospace and Ocean Engineering, Virginia Polytechnic Institute & State University, Blacksburg, Virginia 24061, United States
- Center for Soft Matter and Biological Physics, Virginia Polytechnic Institute & State University, Blacksburg, Virginia 24061, United States
| | - Alexey V Onufriev
- Department of Computer Science, Virginia Polytechnic Institute & State University, Blacksburg, Virginia 24061, United States
- Department of Physics, Virginia Polytechnic Institute & State University, Blacksburg, Virginia 24061, United States
- Center for Soft Matter and Biological Physics, Virginia Polytechnic Institute & State University, Blacksburg, Virginia 24061, United States
| |
Collapse
|
22
|
Poier PP, Jensen F. Polarizable charges in a generalized Born reaction potential. J Chem Phys 2020; 153:024111. [DOI: 10.1063/5.0012022] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Affiliation(s)
- Pier Paolo Poier
- Department of Chemistry, Aarhus University, Langelandsgade 140, DK-8000 Aarhus, Denmark
| | - Frank Jensen
- Department of Chemistry, Aarhus University, Langelandsgade 140, DK-8000 Aarhus, Denmark
| |
Collapse
|
23
|
Horvath D, Marcou G, Varnek A. "Big Data" Fast Chemoinformatics Model to Predict Generalized Born Radius and Solvent Accessibility as a Function of Geometry. J Chem Inf Model 2020; 60:2951-2965. [PMID: 32374171 DOI: 10.1021/acs.jcim.9b01172] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
The Generalized Born (GB) solvent model is offering the best accuracy/computing effort ratio yet requires drastic simplifications to estimate of the Effective Born Radii (EBR) in bypassing a too expensive volume integration step. EBRs are a measure of the degree of burial of an atom and not very sensitive to small changes of geometry: in molecular dynamics, the costly EBR update procedure is not mandatory at every step. This work however aims at implementing a GB model into the Sampler for Multiple Protein-Ligand Entities (S4MPLE) evolutionary algorithm with mandatory EBR updates at each step triggering arbitrarily large geometric changes. Therefore, a quantitative structure-property relationship has been developed in order to express the EBRs as a linear function of both the topological neighborhood and geometric occupancy of the space around atoms. A training set of 810 molecular systems, starting from fragment-like to drug-like compounds, proteins, host-guest systems, and ligand-protein complexes, has been compiled. For each species, S4MPLE generated several hundreds of random conformers. For each atom in each geometry of each species, its "standard" EBR was calculated by numeric integration and associated to topological and geometric descriptors of the atom neighborhood. This training set (EBR, atom descriptors) involving >5 M entries was subjected to a boot-strapping multilinear regression process with descriptor selection. In parallel, the strategy was repurposed to also learn atomic solvent-accessible areas (SA) based on the same descriptors. Resulting linear equations were challenged to predict EBR and SA values for a similarly compiled external set of >2000 new molecular systems. Solvation energies calculated with estimated EBR and SA match "standard" energies within the typical error of a force-field-based approach (a few kilocalories per mole). Given the extreme diversity of molecular systems covered by the model, this simple EBR/SA estimator covers a vast applicability domain.
Collapse
Affiliation(s)
- Dragos Horvath
- Laboratory of Chemoinformatics, UMR 7140 University of Strasbourg/CNRS, 4 rue Blaise Pascal, 67000 Strasbourg, France
| | - Gilles Marcou
- Laboratory of Chemoinformatics, UMR 7140 University of Strasbourg/CNRS, 4 rue Blaise Pascal, 67000 Strasbourg, France
| | - Alexandre Varnek
- Laboratory of Chemoinformatics, UMR 7140 University of Strasbourg/CNRS, 4 rue Blaise Pascal, 67000 Strasbourg, France
| |
Collapse
|
24
|
Sakae Y, Zhang BW, Levy RM, Deng N. Absolute Protein Binding Free Energy Simulations for Ligands with Multiple Poses, a Thermodynamic Path That Avoids Exhaustive Enumeration of the Poses. J Comput Chem 2020; 41:56-68. [PMID: 31621932 PMCID: PMC7140983 DOI: 10.1002/jcc.26078] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2019] [Revised: 08/07/2019] [Accepted: 08/31/2019] [Indexed: 12/28/2022]
Abstract
We propose a free energy calculation method for receptor-ligand binding, which have multiple binding poses that avoids exhaustive enumeration of the poses. For systems with multiple binding poses, the standard procedure is to enumerate orientations of the binding poses, restrain the ligand to each orientation, and then, calculate the binding free energies for each binding pose. In this study, we modify a part of the thermodynamic cycle in order to sample a broader conformational space of the ligand in the binding site. This modification leads to more accurate free energy calculation without performing separate free energy simulations for each binding pose. We applied our modification to simple model host-guest systems as a test, which have only two binding poses, by using a single decoupling method (SDM) in implicit solvent. The results showed that the binding free energies obtained from our method without knowing the two binding poses were in good agreement with the benchmark results obtained by explicit enumeration of the binding poses. Our method is applicable to other alchemical binding free energy calculation methods such as the double decoupling method (DDM) in explicit solvent. We performed a calculation for a protein-ligand system with explicit solvent using our modified thermodynamic path. The results of the free energy simulation along our modified path were in good agreement with the results of conventional DDM, which requires a separate binding free energy calculation for each of the binding poses of the example of phenol binding to T4 lysozyme in explicit solvent. © 2019 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Yoshitake Sakae
- Center for Biophysics and Computational Biology, Temple University, Philadelphia, Pennsylvania, 19122
| | - Bin W Zhang
- Center for Biophysics and Computational Biology, Temple University, Philadelphia, Pennsylvania, 19122
| | - Ronald M Levy
- Center for Biophysics and Computational Biology, Temple University, Philadelphia, Pennsylvania, 19122
- Department of Chemistry, Institute for Computational Molecular Science, Temple University, Philadelphia, Pennsylvania, 19122
| | - Nanjie Deng
- Department of Chemistry and Physical Sciences, Pace University, New York, New York, 10038
| |
Collapse
|
25
|
Wei H, Luo A, Qiu T, Luo R, Qi R. Improved Poisson-Boltzmann Methods for High-Performance Computing. J Chem Theory Comput 2019; 15:6190-6202. [PMID: 31525962 DOI: 10.1021/acs.jctc.9b00602] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Abstract
Implicit solvent models based on the Poisson-Boltzmann equation (PBE) have been widely used to study electrostatic interactions in biophysical processes. These models often treat the solvent and solute regions as high and low dielectric continua, leading to a large jump in dielectrics across the molecular surface which is difficult to handle. Higher order interface schemes are often needed to seek higher accuracy for PBE applications. However, these methods are usually very liberal in the use of grid points nearby the molecular surface, making them difficult to use on high-performance computing platforms. Alternatively, the harmonic average (HA) method has been used to approximate dielectric interface conditions near the molecular surface with surprisingly good convergence and is well suited for high-performance computing. By adopting a 7-point stencil, the HA method is advantageous in generating simple 7-banded coefficient matrices, which greatly facilitate linear system solution with dense data parallelism, on high-performance computing platforms such as a graphics processing unit (GPU). However, the HA method is limited due to its lower accuracy. Therefore, it would be of great interest for high-performance applications to develop more accurate methods while retaining the simplicity and effectiveness of the 7-point stencil discretization scheme. In this study, we have developed two new algorithms based on the spirit of the HA method by introducing more physical interface relations and imposing the discretized Poisson's equation to the second order, respectively. Our testing shows that, for typical biomolecules, the new methods significantly improve the numerical accuracy to that comparable to the second-order solvers and with ∼65% overall efficiency gain on widely available high-performance GPU platforms.
Collapse
|
26
|
He P, Sarkar S, Gallicchio E, Kurtzman T, Wickstrom L. Role of Displacing Confined Solvent in the Conformational Equilibrium of β-Cyclodextrin. J Phys Chem B 2019; 123:8378-8386. [PMID: 31509409 DOI: 10.1021/acs.jpcb.9b07028] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
This study investigates the role of hydration and its relationship to the conformational equilibrium of the host molecule β-cyclodextrin. Molecular dynamics simulations indicate that the unbound β-cyclodextrin exhibits two state behavior in explicit solvent due to the opening and closing of its cavity. In implicit solvent, these transitions are not observed, and there is one dominant conformation of β-cyclodextrin with an open cavity. Based on these observations, we investigate the hypothesis that the expulsion of thermodynamically unfavorable water molecules into the bulk plays an important role in controlling the accessibility of the closed macrostate at room temperature. We compare the results of the molecular mechanics analytical generalized Born plus nonpolar solvation approach to those obtained through grid inhomogeneous solvation theory analysis with explicit solvation to elucidate the thermodynamic forces at play. The work illustrates the use of continuum solvent models to tease out solvation effects related to the inhomogeneity and the molecular nature of water and demonstrates the key role of the thermodynamics of enclosed hydration in driving the conformational equilibrium of molecules in solution.
Collapse
Affiliation(s)
- Peng He
- Center for Biophysics & Computational Biology/ICMS, Department of Chemistry , Temple University , Philadelphia , Pennsylvania 19122 , United States
| | - Sheila Sarkar
- Department of Science , Borough of Manhattan Community College, The City University of New York , New York , New York 10007 , United States
| | - Emilio Gallicchio
- Department of Chemistry , Brooklyn College, The City University of New York , Brooklyn , New York 11210 , United States.,Ph.D. Programs in Chemistry & Biochemistry , The Graduate Center of the City University of New York , 365 Fifth Avenue , New York , New York 10016 , United States
| | - Tom Kurtzman
- Department of Chemistry , Lehman College, The City University of New York , Bronx , New York 10468 , United States.,Ph.D. Programs in Chemistry & Biochemistry , The Graduate Center of the City University of New York , 365 Fifth Avenue , New York , New York 10016 , United States
| | - Lauren Wickstrom
- Department of Science , Borough of Manhattan Community College, The City University of New York , New York , New York 10007 , United States
| |
Collapse
|
27
|
Pal RK, Gallicchio E. Perturbation potentials to overcome order/disorder transitions in alchemical binding free energy calculations. J Chem Phys 2019; 151:124116. [DOI: 10.1063/1.5123154] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Affiliation(s)
- Rajat K. Pal
- Department of Chemistry, Brooklyn College of the City University of New York, New York, New York 11210, USA
| | - Emilio Gallicchio
- Department of Chemistry, Brooklyn College of the City University of New York, New York, New York 11210, USA
| |
Collapse
|
28
|
Lange AW, Herbert JM, Albrecht BJ, You ZQ. Intrinsically smooth discretisation of Connolly's solvent-excluded molecular surface. Mol Phys 2019. [DOI: 10.1080/00268976.2019.1644384] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]
Affiliation(s)
- Adrian W. Lange
- Department of Chemistry and Biochemistry, The Ohio State University, Columbus, OH, USA
| | - John M. Herbert
- Department of Chemistry and Biochemistry, The Ohio State University, Columbus, OH, USA
| | - Benjamin J. Albrecht
- Department of Chemistry and Biochemistry, The Ohio State University, Columbus, OH, USA
| | - Zhi-Qiang You
- Department of Chemistry and Biochemistry, The Ohio State University, Columbus, OH, USA
| |
Collapse
|
29
|
Abstract
It would often be useful in computer simulations to use an implicit description of solvation effects, instead of explicitly representing the individual solvent molecules. Continuum dielectric models often work well in describing the thermodynamic aspects of aqueous solvation and can be very efficient compared to the explicit treatment of the solvent. Here, we review a particular class of so-called fast implicit solvent models, generalized Born (GB) models, which are widely used for molecular dynamics (MD) simulations of proteins and nucleic acids. These approaches model hydration effects and provide solvent-dependent forces with efficiencies comparable to molecular-mechanics calculations on the solute alone; as such, they can be incorporated into MD or other conformational searching strategies in a straightforward manner. The foundations of the GB model are reviewed, followed by examples of newer, emerging models and examples of important applications. We discuss their strengths and weaknesses, both for fidelity to the underlying continuum model and for the ability to replace explicit consideration of solvent molecules in macromolecular simulations.
Collapse
Affiliation(s)
- Alexey V Onufriev
- Departments of Computer Science and Physics, Center for Soft Matter and Biological Physics, Virginia Tech, Blacksburg, Virginia 24060, USA;
| | - David A Case
- Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, New Jersey 08854, USA;
| |
Collapse
|
30
|
Chakravorty A, Gallicchio E, Alexov E. A grid-based algorithm in conjunction with a gaussian-based model of atoms for describing molecular geometry. J Comput Chem 2019; 40:1290-1304. [PMID: 30698861 PMCID: PMC6506848 DOI: 10.1002/jcc.25786] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2018] [Revised: 12/12/2018] [Accepted: 01/06/2019] [Indexed: 11/06/2022]
Abstract
A novel grid-based method is presented, which in conjunction with a smooth Gaussian-based model of atoms, is used to compute molecular volume (MV) and surface area (MSA). The MV and MSA are essential for computing nonpolar component of free energies. The objective of our grid-based approach is to identify solute atom pairs that share overlapping volumes in space. Once completed, this information is used to construct a rooted tree using depth-first method to yield the final volume and SA by using the formulations of the Gaussian model described by Grant and Pickup (J. Phys Chem, 1995, 99, 3503). The method is designed to function uninterruptedly with the grid-based finite-difference method implemented in Delphi, a popular and open-source package used for solving the Poisson-Boltzmann equation (PBE). We demonstrate the time efficacy of the method while also validating its performance in terms of the effect of grid-resolution, positioning of the solute within the grid-map and accuracy in identification of overlapping atom pairs. We also explore and discuss different aspects of the Gaussian model with key emphasis on its physical meaningfulness. This development and its future release with the Delphi package are intended to provide a physically meaningful, fast, robust and comprehensive tool for MM/PBSA based free energy calculations. © 2019 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Arghya Chakravorty
- Department of Physics and Astronomy, Clemson University, Clemson, South Carolina 29634
| | | | - Emil Alexov
- Department of Physics and Astronomy, Clemson University, Clemson, South Carolina 29634
| |
Collapse
|
31
|
Xia J, Flynn W, Gallicchio E, Uplinger K, Armstrong JD, Forli S, Olson AJ, Levy RM. Massive-Scale Binding Free Energy Simulations of HIV Integrase Complexes Using Asynchronous Replica Exchange Framework Implemented on the IBM WCG Distributed Network. J Chem Inf Model 2019; 59:1382-1397. [PMID: 30758197 PMCID: PMC6496938 DOI: 10.1021/acs.jcim.8b00817] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
To perform massive-scale replica exchange molecular dynamics (REMD) simulations for calculating binding free energies of protein-ligand complexes, we implemented the asynchronous replica exchange (AsyncRE) framework of the binding energy distribution analysis method (BEDAM) in implicit solvent on the IBM World Community Grid (WCG) and optimized the simulation parameters to reduce the overhead and improve the prediction power of the WCG AsyncRE simulations. We also performed the first massive-scale binding free energy calculations using the WCG distributed computing grid and 301 ligands from the SAMPL4 challenge for large-scale binding free energy predictions of HIV-1 integrase complexes. In total there are ∼10000 simulated complexes, ∼1 million replicas, and ∼2000 μs of aggregated MD simulations. Running AsyncRE MD simulations on the WCG requires accepting a trade-off between the number of replicas that can be run (breadth) and the number of full RE cycles that can be completed per replica (depth). As compared with synchronous Replica Exchange (SyncRE) running on tightly coupled clusters like XSEDE, on the WCG many more replicas can be launched simultaneously on heterogeneous distributed hardware, but each full RE cycle requires more overhead. We compared the WCG results with that from AutoDock and more advanced RE simulations including the use of flattening potentials to accelerate sampling of selected degrees of freedom of ligands and/or receptors related to slow dynamics due to high energy barriers. We propose a suitable strategy of RE simulations to refine high throughput docking results which can be matched to corresponding computing resources: from HPC clusters, to small or medium-size distributed campus grids, and finally to massive-scale computing networks including millions of CPUs like the resources available on the WCG.
Collapse
Affiliation(s)
- Junchao Xia
- Center for Biophysics and Computational Biology and Department of Physics , Temple University , Philadelphia , Pennsylvania 19122 , United States
| | - William Flynn
- Center for Biophysics and Computational Biology and Department of Chemistry , Temple University , Philadelphia , Pennsylvania 19122 , United States
| | - Emilio Gallicchio
- Department of Chemistry , CUNY Brooklyn College , Brooklyn , New York 11210 , United States
| | - Keith Uplinger
- IBM WCG Team, 1177 South Belt Line Road , Coppell , Texas 75019 , United States
| | - Jonathan D Armstrong
- IBM WCG Team, 11400 Burnet Road , 0453B129, Austin , Texas 78758 , United States
| | - Stefano Forli
- Department of Integrative Structural and Computational Biology , The Scripps Research Institute , La Jolla , California 92037-1000 , United States
| | - Arthur J Olson
- Department of Integrative Structural and Computational Biology , The Scripps Research Institute , La Jolla , California 92037-1000 , United States
| | - Ronald M Levy
- Center for Biophysics and Computational Biology and Department of Chemistry , Temple University , Philadelphia , Pennsylvania 19122 , United States
| |
Collapse
|
32
|
Chan-Yao-Chong M, Durand D, Ha-Duong T. Molecular Dynamics Simulations Combined with Nuclear Magnetic Resonance and/or Small-Angle X-ray Scattering Data for Characterizing Intrinsically Disordered Protein Conformational Ensembles. J Chem Inf Model 2019; 59:1743-1758. [PMID: 30840442 DOI: 10.1021/acs.jcim.8b00928] [Citation(s) in RCA: 39] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
The concept of intrinsically disordered proteins (IDPs) has emerged relatively slowly, but over the past 20 years, it has become an intense research area in structural biology. Indeed, because of their considerable flexibility and structural heterogeneity, the determination of IDP conformational ensemble is particularly challenging and often requires a combination of experimental measurements and computational approaches. With the improved accuracy of all-atom force fields and the increasing computing performances, molecular dynamics (MD) simulations have become more and more reliable to generate realistic conformational ensembles. And the combination of MD simulations with experimental approaches, such as nuclear magnetic resonance (NMR) and/or small-angle X-ray scattering (SAXS) allows one to converge toward a more accurate and exhaustive description of IDP structures. In this Review, we discuss the state of the art of MD simulations of IDP conformational ensembles, with a special focus on studies that back-calculated and directly compared theoretical and experimental NMR or SAXS observables, such as chemical shifts (CS), 3J-couplings (3Jc), residual dipolar couplings (RDC), or SAXS intensities. We organize the review in three parts. In the first section, we discuss the studies which used NMR and/or SAXS data to test and validate the development of force fields or enhanced sampling techniques. In the second part, we explore different methods for the refinement of MD-derived structural ensembles, such as NMR or SAXS data-restrained MD simulations or ensemble reweighting to better fit experiments. Finally, we survey some recent studies combining MD simulations with NMR and/or SAXS measurements to investigate the relationship between IDP conformational ensemble and biological activity, as well as their implication in human diseases. From this review, we noticed that quite a few studies compared MD-generated conformational ensembles with both NMR and SAXS measurements to validate IDP structures at both local and global levels. Yet, beside the IDP propensity to form local secondary structures, their dynamic extension or compactness also appears important for their activity. Thus, we believe that a close synergy between MD simulations, NMR, and SAXS experiments would be greatly appropriate to address the challenges of characterizing the disordered structures of proteins and their complexes, relative to their biological functions.
Collapse
Affiliation(s)
- Maud Chan-Yao-Chong
- BioCIS, Université Paris-Sud, CNRS , Université Paris-Saclay , 92290 Châtenay-Malabry , France.,Institute for Integrative Biology of the Cell (I2BC), CEA, CNRS, Univ. Paris-Sud , Université Paris-Saclay , 91198 , Gif-sur-Yvette cedex, France
| | - Dominique Durand
- Institute for Integrative Biology of the Cell (I2BC), CEA, CNRS, Univ. Paris-Sud , Université Paris-Saclay , 91198 , Gif-sur-Yvette cedex, France
| | - Tâp Ha-Duong
- BioCIS, Université Paris-Sud, CNRS , Université Paris-Saclay , 92290 Châtenay-Malabry , France
| |
Collapse
|
33
|
Wei H, Luo R, Qi R. An efficient second-order poisson-boltzmann method. J Comput Chem 2019; 40:1257-1269. [PMID: 30776135 PMCID: PMC6422926 DOI: 10.1002/jcc.25783] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2018] [Revised: 11/29/2018] [Accepted: 01/06/2019] [Indexed: 11/08/2022]
Abstract
Immersed interface method (IIM) is a promising high-accuracy numerical scheme for the Poisson-Boltzmann model that has been widely used to study electrostatic interactions in biomolecules. However, the IIM suffers from instability and slow convergence for typical applications. In this study, we introduced both analytical interface and surface regulation into IIM to address these issues. The analytical interface setup leads to better accuracy and its convergence closely follows a quadratic manner as predicted by theory. The surface regulation further speeds up the convergence for nontrivial biomolecules. In addition, uncertainties of the numerical energies for tested systems are also reduced by about half. More interestingly, the analytical setup significantly improves the linear solver efficiency and stability by generating more precise and better-conditioned linear systems. Finally, we implemented the bottleneck linear system solver on GPUs to further improve the efficiency of the method, so it can be widely used for practical biomolecular applications. © 2019 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Haixin Wei
- Department of Chemical Engineering and Materials Science, University of California, Irvine, California, 92697
| | - Ray Luo
- Department of Chemical Engineering and Materials Science, University of California, Irvine, California, 92697.,Department of Molecular Biology and Biochemistry, University of California, Irvine, California, 92697.,Department of Biomedical Engineering, University of California, Irvine, California, 92697
| | - Ruxi Qi
- Department of Molecular Biology and Biochemistry, University of California, Irvine, California, 92697
| |
Collapse
|
34
|
Choi JM, Pappu RV. Improvements to the ABSINTH Force Field for Proteins Based on Experimentally Derived Amino Acid Specific Backbone Conformational Statistics. J Chem Theory Comput 2019; 15:1367-1382. [PMID: 30633502 PMCID: PMC10749164 DOI: 10.1021/acs.jctc.8b00573] [Citation(s) in RCA: 43] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
We present an improved version of the ABSINTH implicit solvation model and force field paradigm (termed ABSINTH-C) by incorporating a grid-based term that bootstraps against experimentally derived and computationally optimized conformational statistics for blocked amino acids. These statistics provide high-resolution descriptions of the intrinsic backbone dihedral angle preferences for all 20 amino acids. The original ABSINTH model generates Ramachandran plots that are too shallow in terms of the basin structures and too permissive in terms of dihedral angle preferences. We bootstrap against the reference optimized landscapes and incorporate CMAP-like residue-specific terms that help us reproduce the intrinsic dihedral angle preferences of individual amino acids. These corrections that lead to ABSINTH-C are achieved by balancing the incorporation of the new residue-specific terms with the accuracies inherent to the original ABSINTH model. We demonstrate the robustness of ABSINTH-C through a series of examples to highlight the preservation of accuracies as well as examples that demonstrate the improvements. Our efforts show how the recent experimentally derived and computationally optimized coil-library landscapes can be used as a touchstone for quantifying errors and making improvements to molecular mechanics force fields.
Collapse
Affiliation(s)
- Jeong-Mo Choi
- Department of Biomedical Engineering and Center for Biological Systems Engineering, Washington University in St. Louis, One Brookings Drive, Campus Box 1097, St. Louis, Missouri 63130
| | - Rohit V. Pappu
- Department of Biomedical Engineering and Center for Biological Systems Engineering, Washington University in St. Louis, One Brookings Drive, Campus Box 1097, St. Louis, Missouri 63130
| |
Collapse
|
35
|
SAMPL6 host–guest blind predictions using a non equilibrium alchemical approach. J Comput Aided Mol Des 2018; 32:965-982. [DOI: 10.1007/s10822-018-0151-9] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2018] [Accepted: 08/10/2018] [Indexed: 02/07/2023]
|
36
|
Xia J, Flynn W, Levy RM. Improving Prediction Accuracy of Binding Free Energies and Poses of HIV Integrase Complexes Using the Binding Energy Distribution Analysis Method with Flattening Potentials. J Chem Inf Model 2018; 58:1356-1371. [PMID: 29927237 PMCID: PMC6287956 DOI: 10.1021/acs.jcim.8b00194] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
To accelerate conformation sampling of slow dynamics from receptor or ligand, we introduced flattening potentials on selected bonded and nonbonded intramolecular interactions to the binding energy distribution analysis method (BEDAM) for calculating absolute binding free energies of protein-ligand complexes using an implicit solvent model and implemented flattening BEDAM using the asynchronous replica exchange (AsyncRE) framework for performing large scale replica exchange molecular dynamics (REMD) simulations. The advantage of using the flattening feature to reduce high energy barriers was exhibited first by the p-xylene-T4 lysozyme complex, where the intramolecular interactions of a protein side chain on the binding site were flattened to accelerate the conformational transition of the side chain from the trans to the gauche state when the p-xylene ligand is present in the binding site. Much more extensive flattening BEDAM simulations were performed for 53 experimental binders and 248 nonbinders of HIV-1 integrase which formed the SAMPL4 challenge, with the total simulation time of 24.3 μs. We demonstrated that the flattening BEDAM simulations not only substantially increase the number of true positives (and reduce false negatives) but also improve the prediction accuracy of binding poses of experimental binders. Furthermore, the values of area under the curve (AUC) of receiver operating characteristic (ROC) and the enrichment factors at 20% cutoff calculated from the flattening BEDAM simulations were improved significantly in comparison with that of simulations without flattening as we previously reported for the whole SAMPL4 database. Detailed analysis found that the improved ability to discriminate the binding free energies between the binders and nonbinders is due to the fact that the flattening simulations reduce the reorganization free energy penalties of binders and decrease the overlap of binding free energy distributions of binders relative to that of nonbinders. This happens because the conformational ensemble distributions for both the ligand and protein in solution match those at the fully coupled (complex) state more closely when the systems are more fully sampled after the flattening potentials are applied to the intermediate states.
Collapse
Affiliation(s)
- Junchao Xia
- Center for Biophysics and Computational Biology and Department of Physics , Temple University , Philadelphia , Pennsylvania 19122 , United States
| | - William Flynn
- Center for Biophysics and Computational Biology and Department of Chemistry , Temple University , Philadelphia , Pennsylvania 19122 , United States
| | - Ronald M Levy
- Center for Biophysics and Computational Biology and Department of Chemistry , Temple University , Philadelphia , Pennsylvania 19122 , United States
| |
Collapse
|
37
|
Aleksandrov A, Lin FY, Roux B, MacKerell AD. Combining the polarizable Drude force field with a continuum electrostatic Poisson-Boltzmann implicit solvation model. J Comput Chem 2018; 39:1707-1719. [PMID: 29737546 DOI: 10.1002/jcc.25345] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2017] [Revised: 02/26/2018] [Accepted: 04/08/2018] [Indexed: 12/13/2022]
Abstract
In this work, we have combined the polarizable force field based on the classical Drude oscillator with a continuum Poisson-Boltzmann/solvent-accessible surface area (PB/SASA) model. In practice, the positions of the Drude particles experiencing the solvent reaction field arising from the fixed charges and induced polarization of the solute must be optimized in a self-consistent manner. Here, we parameterized the model to reproduce experimental solvation free energies of a set of small molecules. The model reproduces well-experimental solvation free energies of 70 molecules, yielding a root mean square difference of 0.8 kcal/mol versus 2.5 kcal/mol for the CHARMM36 additive force field. The polarization work associated with the solute transfer from the gas-phase to the polar solvent, a term neglected in the framework of additive force fields, was found to make a large contribution to the total solvation free energy, comparable to the polar solute-solvent solvation contribution. The Drude PB/SASA also reproduces well the electronic polarization from the explicit solvent simulations of a small protein, BPTI. Model validation was based on comparisons with the experimental relative binding free energies of 371 single alanine mutations. With the Drude PB/SASA model the root mean square deviation between the predicted and experimental relative binding free energies is 3.35 kcal/mol, lower than 5.11 kcal/mol computed with the CHARMM36 additive force field. Overall, the results indicate that the main limitation of the Drude PB/SASA model is the inability of the SASA term to accurately capture non-polar solvation effects. © 2018 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Alexey Aleksandrov
- Laboratoire d'Optique et Biosciences, CNRS, INSERM, Ecole Polytechnique, Palaiseau F-91128, France
| | - Fang-Yu Lin
- Department of Pharmaceutical Sciences, School of Pharmacy, University of Maryland, 20 Penn Street, Baltimore, Maryland 21201
| | - Benoît Roux
- Department of Biochemistry and Molecular Biology, Gordon Center for Integrative Science, 929 E57th Street, University of Chicago, Chicago, Illinois 60637
| | - Alexander D MacKerell
- Department of Pharmaceutical Sciences, School of Pharmacy, University of Maryland, 20 Penn Street, Baltimore, Maryland 21201
| |
Collapse
|
38
|
Chakravorty A, Jia Z, Peng Y, Tajielyato N, Wang L, Alexov E. Gaussian-Based Smooth Dielectric Function: A Surface-Free Approach for Modeling Macromolecular Binding in Solvents. Front Mol Biosci 2018; 5:25. [PMID: 29637074 PMCID: PMC5881404 DOI: 10.3389/fmolb.2018.00025] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2018] [Accepted: 03/05/2018] [Indexed: 12/04/2022] Open
Abstract
Conventional modeling techniques to model macromolecular solvation and its effect on binding in the framework of Poisson-Boltzmann based implicit solvent models make use of a geometrically defined surface to depict the separation of macromolecular interior (low dielectric constant) from the solvent phase (high dielectric constant). Though this simplification saves time and computational resources without significantly compromising the accuracy of free energy calculations, it bypasses some of the key physio-chemical properties of the solute-solvent interface, e.g., the altered flexibility of water molecules and that of side chains at the interface, which results in dielectric properties different from both bulk water and macromolecular interior, respectively. Here we present a Gaussian-based smooth dielectric model, an inhomogeneous dielectric distribution model that mimics the effect of macromolecular flexibility and captures the altered properties of surface bound water molecules. Thus, the model delivers a smooth transition of dielectric properties from the macromolecular interior to the solvent phase, eliminating any unphysical surface separating the two phases. Using various examples of macromolecular binding, we demonstrate its utility and illustrate the comparison with the conventional 2-dielectric model. We also showcase some additional abilities of this model, viz. to account for the effect of electrolytes in the solution and to render the distribution profile of water across a lipid membrane.
Collapse
Affiliation(s)
- Arghya Chakravorty
- Computational Biophysics and Bioinformatics, Department of Physics and Astronomy, Clemson University, Clemson, SC, United States
| | - Zhe Jia
- Computational Biophysics and Bioinformatics, Department of Physics and Astronomy, Clemson University, Clemson, SC, United States
| | - Yunhui Peng
- Computational Biophysics and Bioinformatics, Department of Physics and Astronomy, Clemson University, Clemson, SC, United States
| | - Nayere Tajielyato
- Computational Biophysics and Bioinformatics, Department of Physics and Astronomy, Clemson University, Clemson, SC, United States
| | - Lisi Wang
- Department of Chemistry, Clemson University, Clemson, SC, United States
| | - Emil Alexov
- Computational Biophysics and Bioinformatics, Department of Physics and Astronomy, Clemson University, Clemson, SC, United States
| |
Collapse
|
39
|
Kilburg D, Gallicchio E. Assessment of a Single Decoupling Alchemical Approach for the Calculation of the Absolute Binding Free Energies of Protein-Peptide Complexes. Front Mol Biosci 2018; 5:22. [PMID: 29568737 PMCID: PMC5852065 DOI: 10.3389/fmolb.2018.00022] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2018] [Accepted: 02/21/2018] [Indexed: 01/24/2023] Open
Abstract
The computational modeling of peptide inhibitors to target protein-protein binding interfaces is growing in interest as these are often too large, too shallow, and too feature-less for conventional small molecule compounds. Here, we present a rare successful application of an alchemical binding free energy method for the calculation of converged absolute binding free energies of a series of protein-peptide complexes. Specifically, we report the binding free energies of a series of cyclic peptides derived from the LEDGF/p75 protein to the integrase receptor of the HIV1 virus. The simulations recapitulate the effect of mutations relative to the wild-type binding motif of LEDGF/p75, providing structural, energetic and dynamical interpretations of the observed trends. The equilibration and convergence of the calculations are carefully analyzed. Convergence is aided by the adoption of a single-decoupling alchemical approach with implicit solvation, which circumvents the convergence difficulties of conventional double-decoupling protocols. We hereby present the single-decoupling methodology and critically evaluate its advantages and limitations. We also discuss some of the challenges and potential pitfalls of binding free energy calculations for complex molecular systems which have generally limited their applicability to the quantitative study of protein-peptide binding equilibria.
Collapse
Affiliation(s)
- Denise Kilburg
- Department of Chemistry, Brooklyn College, Brooklyn, NY, United States.,Ph.D. Program in Chemistry, The Graduate Center, City University of New York, New York, NY, United States
| | - Emilio Gallicchio
- Department of Chemistry, Brooklyn College, Brooklyn, NY, United States.,Ph.D. Program in Chemistry, The Graduate Center, City University of New York, New York, NY, United States.,Ph.D. Program in Biochemistry, The Graduate Center, City University of New York, New York, NY, United States
| |
Collapse
|
40
|
Izadi S, Harris RC, Fenley MO, Onufriev AV. Accuracy Comparison of Generalized Born Models in the Calculation of Electrostatic Binding Free Energies. J Chem Theory Comput 2018; 14:1656-1670. [PMID: 29378399 DOI: 10.1021/acs.jctc.7b00886] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
The need for accurate yet efficient representation of the aqueous environment in biomolecular modeling has led to the development of a variety of generalized Born (GB) implicit solvent models. While many studies have focused on the accuracy of available GB models in predicting solvation free energies, a systematic assessment of the quality of these models in binding free energy calculations, crucial for rational drug design, has not been undertaken. Here, we evaluate the accuracies of eight common GB flavors (GB-HCT, GB-OBC, GB-neck2, GBNSR6, GBSW, GBMV1, GBMV2, and GBMV3), available in major molecular dynamics packages, in predicting the electrostatic binding free energies ( ΔΔ Gel) for a diverse set of 60 biomolecular complexes belonging to four main classes: protein-protein, protein-drug, RNA-peptide, and small complexes. The GB flavors are examined in terms of their ability to reproduce the results from the Poisson-Boltzmann (PB) model, commonly used as accuracy reference in this context. We show that the agreement with the PB of ΔΔ Gel estimates varies widely between different GB models and also across different types of biomolecular complexes, with R2 correlations ranging from 0.3772 to 0.9986. A surface-based "R6" GB model recently implemented in AMBER shows the closest overall agreement with reference PB ( R2 = 0.9949, RMSD = 8.75 kcal/mol). The RNA-peptide and protein-drug complex sets appear to be most challenging for all but one model, as indicated by the large deviations from the PB in ΔΔ Gel. Small neutral complexes present the least challenge for most of the GB models tested. The quantitative demonstration of the strengths and weaknesses of the GB models across the diverse complex types provided here can be used as a guide for practical computations and future development efforts.
Collapse
Affiliation(s)
- Saeed Izadi
- Early Stage Pharmaceutical Development , Genentech Inc. , 1 DNA Way , South San Francisco , California 94080 , United States
| | - Robert C Harris
- Department of Pharmaceutical Sciences , University of Maryland School of Pharmacy , Baltimore , Maryland 21201 , United States
| | - Marcia O Fenley
- Institute of Molecular Biophysics , Florida State University , Tallahassee , Florida 32306-3408 , United States
| | | |
Collapse
|
41
|
Brieg M, Setzler J, Albert S, Wenzel W. Generalized Born implicit solvent models for small molecule hydration free energies. Phys Chem Chem Phys 2018; 19:1677-1685. [PMID: 27995260 DOI: 10.1039/c6cp07347f] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
Hydration free energy estimation of small molecules from all-atom simulations was widely investigated in recent years, as it provides an essential test of molecular force fields and our understanding of solvation effects. While explicit solvent representations result in highly accurate models, they also require extensive sampling due to the high number of solvent degrees of freedom. Implicit solvent models, such as those based on the generalized Born model for electrostatic solvation effects and a solvent accessible surface area term for nonpolar contributions (GBSA), significantly reduce the number of degrees of freedom and the computational cost to estimate hydration free energies. However, a recent survey revealed a gap in the accuracy between explicit TIP3P solvent estimates and those computed with many common GBSA models. Here we address this shortcoming by providing a thorough comparison of the performance of three implicit solvent models with different nonpolar contributions and a generalized Born term to estimate experimental hydration free energies. Starting with a minimal set of only ten atom types, we demonstrate that a nonpolar term with atom type dependent surface tension coefficients in combination with an accurate generalized Born term and fully optimized parameters performs best in estimating hydration free energies, even yielding comparable results to the explicit TIP3P water model. Analysis of our results provides evidence that the asymmetric behavior of water around oppositely charged atoms is one of the main sources of error for two of the three implicit solvent models. Explicitly accounting for this effect in the parameterization reduces the corresponding errors, suggesting this as a general strategy for improving implicit solvent models. The findings presented here will help to improve the existing generalized Born based implicit solvent models implemented in state-of-the-art molecular simulation packages.
Collapse
Affiliation(s)
- Martin Brieg
- Steinbuch Centre for Computing (SCC), Karlsruhe Institute of Technology (KIT), P.O. Box 3640, 76021 Karlsruhe, Germany and Institute of Nanotechnology (INT), Karlsruhe Institute of Technology (KIT), P.O. Box 3640, 76021 Karlsruhe, Germany.
| | - Julia Setzler
- Institute of Nanotechnology (INT), Karlsruhe Institute of Technology (KIT), P.O. Box 3640, 76021 Karlsruhe, Germany.
| | - Steffen Albert
- Institute of Nanotechnology (INT), Karlsruhe Institute of Technology (KIT), P.O. Box 3640, 76021 Karlsruhe, Germany.
| | - Wolfgang Wenzel
- Institute of Nanotechnology (INT), Karlsruhe Institute of Technology (KIT), P.O. Box 3640, 76021 Karlsruhe, Germany.
| |
Collapse
|
42
|
Zhou HX, Pang X. Electrostatic Interactions in Protein Structure, Folding, Binding, and Condensation. Chem Rev 2018; 118:1691-1741. [PMID: 29319301 DOI: 10.1021/acs.chemrev.7b00305] [Citation(s) in RCA: 584] [Impact Index Per Article: 83.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
Charged and polar groups, through forming ion pairs, hydrogen bonds, and other less specific electrostatic interactions, impart important properties to proteins. Modulation of the charges on the amino acids, e.g., by pH and by phosphorylation and dephosphorylation, have significant effects such as protein denaturation and switch-like response of signal transduction networks. This review aims to present a unifying theme among the various effects of protein charges and polar groups. Simple models will be used to illustrate basic ideas about electrostatic interactions in proteins, and these ideas in turn will be used to elucidate the roles of electrostatic interactions in protein structure, folding, binding, condensation, and related biological functions. In particular, we will examine how charged side chains are spatially distributed in various types of proteins and how electrostatic interactions affect thermodynamic and kinetic properties of proteins. Our hope is to capture both important historical developments and recent experimental and theoretical advances in quantifying electrostatic contributions of proteins.
Collapse
Affiliation(s)
- Huan-Xiang Zhou
- Department of Chemistry and Department of Physics, University of Illinois at Chicago , Chicago, Illinois 60607, United States.,Department of Physics and Institute of Molecular Biophysics, Florida State University , Tallahassee, Florida 32306, United States
| | - Xiaodong Pang
- Department of Physics and Institute of Molecular Biophysics, Florida State University , Tallahassee, Florida 32306, United States
| |
Collapse
|
43
|
Wang C, Greene D, Xiao L, Qi R, Luo R. Recent Developments and Applications of the MMPBSA Method. Front Mol Biosci 2018; 4:87. [PMID: 29367919 PMCID: PMC5768160 DOI: 10.3389/fmolb.2017.00087] [Citation(s) in RCA: 399] [Impact Index Per Article: 57.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2017] [Accepted: 11/30/2017] [Indexed: 12/23/2022] Open
Abstract
The Molecular Mechanics Poisson-Boltzmann Surface Area (MMPBSA) approach has been widely applied as an efficient and reliable free energy simulation method to model molecular recognition, such as for protein-ligand binding interactions. In this review, we focus on recent developments and applications of the MMPBSA method. The methodology review covers solvation terms, the entropy term, extensions to membrane proteins and high-speed screening, and new automation toolkits. Recent applications in various important biomedical and chemical fields are also reviewed. We conclude with a few future directions aimed at making MMPBSA a more robust and efficient method.
Collapse
Affiliation(s)
- Changhao Wang
- Chemical and Materials Physics Graduate Program, University of California, Irvine, Irvine, CA, United States
- Department of Molecular Biology and Biochemistry, University of California, Irvine, Irvine, CA, United States
- Department of Physics and Astronomy, University of California, Irvine, Irvine, CA, United States
| | - D'Artagnan Greene
- Department of Molecular Biology and Biochemistry, University of California, Irvine, Irvine, CA, United States
| | - Li Xiao
- Department of Molecular Biology and Biochemistry, University of California, Irvine, Irvine, CA, United States
- Department of Biomedical Engineering, University of California, Irvine, Irvine, CA, United States
| | - Ruxi Qi
- Department of Molecular Biology and Biochemistry, University of California, Irvine, Irvine, CA, United States
| | - Ray Luo
- Chemical and Materials Physics Graduate Program, University of California, Irvine, Irvine, CA, United States
- Department of Molecular Biology and Biochemistry, University of California, Irvine, Irvine, CA, United States
- Department of Biomedical Engineering, University of California, Irvine, Irvine, CA, United States
- Department of Chemical Engineering and Materials Science, University of California, Irvine, Irvine, CA, United States
| |
Collapse
|
44
|
Wang C, Ren P, Luo R. Ionic Solution: What Goes Right and Wrong with Continuum Solvation Modeling. J Phys Chem B 2017; 121:11169-11179. [PMID: 29164898 DOI: 10.1021/acs.jpcb.7b09616] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Solvent-mediated electrostatic interactions were well recognized to be important in the structure and function of molecular systems. Ionic interaction is an important component in electrostatic interactions, especially in highly charged molecules, such as nucleic acids. Here, we focus on the quality of the widely used Poisson-Boltzmann surface area (PBSA) continuum models in modeling ionic interactions by comparing with both explicit solvent simulations and the experiment. In this work, the molality-dependent chemical potentials for sodium chloride (NaCl) electrolyte were first simulated in the SPC/E explicit solvent. Our high-quality simulation agrees well with both the previous study and the experiment. Given the free-energy simulations in SPC/E as the benchmark, we used the same sets of snapshots collected in the SPC/E solvent model for PBSA free-energy calculations in the hope to achieve the maximum consistency between the two solvent models. Our comparative analysis shows that the molality-dependent chemical potentials of NaCl were reproduced well with both linear PB and nonlinear PB methods, although nonlinear PB agrees better with SPC/E and the experiment. Our free-energy simulations also show that the presence of salt increases the hydrophobic effect in a nonlinear fashion, in qualitative agreement with previous theoretical studies of Onsager and Samaras. However, the lack of molality-dependency in the nonelectrostatics continuum models dramatically reduces the overall quality of PBSA methods in modeling salt-dependent energetics. These analyses point to further improvements needed for more robust modeling of solvent-mediated interactions by the continuum solvation frameworks.
Collapse
Affiliation(s)
| | - Pengyu Ren
- Department of Biomedical Engineering, University of Texas , Austin, Texas 78712, United States
| | | |
Collapse
|
45
|
Onufriev AV, Izadi S. Water models for biomolecular simulations. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE 2017. [DOI: 10.1002/wcms.1347] [Citation(s) in RCA: 94] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Affiliation(s)
- Alexey V. Onufriev
- Department of Physics; Virginia Tech; Blacksburg VA USA
- Department of Computer Science; Virginia Tech; Blacksburg VA USA
- Center for Soft Matter and Biological Physics; Virginia Tech; Blacksburg VA USA
| | - Saeed Izadi
- Early Stage Pharmaceutical Development; Genentech Inc.; South San Francisco, CA USA
| |
Collapse
|
46
|
Wang B, Wang C, Wu K, Wei G. Breaking the polar‐nonpolar division in solvation free energy prediction. J Comput Chem 2017; 39:217-233. [DOI: 10.1002/jcc.25107] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2017] [Revised: 09/02/2017] [Accepted: 10/22/2017] [Indexed: 11/12/2022]
Affiliation(s)
- Bao Wang
- Department of MathematicsMichigan State University Michigan48824
| | - Chengzhang Wang
- School of Statistics and MathematicsCentral University of Finance and EconomicsBeijing100081 China
| | - Kedi Wu
- Department of MathematicsMichigan State University Michigan48824
| | - Guo‐Wei Wei
- Department of MathematicsMichigan State University Michigan48824
- Department of Electrical and ComputerEngineering Michigan State University Michigan48824
- Department of Biochemistry and MolecularBiology Michigan State UniversityMichigan48824
| |
Collapse
|
47
|
Forouzesh N, Izadi S, Onufriev AV. Grid-Based Surface Generalized Born Model for Calculation of Electrostatic Binding Free Energies. J Chem Inf Model 2017; 57:2505-2513. [PMID: 28786669 PMCID: PMC12085165 DOI: 10.1021/acs.jcim.7b00192] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Fast and accurate calculation of solvation free energies is central to many applications, such as rational drug design. In this study, we present a grid-based molecular surface implementation of "R6" flavor of the generalized Born (GB) implicit solvent model, named GBNSR6. The speed, accuracy relative to numerical Poisson-Boltzmann treatment, and sensitivity to grid surface parameters are tested on a set of 15 small protein-ligand complexes and a set of biomolecules in the range of 268 to 25099 atoms. Our results demonstrate that the proposed model provides a relatively successful compromise between the speed and accuracy of computing polar components of the solvation free energies (ΔGpol) and binding free energies (ΔΔGpol). The model tolerates a relatively coarse grid size h = 0.5 Å, where the grid artifact error in computing ΔΔGpol remains in the range of kBT ∼ 0.6 kcal/mol. The estimated ΔΔGpols are well correlated (r2 = 0.97) with the numerical Poisson-Boltzmann reference, while showing virtually no systematic bias and RMSE = 1.43 kcal/mol. The grid-based GBNSR6 model is available in Amber (AmberTools) package of molecular simulation programs.
Collapse
Affiliation(s)
- Negin Forouzesh
- Department of Computer Science, Virginia Tech, Blacksburg, VA 24061, USA
| | - Saeed Izadi
- Early Stage Pharmaceutical Development, Genentech Inc., 1 DNA Way, South San Francisco, CA 94080, USA
| | - Alexey V. Onufriev
- Department of Computer Science, Virginia Tech, Blacksburg, VA 24061, USA
- Department of Physics, Virginia Tech, Blacksburg, VA 24061, USA
| |
Collapse
|
48
|
Brini E, Fennell CJ, Fernandez-Serra M, Hribar-Lee B, Lukšič M, Dill KA. How Water's Properties Are Encoded in Its Molecular Structure and Energies. Chem Rev 2017; 117:12385-12414. [PMID: 28949513 PMCID: PMC5639468 DOI: 10.1021/acs.chemrev.7b00259] [Citation(s) in RCA: 218] [Impact Index Per Article: 27.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2017] [Indexed: 11/29/2022]
Abstract
How are water's material properties encoded within the structure of the water molecule? This is pertinent to understanding Earth's living systems, its materials, its geochemistry and geophysics, and a broad spectrum of its industrial chemistry. Water has distinctive liquid and solid properties: It is highly cohesive. It has volumetric anomalies-water's solid (ice) floats on its liquid; pressure can melt the solid rather than freezing the liquid; heating can shrink the liquid. It has more solid phases than other materials. Its supercooled liquid has divergent thermodynamic response functions. Its glassy state is neither fragile nor strong. Its component ions-hydroxide and protons-diffuse much faster than other ions. Aqueous solvation of ions or oils entails large entropies and heat capacities. We review how these properties are encoded within water's molecular structure and energies, as understood from theories, simulations, and experiments. Like simpler liquids, water molecules are nearly spherical and interact with each other through van der Waals forces. Unlike simpler liquids, water's orientation-dependent hydrogen bonding leads to open tetrahedral cage-like structuring that contributes to its remarkable volumetric and thermal properties.
Collapse
Affiliation(s)
- Emiliano Brini
- Laufer
Center for Physical and Quantitative Biology, Department of Physics and Astronomy, and Department of
Chemistry, Stony Brook University, Stony Brook, New York 11794, United States
| | - Christopher J. Fennell
- Department
of Chemistry, Oklahoma State University, Stillwater, Oklahoma 74078, United States
| | - Marivi Fernandez-Serra
- Laufer
Center for Physical and Quantitative Biology, Department of Physics and Astronomy, and Department of
Chemistry, Stony Brook University, Stony Brook, New York 11794, United States
| | - Barbara Hribar-Lee
- Faculty
of Chemistry and Chemical Technology, University
of Ljubljana, Večna
pot 113, SI-1000 Ljubljana, Slovenia
| | - Miha Lukšič
- Faculty
of Chemistry and Chemical Technology, University
of Ljubljana, Večna
pot 113, SI-1000 Ljubljana, Slovenia
| | - Ken A. Dill
- Laufer
Center for Physical and Quantitative Biology, Department of Physics and Astronomy, and Department of
Chemistry, Stony Brook University, Stony Brook, New York 11794, United States
| |
Collapse
|
49
|
Zhang BW, Deng N, Tan Z, Levy RM. Stratified UWHAM and Its Stochastic Approximation for Multicanonical Simulations Which Are Far from Equilibrium. J Chem Theory Comput 2017; 13:4660-4674. [PMID: 28902500 PMCID: PMC5897113 DOI: 10.1021/acs.jctc.7b00651] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
We describe a new analysis tool called Stratified unbinned Weighted Histogram Analysis Method (Stratified-UWHAM), which can be used to compute free energies and expectations from a multicanonical ensemble when a subset of the parallel simulations is far from being equilibrated because of barriers between free energy basins which are only rarely (or never) crossed at some states. The Stratified-UWHAM equations can be obtained in the form of UWHAM equations but with an expanded set of states. We also provide a stochastic solver, Stratified RE-SWHAM, for Stratified-UWHAM to remove its computational bottleneck. Stratified-UWHAM and Stratified RE-SWHAM are applied to study three test topics: the free energy landscape of alanine dipeptide, the binding affinity of a host-guest binding complex, and path sampling for a two-dimensional double well potential. The examples show that when some of the parallel simulations are only locally equilibrated, the estimates of free energies and equilibrium distributions provided by the conventional UWHAM (or MBAR) solutions exhibit considerable biases, but the estimates provided by Stratified-UWHAM and Stratified RE-SWHAM agree with the benchmark very well. Lastly, we discuss features of the Stratified-UWHAM approach which is based on coarse-graining in relation to two other maximum likelihood-based methods which were proposed recently, that also coarse-grain the multicanonical data.
Collapse
|
50
|
Izairi R, Kamberaj H. Comparison Study of Polar and Nonpolar Contributions to Solvation Free Energy. J Chem Inf Model 2017; 57:2539-2553. [PMID: 28880080 DOI: 10.1021/acs.jcim.7b00368] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
In this study, we compared the contributions of polar and nonpolar interactions to the solvation free energy of a solute in solvent, which is decomposed into four different terms based on the nature of interactions: (i) electrostatic solvation free energy term counting for the work done to move solute charges from fixed points in some reference environment to their configuration positions in solvent; (ii) solute-solvent van der Waals dispersion interactions; (iii) change on solvent-solvent interactions and solvent entropy due to reorganization of solvent around solute cavity in solvent; and (iv) compensation of electrostatic forces acting on the dielectric surface boundary between solvent and solute. We compared these contributions to each other for a data set of 573 proteins, which were prepared using CHARMM22 and AMBER force fields. In addition, we compared the calculated with experimental hydration free energies for a data set of 642 small molecules, which were prepared using the general AMBER force field. Our results indicated the significance of each term to the total solvation free energy.
Collapse
Affiliation(s)
- Redona Izairi
- Department of Physics, Faculty of Mathematics and Natural Sciences, State University of Tetovo , Street Ilinden, 1200, Tetovo, Republic of Macedonia
| | - Hiqmet Kamberaj
- Department of Computer Engineering, Faculty of Engineering, International Balkan University , Tashko Karadza, 11A, 1000, Skopje, Republic of Macedonia
| |
Collapse
|