1
|
Hahn DF, Gapsys V, de Groot BL, Mobley DL, Tresadern G. Current State of Open Source Force Fields in Protein-Ligand Binding Affinity Predictions. J Chem Inf Model 2024; 64:5063-5076. [PMID: 38895959 PMCID: PMC11234369 DOI: 10.1021/acs.jcim.4c00417] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2024] [Revised: 04/23/2024] [Accepted: 04/25/2024] [Indexed: 06/21/2024]
Abstract
In drug discovery, the in silico prediction of binding affinity is one of the major means to prioritize compounds for synthesis. Alchemical relative binding free energy (RBFE) calculations based on molecular dynamics (MD) simulations are nowadays a popular approach for the accurate affinity ranking of compounds. MD simulations rely on empirical force field parameters, which strongly influence the accuracy of the predicted affinities. Here, we evaluate the ability of six different small-molecule force fields to predict experimental protein-ligand binding affinities in RBFE calculations on a set of 598 ligands and 22 protein targets. The public force fields OpenFF Parsley and Sage, GAFF, and CGenFF show comparable accuracy, while OPLS3e is significantly more accurate. However, a consensus approach using Sage, GAFF, and CGenFF leads to accuracy comparable to OPLS3e. While Parsley and Sage are performing comparably based on aggregated statistics across the whole dataset, there are differences in terms of outliers. Analysis of the force field reveals that improved parameters lead to significant improvement in the accuracy of affinity predictions on subsets of the dataset involving those parameters. Lower accuracy can not only be attributed to the force field parameters but is also dependent on input preparation and sampling convergence of the calculations. Especially large perturbations and nonconverged simulations lead to less accurate predictions. The input structures, Gromacs force field files, as well as the analysis Python notebooks are available on GitHub.
Collapse
Affiliation(s)
- David F. Hahn
- Computational
Chemistry, Janssen Research & Development, Turnhoutseweg 30, Beerse 2340, Belgium
| | - Vytautas Gapsys
- Computational
Chemistry, Janssen Research & Development, Turnhoutseweg 30, Beerse 2340, Belgium
- Computational
Biomolecular Dynamics Group, Max Planck
Institute for Multidisciplinary Sciences, Am Fassberg 11, Göttingen 37077, Germany
| | - Bert L. de Groot
- Computational
Biomolecular Dynamics Group, Max Planck
Institute for Multidisciplinary Sciences, Am Fassberg 11, Göttingen 37077, Germany
| | - David L. Mobley
- Department
of Chemistry, University of California, Irvine, California 92697, United States
- Department
of Pharmaceutical Sciences, University of
California, Irvine, California 92697, United States
| | - Gary Tresadern
- Computational
Chemistry, Janssen Research & Development, Turnhoutseweg 30, Beerse 2340, Belgium
| |
Collapse
|
2
|
Deng J, Cui Q. Efficient Sampling of Cavity Hydration in Proteins with Nonequilibrium Grand Canonical Monte Carlo and Polarizable Force Fields. J Chem Theory Comput 2024; 20:1897-1911. [PMID: 38417108 PMCID: PMC11663258 DOI: 10.1021/acs.jctc.4c00013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/01/2024]
Abstract
Prediction of the hydration levels of protein cavities and active sites is important to both mechanistic analysis and ligand design. Due to the unique microscopic environment of these buried water molecules, a polarizable model is expected to be crucial for an accurate treatment of protein internal hydration in simulations. Here we adapt a nonequilibrium candidate Monte Carlo approach for conducting grand canonical Monte Carlo simulations with the Drude polarizable force field. The GPU implementation enables the efficient sampling of internal cavity hydration levels in biomolecular systems. We also develop an enhanced sampling approach referred to as B-walking, which satisfies detailed balance and readily combines with grand canonical integration to efficiently calculate quantitative binding free energies of water to protein cavities. Applications of these developments are illustrated in a solvent box and the polar ligand binding site in trypsin. Our simulation results show that including electronic polarization leads to a modest but clear improvement in the description of water position and occupancy compared to the crystal structure. The B-walking approach enhances the range of water sampling in different chemical potential windows and thus improves the accuracy of water binding free energy calculations.
Collapse
Affiliation(s)
- Jiahua Deng
- Department of Chemistry, Boston University, 590 Commonwealth Avenue, Boston, Massachusetts 02215, United States
| | - Qiang Cui
- Department of Chemistry, Boston University, 590 Commonwealth Avenue, Boston, Massachusetts 02215, United States
- Department of Physics, Boston University, 590 Commonwealth Avenue, Boston, Massachusetts 02215, United States
- Department of Biomedical Engineering, Boston University, 44 Cummington Mall, Boston, Massachusetts 02215, United States
| |
Collapse
|
3
|
Gracia Carmona O, Gillhofer M, Tomasiak L, De Ruiter A, Oostenbrink C. Accelerated Enveloping Distribution Sampling to Probe the Presence of Water Molecules. J Chem Theory Comput 2023. [PMID: 37167545 DOI: 10.1021/acs.jctc.3c00109] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/13/2023]
Abstract
Determining the presence of water molecules at protein-ligand interfaces is still a challenging task in free-energy calculations. The inappropriate placement of water molecules results in the stabilization of wrong conformational orientations of the ligand. With classical alchemical perturbation methods, such as thermodynamic integration (TI), it is essential to know the amount of water molecules in the active site of the respective ligands. However, the resolution of the crystal structure and the correct assignment of the electron density do not always lead to a clear placement of water molecules. In this work, we apply the one-step perturbation method named accelerated enveloping distribution sampling (AEDS) to determine the presence of water molecules in the active site by probing them in a fast and straightforward way. Based on these results, we combined the AEDS method with standard TI to calculate accurate binding free energies in the presence of buried water molecules. The main idea is to perturb the water molecules with AEDS such that they are allowed to alternate between regular water molecules and non-interacting dummy particles while treating the ligand with TI over an alchemical pathway. We demonstrate the use of AEDS to probe the presence of water molecules for six different test systems. For one of these, previous calculations showed difficulties to reproduce the experimental binding free energies, and here, we use the combined TI-AEDS approach to tackle these issues.
Collapse
Affiliation(s)
- Oriol Gracia Carmona
- Institute for Molecular Modeling and Simulation, Department of Material Sciences and Process Engineering, University of Natural Resources and Life Sciences, Vienna, Muthgasse 18, 1190 Vienna, Austria
| | - Michael Gillhofer
- Institute for Molecular Modeling and Simulation, Department of Material Sciences and Process Engineering, University of Natural Resources and Life Sciences, Vienna, Muthgasse 18, 1190 Vienna, Austria
| | - Lisa Tomasiak
- Institute for Molecular Modeling and Simulation, Department of Material Sciences and Process Engineering, University of Natural Resources and Life Sciences, Vienna, Muthgasse 18, 1190 Vienna, Austria
| | - Anita De Ruiter
- Institute for Molecular Modeling and Simulation, Department of Material Sciences and Process Engineering, University of Natural Resources and Life Sciences, Vienna, Muthgasse 18, 1190 Vienna, Austria
| | - Chris Oostenbrink
- Institute for Molecular Modeling and Simulation, Department of Material Sciences and Process Engineering, University of Natural Resources and Life Sciences, Vienna, Muthgasse 18, 1190 Vienna, Austria
- Christian Doppler Laboratory for Molecular Informatics in the Biosciences, University of Natural Resources and Life Sciences, Vienna, Muthgasse 18, 1190 Vienna, Austria
| |
Collapse
|
4
|
Eberhardt J, Forli S. WaterKit: Thermodynamic Profiling of Protein Hydration Sites. J Chem Theory Comput 2023; 19:2535-2556. [PMID: 37094087 PMCID: PMC10732097 DOI: 10.1021/acs.jctc.2c01087] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/26/2023]
Abstract
Water desolvation is one of the key components of the free energy of binding of small molecules to their receptors. Thus, understanding the energetic balance of solvation and desolvation resulting from individual water molecules can be crucial when estimating ligand binding, especially when evaluating different molecules and poses as done in High-Throughput Virtual Screening (HTVS). Over the most recent decades, several methods were developed to tackle this problem, ranging from fast approximate methods (usually empirical functions using either discrete atom-atom pairwise interactions or continuum solvent models) to more computationally expensive and accurate ones, mostly based on Molecular Dynamics (MD) simulations, such as Grid Inhomogeneous Solvation Theory (GIST) or Double Decoupling. On one hand, MD-based methods are prohibitive to use in HTVS to estimate the role of waters on the fly for each ligand. On the other hand, fast and approximate methods show an unsatisfactory level of accuracy, with low agreement with results obtained with the more expensive methods. Here we introduce WaterKit, a new grid-based sampling method with explicit water molecules to calculate thermodynamic properties using the GIST method. Our results show that the discrete placement of water molecules is successful in reproducing the position of crystallographic waters with very high accuracy, as well as providing thermodynamic estimates with accuracy comparable to more expensive MD simulations. Unlike these methods, WaterKit can be used to analyze specific regions on the protein surface, (such as the binding site of a receptor), without having to hydrate and simulate the whole receptor structure. The results show the feasibility of a general and fast method to compute thermodynamic properties of water molecules, making it well-suited to be integrated in high-throughput pipelines such as molecular docking.
Collapse
Affiliation(s)
- Jerome Eberhardt
- Department of Integrative Structural and Computational Biology, Scripps Research, La Jolla, California 92037, United States
| | - Stefano Forli
- Department of Integrative Structural and Computational Biology, Scripps Research, La Jolla, California 92037, United States
| |
Collapse
|
5
|
Melling O, Samways ML, Ge Y, Mobley DL, Essex JW. Enhanced Grand Canonical Sampling of Occluded Water Sites Using Nonequilibrium Candidate Monte Carlo. J Chem Theory Comput 2023; 19:1050-1062. [PMID: 36692215 PMCID: PMC9933432 DOI: 10.1021/acs.jctc.2c00823] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2022] [Indexed: 01/25/2023]
Abstract
Water molecules play a key role in many biomolecular systems, particularly when bound at protein-ligand interfaces. However, molecular simulation studies on such systems are hampered by the relatively long time scales over which water exchange between a protein and solvent takes place. Grand canonical Monte Carlo (GCMC) is a simulation technique that avoids this issue by attempting the insertion and deletion of water molecules within a given structure. The approach is constrained by low acceptance probabilities for insertions in congested systems, however. To address this issue, here, we combine GCMC with nonequilibium candidate Monte Carlo (NCMC) to yield a method that we refer to as grand canonical nonequilibrium candidate Monte Carlo (GCNCMC), in which the water insertions and deletions are carried out in a gradual, nonequilibrium fashion. We validate this new approach by comparing GCNCMC and GCMC simulations of bulk water and three protein binding sites. We find that not only is the efficiency of the water sampling improved by GCNCMC but that it also results in increased sampling of ligand conformations in a protein binding site, revealing new water-mediated ligand-binding geometries that are not observed using alternative enhanced sampling techniques.
Collapse
Affiliation(s)
- Oliver
J. Melling
- School
of Chemistry, University of Southampton, SouthamptonSO17 1BJ, U.K.
| | - Marley L. Samways
- School
of Chemistry, University of Southampton, SouthamptonSO17 1BJ, U.K.
| | - Yunhui Ge
- Department
of Pharmaceutical Sciences, University of
California, Irvine, California92697, United States
| | - David L. Mobley
- Department
of Pharmaceutical Sciences, University of
California, Irvine, California92697, United States
- Department
of Chemistry, University of California, Irvine, California92697, United States
| | - Jonathan W. Essex
- School
of Chemistry, University of Southampton, SouthamptonSO17 1BJ, U.K.
| |
Collapse
|
6
|
Xu H. The slow but steady rise of binding free energy calculations in drug discovery. J Comput Aided Mol Des 2023; 37:67-74. [PMID: 36469232 DOI: 10.1007/s10822-022-00494-x] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2022] [Accepted: 11/28/2022] [Indexed: 12/12/2022]
Abstract
Binding free energy calculations are increasingly used in drug discovery research to predict protein-ligand binding affinities and to prioritize candidate drug molecules accordingly. It has taken decades of collective effort to transform this academic concept into a technology adopted by the pharmaceutical and biotech industry. Having personally witnessed and taken part in this transformation, here I recount the (incomplete) list of problems that had to be solved to make this computational tool practical and suggest areas of future development.
Collapse
Affiliation(s)
- Huafeng Xu
- Roivant Discovery, 151 West 42nd Street, New York, NY, 10036, USA.
| |
Collapse
|
7
|
Sun Z, Zheng L, Zhang ZY, Cong Y, Wang M, Wang X, Yang J, Liu Z, Huai Z. Molecular Modelling of Ionic Liquids: Situations When Charge Scaling Seems Insufficient. MOLECULES (BASEL, SWITZERLAND) 2023; 28:molecules28020800. [PMID: 36677859 PMCID: PMC9865557 DOI: 10.3390/molecules28020800] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/30/2022] [Revised: 01/10/2023] [Accepted: 01/11/2023] [Indexed: 01/15/2023]
Abstract
Charge scaling as an effective solution to the experiment-computation disagreement in molecular modelling of ionic liquids (ILs) could bring the computational results close to the experimental reference for various thermodynamic properties. According to the large-scale benchmark calculations of mass density, solvation, and water-ILs transfer-free energies in our series of papers, the charge-scaling factor of 0.8 serves as a near-optimal option generally applicable to most ILs, although a system-dependent parameter adjustment could be attempted for further improved performance. However, there are situations in which such a charge-scaling treatment would fail. Namely, charge scaling cannot really affect the simulation outcome, or minimally perturbs the results that are still far from the experimental value. In such situations, the vdW radius as an additional adjustable parameter is commonly tuned to minimize the experiment-calculation deviation. In the current work, considering two ILs from the quinuclidinium family, we investigate the impacts of this vdW-scaling treatment on the mass density and the solvation/partition thermodynamics in a fashion similar to our previous charge-scaling works, i.e., scanning the vdW-scaling factor and computing physical properties under these parameter sets. It is observed that the mass density exhibits a linear response to the vdW-scaling factor with slopes close to -1.8 g/mL. By further investigating a set of physiochemically relevant temperatures between 288 K and 348 K, we confirm the robustness of the vdW-scaling treatment in the estimation of bulk properties. The best vdW-scaling parameter for mass density would worsen the computation of solvation/partition thermodynamics, and a marginal decrease in the vdW-scaling factor is considered as an intermediate option balancing the reproductions of bulk properties and solvation thermodynamics. These observations could be understood in a way similar to the charge-scaling situation. i.e., overfitting some properties (e.g., mass density) would degrade the accuracy of the other properties (e.g., solvation free energies). Following this principle, the general guideline for applying this vdW-tuning protocol is by using values between the density-derived choice and the solvation/partition-derived solution. The charge and current vdW scaling treatments cover commonly encountered ILs, completing the protocol for accurate modelling of ILs with fixed-charge force fields.
Collapse
Affiliation(s)
- Zhaoxi Sun
- College of Chemistry and Molecular Engineering, Peking University, Beijing 100871, China
- Correspondence: (Z.S.); (X.W.); (Z.H.)
| | - Lei Zheng
- NYU-ECNU Center for Computational Chemistry at NYU Shanghai, Shanghai 200062, China
- Department of Chemistry, New York University, New York, NY 10003, USA
| | - Zuo-Yuan Zhang
- College of Physical Science and Technology, Yangzhou University, Yangzhou 225009, China
| | - Yalong Cong
- School of Chemistry and Molecular Engineering, East China Normal University, Shanghai 200062, China
| | - Mao Wang
- NCS Testing Technology Co., Ltd., No. 13, Gaoliangqiao Xiejie, Beijing 100081, China
| | - Xiaohui Wang
- College of Chemistry and Molecular Engineering, Peking University, Beijing 100871, China
- Beijing Leto Laboratories Co., Ltd., Beijing 100083, China
- Correspondence: (Z.S.); (X.W.); (Z.H.)
| | - Jingjing Yang
- School of Environmental Science and Engineering, Suzhou University of Science and Technology, Suzhou 215009, China
| | - Zhirong Liu
- College of Chemistry and Molecular Engineering, Peking University, Beijing 100871, China
| | - Zhe Huai
- XtalPi-AI Research Center, 7F, Tower A, Dongsheng Building, No.8, Zhongguancun East Road, Beijing 100083, China
- Correspondence: (Z.S.); (X.W.); (Z.H.)
| |
Collapse
|
8
|
Samways M, Bruce Macdonald HE, Taylor RD, Essex JW. Water Networks in Complexes between Proteins and FDA-Approved Drugs. J Chem Inf Model 2023; 63:387-396. [PMID: 36469670 PMCID: PMC9832485 DOI: 10.1021/acs.jcim.2c01225] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Water molecules at protein-ligand interfaces are often of significant pharmaceutical interest, owing in part to the entropy which can be released upon the displacement of an ordered water by a therapeutic compound. Protein structures may not, however, completely resolve all critical bound water molecules, or there may be no experimental data available. As such, predicting the location of water molecules in the absence of a crystal structure is important in the context of rational drug design. Grand canonical Monte Carlo (GCMC) is a computational technique that is gaining popularity for the simulation of buried water sites. In this work, we assess the ability of GCMC to accurately predict water binding locations, using a dataset that we have curated, containing 108 unique structures of complexes between proteins and Food and Drug Administration (FDA)-approved small-molecule drugs. We show that GCMC correctly predicts 81.4% of nonbulk crystallographic water sites to within 1.4 Å. However, our analysis demonstrates that the reported performance of water prediction methods is highly sensitive to the way in which the performance is measured. We also find that crystallographic water sites with more protein/ligand hydrogen bonds and stronger electron density are more reliably predicted by GCMC. An analysis of water networks revealed that more than half of the structures contain at least one ligand-contacting water network. In these cases, displacement of a water site by a ligand modification might yield unexpected results if the larger network is destabilized. Cooperative effects between waters should therefore be explicitly considered in structure-based drug design.
Collapse
Affiliation(s)
- Marley
L. Samways
- School
of Chemistry, University of Southampton, Southampton SO17 1BJ, U.K.
| | - Hannah E. Bruce Macdonald
- Computational
and Systems Biology Program, Memorial Sloan
Kettering Cancer Center, New York, New York 10065, United States
| | | | - Jonathan W. Essex
- School
of Chemistry, University of Southampton, Southampton SO17 1BJ, U.K.,
| |
Collapse
|
9
|
Molecular modelling of ionic liquids: Physical properties of species with extremely long aliphatic chains from a near-optimal regime. J Mol Liq 2022. [DOI: 10.1016/j.molliq.2022.120492] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
10
|
Ge Y, Baumann HM, Mobley DL. Absolute Binding Free Energy Calculations for Buried Water Molecules. J Chem Theory Comput 2022; 18:6482-6499. [PMID: 36197451 PMCID: PMC9873352 DOI: 10.1021/acs.jctc.2c00658] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
Abstract
Water often plays a key role in mediating protein-ligand interactions. Understanding contributions from active-site water molecules to binding thermodynamics of a ligand is important in predicting binding free energies for ligand optimization. In this work, we tested a non-equilibrium switching method for absolute binding free energy calculations on water molecules in binding sites of 13 systems. We discuss the lessons we learned about identified issues that affected our calculations and ways to address them. This work fits with our larger focus on how to do accurate ligand binding free energy calculations when water rearrangements are very slow, such as rearrangements due to ligand modification (as in relative free energy calculations) or ligand binding (as in absolute free energy calculations). The method studied in this work can potentially be used to account for limited water sampling via providing endpoint corrections to free energy calculations using our calculated binding free energy of water.
Collapse
Affiliation(s)
- Yunhui Ge
- Department of Pharmaceutical Sciences, University of California, Irvine, California92697, United States
| | - Hannah M Baumann
- Department of Pharmaceutical Sciences, University of California, Irvine, California92697, United States
| | - David L Mobley
- Department of Pharmaceutical Sciences, University of California, Irvine, California92697, United States
- Department of Chemistry, University of California, Irvine, California92697, United States
| |
Collapse
|
11
|
Ge Y, Melling OJ, Dong W, Essex JW, Mobley DL. Enhancing sampling of water rehydration upon ligand binding using variants of grand canonical Monte Carlo. J Comput Aided Mol Des 2022; 36:767-779. [PMID: 36198874 PMCID: PMC9869699 DOI: 10.1007/s10822-022-00479-w] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2022] [Accepted: 09/15/2022] [Indexed: 01/26/2023]
Abstract
Water plays an important role in mediating protein-ligand interactions. Water rearrangement upon a ligand binding or modification can be very slow and beyond typical timescales used in molecular dynamics (MD) simulations. Thus, inadequate sampling of slow water motions in MD simulations often impairs the accuracy of the accuracy of ligand binding free energy calculations. Previous studies suggest grand canonical Monte Carlo (GCMC) outperforms normal MD simulations for water sampling, thus GCMC has been applied to help improve the accuracy of ligand binding free energy calculations. However, in prior work we observed protein and/or ligand motions impaired how well GCMC performs at water rehydration, suggesting more work is needed to improve this method to handle water sampling. In this work, we applied GCMC in 21 protein-ligand systems to assess the performance of GCMC for rehydrating buried water sites. While our results show that GCMC can rapidly rehydrate all selected water sites for most systems, it fails in five systems. In most failed systems, we observe protein/ligand motions, which occur in the absence of water, combine to close water sites and block instantaneous GCMC water insertion moves. For these five failed systems, we both extended our GCMC simulations and tested a new technique named grand canonical nonequilibrium candidate Monte Carlo (GCNCMC). GCNCMC combines GCMC with the nonequilibrium candidate Monte Carlo (NCMC) sampling technique to improve the probability of a successful water insertion/deletion. Our results show that GCNCMC and extended GCMC can rehydrate all target water sites for three of the five problematic systems and GCNCMC is more efficient than GCMC in two out of the three systems. In one system, only GCNCMC can rehydrate all target water sites, while GCMC fails. Both GCNCMC and GCMC fail in one system. This work suggests this new GCNCMC method is promising for water rehydration especially when protein/ligand motions may block water insertion/removal.
Collapse
Affiliation(s)
- Yunhui Ge
- Department of Pharmaceutical Sciences, University of California, Irvine, Irvine, CA, 92697, USA
| | - Oliver J Melling
- School of Chemistry, University of Southampton, Southampton, SO17 1BJ, UK
| | - Weiming Dong
- Department of Pharmaceutical Sciences, University of California, Irvine, Irvine, CA, 92697, USA
| | - Jonathan W Essex
- School of Chemistry, University of Southampton, Southampton, SO17 1BJ, UK
| | - David L Mobley
- Department of Pharmaceutical Sciences, University of California, Irvine, Irvine, CA, 92697, USA.
- Department of Chemistry, University of California,Irvine, Irvine, CA, 92697, USA.
| |
Collapse
|
12
|
Stachowski TR, Fischer M. Large-Scale Ligand Perturbations of the Protein Conformational Landscape Reveal State-Specific Interaction Hotspots. J Med Chem 2022; 65:13692-13704. [PMID: 35970514 PMCID: PMC9619398 DOI: 10.1021/acs.jmedchem.2c00708] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
![]()
Protein flexibility is important for ligand binding but
often ignored
in drug design. Considering proteins as ensembles rather than static
snapshots creates opportunities to target dynamic proteins that lack
FDA-approved drugs, such as the human chaperone, heat shock protein
90 (Hsp90). Hsp90α accommodates ligands with a dynamic lid domain,
yet no comprehensive analysis relating lid conformations to ligand
properties is available. To date, ∼300 ligand-bound Hsp90α
crystal structures are deposited in the Protein Data Bank, which enables
us to consider ligand binding as a perturbation of the protein conformational
landscape. By estimating binding site volumes, we classified structures
into distinct major and minor lid conformations. Supported by retrospective
docking, each conformation creates unique hotspots that bind chemically
distinguishable ligands. Clustering revealed insightful exceptions
and the impact of crystal packing. Overall, Hsp90α’s
plasticity provides a cautionary tale of overinterpreting individual
crystal structures and motivates an ensemble-based view of drug design.
Collapse
Affiliation(s)
- Timothy R Stachowski
- Department of Chemical Biology & Therapeutics, St. Jude Children's Research Hospital, Memphis, Tennessee 38105, United States
| | - Marcus Fischer
- Department of Chemical Biology & Therapeutics, St. Jude Children's Research Hospital, Memphis, Tennessee 38105, United States.,Department of Structural Biology, St. Jude Children's Research Hospital, Memphis, Tennessee 38105, United States
| |
Collapse
|
13
|
Mukherjee S, Schäfer LV. Spatially Resolved Hydration Thermodynamics in Biomolecular Systems. J Phys Chem B 2022; 126:3619-3631. [PMID: 35534011 PMCID: PMC9150089 DOI: 10.1021/acs.jpcb.2c01088] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2022] [Revised: 04/22/2022] [Indexed: 01/17/2023]
Abstract
Water is essential for the structure, dynamics, energetics, and thus the function of biomolecules. It is a formidable challenge to elicit, in microscopic detail, the role of the solvation-related driving forces of biomolecular processes, such as the enthalpy and entropy contributions to the underlying free-energy landscape. In this Perspective, we discuss recent developments and applications of computational methods that provide a spatially resolved map of hydration thermodynamics in biomolecular systems and thus yield atomic-level insights to guide the interpretation of experimental observations. An emphasis is on the challenge of quantifying the hydration entropy, which requires characterization of both the motions of the biomolecules and of the water molecules in their surrounding.
Collapse
Affiliation(s)
- Saumyak Mukherjee
- Theoretical Chemistry, Ruhr
University Bochum, 44801 Bochum, Germany
| | - Lars V. Schäfer
- Theoretical Chemistry, Ruhr
University Bochum, 44801 Bochum, Germany
| |
Collapse
|
14
|
Ge Y, Wych DC, Samways ML, Wall ME, Essex JW, Mobley DL. Enhancing Sampling of Water Rehydration on Ligand Binding: A Comparison of Techniques. J Chem Theory Comput 2022; 18:1359-1381. [PMID: 35148093 PMCID: PMC9241631 DOI: 10.1021/acs.jctc.1c00590] [Citation(s) in RCA: 27] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
Abstract
Water often plays a key role in protein structure, molecular recognition, and mediating protein-ligand interactions. Thus, free energy calculations must adequately sample water motions, which often proves challenging in typical MD simulation time scales. Thus, the accuracy of methods relying on MD simulations ends up limited by slow water sampling. Particularly, as a ligand is removed or modified, bulk water may not have time to fill or rearrange in the binding site. In this work, we focus on several molecular dynamics (MD) simulation-based methods attempting to help rehydrate buried water sites: BLUES, using nonequilibrium candidate Monte Carlo (NCMC); grand, using grand canonical Monte Carlo (GCMC); and normal MD. We assess the accuracy and efficiency of these methods in rehydrating target water sites. We selected a range of systems with varying numbers of waters in the binding site, as well as those where water occupancy is coupled to the identity or binding mode of the ligand. We analyzed the rehydration of buried water sites in binding pockets using both clustering of trajectories and direct analysis of electron density maps. Our results suggest both BLUES and grand enhance water sampling relative to normal MD and grand is more robust than BLUES, but also that water sampling remains a major challenge for all of the methods tested. The lessons we learned for these methods and systems are discussed.
Collapse
Affiliation(s)
- Yunhui Ge
- Department of Pharmaceutical Sciences, University of California, Irvine, California 92697, United States
| | - David C Wych
- Department of Pharmaceutical Sciences, University of California, Irvine, California 92697, United States
- Computer, Computational, and Statistical Sciences Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, United States
| | - Marley L Samways
- School of Chemistry, University of Southampton, Southampton SO17 1BJ, United Kingdom
| | - Michael E Wall
- Computer, Computational, and Statistical Sciences Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, United States
| | - Jonathan W Essex
- School of Chemistry, University of Southampton, Southampton SO17 1BJ, United Kingdom
| | - David L Mobley
- Department of Pharmaceutical Sciences, University of California, Irvine, California 92697, United States
- Department of Chemistry, University of California, Irvine, California 92697, United States
| |
Collapse
|
15
|
Ben-Shalom IY, Lin C, Radak BK, Sherman W, Gilson MK. Fast Equilibration of Water between Buried Sites and the Bulk by Molecular Dynamics with Parallel Monte Carlo Water Moves on Graphical Processing Units. J Chem Theory Comput 2021; 17:7366-7372. [PMID: 34762421 PMCID: PMC8716912 DOI: 10.1021/acs.jctc.1c00867] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
Molecular dynamics (MD) simulations of proteins are commonly used to sample from the Boltzmann distribution of conformational states, with wide-ranging applications spanning chemistry, biophysics, and drug discovery. However, MD can be inefficient at equilibrating water occupancy for buried cavities in proteins that are inaccessible to the surrounding solvent. Indeed, the time needed for water molecules to equilibrate between the bulk solvent and the binding site can be well beyond what is practical with standard MD, which typically ranges from hundreds of nanoseconds to a few microseconds. We recently introduced a hybrid Monte Carlo/MD (MC/MD) method, which speeds up the equilibration of water between buried cavities and the surrounding solvent, while sampling from the thermodynamically correct distribution of states. While the initial implementation of the MC functionality led to considerable slowing of the overall simulations, here we address this problem with a parallel MC algorithm implemented on graphical processing units. This results in speed-ups of 10-fold to 1000-fold over the original MC/MD algorithm, depending on the system and simulation parameters. The present method is available for use in the AMBER simulation software.
Collapse
Affiliation(s)
- Ido Y. Ben-Shalom
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, 92093 La Jolla, California, USA
| | - Charles Lin
- Roivant Discovery, Boston, Massachusetts, 02110, USA
| | | | - Woody Sherman
- Roivant Discovery, Boston, Massachusetts, 02110, USA
| | - Michael K. Gilson
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, 92093 La Jolla, California, USA,To whom correspondence should be addressed,
| |
Collapse
|
16
|
Samways ML, Taylor RD, Bruce Macdonald HE, Essex JW. Water molecules at protein-drug interfaces: computational prediction and analysis methods. Chem Soc Rev 2021; 50:9104-9120. [PMID: 34184009 DOI: 10.1039/d0cs00151a] [Citation(s) in RCA: 34] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
Abstract
The fundamental importance of water molecules at drug-protein interfaces is now widely recognised and a significant feature in structure-based drug design. Experimental methods for analysing the role of water in drug binding have many challenges, including the accurate location of bound water molecules in crystal structures, and problems in resolving specific water contributions to binding thermodynamics. Computational analyses of binding site water molecules provide an alternative, and in principle complete, structural and thermodynamic picture, and their use is now commonplace in the pharmaceutical industry. In this review, we describe the computational methodologies that are available and discuss their strengths and weaknesses. Additionally, we provide a critical analysis of the experimental data used to validate the methods, regarding the type and quality of experimental structural data. We also discuss some of the fundamental difficulties of each method and suggest directions for future study.
Collapse
Affiliation(s)
- Marley L Samways
- School of Chemistry, University of Southampton, Highfield, Southampton SO17 1BJ, UK.
| | | | | | | |
Collapse
|
17
|
Heinzelmann G, Gilson MK. Automation of absolute protein-ligand binding free energy calculations for docking refinement and compound evaluation. Sci Rep 2021; 11:1116. [PMID: 33441879 PMCID: PMC7806944 DOI: 10.1038/s41598-020-80769-1] [Citation(s) in RCA: 40] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2020] [Accepted: 12/24/2020] [Indexed: 02/06/2023] Open
Abstract
Absolute binding free energy calculations with explicit solvent molecular simulations can provide estimates of protein-ligand affinities, and thus reduce the time and costs needed to find new drug candidates. However, these calculations can be complex to implement and perform. Here, we introduce the software BAT.py, a Python tool that invokes the AMBER simulation package to automate the calculation of binding free energies for a protein with a series of ligands. The software supports the attach-pull-release (APR) and double decoupling (DD) binding free energy methods, as well as the simultaneous decoupling-recoupling (SDR) method, a variant of double decoupling that avoids numerical artifacts associated with charged ligands. We report encouraging initial test applications of this software both to re-rank docked poses and to estimate overall binding free energies. We also show that it is practical to carry out these calculations cheaply by using graphical processing units in common machines that can be built for this purpose. The combination of automation and low cost positions this procedure to be applied in a relatively high-throughput mode and thus stands to enable new applications in early-stage drug discovery.
Collapse
Affiliation(s)
- Germano Heinzelmann
- Departamento de Física, Universidade Federal de Santa Catarina, Florianópolis, Santa Catarina, Brazil.
| | - Michael K Gilson
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, USA
| |
Collapse
|
18
|
Rizzi V, Bonati L, Ansari N, Parrinello M. The role of water in host-guest interaction. Nat Commun 2021; 12:93. [PMID: 33397926 PMCID: PMC7782548 DOI: 10.1038/s41467-020-20310-0] [Citation(s) in RCA: 42] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2020] [Accepted: 11/23/2020] [Indexed: 12/13/2022] Open
Abstract
One of the main applications of atomistic computer simulations is the calculation of ligand binding free energies. The accuracy of these calculations depends on the force field quality and on the thoroughness of configuration sampling. Sampling is an obstacle in simulations due to the frequent appearance of kinetic bottlenecks in the free energy landscape. Very often this difficulty is circumvented by enhanced sampling techniques. Typically, these techniques depend on the introduction of appropriate collective variables that are meant to capture the system's degrees of freedom. In ligand binding, water has long been known to play a key role, but its complex behaviour has proven difficult to fully capture. In this paper we combine machine learning with physical intuition to build a non-local and highly efficient water-describing collective variable. We use it to study a set of host-guest systems from the SAMPL5 challenge. We obtain highly accurate binding free energies and good agreement with experiments. The role of water during the binding process is then analysed in some detail.
Collapse
Affiliation(s)
- Valerio Rizzi
- Department of Chemistry and Applied Biosciences, ETH Zurich, 8092, Zurich, Switzerland
- Facoltà di Informatica, Istituto di Scienze Computazionali, Università della Svizzera Italiana, Via G. Buffi 13, 6900, Lugano, Switzerland
| | - Luigi Bonati
- Facoltà di Informatica, Istituto di Scienze Computazionali, Università della Svizzera Italiana, Via G. Buffi 13, 6900, Lugano, Switzerland
- Department of Physics, ETH Zurich, 8092, Zurich, Switzerland
| | - Narjes Ansari
- Department of Chemistry and Applied Biosciences, ETH Zurich, 8092, Zurich, Switzerland
- Facoltà di Informatica, Istituto di Scienze Computazionali, Università della Svizzera Italiana, Via G. Buffi 13, 6900, Lugano, Switzerland
| | - Michele Parrinello
- Department of Chemistry and Applied Biosciences, ETH Zurich, 8092, Zurich, Switzerland.
- Facoltà di Informatica, Istituto di Scienze Computazionali, Università della Svizzera Italiana, Via G. Buffi 13, 6900, Lugano, Switzerland.
- Italian Institute of Technology, Via Morego 30, 16163, Genova, Italy.
| |
Collapse
|