1
|
Gallicchio E. Relative Binding Free Energy Estimation of Congeneric Ligands and Macromolecular Mutants with the Alchemical Transfer Method with Coordinate Swapping. J Chem Inf Model 2025; 65:3706-3714. [PMID: 40136079 PMCID: PMC12004517 DOI: 10.1021/acs.jcim.5c00207] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2025] [Revised: 03/09/2025] [Accepted: 03/14/2025] [Indexed: 03/27/2025]
Abstract
We present the Alchemical Transfer with Coordinate Swapping (ATS) method to enable the calculation of the relative binding free energies between large congeneric ligands and single-point mutant peptides to protein receptors with the Alchemical Transfer Method (ATM) framework. Similarly to ATM, the new method implements the alchemical transformation as a coordinate transformation and works with any unmodified force fields and standard chemical topologies. Unlike ATM, which transfers whole ligands in and out of the receptor binding site, ATS limits the magnitude of the alchemical perturbation by transferring only the portion of the molecules that differ between the bound and unbound ligands. The common region of the two ligands, which can be arbitrarily large, is unchanged and does not contribute to the magnitude and statistical fluctuations of the perturbation energy. Internally, the coordinates of the atoms of the common regions are swapped to maintain the integrity of the covalent bonding data structures of the OpenMM molecular dynamics engine. The work successfully validates the method on protein-ligand and protein-peptide RBFE benchmarks. This advance paves the road for the application of the relative binding free energy Alchemical Transfer Method protocol to study the effect of protein and nucleic acid mutations on the binding affinity and specificity of macromolecular complexes.
Collapse
Affiliation(s)
- Emilio Gallicchio
- Department
of Chemistry and Biochemistry, Brooklyn
College of the City University of New York, New York, New York 11210, United States
- Ph.D.
Program in Chemistry, The Graduate Center
of the City University of New York, New York, New York 10016, United States
- Ph.D.
Program in Biochemistry, The Graduate Center
of the City University of New York, New York, New York 10016, United States
| |
Collapse
|
2
|
Bhati A, Wan S, Coveney PV. Equilibrium and Nonequilibrium Ensemble Methods for Accurate, Precise and Reproducible Absolute Binding Free Energy Calculations. J Chem Theory Comput 2025; 21:440-462. [PMID: 39680850 PMCID: PMC11736689 DOI: 10.1021/acs.jctc.4c01389] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2024] [Revised: 11/28/2024] [Accepted: 12/02/2024] [Indexed: 12/18/2024]
Abstract
Free energy calculations for protein-ligand complexes have become widespread in recent years owing to several conceptual, methodological and technological advances. Central among these is the use of ensemble methods which permits accurate, precise and reproducible predictions and is necessary for uncertainty quantification. Absolute binding free energies (ABFEs) are challenging to predict using alchemical methods and their routine application in drug discovery has remained out of reach until now. Here, we apply ensemble alchemical ABFE methods to a large data set comprising 219 ligand-protein complexes and obtain statistically robust results with high accuracy (<1 kcal/mol). We compare equilibrium and nonequilibrium methods for ABFE predictions at large scale and provide a systematic critical assessment of each method. The equilibrium method is more accurate, precise, faster, computationally more cost-effective and requires a much simpler protocol, making it preferable for large scale and blind applications. We find that the calculated free energy distributions are non-normal and discuss the consequences. We recommend a definitive protocol to perform ABFE calculations optimally. Using this protocol, it is possible to perform thousands of ABFE calculations within a few hours on modern exascale machines.
Collapse
Affiliation(s)
- Agastya
P. Bhati
- Centre
for Computational Science, Department of Chemistry, University College London, London WC1H 0AJ, United Kingdom
| | - Shunzhou Wan
- Centre
for Computational Science, Department of Chemistry, University College London, London WC1H 0AJ, United Kingdom
| | - Peter V. Coveney
- Centre
for Computational Science, Department of Chemistry, University College London, London WC1H 0AJ, United Kingdom
- Computational
Science Laboratory, Institute for Informatics, Faculty of Science, University of Amsterdam, Amsterdam 1012, The Netherlands
- Advanced
Research Computing Centre, University College
London, London WC1H 9BT, United Kingdom
| |
Collapse
|
3
|
Azimi S, Gallicchio E. Binding Selectivity Analysis from Alchemical Receptor Hopping and Swapping Free Energy Calculations. J Phys Chem B 2024; 128:10841-10852. [PMID: 39468848 PMCID: PMC11551962 DOI: 10.1021/acs.jpcb.4c05732] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2024] [Revised: 10/08/2024] [Accepted: 10/10/2024] [Indexed: 10/30/2024]
Abstract
We present receptor hopping and receptor swapping free energy estimation protocols based on the Alchemical Transfer Method (ATM) to model the binding selectivity of a set of ligands to two arbitrary receptors. The receptor hopping protocol, where a ligand is alchemically transferred from one receptor to another in one simulation, directly yields the ligand's binding selectivity free energy (BSFE) for the two receptors, which is the difference between the two individual binding free energies. In the receptor swapping protocol, the first ligand of a pair is transferred from one receptor to another while the second ligand is simultaneously transferred in the opposite direction. The receptor swapping free energy yields the differences in binding selectivity free energies of a set of ligands, which, when combined using a generalized DiffNet algorithm, yield the binding selectivity free energies of the ligands. We test these algorithms on host-guest systems and show that they yield results that agree with experimental data and are consistent with differences in absolute and relative binding free energies obtained by conventional methods. Preliminary applications to the selectivity analysis of molecular fragments binding to the trypsin and thrombin serine protease confirm the potential of the receptor swapping technology in structure-based drug discovery. The novel methodologies presented in this work are a first step toward streamlined and computationally efficient protocols for ligand selectivity optimization between mutants and homologous proteins.
Collapse
Affiliation(s)
- Solmaz Azimi
- Department
of Chemistry and Biochemistry, Brooklyn
College of the City University of New York, New York, New York 11210, United States
- Ph.D.
Program in Biochemistry, The Graduate Center
of the City University of New York, New York, New York 10016, United States
| | - Emilio Gallicchio
- Department
of Chemistry and Biochemistry, Brooklyn
College of the City University of New York, New York, New York 11210, United States
- Ph.D.
Program in Biochemistry, The Graduate Center
of the City University of New York, New York, New York 10016, United States
- Ph.D.
Program in Chemistry, The Graduate Center
of the City University of New York, New York, New York 10016, United States
| |
Collapse
|
4
|
Di Paco G, Macchiagodena M, Procacci P. Identification of Potential Inhibitors of the SARS-CoV-2 NSP13 Helicase via Structure-Based Ligand Design, Molecular Docking and Nonequilibrium Alchemical Simulations. ChemMedChem 2024; 19:e202400095. [PMID: 38456332 DOI: 10.1002/cmdc.202400095] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2024] [Revised: 03/06/2024] [Accepted: 03/06/2024] [Indexed: 03/09/2024]
Abstract
We have assembled a computational pipeline based on virtual screening, docking techniques, and nonequilibrium molecular dynamics simulations, with the goal of identifying possible inhibitors of the SARS-CoV-2 NSP13 helicase, catalyzing by ATP hydrolysis the unwinding of double or single-stranded RNA in the viral replication process inside the host cell. The druggable sites for broad-spectrum inhibitors are represented by the RNA binding sites at the 5' entrance and 3' exit of the central channel, a structural motif that is highly conserved across coronaviruses. Potential binders were first generated using structure-based ligand techniques. Their potency was estimated by using four popular docking scoring functions. Common docking hits for NSP13 were finally tested using advanced nonequilibrium alchemical techniques for binding free energy calculations on a high-performing parallel cluster. Four potential NSP13 inhibitors with potency from submicrimolar to nanomolar were finally identified.
Collapse
Affiliation(s)
- Giorgio Di Paco
- Dipartimento di Chimica "Ugo Schiff", Universit'a degli Studi di Firenze, Via della Lastruccia 3, 50019, Sesto Fiorentino, Italy
| | - Marina Macchiagodena
- Dipartimento di Chimica "Ugo Schiff", Universit'a degli Studi di Firenze, Via della Lastruccia 3, 50019, Sesto Fiorentino, Italy
| | - Piero Procacci
- Dipartimento di Chimica "Ugo Schiff", Universit'a degli Studi di Firenze, Via della Lastruccia 3, 50019, Sesto Fiorentino, Italy
| |
Collapse
|
5
|
Khuttan S, Gallicchio E. What to Make of Zero: Resolving the Statistical Noise from Conformational Reorganization in Alchemical Binding Free Energy Estimates with Metadynamics Sampling. J Chem Theory Comput 2024; 20:1489-1501. [PMID: 38252868 PMCID: PMC10867849 DOI: 10.1021/acs.jctc.3c01250] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2023] [Revised: 01/03/2024] [Accepted: 01/03/2024] [Indexed: 01/24/2024]
Abstract
We introduce the self-relative binding free energy (self-RBFE) approach to evaluate the intrinsic statistical variance of dual-topology alchemical binding free energy estimators. The self-RBFE is the relative binding free energy between a ligand and a copy of the same ligand, and its true value is zero. Nevertheless, because the two copies of the ligand move independently, the self-RBFE value produced by a finite-length simulation fluctuates and can be used to measure the variance of the model. The results of this validation provide evidence that a significant fraction of the errors observed in benchmark studies reflect the statistical fluctuations of unconverged estimates rather than the models' accuracy. Furthermore, we find that ligand reorganization is a significant contributing factor to the statistical variance of binding free energy estimates and that metadynamics-accelerated conformational sampling of the torsional degrees of freedom of the ligand can drastically reduce the time to convergence.
Collapse
Affiliation(s)
- Sheenam Khuttan
- Department
of Chemistry and Biochemistry, Brooklyn
College of the City University of New York, New York, New York 11210, United States
- Ph.D.
Program in Biochemistry, The Graduate Center
of the City University of New York, New York, New York 10016, United States
| | - Emilio Gallicchio
- Department
of Chemistry and Biochemistry, Brooklyn
College of the City University of New York, New York, New York 11210, United States
- Ph.D.
Program in Biochemistry, The Graduate Center
of the City University of New York, New York, New York 10016, United States
- Ph.D.
Program in Chemistry, The Graduate Center
of the City University of New York, New York, New York 10016, United States
| |
Collapse
|
6
|
Chen L, Wu Y, Wu C, Silveira A, Sherman W, Xu H, Gallicchio E. Performance and Analysis of the Alchemical Transfer Method for Binding-Free-Energy Predictions of Diverse Ligands. J Chem Inf Model 2024; 64:250-264. [PMID: 38147877 DOI: 10.1021/acs.jcim.3c01705] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2023]
Abstract
The Alchemical Transfer Method (ATM) is herein validated against the relative binding-free energies (RBFEs) of a diverse set of protein-ligand complexes. We employed a streamlined setup workflow, a bespoke force field, and AToM-OpenMM software to compute the RBFEs of the benchmark set prepared by Schindler and collaborators at Merck KGaA. This benchmark set includes examples of standard small R-group ligand modifications as well as more challenging scenarios, such as large R-group changes, scaffold hopping, formal charge changes, and charge-shifting transformations. The novel coordinate perturbation scheme and a dual-topology approach of ATM address some of the challenges of single-topology alchemical RBFE methods. Specifically, ATM eliminates the need for splitting electrostatic and Lennard-Jones interactions, atom mapping, defining ligand regions, and postcorrections for charge-changing perturbations. Thus, ATM is simpler and more broadly applicable than conventional alchemical methods, especially for scaffold-hopping and charge-changing transformations. Here, we performed well over 500 RBFE calculations for eight protein targets and found that ATM achieves accuracy comparable to that of existing state-of-the-art methods, albeit with larger statistical fluctuations. We discuss insights into the specific strengths and weaknesses of the ATM method that will inform future deployments. This study confirms that ATM can be applied as a production tool for RBFE predictions across a wide range of perturbation types within a unified, open-source framework.
Collapse
Affiliation(s)
- Lieyang Chen
- Roivant Sciences, 151 W 42nd Street, 15th Floor, New York, New York 10036, United States
| | - Yujie Wu
- Roivant Sciences, 151 W 42nd Street, 15th Floor, New York, New York 10036, United States
- Atommap Corporation, New York, New York 10017, United States
| | - Chuanjie Wu
- Roivant Sciences, 151 W 42nd Street, 15th Floor, New York, New York 10036, United States
| | - Ana Silveira
- Psivant Therapeutics, 451 D Street, Boston, Massachusetts 02210, United States
| | - Woody Sherman
- Psivant Therapeutics, 451 D Street, Boston, Massachusetts 02210, United States
| | - Huafeng Xu
- Roivant Sciences, 151 W 42nd Street, 15th Floor, New York, New York 10036, United States
- Atommap Corporation, New York, New York 10017, United States
| | - Emilio Gallicchio
- Department of Chemistry and Biochemistry, Brooklyn College of the City University of New York, New York, New York 11210, United States
- Ph.D. Program in Chemistry, The Graduate Center of the City University of New York, New York, New York 10016, United States
- Ph.D. Program in Biochemistry, The Graduate Center of the City University of New York, New York, New York 10016, United States
| |
Collapse
|
7
|
Herz AM, Kellici T, Morao I, Michel J. Alchemical Free Energy Workflows for the Computation of Protein-Ligand Binding Affinities. Methods Mol Biol 2024; 2716:241-264. [PMID: 37702943 DOI: 10.1007/978-1-0716-3449-3_11] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/14/2023]
Abstract
Alchemical free energy methods can be used for the efficient computation of relative binding free energies during preclinical drug discovery stages. In recent years, this has been facilitated further by the implementation of workflows that enable non-experts to quickly and consistently set up the required simulations. Given the correct input structures, workflows handle the difficult aspects of setting up perturbations, including consistently defining the perturbable molecule, its atom mapping and topology generation, perturbation network generation, running of the simulations via different sampling methods, and analysis of the results. Different academic and commercial workflows are discussed, including FEW, FESetup, FEPrepare, CHARMM-GUI, Transformato, PMX, QLigFEP, TIES, ProFESSA, PyAutoFEP, BioSimSpace, FEP+, Flare, and Orion. These workflows differ in various aspects, such as mapping algorithms or enhanced sampling methods. Some workflows can accommodate more than one molecular dynamics (MD) engine and use external libraries for tasks. Differences between workflows can present advantages for different use cases, however a lack of interoperability of the workflows' components hinders systematic comparisons.
Collapse
Affiliation(s)
- Anna M Herz
- EaStChem School of Chemistry, Joseph Black Building, University of Edinburgh, Edinburgh, UK
| | - Tahsin Kellici
- Evotec (UK) Ltd., In Silico Research and Development, Abingdon, Oxfordshire, UK
- Merck & Co., Inc., Modelling and Informatics, West Point, PA, USA
| | - Inaki Morao
- Evotec (UK) Ltd., In Silico Research and Development, Abingdon, Oxfordshire, UK
| | - Julien Michel
- EaStChem School of Chemistry, Joseph Black Building, University of Edinburgh, Edinburgh, UK.
| |
Collapse
|
8
|
Wan S, Bhati AP, Wade AD, Coveney PV. Ensemble-Based Approaches Ensure Reliability and Reproducibility. J Chem Inf Model 2023; 63:6959-6963. [PMID: 37965695 PMCID: PMC10685440 DOI: 10.1021/acs.jcim.3c01654] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2023] [Indexed: 11/16/2023]
Abstract
It is increasingly widely recognized that ensemble-based approaches are required to achieve reliability, accuracy, and precision in molecular dynamics calculations. The purpose of the present article is to address a frequently raised question: what is the optimal way to perform ensemble simulation to calculate quantities of interest?
Collapse
Affiliation(s)
- Shunzhou Wan
- Centre
for Computational Science, Department of Chemistry, University College London, London WC1H 0AJ, U. K
| | - Agastya P. Bhati
- Centre
for Computational Science, Department of Chemistry, University College London, London WC1H 0AJ, U. K
| | - Alexander D. Wade
- Centre
for Computational Science, Department of Chemistry, University College London, London WC1H 0AJ, U. K
| | - Peter V. Coveney
- Centre
for Computational Science, Department of Chemistry, University College London, London WC1H 0AJ, U. K
- Advanced
Research Computing Centre, University College
London, London WC1H 0AJ, U.K.
- Institute
for Informatics, Faculty of Science, University
of Amsterdam, 1098XH Amsterdam, The Netherlands
| |
Collapse
|
9
|
Wan S, Bhati AP, Coveney PV. Comparison of Equilibrium and Nonequilibrium Approaches for Relative Binding Free Energy Predictions. J Chem Theory Comput 2023; 19:7846-7860. [PMID: 37862058 PMCID: PMC10653111 DOI: 10.1021/acs.jctc.3c00842] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2023] [Indexed: 10/21/2023]
Abstract
Alchemical relative binding free energy calculations have recently found important applications in drug optimization. A series of congeneric compounds are generated from a preidentified lead compound, and their relative binding affinities to a protein are assessed in order to optimize candidate drugs. While methods based on equilibrium thermodynamics have been extensively studied, an approach based on nonequilibrium methods has recently been reported together with claims of its superiority. However, these claims pay insufficient attention to the basis and reliability of both methods. Here we report a comparative study of the two approaches across a large data set, comprising more than 500 ligand transformations spanning in excess of 300 ligands binding to a set of 14 diverse protein targets. Ensemble methods are essential to quantify the uncertainty in these calculations, not only for the reasons already established in the equilibrium approach but also to ensure that the nonequilibrium calculations reside within their domain of validity. If and only if ensemble methods are applied, we find that the nonequilibrium method can achieve accuracy and precision comparable to those of the equilibrium approach. Compared to the equilibrium method, the nonequilibrium approach can reduce computational costs but introduces higher computational complexity and longer wall clock times. There are, however, cases where the standard length of a nonequilibrium transition is not sufficient, necessitating a complete rerun of the entire set of transitions. This significantly increases the computational cost and proves to be highly inconvenient during large-scale applications. Our findings provide a key set of recommendations that should be adopted for the reliable implementation of nonequilibrium approaches to relative binding free energy calculations in ligand-protein systems.
Collapse
Affiliation(s)
- Shunzhou Wan
- Centre
for Computational Science, Department of Chemistry, University College London, London WC1H 0AJ, U.K.
| | - Agastya P. Bhati
- Centre
for Computational Science, Department of Chemistry, University College London, London WC1H 0AJ, U.K.
| | - Peter V. Coveney
- Centre
for Computational Science, Department of Chemistry, University College London, London WC1H 0AJ, U.K.
- Advanced
Research Computing Centre, University College
London, London WC1H 0AJ, U.K.
- Computational
Science Laboratory, Institute for Informatics, Faculty of Science, University of Amsterdam, Amsterdam 1012 WP, Netherlands
| |
Collapse
|
10
|
Sabanés Zariquiey F, Pérez A, Majewski M, Gallicchio E, De Fabritiis G. Validation of the Alchemical Transfer Method for the Estimation of Relative Binding Affinities of Molecular Series. J Chem Inf Model 2023; 63:2438-2444. [PMID: 37042797 PMCID: PMC10577236 DOI: 10.1021/acs.jcim.3c00178] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/13/2023]
Abstract
The accurate prediction of protein-ligand binding affinities is crucial for drug discovery. Alchemical free energy calculations have become a popular tool for this purpose. However, the accuracy and reliability of these methods can vary depending on the methodology. In this study, we evaluate the performance of a relative binding free energy protocol based on the alchemical transfer method (ATM), a novel approach based on a coordinate transformation that swaps the positions of two ligands. The results show that ATM matches the performance of more complex free energy perturbation (FEP) methods in terms of Pearson correlation but with marginally higher mean absolute errors. This study shows that the ATM method is competitive compared to more traditional methods in speed and accuracy and offers the advantage of being applicable with any potential energy function.
Collapse
Affiliation(s)
- Francesc Sabanés Zariquiey
- Computational Science Laboratory, Universitat Pompeu Fabra, Barcelona Biomedical Research Park (PRBB), C Dr. Aiguader 88, 08003 Barcelona, Spain
| | - Adrià Pérez
- Acellera Labs, C Dr Trueta 183, 08005 Barcelona, Spain
| | | | - Emilio Gallicchio
- Department of Chemistry, Brooklyn College of the City University of New York, New York, New York 11210, United States
- PhD Program in Chemistry Graduate Center of the City University of New York, New York, New York 10016, United States
- PhD Program in Biochemistry, Graduate Center of the City University of New York, New York, New York 10016, United States
| | - Gianni De Fabritiis
- Computational Science Laboratory, Universitat Pompeu Fabra, Barcelona Biomedical Research Park (PRBB), C Dr. Aiguader 88, 08003 Barcelona, Spain
- Acellera, Devonshire House 582 Honeypot Lane, Stanmore, Middlesex HA7 1JS, United Kingdom
- Institució Catalana de Recerca i Estudis Avançats (ICREA), Passeig Lluis Companys 23, 08010 Barcelona, Spain
| |
Collapse
|
11
|
Bieniek M, Wade AD, Bhati AP, Wan S, Coveney PV. TIES 2.0: A Dual-Topology Open Source Relative Binding Free Energy Builder with Web Portal. J Chem Inf Model 2023; 63:718-724. [PMID: 36719676 PMCID: PMC9930115 DOI: 10.1021/acs.jcim.2c01596] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
Relative binding free energy (RBFE) calculations are widely used to aid the process of drug discovery. TIES, Thermodynamic Integration with Enhanced Sampling, is a dual-topology approach to RBFE calculations with support for NAMD and OpenMM molecular dynamics engines. The software has been thoroughly validated on publicly available datasets. Here we describe the open source software along with a web portal (https://ccs-ties.org) that enables users to perform such calculations correctly and rapidly.
Collapse
Affiliation(s)
- Mateusz
K. Bieniek
- Centre
for Computational Science, Department of Chemistry, University College London, London WC1H 0AJ, United Kingdom,School
of Natural and Environmental Sciences, Newcastle
University, Newcastle upon Tyne NE1 7RU, United
Kingdom
| | - Alexander D. Wade
- Centre
for Computational Science, Department of Chemistry, University College London, London WC1H 0AJ, United Kingdom
| | - Agastya P. Bhati
- Centre
for Computational Science, Department of Chemistry, University College London, London WC1H 0AJ, United Kingdom
| | - Shunzhou Wan
- Centre
for Computational Science, Department of Chemistry, University College London, London WC1H 0AJ, United Kingdom
| | - Peter V. Coveney
- Centre
for Computational Science, Department of Chemistry, University College London, London WC1H 0AJ, United Kingdom,Advanced
Research Computing Centre, University College
London, London WC1H 0AJ, United
Kingdom,Institute
for Informatics, Faculty of Science, University
of Amsterdam, 1098XH Amsterdam, The Netherlands,E-mail:
| |
Collapse
|
12
|
The performance of ensemble-based free energy protocols in computing binding affinities to ROS1 kinase. Sci Rep 2022; 12:10433. [PMID: 35729177 PMCID: PMC9211793 DOI: 10.1038/s41598-022-13319-6] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2022] [Accepted: 05/23/2022] [Indexed: 11/08/2022] Open
Abstract
Optimization of binding affinities for compounds to their target protein is a primary objective in drug discovery. Herein we report on a collaborative study that evaluates a set of compounds binding to ROS1 kinase. We use ESMACS (enhanced sampling of molecular dynamics with approximation of continuum solvent) and TIES (thermodynamic integration with enhanced sampling) protocols to rank the binding free energies. The predicted binding free energies from ESMACS simulations show good correlations with experimental data for subsets of the compounds. Consistent binding free energy differences are generated for TIES and ESMACS. Although an unexplained overestimation exists, we obtain excellent statistical rankings across the set of compounds from the TIES protocol, with a Pearson correlation coefficient of 0.90 between calculated and experimental activities.
Collapse
|
13
|
Wade A, Bhati AP, Wan S, Coveney PV. Alchemical Free Energy Estimators and Molecular Dynamics Engines: Accuracy, Precision, and Reproducibility. J Chem Theory Comput 2022; 18:3972-3987. [PMID: 35609233 PMCID: PMC9202356 DOI: 10.1021/acs.jctc.2c00114] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2022] [Indexed: 11/28/2022]
Abstract
The binding free energy between a ligand and its target protein is an essential quantity to know at all stages of the drug discovery pipeline. Assessing this value computationally can offer insight into where efforts should be focused in the pursuit of effective therapeutics to treat a myriad of diseases. In this work, we examine the computation of alchemical relative binding free energies with an eye for assessing reproducibility across popular molecular dynamics packages and free energy estimators. The focus of this work is on 54 ligand transformations from a diverse set of protein targets: MCL1, PTP1B, TYK2, CDK2, and thrombin. These targets are studied with three popular molecular dynamics packages: OpenMM, NAMD2, and NAMD3 alpha. Trajectories collected with these packages are used to compare relative binding free energies calculated with thermodynamic integration and free energy perturbation methods. The resulting binding free energies show good agreement between molecular dynamics packages with an average mean unsigned error between them of 0.50 kcal/mol. The correlation between packages is very good, with the lowest Spearman's, Pearson's and Kendall's tau correlation coefficients being 0.92, 0.91, and 0.76, respectively. Agreement between thermodynamic integration and free energy perturbation is shown to be very good when using ensemble averaging.
Collapse
Affiliation(s)
- Alexander
D. Wade
- Centre
for Computational Science, Department of Chemistry, University College London, London WC1H 0AJ, UK
| | - Agastya P. Bhati
- Centre
for Computational Science, Department of Chemistry, University College London, London WC1H 0AJ, UK
| | - Shunzhou Wan
- Centre
for Computational Science, Department of Chemistry, University College London, London WC1H 0AJ, UK
| | - Peter V. Coveney
- Centre
for Computational Science, Department of Chemistry, University College London, London WC1H 0AJ, UK
- Informatics
Institute, University of Amsterdam, Amsterdam 1098XH, The Netherlands
- Advanced
Research Computing Centre, University College
London, London WC1H 0AJ, UK
| |
Collapse
|
14
|
Wan S, Bhati AP, Wright DW, Wall ID, Graves AP, Green D, Coveney PV. Ensemble Simulations and Experimental Free Energy Distributions: Evaluation and Characterization of Isoxazole Amides as SMYD3 Inhibitors. J Chem Inf Model 2022; 62:2561-2570. [PMID: 35508076 PMCID: PMC9131449 DOI: 10.1021/acs.jcim.2c00255] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
Optimization of binding affinities for ligands to their target protein is a primary objective in rational drug discovery. Herein, we report on a collaborative study that evaluates various compounds designed to bind to the SET and MYND domain-containing protein 3 (SMYD3). SMYD3 is a histone methyltransferase and plays an important role in transcriptional regulation in cell proliferation, cell cycle, and human carcinogenesis. Experimental measurements using the scintillation proximity assay show that the distributions of binding free energies from a large number of independent measurements exhibit non-normal properties. We use ESMACS (enhanced sampling of molecular dynamics with approximation of continuum solvent) and TIES (thermodynamic integration with enhanced sampling) protocols to predict the binding free energies and to provide a detailed chemical insight into the nature of ligand-protein binding. Our results show that the 1-trajectory ESMACS protocol works well for the set of ligands studied here. Although one unexplained outlier exists, we obtain excellent statistical ranking across the set of compounds from the ESMACS protocol and good agreement between calculations and experiments for the relative binding free energies from the TIES protocol. ESMACS and TIES are again found to be powerful protocols for the accurate comparison of the binding free energies.
Collapse
Affiliation(s)
- Shunzhou Wan
- Centre for Computational Science, Department of Chemistry, University College London, London WC1H 0AJ, U.K
| | - Agastya P Bhati
- Centre for Computational Science, Department of Chemistry, University College London, London WC1H 0AJ, U.K
| | - David W Wright
- Centre for Computational Science, Department of Chemistry, University College London, London WC1H 0AJ, U.K
| | - Ian D Wall
- GlaxoSmithKline, Gunnels Wood Road, Stevenage, Hertfordshire SG1 2NY, U.K
| | - Alan P Graves
- GlaxoSmithKline, 1250 South Collegeville Road, Collegeville, Pennsylvania 19426, United States
| | - Darren Green
- GlaxoSmithKline, Gunnels Wood Road, Stevenage, Hertfordshire SG1 2NY, U.K
| | - Peter V Coveney
- Centre for Computational Science, Department of Chemistry, University College London, London WC1H 0AJ, U.K.,Advanced Research Computing Centre, University College London, London WC1H 0AJ U.K.,Institute for Informatics, Faculty of Science, University of Amsterdam, 1098XH Amsterdam, The Netherlands
| |
Collapse
|
15
|
Bhati A, Coveney PV. Large Scale Study of Ligand-Protein Relative Binding Free Energy Calculations: Actionable Predictions from Statistically Robust Protocols. J Chem Theory Comput 2022; 18:2687-2702. [PMID: 35293737 PMCID: PMC9009079 DOI: 10.1021/acs.jctc.1c01288] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2021] [Indexed: 12/28/2022]
Abstract
The accurate and reliable prediction of protein-ligand binding affinities can play a central role in the drug discovery process as well as in personalized medicine. Of considerable importance during lead optimization are the alchemical free energy methods that furnish an estimation of relative binding free energies (RBFE) of similar molecules. Recent advances in these methods have increased their speed, accuracy, and precision. This is evident from the increasing number of retrospective as well as prospective studies employing them. However, such methods still have limited applicability in real-world scenarios due to a number of important yet unresolved issues. Here, we report the findings from a large data set comprising over 500 ligand transformations spanning over 300 ligands binding to a diverse set of 14 different protein targets which furnish statistically robust results on the accuracy, precision, and reproducibility of RBFE calculations. We use ensemble-based methods which are the only way to provide reliable uncertainty quantification given that the underlying molecular dynamics is chaotic. These are implemented using TIES (Thermodynamic Integration with Enhanced Sampling). Results achieve chemical accuracy in all cases. Ensemble simulations also furnish information on the statistical distributions of the free energy calculations which exhibit non-normal behavior. We find that the "enhanced sampling" method known as replica exchange with solute tempering degrades RBFE predictions. We also report definitively on numerous associated alchemical factors including the choice of ligand charge method, flexibility in ligand structure, and the size of the alchemical region including the number of atoms involved in transforming one ligand into another. Our findings provide a key set of recommendations that should be adopted for the reliable application of RBFE methods.
Collapse
Affiliation(s)
- Agastya
P. Bhati
- Centre
for Computational Science, Department of Chemistry, University College London, London WC1H 0AJ, United Kingdom
| | - Peter V. Coveney
- Centre
for Computational Science, Department of Chemistry, University College London, London WC1H 0AJ, United Kingdom
- Informatics
Institute, University of Amsterdam, P.O. Box 94323, 1090 GH Amsterdam, Netherlands
| |
Collapse
|
16
|
Bhati AP, Wan S, Alfè D, Clyde AR, Bode M, Tan L, Titov M, Merzky A, Turilli M, Jha S, Highfield RR, Rocchia W, Scafuri N, Succi S, Kranzlmüller D, Mathias G, Wifling D, Donon Y, Di Meglio A, Vallecorsa S, Ma H, Trifan A, Ramanathan A, Brettin T, Partin A, Xia F, Duan X, Stevens R, Coveney PV. Pandemic drugs at pandemic speed: infrastructure for accelerating COVID-19 drug discovery with hybrid machine learning- and physics-based simulations on high-performance computers. Interface Focus 2021; 11:20210018. [PMID: 34956592 PMCID: PMC8504892 DOI: 10.1098/rsfs.2021.0018] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/07/2021] [Indexed: 12/13/2022] Open
Abstract
The race to meet the challenges of the global pandemic has served as a reminder that the existing drug discovery process is expensive, inefficient and slow. There is a major bottleneck screening the vast number of potential small molecules to shortlist lead compounds for antiviral drug development. New opportunities to accelerate drug discovery lie at the interface between machine learning methods, in this case, developed for linear accelerators, and physics-based methods. The two in silico methods, each have their own advantages and limitations which, interestingly, complement each other. Here, we present an innovative infrastructural development that combines both approaches to accelerate drug discovery. The scale of the potential resulting workflow is such that it is dependent on supercomputing to achieve extremely high throughput. We have demonstrated the viability of this workflow for the study of inhibitors for four COVID-19 target proteins and our ability to perform the required large-scale calculations to identify lead antiviral compounds through repurposing on a variety of supercomputers.
Collapse
Affiliation(s)
- Agastya P. Bhati
- Centre for Computational Science, University College London, Gordon Street, London WC1H 0AJ, UK
| | - Shunzhou Wan
- Centre for Computational Science, University College London, Gordon Street, London WC1H 0AJ, UK
| | - Dario Alfè
- Department of Earth Sciences, London Centre for Nanotechnology and Thomas Young Centre at University College London, University College London, Gower Street, London WC1E 6BT, UK
- Dipartimento di Fisica Ettore Pancini, Università di Napoli Federico II, Monte Sant'Angelo, Napoli 80126, Italy
| | - Austin R. Clyde
- Department of Computer Science, University of Chicago, Chicago, IL, USA
| | - Mathis Bode
- Institute for Combustion Technology, RWTH Aachen University, Aachen 52056, Germany
| | - Li Tan
- Brookhaven National Laboratory, Upton, NY 11973, USA
| | - Mikhail Titov
- Department of Electrical and Computer Engineering, Rutgers, the State University of New Jersey, Piscataway, NJ 08854, USA
| | - Andre Merzky
- Department of Electrical and Computer Engineering, Rutgers, the State University of New Jersey, Piscataway, NJ 08854, USA
| | - Matteo Turilli
- Department of Electrical and Computer Engineering, Rutgers, the State University of New Jersey, Piscataway, NJ 08854, USA
| | - Shantenu Jha
- Brookhaven National Laboratory, Upton, NY 11973, USA
- Department of Electrical and Computer Engineering, Rutgers, the State University of New Jersey, Piscataway, NJ 08854, USA
| | | | - Walter Rocchia
- Concept Lab, Italian Institute of Technology, Via Melen, Genova, Italy
| | - Nicola Scafuri
- Concept Lab, Italian Institute of Technology, Via Melen, Genova, Italy
| | - Sauro Succi
- Center for Life Nanosciences at La Sapienza, Italian Institute of Technology, viale Regina Elena, Roma, Italy
| | - Dieter Kranzlmüller
- Leibniz Supercomputing Centre (LRZ) of the Bavarian Academy of Sciences and Humanities, Boltzmannstrasse 1, Garching bei München 85748, Germany
| | - Gerald Mathias
- Leibniz Supercomputing Centre (LRZ) of the Bavarian Academy of Sciences and Humanities, Boltzmannstrasse 1, Garching bei München 85748, Germany
| | - David Wifling
- Leibniz Supercomputing Centre (LRZ) of the Bavarian Academy of Sciences and Humanities, Boltzmannstrasse 1, Garching bei München 85748, Germany
| | | | | | | | - Heng Ma
- Data Science and Learning Division, Argonne National Laboratory, Lemont, IL 60439, USA
| | - Anda Trifan
- Data Science and Learning Division, Argonne National Laboratory, Lemont, IL 60439, USA
| | - Arvind Ramanathan
- Data Science and Learning Division, Argonne National Laboratory, Lemont, IL 60439, USA
| | - Tom Brettin
- Computing, Environment and Life Sciences Directorate, Argonne National Laboratory, Lemont, IL 60439, USA
| | - Alexander Partin
- Data Science and Learning Division, Argonne National Laboratory, Lemont, IL 60439, USA
| | - Fangfang Xia
- Data Science and Learning Division, Argonne National Laboratory, Lemont, IL 60439, USA
| | - Xiaotan Duan
- Department of Computer Science, University of Chicago, Chicago, IL, USA
| | - Rick Stevens
- Computing, Environment and Life Sciences Directorate, Argonne National Laboratory, Lemont, IL 60439, USA
| | - Peter V. Coveney
- Centre for Computational Science, University College London, Gordon Street, London WC1H 0AJ, UK
- Institute for Informatics, University of Amsterdam, Science Park 904, Amsterdam 1098 XH, The Netherlands
| |
Collapse
|
17
|
Wan S, Kumar D, Ilyin V, Al Homsi U, Sher G, Knuth A, Coveney PV. The effect of protein mutations on drug binding suggests ensuing personalised drug selection. Sci Rep 2021; 11:13452. [PMID: 34188094 PMCID: PMC8241852 DOI: 10.1038/s41598-021-92785-w] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2021] [Accepted: 06/09/2021] [Indexed: 11/08/2022] Open
Abstract
The advent of personalised medicine promises a deeper understanding of mechanisms and therefore therapies. However, the connection between genomic sequences and clinical treatments is often unclear. We studied 50 breast cancer patients belonging to a population-cohort in the state of Qatar. From Sanger sequencing, we identified several new deleterious mutations in the estrogen receptor 1 gene (ESR1). The effect of these mutations on drug treatment in the protein target encoded by ESR1, namely the estrogen receptor, was achieved via rapid and accurate protein-ligand binding affinity interaction studies which were performed for the selected drugs and the natural ligand estrogen. Four nonsynonymous mutations in the ligand-binding domain were subjected to molecular dynamics simulation using absolute and relative binding free energy methods, leading to the ranking of the efficacy of six selected drugs for patients with the mutations. Our study shows that a personalised clinical decision system can be created by integrating an individual patient's genomic data at the molecular level within a computational pipeline which ranks the efficacy of binding of particular drugs to variant proteins.
Collapse
Affiliation(s)
- Shunzhou Wan
- Department of Chemistry, Centre for Computational Science, University College London, London, WC1H 0AJ, UK
| | - Deepak Kumar
- Computational Biology, Carnegie Mellon University in Qatar (CMU-Q), Doha, Qatar
| | - Valentin Ilyin
- Computational Biology, Carnegie Mellon University in Qatar (CMU-Q), Doha, Qatar
| | - Ussama Al Homsi
- Hematology and Oncology Department, National Center for Cancer Care & Research, Hamad Medical Corporation, Doha, Qatar
| | - Gulab Sher
- Interim Translational Research Institute, Hamad Medical Corporation, Doha, Qatar
| | - Alexander Knuth
- Hematology and Oncology Department, National Center for Cancer Care & Research, Hamad Medical Corporation, Doha, Qatar
| | - Peter V Coveney
- Department of Chemistry, Centre for Computational Science, University College London, London, WC1H 0AJ, UK.
| |
Collapse
|