1
|
Samways ML, Taylor RD, Bruce Macdonald HE, Essex JW. Water molecules at protein-drug interfaces: computational prediction and analysis methods. Chem Soc Rev 2021; 50:9104-9120. [PMID: 34184009 DOI: 10.1039/d0cs00151a] [Citation(s) in RCA: 34] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
Abstract
The fundamental importance of water molecules at drug-protein interfaces is now widely recognised and a significant feature in structure-based drug design. Experimental methods for analysing the role of water in drug binding have many challenges, including the accurate location of bound water molecules in crystal structures, and problems in resolving specific water contributions to binding thermodynamics. Computational analyses of binding site water molecules provide an alternative, and in principle complete, structural and thermodynamic picture, and their use is now commonplace in the pharmaceutical industry. In this review, we describe the computational methodologies that are available and discuss their strengths and weaknesses. Additionally, we provide a critical analysis of the experimental data used to validate the methods, regarding the type and quality of experimental structural data. We also discuss some of the fundamental difficulties of each method and suggest directions for future study.
Collapse
Affiliation(s)
- Marley L Samways
- School of Chemistry, University of Southampton, Highfield, Southampton SO17 1BJ, UK.
| | | | | | | |
Collapse
|
2
|
Hu X, Maffucci I, Contini A. Advances in the Treatment of Explicit Water Molecules in Docking and Binding Free Energy Calculations. Curr Med Chem 2020; 26:7598-7622. [DOI: 10.2174/0929867325666180514110824] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2018] [Revised: 02/26/2018] [Accepted: 04/18/2018] [Indexed: 12/30/2022]
Abstract
Background:
The inclusion of direct effects mediated by water during the ligandreceptor
recognition is a hot-topic of modern computational chemistry applied to drug discovery
and development. Docking or virtual screening with explicit hydration is still debatable,
despite the successful cases that have been presented in the last years. Indeed, how to select
the water molecules that will be included in the docking process or how the included waters
should be treated remain open questions.
Objective:
In this review, we will discuss some of the most recent methods that can be used in
computational drug discovery and drug development when the effect of a single water, or of a
small network of interacting waters, needs to be explicitly considered.
Results:
Here, we analyse the software to aid the selection, or to predict the position, of water
molecules that are going to be explicitly considered in later docking studies. We also present
software and protocols able to efficiently treat flexible water molecules during docking, including
examples of applications. Finally, we discuss methods based on molecular dynamics
simulations that can be used to integrate docking studies or to reliably and efficiently compute
binding energies of ligands in presence of interfacial or bridging water molecules.
Conclusions:
Software applications aiding the design of new drugs that exploit water molecules,
either as displaceable residues or as bridges to the receptor, are constantly being developed.
Although further validation is needed, workflows that explicitly consider water will
probably become a standard for computational drug discovery soon.
Collapse
Affiliation(s)
- Xiao Hu
- Università degli Studi di Milano, Dipartimento di Scienze Farmaceutiche, Sezione di Chimica Generale e Organica “A. Marchesini”, Via Venezian, 21 20133 Milano, Italy
| | - Irene Maffucci
- Pasteur, Département de Chimie, École Normale Supérieure, PSL Research University, Sorbonne Universités, UPMC Univ. Paris 06, CNRS, 75005 Paris, France
| | - Alessandro Contini
- Università degli Studi di Milano, Dipartimento di Scienze Farmaceutiche, Sezione di Chimica Generale e Organica “A. Marchesini”, Via Venezian, 21 20133 Milano, Italy
| |
Collapse
|
3
|
Danishuddin M, Khan AU. Structure based virtual screening to discover putative drug candidates: Necessary considerations and successful case studies. Methods 2015; 71:135-45. [DOI: 10.1016/j.ymeth.2014.10.019] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2014] [Revised: 09/25/2014] [Accepted: 10/17/2014] [Indexed: 12/19/2022] Open
|
4
|
Parikh HI, Kellogg GE. Intuitive, but not simple: including explicit water molecules in protein-protein docking simulations improves model quality. Proteins 2013; 82:916-32. [PMID: 24214407 DOI: 10.1002/prot.24466] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2013] [Revised: 10/18/2013] [Accepted: 10/22/2013] [Indexed: 11/06/2022]
Abstract
Characterizing the nature of interaction between proteins that have not been experimentally cocrystallized requires a computational docking approach that can successfully predict the spatial conformation adopted in the complex. In this work, the Hydropathic INTeractions (HINT) force field model was used for scoring docked models in a data set of 30 high-resolution crystallographically characterized "dry" protein-protein complexes and was shown to reliably identify native-like models. However, most current protein-protein docking algorithms fail to explicitly account for water molecules involved in bridging interactions that mediate and stabilize the association of the protein partners, so we used HINT to illuminate the physical and chemical properties of bridging waters and account for their energetic stabilizing contributions. The HINT water Relevance metric identified the "truly" bridging waters at the 30 protein-protein interfaces and we utilized them in "solvated" docking by manually inserting them into the input files for the rigid body ZDOCK program. By accounting for these interfacial waters, a statistically significant improvement of ∼24% in the average hit-count within the top-10 predictions the protein-protein dataset was seen, compared to standard "dry" docking. The results also show scoring improvement, with medium and high accuracy models ranking much better than incorrect ones. These improvements can be attributed to the physical presence of water molecules that alter surface properties and better represent native shape and hydropathic complementarity between interacting partners, with concomitantly more accurate native-like structure predictions.
Collapse
Affiliation(s)
- Hardik I Parikh
- Department of Medicinal Chemistry and Institute for Structural Biology and Drug Discovery, Virginia Commonwealth University, Richmond, Virginia, 23298-0540
| | | |
Collapse
|
5
|
Interfacial water molecules in SH3 interactions: Getting the full picture on polyproline recognition by protein-protein interaction domains. FEBS Lett 2012; 586:2619-30. [DOI: 10.1016/j.febslet.2012.04.057] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2012] [Revised: 04/27/2012] [Accepted: 04/30/2012] [Indexed: 01/16/2023]
|
6
|
Ross GA, Morris GM, Biggin PC. Rapid and accurate prediction and scoring of water molecules in protein binding sites. PLoS One 2012; 7:e32036. [PMID: 22396746 PMCID: PMC3291545 DOI: 10.1371/journal.pone.0032036] [Citation(s) in RCA: 136] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2011] [Accepted: 01/18/2012] [Indexed: 12/21/2022] Open
Abstract
Water plays a critical role in ligand-protein interactions. However, it is still challenging to predict accurately not only where water molecules prefer to bind, but also which of those water molecules might be displaceable. The latter is often seen as a route to optimizing affinity of potential drug candidates. Using a protocol we call WaterDock, we show that the freely available AutoDock Vina tool can be used to predict accurately the binding sites of water molecules. WaterDock was validated using data from X-ray crystallography, neutron diffraction and molecular dynamics simulations and correctly predicted 97% of the water molecules in the test set. In addition, we combined data-mining, heuristic and machine learning techniques to develop probabilistic water molecule classifiers. When applied to WaterDock predictions in the Astex Diverse Set of protein ligand complexes, we could identify whether a water molecule was conserved or displaced to an accuracy of 75%. A second model predicted whether water molecules were displaced by polar groups or by non-polar groups to an accuracy of 80%. These results should prove useful for anyone wishing to undertake rational design of new compounds where the displacement of water molecules is being considered as a route to improved affinity.
Collapse
Affiliation(s)
- Gregory A. Ross
- Structural Bioinformatics and Computational Biochemistry, University of Oxford, Oxford, United Kingdom
| | | | - Philip C. Biggin
- Structural Bioinformatics and Computational Biochemistry, University of Oxford, Oxford, United Kingdom
- * E-mail:
| |
Collapse
|
7
|
Tripathi A, Kellogg GE. A novel and efficient tool for locating and characterizing protein cavities and binding sites. Proteins 2010; 78:825-42. [PMID: 19847777 DOI: 10.1002/prot.22608] [Citation(s) in RCA: 47] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
Systematic investigation of a protein and its binding site characteristics are crucial for designing small molecules that modulate protein functions. However, fundamental uncertainties in binding site interactions and insufficient knowledge of the properties of even well-defined binding pockets can make it difficult to design optimal drugs. Herein, we report the development and implementation of a cavity detection algorithm built with HINT toolkit functions that we are naming Vectorial Identification of Cavity Extents (VICE). This very efficient algorithm is based on geometric criteria applied to simple integer grid maps. In testing, we carried out a systematic investigation on a very diverse data set of proteins and protein-protein/protein-polynucleotide complexes for locating and characterizing the indentations, cavities, pockets, grooves, channels, and surface regions. Additionally, we evaluated a curated data set of unbound proteins for which a ligand-bound protein structures are also known; here the VICE algorithm located the actual ligand in the largest cavity in 83% of the cases and in one of the three largest in 90% of the cases. An interactive front-end provides a quick and simple procedure for locating, displaying and manipulating cavities in these structures. Information describing the cavity, including its volume and surface area metrics, and lists of atoms, residues, and/or chains lining the binding pocket, can be easily obtained and analyzed. For example, the relative cross-sectional surface area (to total surface area) of cavity openings in well-enclosed cavities is 0.06 +/- 0.04 and in surface clefts or crevices is 0.25 +/- 0.09. Proteins 2010. (c) 2009 Wiley-Liss, Inc.
Collapse
Affiliation(s)
- Ashutosh Tripathi
- Department of Medicinal Chemistry and Institute for Structural Biology and Drug Discovery, Virginia Commonwealth University, Richmond, Virginia 23298-0540, USA
| | | |
Collapse
|
8
|
Marabotti A, Colonna G, Facchiano A. New computational strategy to analyze the interactions of ERalpha and ERbeta with different ERE sequences. J Comput Chem 2007; 28:1031-41. [PMID: 17269124 DOI: 10.1002/jcc.20582] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
The importance of computational methods for the simulation and analysis of biological systems has increased during the last years. In particular, methods to predict binding energies are developing not only with the aim of ranking the affinities between two or more complexes, but also to quantify the contribution of different types of interaction. In this work, we present the application of HINT, a non Newtonian force field, to rank the affinities of complexes formed by estrogen receptors (ER) alpha and beta and different estrogen responsive elements (ERE) near the estrogen-regulated genes. We used the crystallographic coordinates of the DNA binding domain of ERalpha complexed to a consensus ERE as a starting point to simulate several complexes in which some nucleotides in the ERE sequence were mutated. Moreover, we used homology modeling methods to create the structure of the complexes between the DNA binding domain of ERbeta (for which no experimental structures are currently available) and the same ERE sequences. Our results show that HINT is able to rank the affinities of ERalpha and ERbeta for different ERE sequences, and to correctly identify the positions on the DNA sequence that are most important for binding affinity. Moreover, the HINT output gives us the opportunity to identify and quantify the role played by each single atom of amino acids and nucleotides in the binding event, as well as to predict the effect on the binding affinity for other nucleotide mutations.
Collapse
Affiliation(s)
- Anna Marabotti
- Laboratory of Bioinformatics and Computational Biology, Institute of Food Science, National Research Council, Avellino, Italy.
| | | | | |
Collapse
|
9
|
Energetics of the protein-DNA-water interaction. BMC STRUCTURAL BIOLOGY 2007; 7:4. [PMID: 17214883 PMCID: PMC1781455 DOI: 10.1186/1472-6807-7-4] [Citation(s) in RCA: 47] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/14/2006] [Accepted: 01/10/2007] [Indexed: 11/30/2022]
Abstract
Background To understand the energetics of the interaction between protein and DNA we analyzed 39 crystallographically characterized complexes with the HINT (Hydropathic INTeractions) computational model. HINT is an empirical free energy force field based on solvent partitioning of small molecules between water and 1-octanol. Our previous studies on protein-ligand complexes demonstrated that free energy predictions were significantly improved by taking into account the energetic contribution of water molecules that form at least one hydrogen bond with each interacting species. Results An initial correlation between the calculated HINT scores and the experimentally determined binding free energies in the protein-DNA system exhibited a relatively poor r2 of 0.21 and standard error of ± 1.71 kcal mol-1. However, the inclusion of 261 waters that bridge protein and DNA improved the HINT score-free energy correlation to an r2 of 0.56 and standard error of ± 1.28 kcal mol-1. Analysis of the water role and energy contributions indicate that 46% of the bridging waters act as linkers between amino acids and nucleotide bases at the protein-DNA interface, while the remaining 54% are largely involved in screening unfavorable electrostatic contacts. Conclusion This study quantifies the key energetic role of bridging waters in protein-DNA associations. In addition, the relevant role of hydrophobic interactions and entropy in driving protein-DNA association is indicated by analyses of interaction character showing that, together, the favorable polar and unfavorable polar/hydrophobic-polar interactions (i.e., desolvation) mostly cancel.
Collapse
|
10
|
Amadasi A, Spyrakis F, Cozzini P, Abraham DJ, Kellogg GE, Mozzarelli A. Mapping the energetics of water-protein and water-ligand interactions with the "natural" HINT forcefield: predictive tools for characterizing the roles of water in biomolecules. J Mol Biol 2006; 358:289-309. [PMID: 16497327 DOI: 10.1016/j.jmb.2006.01.053] [Citation(s) in RCA: 61] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2005] [Revised: 12/30/2005] [Accepted: 01/14/2006] [Indexed: 11/15/2022]
Abstract
The energetics and hydrogen bonding pattern of water molecules bound to proteins were mapped by analyzing structural data (resolution better than 2.3A) for sets of uncomplexed and ligand-complexed proteins. Water-protein and water-ligand interactions were evaluated using hydropatic interactions (HINT), a non-Newtonian forcefield based on experimentally determined logP(octanol/water) values. Potential water hydrogen bonding ability was assessed by a new Rank algorithm. The HINT-derived binding energies and Ranks for second shell water molecules were -0.04 kcal mol(-1) and 0.0, respectively, for first shell water molecules -0.38 kcal mol(-1) and 1.6, for active site water molecules -0.45 kcal mol(-1) and 2.3, for cavity water molecules -0.55 kcal mol(-1) and 3.3, and for buried water molecules -0.56 kcal mol(-1) and 4.4. For the last four classes, similar energies indicate that internal and external water molecules interact with protein almost equally, despite different degrees of hydrogen bonding. The binding energies and Ranks for water molecules bridging ligand-protein were -1.13 kcal mol(-1) and 4.5, respectively. This energetic contribution is shared equally between protein and ligand, whereas Rank favors the protein. Lastly, by comparing the uncomplexed and complexed forms of proteins, guidelines were developed for prediction of the roles played by active site water molecules in ligand binding. A water molecule with high Rank and HINT score is unlikely to make further interactions with the ligand and is largely irrelevant to the binding process, while a water molecule with moderate Rank and high HINT score is available for ligand interaction. Water molecule displaced for steric reasons were characterized by lower Rank and HINT score. These guidelines, tested by calculating HINT score and Rank for 50 water molecules bound in the active site of four uncomplexed proteins (for which the structures of the liganded forms were also available), correctly predicted the ultimate roles (in the complex) for 76% of water molecules. Some failures were likely due to ambiguities in the structural data.
Collapse
Affiliation(s)
- Alessio Amadasi
- Department of Biochemistry and Molecular Biology University of Parma, 43100 Parma, Italy
| | | | | | | | | | | |
Collapse
|