1
|
Can docking scoring functions guarantee success in virtual screening? VIRTUAL SCREENING AND DRUG DOCKING 2022. [DOI: 10.1016/bs.armc.2022.08.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
2
|
Abstract
Computational methods play an increasingly important role in drug discovery. Structure-based drug design (SBDD), in particular, includes techniques that take into account the structure of the macromolecular target to predict compounds that are likely to establish optimal interactions with the binding site. The current interest in machine learning algorithms based on deep neural networks encouraged the application of deep learning to SBDD related problems. This chapter covers selected works in this active area of research.
Collapse
|
3
|
Bonanno E, Ebejer JP. Applying Machine Learning to Ultrafast Shape Recognition in Ligand-Based Virtual Screening. Front Pharmacol 2020; 10:1675. [PMID: 32140104 PMCID: PMC7042174 DOI: 10.3389/fphar.2019.01675] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2019] [Accepted: 12/23/2019] [Indexed: 11/13/2022] Open
Abstract
Ultrafast Shape Recognition (USR), along with its derivatives, are Ligand-Based Virtual Screening (LBVS) methods that condense 3-dimensional information about molecular shape, as well as other properties, into a small set of numeric descriptors. These can be used to efficiently compute a measure of similarity between pairs of molecules using a simple inverse Manhattan Distance metric. In this study we explore the use of suitable Machine Learning techniques that can be trained using USR descriptors, so as to improve the similarity detection of potential new leads. We use molecules from the Directory for Useful Decoys-Enhanced to construct machine learning models based on three different algorithms: Gaussian Mixture Models (GMMs), Isolation Forests and Artificial Neural Networks (ANNs). We train models based on full molecule conformer models, as well as the Lowest Energy Conformations (LECs) only. We also investigate the performance of our models when trained on smaller datasets so as to model virtual screening scenarios when only a small number of actives are known a priori. Our results indicate significant performance gains over a state of the art USR-derived method, ElectroShape 5D, with GMMs obtaining a mean performance up to 430% better than that of ElectroShape 5D in terms of Enrichment Factor with a maximum improvement of up to 940%. Additionally, we demonstrate that our models are capable of maintaining their performance, in terms of enrichment factor, within 10% of the mean as the size of the training dataset is successively reduced. Furthermore, we also demonstrate that running times for retrospective screening using the machine learning models we selected are faster than standard USR, on average by a factor of 10, including the time required for training. Our results show that machine learning techniques can significantly improve the virtual screening performance and efficiency of the USR family of methods.
Collapse
Affiliation(s)
- Etienne Bonanno
- Department of Artificial Intelligence, University of Malta, Msida, Malta
| | - Jean-Paul Ebejer
- Centre for Molecular Medicine and Biobanking, University of Malta, Msida, Malta
| |
Collapse
|
4
|
Pérez-Sianes J, Pérez-Sánchez H, Díaz F. Virtual Screening Meets Deep Learning. Curr Comput Aided Drug Des 2019; 15:6-28. [PMID: 30338743 DOI: 10.2174/1573409914666181018141602] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2017] [Revised: 10/08/2018] [Accepted: 10/11/2018] [Indexed: 12/27/2022]
Abstract
BACKGROUND Automated compound testing is currently the de facto standard method for drug screening, but it has not brought the great increase in the number of new drugs that was expected. Computer- aided compounds search, known as Virtual Screening, has shown the benefits to this field as a complement or even alternative to the robotic drug discovery. There are different methods and approaches to address this problem and most of them are often included in one of the main screening strategies. Machine learning, however, has established itself as a virtual screening methodology in its own right and it may grow in popularity with the new trends on artificial intelligence. OBJECTIVE This paper will attempt to provide a comprehensive and structured review that collects the most important proposals made so far in this area of research. Particular attention is given to some recent developments carried out in the machine learning field: the deep learning approach, which is pointed out as a future key player in the virtual screening landscape.
Collapse
Affiliation(s)
| | - Horacio Pérez-Sánchez
- Bioinformatics and High Performance Computing Research Group (BIO-HPC), Computer Engineering Department, Universidad Católica San Antonio de Murcia (UCAM), Murcia, Spain
| | - Fernando Díaz
- Departamento de Informática, Escuela de Ingeniería Informática, University of Valladolid, Segovia, Spain
| |
Collapse
|
5
|
Li J, Fu A, Zhang L. An Overview of Scoring Functions Used for Protein-Ligand Interactions in Molecular Docking. Interdiscip Sci 2019; 11:320-328. [PMID: 30877639 DOI: 10.1007/s12539-019-00327-w] [Citation(s) in RCA: 197] [Impact Index Per Article: 32.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2018] [Revised: 02/06/2019] [Accepted: 03/06/2019] [Indexed: 12/17/2022]
Abstract
Currently, molecular docking is becoming a key tool in drug discovery and molecular modeling applications. The reliability of molecular docking depends on the accuracy of the adopted scoring function, which can guide and determine the ligand poses when thousands of possible poses of ligand are generated. The scoring function can be used to determine the binding mode and site of a ligand, predict binding affinity and identify the potential drug leads for a given protein target. Despite intensive research over the years, accurate and rapid prediction of protein-ligand interactions is still a challenge in molecular docking. For this reason, this study reviews four basic types of scoring functions, physics-based, empirical, knowledge-based, and machine learning-based scoring functions, based on an up-to-date classification scheme. We not only discuss the foundations of the four types scoring functions, suitable application areas and shortcomings, but also discuss challenges and potential future study directions.
Collapse
Affiliation(s)
- Jin Li
- College of Computer and Information Science, Southwest University, Chongqing, 400715, China.,School of Medical Information and Engineering, Southwest Medical University, Luzhou, 646000, China
| | - Ailing Fu
- College of Pharmaceutical Sciences, Southwest University, Chongqing, 400715, China
| | - Le Zhang
- College of Computer and Information Science, Southwest University, Chongqing, 400715, China. .,College of Computer Science, Sichuan University, Chengdu, 610065, China. .,Medical Big Data Center, Sichuan University, Chengdu, 610065, China. .,Zdmedical, Information Polytron Technologies Inc Chongqing, Chongqing, 401320, China.
| |
Collapse
|
6
|
Guedes IA, Pereira FSS, Dardenne LE. Empirical Scoring Functions for Structure-Based Virtual Screening: Applications, Critical Aspects, and Challenges. Front Pharmacol 2018; 9:1089. [PMID: 30319422 PMCID: PMC6165880 DOI: 10.3389/fphar.2018.01089] [Citation(s) in RCA: 163] [Impact Index Per Article: 23.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2018] [Accepted: 09/07/2018] [Indexed: 12/19/2022] Open
Abstract
Structure-based virtual screening (VS) is a widely used approach that employs the knowledge of the three-dimensional structure of the target of interest in the design of new lead compounds from large-scale molecular docking experiments. Through the prediction of the binding mode and affinity of a small molecule within the binding site of the target of interest, it is possible to understand important properties related to the binding process. Empirical scoring functions are widely used for pose and affinity prediction. Although pose prediction is performed with satisfactory accuracy, the correct prediction of binding affinity is still a challenging task and crucial for the success of structure-based VS experiments. There are several efforts in distinct fronts to develop even more sophisticated and accurate models for filtering and ranking large libraries of compounds. This paper will cover some recent successful applications and methodological advances, including strategies to explore the ligand entropy and solvent effects, training with sophisticated machine-learning techniques, and the use of quantum mechanics. Particular emphasis will be given to the discussion of critical aspects and further directions for the development of more accurate empirical scoring functions.
Collapse
Affiliation(s)
- Isabella A Guedes
- Grupo de Modelagem Molecular em Sistemas Biológicos, Laboratório Nacional de Computação Científica, Petrópolis, Brazil
| | - Felipe S S Pereira
- Grupo de Modelagem Molecular em Sistemas Biológicos, Laboratório Nacional de Computação Científica, Petrópolis, Brazil
| | - Laurent E Dardenne
- Grupo de Modelagem Molecular em Sistemas Biológicos, Laboratório Nacional de Computação Científica, Petrópolis, Brazil
| |
Collapse
|
7
|
Prediction of N-Methyl-D-Aspartate Receptor GluN1-Ligand Binding Affinity by a Novel SVM-Pose/SVM-Score Combinatorial Ensemble Docking Scheme. Sci Rep 2017; 7:40053. [PMID: 28059133 PMCID: PMC5216401 DOI: 10.1038/srep40053] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2016] [Accepted: 11/30/2016] [Indexed: 01/24/2023] Open
Abstract
The glycine-binding site of the N-methyl-D-aspartate receptor (NMDAR) subunit GluN1 is a potential pharmacological target for neurodegenerative disorders. A novel combinatorial ensemble docking scheme using ligand and protein conformation ensembles and customized support vector machine (SVM)-based models to select the docked pose and to predict the docking score was generated for predicting the NMDAR GluN1-ligand binding affinity. The predicted root mean square deviation (RMSD) values in pose by SVM-Pose models were found to be in good agreement with the observed values (n = 30, r2 = 0.928–0.988, = 0.894–0.954, RMSE = 0.002–0.412, s = 0.001–0.214), and the predicted pKi values by SVM-Score were found to be in good agreement with the observed values for the training samples (n = 24, r2 = 0.967, = 0.899, RMSE = 0.295, s = 0.170) and test samples (n = 13, q2 = 0.894, RMSE = 0.437, s = 0.202). When subjected to various statistical validations, the developed SVM-Pose and SVM-Score models consistently met the most stringent criteria. A mock test asserted the predictivity of this novel docking scheme. Collectively, this accurate novel combinatorial ensemble docking scheme can be used to predict the NMDAR GluN1-ligand binding affinity for facilitating drug discovery.
Collapse
|
8
|
Yan Z, Wang J. Scoring Functions of Protein-Ligand Interactions. Oncology 2017. [DOI: 10.4018/978-1-5225-0549-5.ch036] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Scoring function of protein-ligand interactions is used to recognize the “native” binding pose of a ligand on the protein and to predict the binding affinity, so that the active small molecules can be discriminated from the non-active ones. Scoring function is widely used in computationally molecular docking and structure-based drug discovery. The development and improvement of scoring functions have broad implications in pharmaceutical industry and academic research. During the past three decades, much progress have been made in methodology and accuracy for scoring functions, and many successful cases have be witnessed in virtual database screening. In this chapter, the authors introduced the basic types of scoring functions and their derivations, the commonly-used evaluation methods and benchmarks, as well as the underlying challenges and current solutions. Finally, the authors discussed the promising directions to improve and develop scoring functions for future molecular docking-based drug discovery.
Collapse
|
9
|
Hamzeh-Mivehroud M, Sokouti B, Dastmalchi S. Molecular Docking at a Glance. Oncology 2017. [DOI: 10.4018/978-1-5225-0549-5.ch030] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The current chapter introduces different aspects of molecular docking technique in order to give an overview to the readers about the topics which will be dealt with throughout this volume. Like many other fields of science, molecular docking studies has experienced a lagging period of slow and steady increase in terms of acquiring attention of scientific community as well as its frequency of application, followed by a pronounced era of exponential expansion in theory, methodology, areas of application and performance due to developments in related technologies such as computational resources and theoretical as well as experimental biophysical methods. In the following sections the evolution of molecular docking will be reviewed and its different components including methods, search algorithms, scoring functions, validation of the methods, and area of applications plus few case studies will be touched briefly.
Collapse
Affiliation(s)
| | | | - Siavoush Dastmalchi
- Biotechnology Research Center, Tabriz University of Medical Sciences, Iran & School of Pharmacy, Tabriz University of Medical Sciences, Iran
| |
Collapse
|
10
|
Ain QU, Aleksandrova A, Roessler FD, Ballester PJ. Machine-learning scoring functions to improve structure-based binding affinity prediction and virtual screening. WILEY INTERDISCIPLINARY REVIEWS. COMPUTATIONAL MOLECULAR SCIENCE 2015; 5:405-424. [PMID: 27110292 PMCID: PMC4832270 DOI: 10.1002/wcms.1225] [Citation(s) in RCA: 204] [Impact Index Per Article: 20.4] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/03/2015] [Revised: 07/17/2015] [Accepted: 07/18/2015] [Indexed: 12/29/2022]
Abstract
Docking tools to predict whether and how a small molecule binds to a target can be applied if a structural model of such target is available. The reliability of docking depends, however, on the accuracy of the adopted scoring function (SF). Despite intense research over the years, improving the accuracy of SFs for structure-based binding affinity prediction or virtual screening has proven to be a challenging task for any class of method. New SFs based on modern machine-learning regression models, which do not impose a predetermined functional form and thus are able to exploit effectively much larger amounts of experimental data, have recently been introduced. These machine-learning SFs have been shown to outperform a wide range of classical SFs at both binding affinity prediction and virtual screening. The emerging picture from these studies is that the classical approach of using linear regression with a small number of expert-selected structural features can be strongly improved by a machine-learning approach based on nonlinear regression allied with comprehensive data-driven feature selection. Furthermore, the performance of classical SFs does not grow with larger training datasets and hence this performance gap is expected to widen as more training data becomes available in the future. Other topics covered in this review include predicting the reliability of a SF on a particular target class, generating synthetic data to improve predictive performance and modeling guidelines for SF development. WIREs Comput Mol Sci 2015, 5:405-424. doi: 10.1002/wcms.1225 For further resources related to this article, please visit the WIREs website.
Collapse
Affiliation(s)
- Qurrat Ul Ain
- Department of Chemistry, Centre for Molecular Informatics University of Cambridge Cambridge UK
| | | | - Florian D Roessler
- Department of Chemistry, Centre for Molecular Informatics University of Cambridge Cambridge UK
| | - Pedro J Ballester
- Cancer Research Center of Marseille, (INSERM U1068, Institut Paoli-Calmettes, Aix-Marseille Université, CNRS UMR7258) Marseille France
| |
Collapse
|
11
|
Ferreira LG, Dos Santos RN, Oliva G, Andricopulo AD. Molecular docking and structure-based drug design strategies. Molecules 2015; 20:13384-421. [PMID: 26205061 PMCID: PMC6332083 DOI: 10.3390/molecules200713384] [Citation(s) in RCA: 1147] [Impact Index Per Article: 114.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2015] [Revised: 07/14/2015] [Accepted: 07/20/2015] [Indexed: 02/07/2023] Open
Abstract
Pharmaceutical research has successfully incorporated a wealth of molecular modeling methods, within a variety of drug discovery programs, to study complex biological and chemical systems. The integration of computational and experimental strategies has been of great value in the identification and development of novel promising compounds. Broadly used in modern drug design, molecular docking methods explore the ligand conformations adopted within the binding sites of macromolecular targets. This approach also estimates the ligand-receptor binding free energy by evaluating critical phenomena involved in the intermolecular recognition process. Today, as a variety of docking algorithms are available, an understanding of the advantages and limitations of each method is of fundamental importance in the development of effective strategies and the generation of relevant results. The purpose of this review is to examine current molecular docking strategies used in drug discovery and medicinal chemistry, exploring the advances in the field and the role played by the integration of structure- and ligand-based methods.
Collapse
Affiliation(s)
- Leonardo G Ferreira
- Laboratório de Química Medicinal e Computacional, Centro de Pesquisa e Inovação em Biodiversidade e Fármacos, Instituto de Física de São Carlos, Universidade de São Paulo, Av. João Dagnone 1100, São Carlos-SP 13563-120, Brazil.
| | - Ricardo N Dos Santos
- Laboratório de Química Medicinal e Computacional, Centro de Pesquisa e Inovação em Biodiversidade e Fármacos, Instituto de Física de São Carlos, Universidade de São Paulo, Av. João Dagnone 1100, São Carlos-SP 13563-120, Brazil.
| | - Glaucius Oliva
- Laboratório de Química Medicinal e Computacional, Centro de Pesquisa e Inovação em Biodiversidade e Fármacos, Instituto de Física de São Carlos, Universidade de São Paulo, Av. João Dagnone 1100, São Carlos-SP 13563-120, Brazil.
| | - Adriano D Andricopulo
- Laboratório de Química Medicinal e Computacional, Centro de Pesquisa e Inovação em Biodiversidade e Fármacos, Instituto de Física de São Carlos, Universidade de São Paulo, Av. João Dagnone 1100, São Carlos-SP 13563-120, Brazil.
| |
Collapse
|
12
|
Chen YC. Beware of docking! Trends Pharmacol Sci 2015; 36:78-95. [DOI: 10.1016/j.tips.2014.12.001] [Citation(s) in RCA: 344] [Impact Index Per Article: 34.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2014] [Revised: 11/23/2014] [Accepted: 12/02/2014] [Indexed: 12/16/2022]
|
13
|
Lauro G, Ferruz N, Fulle S, Harvey MJ, Finn PW, De Fabritiis G. Reranking docking poses using molecular simulations and approximate free energy methods. J Chem Inf Model 2014; 54:2185-9. [PMID: 25046765 DOI: 10.1021/ci500309a] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Fast and accurate identification of active compounds is essential for effective use of virtual screening workflows. Here, we have compared the ligand-ranking efficiency of the linear interaction energy (LIE) method against standard docking approaches. Using a trypsin set of 1549 compounds, we performed 12,250 molecular dynamics simulations. The LIE method proved effective but did not yield results significantly better than those obtained with docking codes. The entire database of simulations is released.
Collapse
Affiliation(s)
- G Lauro
- Dipartimento di Farmacia, Università degli Studi di Salerno , Via Giovanni Paolo II 132, 84084 Fisciano (SA), Italy
| | | | | | | | | | | |
Collapse
|
14
|
Kubrycht J, Sigler K, Souček P. Virtual interactomics of proteins from biochemical standpoint. Mol Biol Int 2012; 2012:976385. [PMID: 22928109 PMCID: PMC3423939 DOI: 10.1155/2012/976385] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2012] [Revised: 05/18/2012] [Accepted: 05/18/2012] [Indexed: 12/24/2022] Open
Abstract
Virtual interactomics represents a rapidly developing scientific area on the boundary line of bioinformatics and interactomics. Protein-related virtual interactomics then comprises instrumental tools for prediction, simulation, and networking of the majority of interactions important for structural and individual reproduction, differentiation, recognition, signaling, regulation, and metabolic pathways of cells and organisms. Here, we describe the main areas of virtual protein interactomics, that is, structurally based comparative analysis and prediction of functionally important interacting sites, mimotope-assisted and combined epitope prediction, molecular (protein) docking studies, and investigation of protein interaction networks. Detailed information about some interesting methodological approaches and online accessible programs or databases is displayed in our tables. Considerable part of the text deals with the searches for common conserved or functionally convergent protein regions and subgraphs of conserved interaction networks, new outstanding trends and clinically interesting results. In agreement with the presented data and relationships, virtual interactomic tools improve our scientific knowledge, help us to formulate working hypotheses, and they frequently also mediate variously important in silico simulations.
Collapse
Affiliation(s)
- Jaroslav Kubrycht
- Department of Physiology, Second Medical School, Charles University, 150 00 Prague, Czech Republic
| | - Karel Sigler
- Laboratory of Cell Biology, Institute of Microbiology, Academy of Sciences of the Czech Republic, 142 20 Prague, Czech Republic
| | - Pavel Souček
- Toxicogenomics Unit, National Institute of Public Health, 100 42 Prague, Czech Republic
| |
Collapse
|
15
|
Tang YT, Gao R, Havranek JJ, Groisman EA, Stock AM, Marshall GR. Inhibition of bacterial virulence: drug-like molecules targeting the Salmonella enterica PhoP response regulator. Chem Biol Drug Des 2012; 79:1007-17. [PMID: 22339993 PMCID: PMC3445336 DOI: 10.1111/j.1747-0285.2012.01362.x] [Citation(s) in RCA: 47] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
Abstract
Two-component signal transduction (TCST) is the predominant signaling scheme used in bacteria to sense and respond to environmental changes in order to survive and thrive. A typical TCST system consists of a sensor histidine kinase to detect external signals and an effector response regulator to respond to external changes. In the signaling scheme, the histidine kinase phosphorylates and activates the response regulator, which functions as a transcription factor to modulate gene expression. One promising strategy toward antibacterial development is to target TCST regulatory systems, specifically the response regulators to disrupt the expression of genes important for virulence. In Salmonella enterica, the PhoQ/PhoP signal transduction system is used to sense and respond to low magnesium levels and regulates the expression for over 40 genes necessary for growth under these conditions, and more interestingly, genes that are important for virulence. In this study, a hybrid approach coupling computational and experimental methods was applied to identify drug-like compounds to target the PhoP response regulator. A computational approach of structure-based virtual screening combined with a series of biochemical and biophysical assays was used to test the predictability of the computational strategy and to characterize the mode of action of the compounds. Eight compounds from virtual screening inhibit the formation of the PhoP-DNA complex necessary for virulence gene regulation. This investigation served as an initial case study for targeting TCST response regulators to modulate the gene expression of a signal transduction pathway important for bacterial virulence. With the increasing resistance of pathogenic bacteria to current antibiotics, targeting TCST response regulators that control virulence is a viable strategy for the development of antimicrobial therapeutics with novel modes of action.
Collapse
Affiliation(s)
- Yat T Tang
- Center for Computational Biology, Department of Biochemistry and Molecular Biophysics, Washington University School of MedicineSt. Louis, MO 63110, USA
| | - Rong Gao
- Howard Hughes Medical Institute, Center for Advanced Biotechnology and Medicine, University of Medicine and Dentistry of New Jersey-Robert Wood Johnson Medical SchoolPiscataway, NJ 08854, USA
| | - James J Havranek
- Department of Genetics, Washington University School of MedicineSt. Louis, MO 63110, USA
| | - Eduardo A Groisman
- Howard Hughes Medical Institute, Department of Molecular Microbiology, Washington University School of MedicineSt. Louis, MO 63110, USA
| | - Ann M Stock
- Howard Hughes Medical Institute, Center for Advanced Biotechnology and Medicine, University of Medicine and Dentistry of New Jersey-Robert Wood Johnson Medical SchoolPiscataway, NJ 08854, USA
| | - Garland R Marshall
- Center for Computational Biology, Department of Biochemistry and Molecular Biophysics, Washington University School of MedicineSt. Louis, MO 63110, USA
- *Corresponding author: Garland R. Marshall,
| |
Collapse
|
16
|
Are predefined decoy sets of ligand poses able to quantify scoring function accuracy? J Comput Aided Mol Des 2012; 26:185-97. [DOI: 10.1007/s10822-011-9539-5] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2011] [Accepted: 12/23/2011] [Indexed: 11/26/2022]
|
17
|
Avram S, Pacureanu LM, Seclaman E, Bora A, Kurunczi L. PLS-DA - Docking Optimized Combined Energetic Terms (PLSDA-DOCET) protocol: a brief evaluation. J Chem Inf Model 2011; 51:3169-79. [PMID: 22066983 DOI: 10.1021/ci2002268] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Docking studies have become popular approaches in drug design, where the binding energy of the ligand in the active site of the protein is estimated by a scoring function. Many promising techniques were developed to enhance the performance of scoring functions including the fusion of multiple scoring functions outcomes into a so-called consensus scoring function. Hereby, we evaluated the target oriented consensus technique using the energetic terms of several scoring functions. The approach was denoted PLSDA-DOCET. Optimization strategies for consensus energetic terms and scoring functions based on ROC metric were compared to classical rigid docking and to ligand-based similarity search methods comprising 2D fingerprints and ROCS. The ROCS results indicate large performance variations depending on the biological target. The AUC-based strategy of PLSDA-DOCET outperformed the other docking approaches regarding simple retrieval and scaffold-hopping. The superior performance of PLSDA-DOCET protocol relative to single and combined scoring functions was validated on an external test set. We found a relative low mean correlation of the ranks of the chemotypes retrieved by the PLSDA-DOCET protocol and all the other methods employed here.
Collapse
Affiliation(s)
- Sorin Avram
- Department of Computational Chemistry, Institute of Chemistry of Romanian Academy, Timisoara, Mihai Viteazul Avenue, 24, 300223 Timisoara, Romania
| | | | | | | | | |
Collapse
|
18
|
Smith RD, Dunbar JB, Ung PMU, Esposito EX, Yang CY, Wang S, Carlson HA. CSAR benchmark exercise of 2010: combined evaluation across all submitted scoring functions. J Chem Inf Model 2011; 51:2115-31. [PMID: 21809884 PMCID: PMC3186041 DOI: 10.1021/ci200269q] [Citation(s) in RCA: 117] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
![]()
As part of the Community Structure-Activity Resource (CSAR) center, a set of 343 high-quality, protein–ligand crystal structures were assembled with experimentally determined Kd or Ki information from the literature. We encouraged the community to score the crystallographic poses of the complexes by any method of their choice. The goal of the exercise was to (1) evaluate the current ability of the field to predict activity from structure and (2) investigate the properties of the complexes and methods that appear to hinder scoring. A total of 19 different methods were submitted with numerous parameter variations for a total of 64 sets of scores from 16 participating groups. Linear regression and nonparametric tests were used to correlate scores to the experimental values. Correlation to experiment for the various methods ranged R2 = 0.58–0.12, Spearman ρ = 0.74–0.37, Kendall τ = 0.55–0.25, and median unsigned error = 1.00–1.68 pKd units. All types of scoring functions—force field based, knowledge based, and empirical—had examples with high and low correlation, showing no bias/advantage for any particular approach. The data across all the participants were combined to identify 63 complexes that were poorly scored across the majority of the scoring methods and 123 complexes that were scored well across the majority. The two sets were compared using a Wilcoxon rank-sum test to assess any significant difference in the distributions of >400 physicochemical properties of the ligands and the proteins. Poorly scored complexes were found to have ligands that were the same size as those in well-scored complexes, but hydrogen bonding and torsional strain were significantly different. These comparisons point to a need for CSAR to develop data sets of congeneric series with a range of hydrogen-bonding and hydrophobic characteristics and a range of rotatable bonds.
Collapse
Affiliation(s)
- Richard D Smith
- Department of Medicinal Chemistry, University of Michigan, Ann Arbor, Michigan 48109-1065, United States
| | | | | | | | | | | | | |
Collapse
|
19
|
Allosteric inhibition of the hepatitis C virus NS5B polymerase: in silico strategies for drug discovery and development. Future Med Chem 2011; 3:1027-55. [DOI: 10.4155/fmc.11.53] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
Chronic infection by hepatitis C virus (HCV) often leads to severe liver disease including cirrhosis, hepatocellular carcinoma and liver failure. Despite it being more than 20 years since the identification of HCV, the current standard of care for treating the infection is based on aspecific therapy often associated with severe side effects and low-sustained virological response. Research is ongoing to develop new and better medications, including a broad range of allosteric NS5B polymerase inhibitors. This article reviews traditional computational methodologies and more recently developed in silico strategies aimed at identifying and optimizing non-nucleoside inhibitors targeting allosteric sites of HCV NS5B polymerase. The drug-discovery approaches reviewed could provide take-home lessons for general computer-aided research projects.
Collapse
|
20
|
Sotriffer C, Matter H. The Challenge of Affinity Prediction: Scoring Functions for Structure-Based Virtual Screening. METHODS AND PRINCIPLES IN MEDICINAL CHEMISTRY 2011. [DOI: 10.1002/9783527633326.ch7] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
|
21
|
Huang SY, Grinter SZ, Zou X. Scoring functions and their evaluation methods for protein-ligand docking: recent advances and future directions. Phys Chem Chem Phys 2010; 12:12899-908. [PMID: 20730182 PMCID: PMC11103779 DOI: 10.1039/c0cp00151a] [Citation(s) in RCA: 309] [Impact Index Per Article: 20.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
The scoring function is one of the most important components in structure-based drug design. Despite considerable success, accurate and rapid prediction of protein-ligand interactions is still a challenge in molecular docking. In this perspective, we have reviewed three basic types of scoring functions (force-field, empirical, and knowledge-based) and the consensus scoring technique that are used for protein-ligand docking. The commonly-used assessment criteria and publicly available protein-ligand databases for performance evaluation of the scoring functions have also been presented and discussed. We end with a discussion of the challenges faced by existing scoring functions and possible future directions for developing improved scoring functions.
Collapse
Affiliation(s)
- Sheng-You Huang
- Department of Physics and Astronomy, Department of Biochemistry, Dalton Cardiovascular Research Center, and Informatics Institute University of Missouri Columbia, MO 65211
| | - Sam Z. Grinter
- Department of Physics and Astronomy, Department of Biochemistry, Dalton Cardiovascular Research Center, and Informatics Institute University of Missouri Columbia, MO 65211
| | - Xiaoqin Zou
- Department of Physics and Astronomy, Department of Biochemistry, Dalton Cardiovascular Research Center, and Informatics Institute University of Missouri Columbia, MO 65211
| |
Collapse
|
22
|
Salmi-Smail C, Fabre A, Dequiedt F, Restouin A, Castellano R, Garbit S, Roche P, Morelli X, Brunel JM, Collette Y. Modified cap group suberoylanilide hydroxamic acid histone deacetylase inhibitor derivatives reveal improved selective antileukemic activity. J Med Chem 2010; 53:3038-47. [PMID: 20218673 DOI: 10.1021/jm901358y] [Citation(s) in RCA: 42] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
A series of SAHA cap derivatives was designed and prepared in good-to-excellent yields that varied from 49% to 95%. These derivatives were evaluated for their antiproliferative activity in several human cancer cell lines. Antiproliferative activity was observed for concentrations varying from 0.12 to >100 microM, and a molecular modeling approach of selected SAHA derivatives, based on available structural information of human HDAC8 in complex with SAHA, was performed. Strikingly, two compounds displayed up to 10-fold improved antileukemic activity with respect to SAHA; however, these compounds displayed antiproliferative activity similar to SAHA when assayed against solid tumor-derived cell lines. A 10-fold improvement in the leukemic vs peripheral blood mononuclear cell therapeutic ratio, with no evident in vivo toxicity toward blood cells, was also observed. The herein-described compounds and method of synthesis will provide invaluable tools to investigate the molecular mechanism responsible for the reported selectively improved antileukemic activity.
Collapse
Affiliation(s)
- Chanaz Salmi-Smail
- Unite 891 INSERM, Centre de Recherche en Cancerologie de Marseille, 27 Bd Lei Roure, 13009 Marseille 09, France
| | | | | | | | | | | | | | | | | | | |
Collapse
|
23
|
Bar-Haim S, Aharon A, Ben-Moshe T, Marantz Y, Senderowitz H. SeleX-CS: A New Consensus Scoring Algorithm for Hit Discovery and Lead Optimization. J Chem Inf Model 2009; 49:623-33. [DOI: 10.1021/ci800335j] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
Affiliation(s)
- Shay Bar-Haim
- Epix Pharmaceuticals Ltd., 3 Hayetzira Street, Ramat Gan 52521, Israel
| | - Ayelet Aharon
- Epix Pharmaceuticals Ltd., 3 Hayetzira Street, Ramat Gan 52521, Israel
| | - Tal Ben-Moshe
- Epix Pharmaceuticals Ltd., 3 Hayetzira Street, Ramat Gan 52521, Israel
| | - Yael Marantz
- Epix Pharmaceuticals Ltd., 3 Hayetzira Street, Ramat Gan 52521, Israel
| | | |
Collapse
|
24
|
Lead Discovery Using Virtual Screening. TOPICS IN MEDICINAL CHEMISTRY 2009. [PMCID: PMC7176223 DOI: 10.1007/7355_2009_3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
The practice of virtual screening (VS) to identify chemical leads to known or novel targets is becoming a core function of the computational chemist within industry. By employing a range of techniques, when attempting to identify compounds with activity against a biological target, a small focused subset of a larger collection of compounds can be identified and tested, often with results much better than selecting a similar number of compounds at random. We will review the key methods available, their relative success, and provide practical insights into best practices and key gaps. We will also argue that the capability of VS methods has grown to a point where fuller integration with experimental methods, including HTS, could increase the effectiveness of both.
Collapse
|
25
|
Kerzmann A, Fuhrmann J, Kohlbacher O, Neumann D. BALLDock/SLICK: A New Method for Protein-Carbohydrate Docking. J Chem Inf Model 2008; 48:1616-25. [DOI: 10.1021/ci800103u] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Andreas Kerzmann
- Division for Simulation of Biological Systems, Center for Bioinformatics, University of Tübingen, Sand 14, 72076 Tübingen, Germany, and Junior Research Group Drug Transport, Center for Bioinformatics, Saarland University, Building E1 1, 66123 Saarbrücken, Germany
| | - Jan Fuhrmann
- Division for Simulation of Biological Systems, Center for Bioinformatics, University of Tübingen, Sand 14, 72076 Tübingen, Germany, and Junior Research Group Drug Transport, Center for Bioinformatics, Saarland University, Building E1 1, 66123 Saarbrücken, Germany
| | - Oliver Kohlbacher
- Division for Simulation of Biological Systems, Center for Bioinformatics, University of Tübingen, Sand 14, 72076 Tübingen, Germany, and Junior Research Group Drug Transport, Center for Bioinformatics, Saarland University, Building E1 1, 66123 Saarbrücken, Germany
| | - Dirk Neumann
- Division for Simulation of Biological Systems, Center for Bioinformatics, University of Tübingen, Sand 14, 72076 Tübingen, Germany, and Junior Research Group Drug Transport, Center for Bioinformatics, Saarland University, Building E1 1, 66123 Saarbrücken, Germany
| |
Collapse
|
26
|
Waszkowycz B. Towards improving compound selection in structure-based virtual screening. Drug Discov Today 2008; 13:219-26. [PMID: 18342797 DOI: 10.1016/j.drudis.2007.12.002] [Citation(s) in RCA: 59] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2007] [Revised: 12/06/2007] [Accepted: 12/15/2007] [Indexed: 10/22/2022]
Abstract
Structure-based virtual screening is now an established technology for supporting hit finding and lead optimisation in drug discovery. Recent validation studies have highlighted the poor performance of currently used scoring functions in estimating binding affinity and hence in ranking large datasets of docked ligands. Progress in the analysis of large datasets can be made through the use of appropriate data mining techniques and the derivation of a broader range of descriptors relevant to receptor-ligand binding. In addition, simple scoring functions can be supplemented by simulation-based scoring protocols. Developments in workflow design allow the automation of repetitive tasks, and also encourage the routine use of simulation-based methods and the rapid prototyping of novel modelling and analysis procedures.
Collapse
Affiliation(s)
- Bohdan Waszkowycz
- Argenta Discovery Ltd., 8/9 Spire Green Centre, Flex Meadow, Harlow, Essex CM19 5TR, UK.
| |
Collapse
|
27
|
Teramoto R, Fukunishi H. Consensus Scoring with Feature Selection for Structure-Based Virtual Screening. J Chem Inf Model 2008; 48:288-95. [DOI: 10.1021/ci700239t] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Reiji Teramoto
- Bio-IT Center and Nano Electronics Research Laboratories, NEC Corporation, 34, Miyukigaoka, Tsukuba, Ibaraki 305-8501, Japan
| | - Hiroaki Fukunishi
- Bio-IT Center and Nano Electronics Research Laboratories, NEC Corporation, 34, Miyukigaoka, Tsukuba, Ibaraki 305-8501, Japan
| |
Collapse
|
28
|
Betzi S, Restouin A, Opi S, Arold ST, Parrot I, Guerlesquin F, Morelli X, Collette Y. Protein protein interaction inhibition (2P2I) combining high throughput and virtual screening: Application to the HIV-1 Nef protein. Proc Natl Acad Sci U S A 2007; 104:19256-61. [PMID: 18042718 PMCID: PMC2148277 DOI: 10.1073/pnas.0707130104] [Citation(s) in RCA: 84] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2007] [Indexed: 11/18/2022] Open
Abstract
Protein-protein recognition is the cornerstone of multiple cellular and pathological functions. Therefore, protein-protein interaction inhibition (2P2I) is endowed with great therapeutic potential despite the initial belief that 2P2I was refractory to small-molecule intervention. Improved knowledge of complex molecular binding surfaces has recently stimulated renewed interest for 2P2I, especially after identification of "hot spots" and first inhibitory compounds. However, the combination of target complexity and lack of starting compound has thwarted experimental results and created intellectual barriers. Here we combined virtual and experimental screening when no previously known inhibitors can be used as starting point in a structure-based research program that targets an SH3 binding surface of the HIV type I Nef protein. High-throughput docking and application of a pharmacophoric filter on one hand and search for analogy on the other hand identified drug-like compounds that were further confirmed to bind Nef in the micromolar range (isothermal titration calorimetry), to target the Nef SH3 binding surface (NMR experiments), and to efficiently compete for Nef-SH3 interactions (cell-based assay, GST pull-down). Initial identification of these compounds by virtual screening was validated by screening of the very same library of compounds in the cell-based assay, demonstrating that a significant enrichment factor was attained by the in silico screening. To our knowledge, our results identify the first set of drug-like compounds that functionally target the HIV-1 Nef SH3 binding surface and provide the basis for a powerful discovery process that should help to speed up 2P2I strategies and open avenues for new class of antiviral molecules.
Collapse
Affiliation(s)
- Stéphane Betzi
- *Bioénergétique et Ingénierie des Protéines Laboratory, Centre National de la Recherche Scientifique/Institut de Biologie Structurale et Microbiologie, 31 Chemin Joseph Aiguier, 13402 Marseille Cedex 20, France
| | - Audrey Restouin
- Institut National de la Santé et de la Recherche Médicale, Unité Mixte de Recherche 599, Centre de Recherche en Cancérologie de Marseille, F-13009 Marseille, France
- Institut Paoli-Calmettes, F-13009 Marseille, France
- Université de la Méditerranée, F-13007 Marseille, France
| | - Sandrine Opi
- Institut National de la Santé et de la Recherche Médicale, Unité Mixte de Recherche 599, Centre de Recherche en Cancérologie de Marseille, F-13009 Marseille, France
- Institut Paoli-Calmettes, F-13009 Marseille, France
- Université de la Méditerranée, F-13007 Marseille, France
| | - Stefan T. Arold
- Institut National de la Santé et de la Recherche Médicale, Unité 554, and Université de Montpellier, Centre National de la Recherche Scientifique, Unité Mixte de Recherche 5048, Centre de Biochimie Structurale, 29, Rue de Navacelles, 34090 Montpellier Cedex, France; and
| | - Isabelle Parrot
- Institut des Biomolécules Max Mousseron, Unité Mixte de Recherche 5247, Centre National de la Recherche Scientifique–Universités Montpellier I et II, Faculté de Pharmacie, 34093 Montpellier, France
| | - Françoise Guerlesquin
- *Bioénergétique et Ingénierie des Protéines Laboratory, Centre National de la Recherche Scientifique/Institut de Biologie Structurale et Microbiologie, 31 Chemin Joseph Aiguier, 13402 Marseille Cedex 20, France
| | - Xavier Morelli
- *Bioénergétique et Ingénierie des Protéines Laboratory, Centre National de la Recherche Scientifique/Institut de Biologie Structurale et Microbiologie, 31 Chemin Joseph Aiguier, 13402 Marseille Cedex 20, France
| | - Yves Collette
- Institut National de la Santé et de la Recherche Médicale, Unité Mixte de Recherche 599, Centre de Recherche en Cancérologie de Marseille, F-13009 Marseille, France
- Institut Paoli-Calmettes, F-13009 Marseille, France
- Université de la Méditerranée, F-13007 Marseille, France
| |
Collapse
|
29
|
Abstract
Accurate methods of computing the affinity of a small molecule with a protein are needed to speed the discovery of new medications and biological probes. This paper reviews physics-based models of binding, beginning with a summary of the changes in potential energy, solvation energy, and configurational entropy that influence affinity, and a theoretical overview to frame the discussion of specific computational approaches. Important advances are reported in modeling protein-ligand energetics, such as the incorporation of electronic polarization and the use of quantum mechanical methods. Recent calculations suggest that changes in configurational entropy strongly oppose binding and must be included if accurate affinities are to be obtained. The linear interaction energy (LIE) and molecular mechanics Poisson-Boltzmann surface area (MM-PBSA) methods are analyzed, as are free energy pathway methods, which show promise and may be ready for more extensive testing. Ultimately, major improvements in modeling accuracy will likely require advances on multiple fronts, as well as continued validation against experiment.
Collapse
Affiliation(s)
- Michael K Gilson
- Center for Advanced Research in Biotechnology, University of Maryland Biotechnology Institute, Rockville, Maryland 20850, USA.
| | | |
Collapse
|
30
|
Willett P. Enhancing the Effectiveness of Ligand-Based Virtual Screening Using Data Fusion. ACTA ACUST UNITED AC 2006. [DOI: 10.1002/qsar.200610084] [Citation(s) in RCA: 60] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
|