1
|
Bedart C, Simoben CV, Schapira M. Emerging structure-based computational methods to screen the exploding accessible chemical space. Curr Opin Struct Biol 2024; 86:102812. [PMID: 38603987 DOI: 10.1016/j.sbi.2024.102812] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2024] [Revised: 03/15/2024] [Accepted: 03/16/2024] [Indexed: 04/13/2024]
Abstract
Structure-based virtual screening can be a valuable approach to computationally select hit candidates based on their predicted interaction with a protein of interest. The recent explosion in the size of chemical libraries increases the chances of hitting high-quality compounds during virtual screening exercises but also poses new challenges as the number of chemically accessible molecules grows faster than the computing power necessary to screen them. We review here two novel approaches rapidly gaining in popularity to address this problem: machine learning-accelerated and synthon-based library screening. We summarize the results from seminal proof-of-concept studies, highlight the latest developments, and discuss limitations and future directions.
Collapse
Affiliation(s)
- Corentin Bedart
- Univ. Lille, Inserm, CHU Lille, U1286 - INFINITE - Institute for Translational Research in Inflammation, F-59000, Lille, France
| | - Conrad Veranso Simoben
- Structural Genomics Consortium, University of Toronto, 101 College Street, MaRS South Tower, Suite 700, Toronto, Ontario M5G 1L7, Canada
| | - Matthieu Schapira
- Structural Genomics Consortium, University of Toronto, 101 College Street, MaRS South Tower, Suite 700, Toronto, Ontario M5G 1L7, Canada; Department of Pharmacology and Toxicology, University of Toronto, 1 King's College Circle, Toronto, Ontario M5S 1A8, Canada.
| |
Collapse
|
2
|
Jones J, Clark RD, Lawless MS, Miller DW, Waldman M. The AI-driven Drug Design (AIDD) platform: an interactive multi-parameter optimization system integrating molecular evolution with physiologically based pharmacokinetic simulations. J Comput Aided Mol Des 2024; 38:14. [PMID: 38499823 DOI: 10.1007/s10822-024-00552-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2023] [Accepted: 02/13/2024] [Indexed: 03/20/2024]
Abstract
Computer-aided drug design has advanced rapidly in recent years, and multiple instances of in silico designed molecules advancing to the clinic have demonstrated the contribution of this field to medicine. Properly designed and implemented platforms can drastically reduce drug development timelines and costs. While such efforts were initially focused primarily on target affinity/activity, it is now appreciated that other parameters are equally important in the successful development of a drug and its progression to the clinic, including pharmacokinetic properties as well as absorption, distribution, metabolic, excretion and toxicological (ADMET) properties. In the last decade, several programs have been developed that incorporate these properties into the drug design and optimization process and to varying degrees, allowing for multi-parameter optimization. Here, we introduce the Artificial Intelligence-driven Drug Design (AIDD) platform, which automates the drug design process by integrating high-throughput physiologically-based pharmacokinetic simulations (powered by GastroPlus) and ADMET predictions (powered by ADMET Predictor) with an advanced evolutionary algorithm that is quite different than current generative models. AIDD uses these and other estimates in iteratively performing multi-objective optimizations to produce novel molecules that are active and lead-like. Here we describe the AIDD workflow and details of the methodologies involved therein. We use a dataset of triazolopyrimidine inhibitors of the dihydroorotate dehydrogenase from Plasmodium falciparum to illustrate how AIDD generates novel sets of molecules.
Collapse
Affiliation(s)
- Jeremy Jones
- Simulations Plus, Inc., 42505 10th Street West, Lancaster, CA, 93534‑7059, USA.
| | - Robert D Clark
- The Indiana University Luddy School of Informatics, Computing and Engineering, 700 N. Woodlawn Avenue, Bloomington, IN, 47408, USA
| | - Michael S Lawless
- Simulations Plus, Inc., 42505 10th Street West, Lancaster, CA, 93534‑7059, USA
| | - David W Miller
- Simulations Plus, Inc., 42505 10th Street West, Lancaster, CA, 93534‑7059, USA
| | - Marvin Waldman
- Simulations Plus, Inc., 42505 10th Street West, Lancaster, CA, 93534‑7059, USA
| |
Collapse
|
3
|
Ashenden SK. Screening Library Design. Methods Enzymol 2018; 610:73-96. [PMID: 30390806 DOI: 10.1016/bs.mie.2018.09.016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/18/2023]
Abstract
Thanks to technological advances and a greater understanding of the biological and chemical natures of targets and related diseases, high-throughput screening (HTS) has been allowed to be faster, cheaper, and more accessible. Yet, despite these increased technologies and understandings, the frequency of novel and drugs are being approved each year has not being increasing over the years. 2017 was considered a "bumper" year with a total of 46 approved drugs, over double that of the previous year. However, it is thought that part of the problem that HTS has not lived up to expectations is because of the contents of current chemical libraries. Therefore, new methods to design screening libraries are of great interest.
Collapse
Affiliation(s)
- Stephanie Kay Ashenden
- Department of Chemistry, Cambridge University, Cambridge, United Kingdom; Discovery Sciences, IMed Biotech Unit, AstraZeneca R&D, Cambridge, United Kingdom.
| |
Collapse
|
4
|
Allen WJ, Fochtman BC, Balius TE, Rizzo RC. Customizable de novo design strategies for DOCK: Application to HIVgp41 and other therapeutic targets. J Comput Chem 2017; 38:2641-2663. [PMID: 28940386 DOI: 10.1002/jcc.25052] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2017] [Accepted: 08/03/2017] [Indexed: 12/12/2022]
Abstract
De novo design can be used to explore vast areas of chemical space in computational lead discovery. As a complement to virtual screening, from-scratch construction of molecules is not limited to compounds in pre-existing vendor catalogs. Here, we present an iterative fragment growth method, integrated into the program DOCK, in which new molecules are built using rules for allowable connections based on known molecules. The method leverages DOCK's advanced scoring and pruning approaches and users can define very specific criteria in terms of properties or features to customize growth toward a particular region of chemical space. The code was validated using three increasingly difficult classes of calculations: (1) Rebuilding known X-ray ligands taken from 663 complexes using only their component parts (focused libraries), (2) construction of new ligands in 57 drug target sites using a library derived from ∼13M drug-like compounds (generic libraries), and (3) application to a challenging protein-protein interface on the viral drug target HIVgp41. The computational testing confirms that the de novo DOCK routines are robust and working as envisioned, and the compelling results highlight the potential utility for designing new molecules against a wide variety of important protein targets. © 2017 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- William J Allen
- Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, New York, 11794
| | - Brian C Fochtman
- Department of Biochemistry and Cell Biology, Stony Brook University, Stony Brook, New York, 11794
| | - Trent E Balius
- Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, California, 94158
| | - Robert C Rizzo
- Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, New York, 11794.,Institute of Chemical Biology and Drug Discovery, Stony Brook University, Stony Brook, New York, 11794.,Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, New York, 11794
| |
Collapse
|
5
|
Glaab E. Building a virtual ligand screening pipeline using free software: a survey. Brief Bioinform 2016; 17:352-66. [PMID: 26094053 PMCID: PMC4793892 DOI: 10.1093/bib/bbv037] [Citation(s) in RCA: 63] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2015] [Revised: 05/20/2015] [Indexed: 12/17/2022] Open
Abstract
Virtual screening, the search for bioactive compounds via computational methods, provides a wide range of opportunities to speed up drug development and reduce the associated risks and costs. While virtual screening is already a standard practice in pharmaceutical companies, its applications in preclinical academic research still remain under-exploited, in spite of an increasing availability of dedicated free databases and software tools. In this survey, an overview of recent developments in this field is presented, focusing on free software and data repositories for screening as alternatives to their commercial counterparts, and outlining how available resources can be interlinked into a comprehensive virtual screening pipeline using typical academic computing facilities. Finally, to facilitate the set-up of corresponding pipelines, a downloadable software system is provided, using platform virtualization to integrate pre-installed screening tools and scripts for reproducible application across different operating systems.
Collapse
|
6
|
Wu X, Xia Z, Yang X, Xue C, Lu W. Molecular simulation of pyrroloquinoline quinine-dependent glycerol dehydrogenase inGluconobacter oxydans. MOLECULAR SIMULATION 2012. [DOI: 10.1080/08927022.2012.682281] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
|
7
|
Matter H, Sotriffer C. Applications and Success Stories in Virtual Screening. METHODS AND PRINCIPLES IN MEDICINAL CHEMISTRY 2011. [DOI: 10.1002/9783527633326.ch12] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]
|
8
|
Rognan D. Docking Methods for Virtual Screening: Principles and Recent Advances. METHODS AND PRINCIPLES IN MEDICINAL CHEMISTRY 2011. [DOI: 10.1002/9783527633326.ch6] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/11/2023]
|
9
|
Abstract
Fragment-based design has significantly modified drug discovery strategies and paradigms in the last decade. Besides technological advances and novel therapeutic avenues, one of the most significant changes brought by this new discipline has occurred in the minds of drug designers. Fragment-based approaches have markedly impacted rational computer-aided design both in method development and in applications. The present review illustrates the importance of molecular fragments in many aspects of rational ligand design, and discusses how thinking in "fragment space" has boosted computational biology and chemistry.
Collapse
|
10
|
Schnur DM, Beno BR, Tebben AJ, Cavallaro C. Methods for combinatorial and parallel library design. Methods Mol Biol 2011; 672:387-434. [PMID: 20838978 DOI: 10.1007/978-1-60761-839-3_16] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/29/2023]
Abstract
Diversity has historically played a critical role in design of combinatorial libraries, screening sets and corporate collections for lead discovery. Large library design dominated the field in the 1990s with methods ranging anywhere from purely arbitrary through property based reagent selection to product based approaches. In recent years, however, there has been a downward trend in library size. This was due to increased information about the desirable targets gleaned from the genomics revolution and to the ever growing availability of target protein structures from crystallography and homology modeling. Creation of libraries directed toward families of receptors such as GPCRs, kinases, nuclear hormone receptors, proteases, etc., replaced the generation of libraries based primarily on diversity while single target focused library design has remained an important objective. Concurrently, computing grids and cpu clusters have facilitated the development of structure based tools that screen hundreds of thousands of molecules. Smaller "smarter" combinatorial and focused parallel libraries replaced those early un-focused large libraries in the twenty-first century drug design paradigm. While diversity still plays a role in lead discovery, the focus of current library design methods has shifted to receptor based methods, scaffold hopping/bio-isostere searching, and a much needed emphasis on synthetic feasibility. Methods such as "privileged substructures based design" and pharmacophore based design still are important methods for parallel and small combinatorial library design. This chapter discusses some of the possible design methods and presents examples where they are available.
Collapse
Affiliation(s)
- Dora M Schnur
- Computer Aided Drug Design, Pharmaceutical Research Institute, Bristol-Myers Squibb Company, Princeton, NJ, USA
| | | | | | | |
Collapse
|
11
|
Abstract
The drug discovery process mainly relies on the experimental high-throughput screening of huge compound libraries in their pursuit of new active compounds. However, spiraling research and development costs and unimpressive success rates have driven the development of more rational, efficient, and cost-effective methods. With the increasing availability of protein structural information, advancement in computational algorithms, and faster computing resources, in silico docking-based methods are increasingly used to design smaller and focused compound libraries in order to reduce screening efforts and costs and at the same time identify active compounds with a better chance of progressing through the optimization stages. This chapter is a primer on the various docking-based methods developed for the purpose of structure-based library design. Our aim is to elucidate some basic terms related to the docking technique and explain the methodology behind several docking-based library design methods. This chapter also aims to guide the novice computational practitioner by laying out the general steps involved for such an exercise. Selected successful case studies conclude this chapter.
Collapse
Affiliation(s)
- Claudio N Cavasotto
- School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX, USA.
| | | |
Collapse
|
12
|
Pfeffer P, Fober T, Hüllermeier E, Klebe G. GARLig: A Fully Automated Tool for Subset Selection of Large Fragment Spaces via a Self-Adaptive Genetic Algorithm. J Chem Inf Model 2010; 50:1644-59. [DOI: 10.1021/ci9003305] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Patrick Pfeffer
- Department of Pharmaceutical Chemistry, Philipps-University, Marbacher Weg 6, 35032 Marburg, Germany, and, Department of Mathematics and Computer Science, Philipps-University, Hans-Meerwein-Strasse, 35032 Marburg, Germany
| | - Thomas Fober
- Department of Pharmaceutical Chemistry, Philipps-University, Marbacher Weg 6, 35032 Marburg, Germany, and, Department of Mathematics and Computer Science, Philipps-University, Hans-Meerwein-Strasse, 35032 Marburg, Germany
| | - Eyke Hüllermeier
- Department of Pharmaceutical Chemistry, Philipps-University, Marbacher Weg 6, 35032 Marburg, Germany, and, Department of Mathematics and Computer Science, Philipps-University, Hans-Meerwein-Strasse, 35032 Marburg, Germany
| | - Gerhard Klebe
- Department of Pharmaceutical Chemistry, Philipps-University, Marbacher Weg 6, 35032 Marburg, Germany, and, Department of Mathematics and Computer Science, Philipps-University, Hans-Meerwein-Strasse, 35032 Marburg, Germany
| |
Collapse
|
13
|
Structure-guided expansion of kinase fragment libraries driven by support vector machine models. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS 2010; 1804:642-52. [DOI: 10.1016/j.bbapap.2009.12.002] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/16/2009] [Revised: 11/25/2009] [Accepted: 12/02/2009] [Indexed: 01/30/2023]
|
14
|
Kim HJ, Choi MY, Kim HJ, Llinás M. Conformational dynamics and ligand binding in the multi-domain protein PDC109. PLoS One 2010; 5:e9180. [PMID: 20174627 PMCID: PMC2823774 DOI: 10.1371/journal.pone.0009180] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2009] [Accepted: 01/09/2010] [Indexed: 12/03/2022] Open
Abstract
PDC109 is a modular multi-domain protein with two fibronectin type II (Fn2) repeats joined by a linker. It plays a major role in bull sperm binding to the oviductal epithelium through its interactions with phosphorylcholines (PhCs), a head group of sperm cell membrane lipids. The crystal structure of the PDC109-PhC complex shows that each PhC binds to the corresponding Fn2 domain, while the two domains are on the same face of the protein. Long timescale explicit solvent molecular dynamics (MD) simulations of PDC109, in the presence and absence of PhC, suggest that PhC binding strongly correlates with the relative orientation of choline-phospholipid binding sites of the two Fn2 domains; unless the two domains tightly bind PhCs, they tend to change their relative orientation by deforming the flexible linker. The effective PDC109-PhC association constant of 28 M(-1), estimated from their potential of mean force is consistent with the experimental result. Principal component analysis of the long timescale MD simulations was compared to the significantly less expensive normal mode analysis of minimized structures. The comparison indicates that difference between relative domain motions of PDC109 with bound and unbound PhC is captured by the first principal component in the principal component analysis as well as the three lowest normal modes in the normal mode analysis. The present study illustrates the use of detailed MD simulations to clarify the energetics of specific ligand-domain interactions revealed by a static crystallographic model, as well as their influence on relative domain motions in a multi-domain protein.
Collapse
Affiliation(s)
- Hyun Jin Kim
- Department of Chemistry, Carnegie Mellon University, Pittsburgh, Pennsylvania, United States of America
| | - Moo Young Choi
- Department of Physics and Center for Theoretical Physics, Seoul National University, Seoul, Korea
| | - Hyung J. Kim
- Department of Chemistry, Carnegie Mellon University, Pittsburgh, Pennsylvania, United States of America
- School of Computational Sciences, Korea Institute for Advanced Study, Seoul, Korea
| | - Miguel Llinás
- Department of Chemistry, Carnegie Mellon University, Pittsburgh, Pennsylvania, United States of America
| |
Collapse
|
15
|
Zhou JZ, Shi S, Na J, Peng Z, Thacher T. Combinatorial library-based design with Basis Products. J Comput Aided Mol Des 2009; 23:725-36. [DOI: 10.1007/s10822-009-9297-9] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2009] [Accepted: 06/26/2009] [Indexed: 10/20/2022]
|
16
|
Nicola G, Abagyan R. Structure-based approaches to antibiotic drug discovery. ACTA ACUST UNITED AC 2009; Chapter 17:Unit17.2. [PMID: 19235149 DOI: 10.1002/9780471729259.mc1702s12] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
The development of antimicrobials has advanced tremendously over the past century. However, as our production capacity increases, the threat of resistance is ever-present. To combat this resistance, two main avenues of drug discovery are being pursued: identifying new microbial proteins for which to direct drug discovery efforts, and designing innovative drugs that target existing proteins. The advent of structural genomics research has advanced to the point of rapidly discovering novel microbial protein targets. In addition, modern tools of computational biology greatly enhance the speed and reliability of antimicrobial discovery. The various steps of this process are outlined and discussed, including virtual ligand screening, pocket identification, and compound optimization.
Collapse
Affiliation(s)
- George Nicola
- Department of Molecular Biology, The Scripps Research Institute, La Jolla, California, USA
| | | |
Collapse
|
17
|
Deanda F, Stewart EL, Reno MJ, Drewry DH. Kinase-Targeted Library Design through the Application of the PharmPrint Methodology. J Chem Inf Model 2008; 48:2395-403. [DOI: 10.1021/ci800276t] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Felix Deanda
- GlaxoSmithKline, Five Moore Drive, Research Triangle Park, North Carolina 27709
| | - Eugene L. Stewart
- GlaxoSmithKline, Five Moore Drive, Research Triangle Park, North Carolina 27709
| | - Michael J. Reno
- GlaxoSmithKline, Five Moore Drive, Research Triangle Park, North Carolina 27709
| | - David H. Drewry
- GlaxoSmithKline, Five Moore Drive, Research Triangle Park, North Carolina 27709
| |
Collapse
|
18
|
Koska J, Spassov VZ, Maynard AJ, Yan L, Austin N, Flook PK, Venkatachalam CM. Fully automated molecular mechanics based induced fit protein-ligand docking method. J Chem Inf Model 2008; 48:1965-73. [PMID: 18816046 DOI: 10.1021/ci800081s] [Citation(s) in RCA: 131] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023]
Abstract
We describe a method for docking a ligand into a protein receptor while allowing flexibility of the protein binding site. The method employs a multistep procedure that begins with the generation of protein and ligand conformations. An initial placement of the ligand is then performed by computing binding site hotspots. This initial placement is followed by a protein side-chain refinement stage that models protein flexibility. The final step of the process is an energy minimization of the ligand pose in the presence of the rigid receptor. Thus the algorithm models flexibility of the protein at two stages, before and after ligand placement. We validated this method by performing docking and cross docking studies of eight protein systems for which crystal structures were available for at least two bound ligands. The resulting rmsd values of the 21 docked protein-ligand complexes showed values of 2 A or less for all but one of the systems examined. The method has two critical benefits for high throughput virtual screening studies. First, no user intervention is required in the docking once the initial binding site selection has been made in the protein. Second, the initial protein conformation generation needs to be performed only once for a given binding region. Also, the method may be customized in various ways depending on the particular scenario in which dockings are being performed. Each of the individual steps of the method is fully independent making it straightforward to explore different variants of the high level workflow to further improve accuracy and performance.
Collapse
Affiliation(s)
- Jürgen Koska
- Accelrys Inc., 10188 Telesis Court, San Diego, CA 92121, USA.
| | | | | | | | | | | | | |
Collapse
|
19
|
Zhou JZ. Structure-directed combinatorial library design. Curr Opin Chem Biol 2008; 12:379-85. [PMID: 18328830 DOI: 10.1016/j.cbpa.2008.02.007] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2008] [Accepted: 02/11/2008] [Indexed: 11/30/2022]
Abstract
In recent years pharmaceutical companies have utilized structure-based drug design and combinatorial library design techniques to speed up their drug discovery efforts. Both approaches are routinely used in the lead discovery and lead optimization stages of the drug discovery process. Fragment-based drug design, a new power tool in the drug design toolbox, is also gaining acceptance across the pharmaceutical industry. This review will focus on the interplay between these three design techniques and recent developments in computational methodologies that enhance their integration. Examples of successful synergistic applications of these three techniques will be highlighted. Opinion regarding possible future directions of the field will be given.
Collapse
Affiliation(s)
- Joe Zhongxiang Zhou
- Department of Structural and Computational Biology, Pfizer Global Research & Development, La Jolla Laboratories, San Diego, CA 92121, USA.
| |
Collapse
|
20
|
|
21
|
Schellhammer I, Rarey M. TrixX: structure-based molecule indexing for large-scale virtual screening in sublinear time. J Comput Aided Mol Des 2007; 21:223-38. [PMID: 17294247 DOI: 10.1007/s10822-007-9103-5] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2006] [Accepted: 01/05/2007] [Indexed: 11/25/2022]
Abstract
Structure-based virtual screening today is basically organized as a sequential process where the molecules of a screening library are evaluated for instance with respect to their fit with a biological target. In this paper, we present a novel structure-based screening paradigm avoiding sequential searching and therefore enabling sublinear runtime behavior. We implemented the novel paradigm in the virtual screening tool TrixX and successfully applied it in screening experiments on four targets from relevant therapeutic areas. With the screening paradigm implemented in TrixX, we propose some important extensions and modifications to traditional virtual screening approaches: Instead of processing all compounds in the screening library sequentially, TrixX first analyzes the geometric and physicochemical binding site characteristics and then draws compounds with matching features from a compound catalog. The catalog organizes the compounds by their physicochemical and geometric features making use of relational database technology with indexed tables in order to support efficient queries for compounds with specific features. A key element of the compound catalog is a highly selective geometric descriptor that carries information on the type of functional groups of the compound, their Euclidian distance, the preferred interaction direction of each functional group, and the location of steric bulk around the triangle. In a re-docking experiment with 200 protein-ligand complexes, we could show that TrixX is able to correctly predict the location of ligand functional groups in co-crystallized complexes. In a retrospective virtual screening experiment for four different targets, the enrichment factors of TrixX are comparable to the enrichment factors of FlexX and FlexX-Scan. With computing times clearly below one second per compound, TrixX counts among the fastest virtual screening tools currently available and is nearly two orders of magnitude faster than standard FlexX.
Collapse
Affiliation(s)
- Ingo Schellhammer
- Center for Bioinformatics, Research Group for Computational Molecular Design, University of Hamburg, Bundesstrasse 43, 20146 Hamburg, Germany
| | | |
Collapse
|
22
|
Lewis RA, Pickett SD, Clark DE. Computer-Aided Molecular Diversity Analysis and Combinatorial Library Design. REVIEWS IN COMPUTATIONAL CHEMISTRY 2007. [DOI: 10.1002/9780470125939.ch1] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/11/2023]
|
23
|
Olgen S. Comparison of some 3-(substituted-benzylidene)-1, 3-dihydro-indolin derivatives as ligands of tyrosine kinase based on binding mode studies and biological assay. Arch Pharm Res 2006; 29:1006-17. [PMID: 17146970 DOI: 10.1007/bf02969285] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
A series of 3-(substituted-benylidene)-1, 3-dihydro- indolin-2-one, 3-(substituted-benylidene)-1, 3-dihydro- indolin-2-thione and 2, 2'-dithiobis 3-(substituted-benylidene)-1, 3-dihydro-indole derivatives was investigated as inhibitor of p60c-Src tyrosine kinase by performing receptor docking studies and inhibitory activity toward tyrosine phosphorylation. Some compounds were shown to be docked at the site, where the selective inhibitor PP1 [1-tert-Butyl-3-p-tolyl-1H-pyrazolo[3,4-d]pyrimidine-4-yl-amine] was embedded at the enzyme active site. Evaluation of all compounds for the interactions with the parameters of lowest binding energy levels, capability of hydrogen bond formations and superimposibility on enzyme active site by docking studies, it can be assumed that 3-(substituted- benzylidene)-1, 3-dihydro-indolin-2-one and thione derivatives have better interaction with enzyme active site then 2, 2'-dithiobis 3-(substituted-benzylidene)-1, 3-dihydro indole derivatives. The test results for the inhibitory activity against tyrosine kinase by Elisa method revealed that 3-(substituted-benylidene)-1, 3-dihydro- indolin-2-thione derivatives have more activity then 3-(substituted-benylidene)-1, 3-dihydro- indolin-2-one derivatives.
Collapse
Affiliation(s)
- Süreyya Olgen
- Department of Pharmaceutical Chemistry Faculty of Pharmacy, University of Ankara, 06100 Tandodan, Ankara, Turkey.
| |
Collapse
|
24
|
Herz T, Wolf K, Kraus J, Kramer B. 4SCan/vADME: intelligent library screening as a shortcut from hits to lead compounds. Expert Opin Drug Metab Toxicol 2006; 2:471-84. [PMID: 16863447 DOI: 10.1517/17425255.2.3.471] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
Abstract
Managing to solve the first step in drug discovery - the hit finding - can be a quite elaborate task, but it is only the initial step to the final goal; hit-to-lead optimisation still lies ahead and consumes even more time and resources. The solution is rather simple, that is, to take only the most promising compounds into account; but who is going to decide which ones are the most promising among a list of tens of millions of compounds in a virtual combinatorial library? 4SCan/vADME helps by bridging the gap between virtual (combinatorial) libraries designed by chemists and the in silico methods, docking and alignment, for screening databases. After choosing a random starting set, the implemented learning and prediction algorithm iteratively considers only combinations of fragments that have shown to result in more suitable interactions by the chosen method. ADME properties of the final list are then calculated via several in silico methods, resulting in a combined evaluation of the individual compound's target-specific, as well as ADME, properties. Based on the latter list of evaluated compounds, medicinal chemists can then decide which compounds might be the best ones to synthesise first and to serve as possible lead candidates. Following a brief introduction to virtual high-throughput screening techniques, the 4SCan/vADME method is outlined and discussed in this paper, using an example coming out of the 4SC pipeline.
Collapse
Affiliation(s)
- Thomas Herz
- 4SC AG, Chem & Bioinformatics, Am Klopferspitz 19a, D-82152 Martinsried, Germany.
| | | | | | | |
Collapse
|
25
|
Yoon S, Smellie A, Hartsough D, Filikov A. Surrogate docking: structure-based virtual screening at high throughput speed. J Comput Aided Mol Des 2005; 19:483-97. [PMID: 16292613 DOI: 10.1007/s10822-005-9002-6] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2005] [Accepted: 07/06/2005] [Indexed: 11/25/2022]
Abstract
Structure-based screening using fully flexible docking is still too slow for large molecular libraries. High quality docking of a million molecule library can take days even on a cluster with hundreds of CPUs. This performance issue prohibits the use of fully flexible docking in the design of large combinatorial libraries. We have developed a fast structure-based screening method, which utilizes docking of a limited number of compounds to build a 2D QSAR model used to rapidly score the rest of the database. We compare here a model based on radial basis functions and a Bayesian categorization model. The number of compounds that need to be actually docked depends on the number of docking hits found. In our case studies reasonable quality models are built after docking of the number of molecules containing approximately 50 docking hits. The rest of the library is screened by the QSAR model. Optionally a fraction of the QSAR-prioritized library can be docked in order to find the true docking hits. The quality of the model only depends on the training set size - not on the size of the library to be screened. Therefore, for larger libraries the method yields higher gain in speed no change in performance. Prioritizing a large library with these models provides a significant enrichment with docking hits: it attains the values of approximately 13 and approximately 35 at the beginning of the score-sorted libraries in our two case studies: screening of the NCI collection and a combinatorial libraries on CDK2 kinase structure. With such enrichments, only a fraction of the database must actually be docked to find many of the true hits. The throughput of the method allows its use in screening of large compound collections and in the design of large combinatorial libraries. The strategy proposed has an important effect on efficiency but does not affect retrieval of actives, the latter being determined by the quality of the docking method itself.
Collapse
Affiliation(s)
- Sukjoon Yoon
- ArQule, Inc, 19 Presidential way, Woburn, MA, 01801, USA
| | | | | | | |
Collapse
|
26
|
Abstract
We present a new software module, FlexX-Scan, for high-throughput, structure-based virtual screening. FlexX-Scan was developed with the aim to further speed up the virtual screening process. Based on the incremental construction docking tool FlexX (Rarey et al., J Mol Biol 1996;261:470-489), a compact descriptor for representing favorable protein interaction spots within the protein binding site has been developed. The descriptor is calculated using special-purpose clustering techniques applied to the usual interaction points created by FlexX. The algorithm automatically detects a small set of interaction spots in the binding site for positioning ligand functional groups. The parametrizations of the base placement and incremental construction algorithms have been adapted to the new interaction model. We tested the software tool on a diverse set of 200 protein-ligand complexes from the protein database (PDB) (Kramer et al., Proteins 1999;37:228-241). On average, the algorithm proposes about 90 interaction spots per binding site compared to about 1000 interaction dots in FlexX. We observe that the docking solutions of FlexX-Scan have a root-mean-square deviation from the crystal structure similar to the deviation of docking solutions of standard FlexX. For further validation we also performed virtual screening experiments for cyclin-dependent kinase 2, thrombin, angiotensin-converting enzyme, and dihydrofolat reductase. In these experiments, we screened a set of 34,000 random compounds and a number of known actives for each target. With FlexX-Scan, we achieved comparable enrichments to standard FlexX, with an averaged computing time of 5-10 s per compound, depending on parametrization.
Collapse
Affiliation(s)
- Ingo Schellhammer
- University of Hamburg, Center for Bioinformatics (ZBH), Research Group for Computational Molecular Design, Hamburg, Germany
| | | |
Collapse
|
27
|
Nikitin S, Zaitseva N, Demina O, Solovieva V, Mazin E, Mikhalev S, Smolov M, Rubinov A, Vlasov P, Lepikhin D, Khachko D, Fokin V, Queen C, Zosimov V. A very large diversity space of synthetically accessible compounds for use with drug design programs. J Comput Aided Mol Des 2005; 19:47-63. [PMID: 16059666 DOI: 10.1007/s10822-005-0097-6] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2004] [Accepted: 01/03/2005] [Indexed: 10/25/2022]
Abstract
We have constructed a very large virtual diversity space containing more than 10(13) chemical compounds. The diversity space is built from about 400 combinatorial libraries, which have been expanded by choosing sizeable collections of suitable R-groups that can be attached to each link point of their scaffolds. These R-group collections have been created by selecting reagents that have drug-like properties from catalogs of available chemicals. As members of known combinatorial libraries, the compounds in the diversity space are in general synthetically accessible and useful as potential drug leads. Hence, the diversity space can be used as a vast source of compounds by a de novo drug design program. For example, we have used such a program to generate inhibitors of HIV integrase enzyme that exhibited activity in the micromolar range.
Collapse
Affiliation(s)
- Sergey Nikitin
- Algodign LLC, Bolshaya Sadovaya 8, Moscow 103001, Russia.
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
28
|
Chema D, Eren D, Yayon A, Goldblum A, Zaliani A. Identifying the binding mode of a molecular scaffold. J Comput Aided Mol Des 2004; 18:23-40. [PMID: 15143801 DOI: 10.1023/b:jcam.0000022561.76694.5b] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
We describe a method for docking of a scaffold-based series and present its advantages over docking of individual ligands, for determining the binding mode of a molecular scaffold in a binding site. The method has been applied to eight different scaffolds of protein kinase inhibitors (PKI). A single analog of each of these eight scaffolds was previously crystallized with different protein kinases. We have used FlexX to dock a set of molecules that share the same scaffold, rather than docking a single molecule. The main mode of binding is determined by the mode of binding of the largest cluster among the docked molecules that share a scaffold. Clustering is based on our 'nearest single neighbor' method [J. Chem. Inf. Comput. Sci., 43 (2003) 208-217]. Additional criteria are applied in those cases in which more than one significant binding mode is found. Using the proposed method, most of the crystallographic binding modes of these scaffolds were reconstructed. Alternative modes, that have not been detected yet by experiments, could also be identified. The method was applied to predict the binding mode of an additional molecular scaffold that was not yet reported and the predicted binding mode has been found to be very similar to experimental results for a closely related scaffold. We suggest that this approach be used as a virtual screening tool for scaffold-based design processes.
Collapse
Affiliation(s)
- Doron Chema
- Department of Medicinal Chemistry, David R. Bloom Center for Pharmacy, School of Pharmacy, Hebrew University of Jerusalem 91120, Israel
| | | | | | | | | |
Collapse
|
29
|
Abstract
Virtual screening of virtual libraries (VSVL) is a rapidly changing area of research. Great efforts are being made to produce better algorithms, selection methods and infrastructure. Yet, the number of successful examples in the literature is not impressive, although the quality of work certainly is high. Why is this? One reason is that these methods tend to be applied at the lead generation stage and therefore there is a large lead-time before successful examples appear in the literature. However, any computational chemist would confirm that these methods are successful and there exists a glut of start-up companies specialising in virtual screening. Moreover, the scientific community would not be focussing so much attention on this area if it were not yielding results. Even so, the paucity of literature data is certainly a hindrance to the development of better methods. The VSVL process is unique within the discovery process, in that it is the only method that can screen the > 10(30) genuinely novel molecules out there. Already, some VSVL methods are evaluating 10(13) compounds, a capacity that high throughput screening can only dream of. There is a huge potential advantage for the company that develops efficient and effective methods, for lead generation, lead hopping and optimization of both potency and ADME properties. To do this, it requires more than the software, it requires confidence to exploit the methodology, to commit synthesis on the basis of it, and to build this approach into the medicinal chemistry strategy. It is a fact that these tools remain quite daunting for the majority of scientists working at the bench. The routine use of these methods is not simply a matter of education and training. Integration of these methods into accessible and robust end user software, without dilution of the science, must be a priority. We have reached a coincidence, where several technologies have the required level of maturity predictive computational chemistry methods, algorithms that manage the combinatorial explosion, high throughput crystallography and ADME measurements and the massive increase in computational horsepower from distributed computing. The author is confident that the synergy of these technologies will bring great benefit to the industry, with more efficient production of higher quality clinical candidates. The future is bright. The future is virtual!
Collapse
Affiliation(s)
- Darren V S Green
- GlaxoSmithKline, Gunnels Wood Road, Stevenage, Hertfordshire SG1 2NY, U.K
| |
Collapse
|
30
|
van Drie JH, Rohrer DC, Blinn JR, Gao H. Structure-based design of combinatorial libraries. EXS 2003:203-21. [PMID: 12613178 DOI: 10.1007/978-3-0348-7997-2_10] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Affiliation(s)
- John H van Drie
- Vertex Pharmaceuticals, 130 Waverly St, Cambridge, MA 02139, USA
| | | | | | | |
Collapse
|
31
|
Affiliation(s)
- Timothy R Geistlinger
- Departments of Pharmaceutical Chemistry and Cellular and Molecular Pharmacology, University of California San Francisco, California 94143-0446, USA
| | | |
Collapse
|
32
|
Venkatachalam CM, Jiang X, Oldfield T, Waldman M. LigandFit: a novel method for the shape-directed rapid docking of ligands to protein active sites. J Mol Graph Model 2003; 21:289-307. [PMID: 12479928 DOI: 10.1016/s1093-3263(02)00164-x] [Citation(s) in RCA: 674] [Impact Index Per Article: 32.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
We present a new shape-based method, LigandFit, for accurately docking ligands into protein active sites. The method employs a cavity detection algorithm for detecting invaginations in the protein as candidate active site regions. A shape comparison filter is combined with a Monte Carlo conformational search for generating ligand poses consistent with the active site shape. Candidate poses are minimized in the context of the active site using a grid-based method for evaluating protein-ligand interaction energies. Errors arising from grid interpolation are dramatically reduced using a new non-linear interpolation scheme. Results are presented for 19 diverse protein-ligand complexes. The method appears quite promising, reproducing the X-ray structure ligand pose within an RMS of 2A in 14 out of the 19 complexes. A high-throughput screening study applied to the thymidine kinase receptor is also presented in which LigandFit, when combined with LigScore, an internally developed scoring function, yields very good hit rates for a ligand pool seeded with known actives.
Collapse
|
33
|
Pitts WJ, Guo J, Dhar TGM, Shen Z, Gu HH, Watterson SH, Bednarz MS, Chen BC, Barrish JC, Bassolino D, Cheney D, Fleener CA, Rouleau KA, Hollenbaugh DL, Iwanowicz EJ. Rapid synthesis of triazine inhibitors of inosine monophosphate dehydrogenase. Bioorg Med Chem Lett 2002; 12:2137-40. [PMID: 12127522 DOI: 10.1016/s0960-894x(02)00351-7] [Citation(s) in RCA: 36] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
A series of novel triazine-based small molecule inhibitors (IV) of inosine monophosphate dehydrogenase was prepared. The synthesis and the structure-activity relationships (SAR) derived from in vitro studies are described.
Collapse
Affiliation(s)
- William J Pitts
- Bristol-Myers Squibb Pharmaceutical Research Institute, Princeton, NJ 08543-4000, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
34
|
Abstract
Rational design of small focused libraries that are biased toward specific therapeutic targets is currently at the forefront of combinatorial library design. Various structure-based design strategies can be implemented in focused library design when the 3D structure of the target is available through X-ray or NMR determination. This review discusses the major methods and programs specifically developed for the purpose of designing combinatorial libraries under the constraint of the binding site of a biological target, with emphasis on their advantages and disadvantages. Examples of the successful application of these methodologies are highlighted, demonstrating their performances within the practical drug discovery process.
Collapse
Affiliation(s)
- Mary Pat Beavers
- Computer Assisted Drug Discovery, R.W. Johnson Pharmaceutical Research Institute, Raritan, NJ 08869, USA
| | | |
Collapse
|
35
|
Agrafiotis DK, Lobanov VS, Salemme FR. Combinatorial informatics in the post-genomics ERA. Nat Rev Drug Discov 2002; 1:337-46. [PMID: 12120409 DOI: 10.1038/nrd791] [Citation(s) in RCA: 93] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The multitude of potential drug targets emerging from genome sequencing demands new approaches to drug discovery. A chemogenomics strategy, which involves the generation of small-molecule compounds that can be used both as tools to probe biological mechanisms and as leads for drug-property optimization, provides a highly parallel, industrialized solution. Key to the success of this strategy is an integrated suite of chemi-informatics applications that can allow the rapid and directed optimization of chemical compounds with drug-like properties using 'just-in-time' combinatorial chemical synthesis. An effective embodiment of this process requires new computational and data-mining tools that cover all aspects of library generation, compound selection and experimental design, and work effectively on a massive scale.
Collapse
Affiliation(s)
- Dimitris K Agrafiotis
- 3-Dimensional Pharmaceuticals, Inc., 665 Stockton Drive, Exton, Pennsylvania 19341, USA.
| | | | | |
Collapse
|
36
|
Abstract
Pharmacogenomics requires the integration and analysis of genomic, molecular, cellular, and clinical data, and it thus offers a remarkable set of challenges to biomedical informatics. These include infrastructural challenges such as the creation of data models and databases for storing these data, the integration of these data with external databases, the extraction of information from natural language text, and the protection of databases with sensitive information. There are also scientific challenges in creating tools to support gene expression analysis, three-dimensional structural analysis, and comparative genomic analysis. In this review, we summarize the current uses of informatics within pharmacogenomics and show how the technical challenges that remain for biomedical informatics are typical of those that will be confronted in the postgenomic era.
Collapse
Affiliation(s)
- Russ B Altman
- Stanford Medical Informatics, Stanford, California 94305-5479, USA.
| | | |
Collapse
|
37
|
Abstract
Recent improvements in flexible docking technology may lead to a bigger role for computational methods in lead discovery. Although fast and accurate computational prediction of binding affinities for an arbitrary molecule is still beyond the limits of current methods, the docking and screening procedures can select small sets of likely lead candidates from large libraries of either commercially or synthetically available compounds.
Collapse
Affiliation(s)
- R Abagyan
- Department of Molecular Biology, The Scripps Research Institute, 10550 North Torrey Pines, TCP-28, La Jolla, CA 92037, USA.
| | | |
Collapse
|
38
|
Lamb ML, Burdick KW, Toba S, Young MM, Skillman AG, Zou X, Arnold JR, Kuntz ID. Design, docking, and evaluation of multiple libraries against multiple targets. Proteins 2001; 42:296-318. [PMID: 11151003 DOI: 10.1002/1097-0134(20010215)42:3<296::aid-prot20>3.0.co;2-f] [Citation(s) in RCA: 49] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
We present a general approach to the design, docking, and virtual screening of multiple combinatorial libraries against a family of proteins. The method consists of three main stages: docking the scaffold, selecting the best substituents at each site of diversity, and comparing the resultant molecules within and between the libraries. The core "divide-and-conquer" algorithm for side-chain selection, developed from an earlier version (Sun et al., J Comp Aided Mol Design 1998;12:597-604), provides a way to explore large lists of substituents with linear rather than combinatorial time dependence. We have applied our method to three combinatorial libraries and three serine proteases: trypsin, chymotrypsin, and elastase. We show that the scaffold docking procedure, in conjunction with a novel vector-based orientation filter, reproduces crystallographic binding modes. In addition, the free-energy-based scoring procedure (Zou et al., J Am Chem Soc 1999;121:8033-8043) is able to reproduce experimental binding data for P1 mutants of macromolecular protease inhibitors. Finally, we show that our method discriminates between a peptide library and virtual libraries built on benzodiazepine and tetrahydroisoquinolinone scaffolds. Implications of the docking results for library design are explored.
Collapse
Affiliation(s)
- M L Lamb
- Department of Pharmaceutical Chemistry, School of Pharmacy, University of California, San Francisco, California, USA
| | | | | | | | | | | | | | | |
Collapse
|
39
|
Monte Carlo simulations of HIV-1 protease binding dynamics and thermodynamics with ensembles of protein conformations: Incorporating protein flexibility in deciphering mechanisms of molecular recognition. ACTA ACUST UNITED AC 2001. [DOI: 10.1016/s1380-7323(01)80009-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register]
|
40
|
Leach AR, Bryce RA, Robinson AJ. Synergy between combinatorial chemistry and de novo design. J Mol Graph Model 2000; 18:358-67, 526. [PMID: 11143555 DOI: 10.1016/s1093-3263(00)00062-0] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Abstract
Traditional de novo design algorithms are able to generate many thousands of ligand structures that meet the constraints of a protein structure, but these structures are often not synthetically tractable. In this article, we describe how concepts from structure-based de novo design can be used to explore the search space in library design. A key feature of the approach is the requirement that specific templates are included within the designed structures. Each template corresponds to the "central core" of a combinatorial library. The template is positioned within an acyclic chain whose length and bond orders are systematically varied, and the conformational space of each structure that results (core plus chain) is explored to determine whether it is able to link together two or more strongly interacting functional groups or pharmacophores located within a protein binding site. This fragment connection algorithm provides "generic" 3D molecules in the sense that the linking part (minus the template) is built from an all-carbon chain whose synthesis may not be easily achieved. Thus, in the second phase, 2D queries are derived from the molecular skeletons and used to identify possible reagents from a database. Each potential reagent is checked to ensure that it is compatible with the conformation of its parent 3D conformation and the constraints of the binding site. Combinations of these reagents according to the combinatorial library reaction scheme give product molecules that contain the desired core template and the key functional/pharmacophoric groups, and would be able to adopt a conformation compatible with the original molecular skeleton without any unfavorable intermolecular or intramolecular interactions. We discuss how this strategy compares with and relates to alternative approaches to both structure-based library design and de novo design.
Collapse
Affiliation(s)
- A R Leach
- Glaxo Wellcome Medicines Research Centre, Gunnels Wood Road, Stevenage, Hertfordshire SG1 2NY, United Kingdom.
| | | | | |
Collapse
|
41
|
Mason JS, Beno BR. Library design using BCUT chemistry-space descriptors and multiple four-point pharmacophore fingerprints: simultaneous optimization and structure-based diversity. J Mol Graph Model 2000; 18:438-51, 538. [PMID: 11143561 DOI: 10.1016/s1093-3263(00)00073-5] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
New applications of fingerprints of multiple potential 4-point three-dimensional (3D) pharmacophores in combinatorial library design and virtual screening are presented. Preliminary results demonstrating the feasibility of a simulated annealing process for combinatorial reagent selection that concurrently optimizes product diversity in BCUT chemistry space and in terms of unique 4-point pharmacophores are discussed, and the advantage of using a customized chemistry-space derived for the library design is demonstrated. In addition, an extension to the multiple pharmacophore method for structure-based design that uses the shape of the target site as an additional constraint is presented. This development enables the docking process to be quantified in terms of the number and identities of the pharmacophoric hypotheses that can be matched by a compound or a library of compounds. The design of an example combinatorial library based on the Ugi condensation reaction and a serine protease active site is described.
Collapse
Affiliation(s)
- J S Mason
- Department of Macromolecular Structure and Biopharmaceuticals, Bristol-Myers Squibb Pharmaceutical Research Institute, P.O. Box 4000, Princeton, NJ 08543, USA.
| | | |
Collapse
|
42
|
Böhm HJ, Stahl M. Structure-based library design: molecular modelling merges with combinatorial chemistry. Curr Opin Chem Biol 2000; 4:283-6. [PMID: 10826972 DOI: 10.1016/s1367-5931(00)00090-9] [Citation(s) in RCA: 38] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]
Abstract
Recent advances in both computational and experimental techniques now allow a very fruitful interplay of computational and combinatorial chemistry in the structure-based design of combinatorial libraries.
Collapse
Affiliation(s)
- H J Böhm
- Pharmaceuticals Division, Hoffmann-La Roche Ltd, Basel, CH 4070, Switzerland.
| | | |
Collapse
|
43
|
Verkhivker GM, Rejto PA, Bouzida D, Arthurs S, Colson AB, Freer ST, Gehlhaar DK, Larson V, Luty BA, Marrone T, Rose PW. Towards understanding the mechanisms of molecular recognition by computer simulations of ligand-protein interactions. J Mol Recognit 1999; 12:371-89. [PMID: 10611647 DOI: 10.1002/(sici)1099-1352(199911/12)12:6<371::aid-jmr479>3.0.co;2-o] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
The thermodynamic and kinetic aspects of molecular recognition for the methotrexate (MTX)-dihydrofolate reductase (DHFR) ligand-protein system are investigated by the binding energy landscape approach. The impact of 'hot' and 'cold' errors in ligand mutations on the thermodynamic stability of the native MTX-DHFR complex is analyzed, and relationships between the molecular recognition mechanism and the degree of ligand optimization are discussed. The nature and relative stability of intermediates and thermodynamic phases on the ligand-protein association pathway are studied, providing new insights into connections between protein folding and molecular recognition mechanisms, and cooperativity of ligand-protein binding. The results of kinetic docking simulations are rationalized based on the thermodynamic properties determined from equilibrium simulations and the shape of the underlying binding energy landscape. We show how evolutionary ligand selection for a receptor active site can produce well-optimized ligand-protein systems such as MTX-DHFR complex with the thermodynamically stable native structure and a direct transition mechanism of binding from unbound conformations to the unique native structure.
Collapse
Affiliation(s)
- G M Verkhivker
- Agouron Pharmaceuticals Inc., 3301 North Torrey Pines Court, La Jolla, CA 92037-1022, USA.
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
44
|
Makino S, Ewing TJ, Kuntz ID. DREAM++: flexible docking program for virtual combinatorial libraries. J Comput Aided Mol Des 1999; 13:513-32. [PMID: 10483532 DOI: 10.1023/a:1008066310669] [Citation(s) in RCA: 61] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
We present a set of programs, DREAM+2 (Docking and Reaction programs using Efficient seArch Methods written in C++), for docking computationally generated ligands into macromolecular binding sites. DREAM++ is composed of three programs: ORIENT++, REACT++ and SEARCH++. The program ORIENT++ positions molecules in a binding site with the DOCK algorithm. Its output can be used as input to REACT++ and SEARCH+2. The program REACT++ performs user-specific chemical reactions on a docked molecule, so that reaction products can be evaluated for three dimensional complementarity with the macromolecular site. The program SEARCH++ performs an efficient conformation search on the reaction products using a hybrid backtrack and incremental construction algorithm. We have applied the programs to HIV protease-inhibitor complexes as test systems. We found that we can differentiate high-affinity ligands based on several measures: interaction energies, occupancy of protein subsites and the number of successfully docked conformations for each product. Encouraged by the results in the test case, we applied the programs to propose novel inhibitors of HIV protease. These inhibitors can be generated by organic reactions using commercially available reagents. They are alternatives to the inhibitors synthesized by Glaxo.
Collapse
Affiliation(s)
- S Makino
- Department of Pharmaceutical Chemistry, School of Pharmacy, University of California, San Francisco 94143-0446, USA
| | | | | |
Collapse
|
45
|
|