1
|
Buehler Y, Reymond JL. A View on Molecular Complexity from the GDB Chemical Space. J Chem Inf Model 2025. [PMID: 40372811 DOI: 10.1021/acs.jcim.5c00334] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/17/2025]
Abstract
One recurring question when choosing which molecules to select for investigation is that of molecular complexity: is there a price to pay for complexity in terms of synthesis difficulty, and does complexity have anything to do with biological properties? In the chemical space of small organic molecules enumerated from mathematical graphs in the GDBs (Generated DataBases), most compounds are too complex and challenging for synthesis despite containing only standard functional groups and ring types. For these GDB molecules, we find that an increasing fraction (MC1) or number (MC2) of non-divalent nodes in the molecular graph represent simple measures of molecular complexity, which we interpret in terms of potential synthesis difficulties. We also show that MC1 and MC2 are applicable to commercial screening compounds (ZINC), bioactive molecules (ChEMBL) and natural products (COCONUT) and compare them with previously reported measures of molecular complexity and synthetic accessibility.
Collapse
Affiliation(s)
- Ye Buehler
- Department of Chemistry, Biochemistry and Pharmaceutical Sciences, University of Bern, Freiestrasse 3, 3012 Bern, Switzerland
| | - Jean-Louis Reymond
- Department of Chemistry, Biochemistry and Pharmaceutical Sciences, University of Bern, Freiestrasse 3, 3012 Bern, Switzerland
| |
Collapse
|
2
|
Purohit D, Saini V, Kumar S, Kumar A, Narasimhan B. Three-dimensional Quantitative Structure-activity Relationship (3DQSAR) and Molecular Docking Study of 2-((pyridin-3-yloxy)methyl) Piperazines as α7 Nicotinic Acetylcholine Receptor Modulators for the Treatment of Inflammatory Disorders. Mini Rev Med Chem 2019; 20:1031-1041. [PMID: 31483229 DOI: 10.2174/1389557519666190904151227] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2018] [Revised: 04/27/2019] [Accepted: 05/25/2019] [Indexed: 11/22/2022]
Abstract
BACKGROUND & OBJECTIVE Comparative molecular field analysis (CoMFA) of 27 analogues of 2-((pyridin-3-yloxy)methyl)piperazine derivatives was carried out using software Tripos SYBYL X. Optimal r2 (0.854) and q2 (0.541) values were obtained for the developed 3D-QSAR model. The contour plots obtained from CoMFA analysis have shown 13.84% steric contribution and 66.14% electrostatic contribution towards an anti-inflammatory activity. METHODS The homology model of the receptor protein, α7 nicotinic acetylcholine, was generated in SWISS MODELLER using auto template mode and was analysed for the quality using Procheck, QMEAN Z-score, Anolea and GROMOS plots. The QMEAN score for the model was observed to be - 3.862. The generated model of alpha 7 nicotinic acetylcholine receptor was used for docking study of 27 piperazine analogues using Auto-Dock 4.2.5.1. RESULTS The dock score obtained from docking analysis was then correlated with experimental pIC50 values for in-silico validation of the developed CoMFA model and a good correlation was obtained with correlation coefficient (r2) value of -0.7378. CONCLUSION The present investigation suggests an optimal 3D-QSAR with CoMFA model for further evaluating new chemical entities based on piperazine skeleton.
Collapse
Affiliation(s)
- Deepika Purohit
- Faculty of Pharmaceutical Sciences, Maharshi Dayanand University, Rohtak-124001, India
| | - Vandana Saini
- Centre For Bioinformatics, Maharshi Dayanand University, Rohtak-124001, India
| | - Sanjiv Kumar
- Faculty of Pharmaceutical Sciences, Maharshi Dayanand University, Rohtak-124001, India
| | - Ajit Kumar
- Centre For Bioinformatics, Maharshi Dayanand University, Rohtak-124001, India
| | | |
Collapse
|
3
|
Competitive docking model for prediction of the human nicotinic acetylcholine receptor α7 binding of tobacco constituents. Oncotarget 2018; 9:16899-16916. [PMID: 29682193 PMCID: PMC5908294 DOI: 10.18632/oncotarget.24458] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2017] [Accepted: 02/01/2018] [Indexed: 12/21/2022] Open
Abstract
The detrimental health effects associated with tobacco use constitute a major public health concern. The addiction associated with nicotine found in tobacco products has led to difficulty in quitting among users. Nicotinic acetylcholine receptors (nAChRs) are the targets of nicotine and are responsible for addiction to tobacco products. However, it is unknown if the other >8000 tobacco constituents are addictive. Since it is time-consuming and costly to experimentally assess addictive potential of such larger number of chemicals, computationally predicting human nAChRs binding is important for in silico evaluation of addiction potential of tobacco constituents and needs structures of human nAChRs. Therefore, we constructed three-dimensional structures of the ligand binding domain of human nAChR α7 subtype and then developed a predictive model based on the constructed structures to predict human nAChR α7 binding activity of tobacco constituents. The predictive model correctly predicted 11 out of 12 test compounds to be binders of nAChR α7. The model is a useful tool for high-throughput screening of potential addictive tobacco constituents. These results could inform regulatory science research by providing a new validated predictive tool using cutting-edge computational methodology to high-throughput screen tobacco additives and constituents for their binding interaction with the human α7 nicotinic receptor. The tool represents a prediction model capable of screening thousands of chemicals found in tobacco products for addiction potential, which improves the understanding of the potential effects of additives.
Collapse
|
4
|
Awale M, Probst D, Reymond JL. WebMolCS: A Web-Based Interface for Visualizing Molecules in Three-Dimensional Chemical Spaces. J Chem Inf Model 2017; 57:643-649. [PMID: 28316236 DOI: 10.1021/acs.jcim.6b00690] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
The concept of chemical space provides a convenient framework to analyze large collections of molecules by placing them in property spaces where distances represent similarities. Here we report webMolCS, a new type of web-based interface visualizing up to 5000 user-defined molecules in six different three-dimensional (3D) chemical spaces obtained by principal component analysis or similarity mapping of multidimensional property spaces describing composition (MQN: 42D molecular quantum numbers, SMIfp: 34D SMILES fingerprint), shapes and pharmacophores (APfp: 20D atom pair fingerprint, Xfp: 55D category extended atom pair fingerprint), and substructures (Sfp: 1024D binary substructure fingerprint, ECfp4:1024D extended connectivity fingerprint). Each molecule is shown as a sphere, and its structure appears on mouse over. The sphere is color-coded by similarity to the first compound in the list, by the list rank, or by a user-defined value, which reveals the relationship between any property encoded by these values and structural similarities. WebMolCS is freely available at www.gdb.unibe.ch .
Collapse
Affiliation(s)
- Mahendra Awale
- Department of Chemistry and Biochemistry, National Center of Competence in Research NCCR TransCure, University of Berne , Freiestrasse 3, 3012 Berne, Switzerland
| | - Daniel Probst
- Department of Chemistry and Biochemistry, National Center of Competence in Research NCCR TransCure, University of Berne , Freiestrasse 3, 3012 Berne, Switzerland
| | - Jean-Louis Reymond
- Department of Chemistry and Biochemistry, National Center of Competence in Research NCCR TransCure, University of Berne , Freiestrasse 3, 3012 Berne, Switzerland
| |
Collapse
|
5
|
Awale M, Reymond JL. The polypharmacology browser: a web-based multi-fingerprint target prediction tool using ChEMBL bioactivity data. J Cheminform 2017; 9:11. [PMID: 28270862 PMCID: PMC5319934 DOI: 10.1186/s13321-017-0199-x] [Citation(s) in RCA: 68] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2016] [Accepted: 02/10/2017] [Indexed: 12/31/2022] Open
Abstract
Background Several web-based tools have been reported recently which predict the possible targets of a small molecule by similarity to compounds of known bioactivity using molecular fingerprints (fps), however predictions in each case rely on similarities computed from only one or two fps. Considering that structural similarity and therefore the predicted targets strongly depend on the method used for comparison, it would be highly desirable to predict targets using a broader set of fps simultaneously. Results Herein, we present the polypharmacology browser (PPB), a web-based platform which predicts possible targets for small molecules by searching for nearest neighbors using ten different fps describing composition, substructures, molecular shape and pharmacophores. PPB searches through 4613 groups of at least 10 same target annotated bioactive molecules from ChEMBL and returns a list of predicted targets ranked by consensus voting scheme and p value. A validation study across 670 drugs with up to 20 targets showed that combining the predictions from all 10 fps gives the best results, with on average 50% of the known targets of a drug being correctly predicted with a hit rate of 25%. Furthermore, when profiling a new inhibitor of the calcium channel TRPV6 against 24 targets taken from a safety screen panel, we observed inhibition in 5 out of 5 targets predicted by PPB and in 7 out of 18 targets not predicted by PPB. The rate of correct (5/12) and incorrect (0/12) predictions for this compound by PPB was comparable to that of other web-based prediction tools. Conclusion PPB offers a versatile platform for target prediction based on multi-fingerprint comparisons, and is freely accessible at www.gdb.unibe.ch as a valuable support for drug discovery.. ![]() Electronic supplementary material The online version of this article (doi:10.1186/s13321-017-0199-x) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Mahendra Awale
- Department of Chemistry and Biochemistry, National Center of Competence in Research NCCR Chemical Biology and NCCR TransCure, University of Berne, Freiestrasse 3, 3012 Berne, Switzerland
| | - Jean-Louis Reymond
- Department of Chemistry and Biochemistry, National Center of Competence in Research NCCR Chemical Biology and NCCR TransCure, University of Berne, Freiestrasse 3, 3012 Berne, Switzerland
| |
Collapse
|
6
|
|
7
|
Abstract
One of the simplest questions that can be asked about molecular diversity is how many organic molecules are possible in total? To answer this question, my research group has computationally enumerated all possible organic molecules up to a certain size to gain an unbiased insight into the entire chemical space. Our latest database, GDB-17, contains 166.4 billion molecules of up to 17 atoms of C, N, O, S, and halogens, by far the largest small molecule database reported to date. Molecules allowed by valency rules but unstable or nonsynthesizable due to strained topologies or reactive functional groups were not considered, which reduced the enumeration by at least 10 orders of magnitude and was essential to arrive at a manageable database size. Despite these restrictions, GDB-17 is highly relevant with respect to known molecules. Beyond enumeration, understanding and exploiting GDBs (generated databases) led us to develop methods for virtual screening and visualization of very large databases in the form of a "periodic system of molecules" comprising six different fingerprint spaces, with web-browsers for nearest neighbor searches, and the MQN- and SMIfp-Mapplet application for exploring color-coded principal component maps of GDB and other large databases. Proof-of-concept applications of GDB for drug discovery were realized by combining virtual screening with chemical synthesis and activity testing for neurotransmitter receptor and transporter ligands. One surprising lesson from using GDB for drug analog searches is the incredible depth of chemical space, that is, the fact that millions of very close analogs of any molecule can be readily identified by nearest-neighbor searches in the MQN-space of the various GDBs. The chemical space project has opened an unprecedented door on chemical diversity. Ongoing and yet unmet challenges concern enumerating molecules beyond 17 atoms and synthesizing GDB molecules with innovative scaffolds and pharmacophores.
Collapse
Affiliation(s)
- Jean-Louis Reymond
- Department of Chemistry and
Biochemistry, University of Berne, Freiestrasse 3, 3012 Berne, Switzerland
| |
Collapse
|
8
|
CS-MINER: A Tool for Association Mining in Binding-Database. Mol Inform 2015; 34:185-96. [DOI: 10.1002/minf.201400142] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2014] [Accepted: 01/26/2015] [Indexed: 11/07/2022]
|
9
|
Rupakheti C, Virshup A, Yang W, Beratan DN. Strategy to discover diverse optimal molecules in the small molecule universe. J Chem Inf Model 2015; 55:529-37. [PMID: 25594586 PMCID: PMC4372820 DOI: 10.1021/ci500749q] [Citation(s) in RCA: 47] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023]
Abstract
The small molecule universe (SMU) is defined as a set of over 10(60) synthetically feasible organic molecules with molecular weight less than ∼500 Da. Exhaustive enumerations and evaluation of all SMU molecules for the purpose of discovering favorable structures is impossible. We take a stochastic approach and extend the ACSESS framework ( Virshup et al. J. Am. Chem. Soc. 2013 , 135 , 7296 - 7303 ) to develop diversity oriented molecular libraries that can generate a set of compounds that is representative of the small molecule universe and that also biases the library toward favorable physical property values. We show that the approach is efficient compared to exhaustive enumeration and to existing evolutionary algorithms for generating such libraries by testing in the NKp fitness landscape model and in the fully enumerated GDB-9 chemical universe containing 3 × 10(5) molecules.
Collapse
Affiliation(s)
- Chetan Rupakheti
- †Program in Computational Biology and Bioinformatics, ‡Department of Chemistry, §Department of Physics, and ∥Department of Biochemistry, Duke University, Durham, North Carolina 27708, United States
| | - Aaron Virshup
- †Program in Computational Biology and Bioinformatics, ‡Department of Chemistry, §Department of Physics, and ∥Department of Biochemistry, Duke University, Durham, North Carolina 27708, United States
| | - Weitao Yang
- †Program in Computational Biology and Bioinformatics, ‡Department of Chemistry, §Department of Physics, and ∥Department of Biochemistry, Duke University, Durham, North Carolina 27708, United States
| | - David N Beratan
- †Program in Computational Biology and Bioinformatics, ‡Department of Chemistry, §Department of Physics, and ∥Department of Biochemistry, Duke University, Durham, North Carolina 27708, United States
| |
Collapse
|
10
|
Li GB, Yang LL, Yuan Y, Zou J, Cao Y, Yang SY, Xiang R, Xiang M. Virtual screening in small molecule discovery for epigenetic targets. Methods 2015; 71:158-66. [DOI: 10.1016/j.ymeth.2014.11.010] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2014] [Revised: 10/27/2014] [Accepted: 11/14/2014] [Indexed: 12/31/2022] Open
|
11
|
|
12
|
Awale M, Reymond JL. Atom Pair 2D-Fingerprints Perceive 3D-Molecular Shape and Pharmacophores for Very Fast Virtual Screening of ZINC and GDB-17. J Chem Inf Model 2014; 54:1892-907. [DOI: 10.1021/ci500232g] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Affiliation(s)
- Mahendra Awale
- Department of Chemistry and
Biochemistry, University of Berne, Freiestrasse 3, 3012 Berne Switzerland
| | - Jean-Louis Reymond
- Department of Chemistry and
Biochemistry, University of Berne, Freiestrasse 3, 3012 Berne Switzerland
| |
Collapse
|
13
|
Ruddigkeit L, Awale M, Reymond JL. Expanding the fragrance chemical space for virtual screening. J Cheminform 2014; 6:27. [PMID: 24876890 PMCID: PMC4037718 DOI: 10.1186/1758-2946-6-27] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2014] [Accepted: 05/12/2014] [Indexed: 12/30/2022] Open
Abstract
The properties of fragrance molecules in the public databases SuperScent and Flavornet were analyzed to define a “fragrance-like” (FL) property range (Heavy Atom Count ≤ 21, only C, H, O, S, (O + S) ≤ 3, Hydrogen Bond Donor ≤ 1) and the corresponding chemical space including FL molecules from PubChem (NIH repository of molecules), ChEMBL (bioactive molecules), ZINC (drug-like molecules), and GDB-13 (all possible organic molecules up to 13 atoms of C, N, O, S, Cl). The FL subsets of these databases were classified by MQN (Molecular Quantum Numbers, a set of 42 integer value descriptors of molecular structure) and formatted for fast MQN-similarity searching and interactive exploration of color-coded principal component maps in form of the FL-mapplet and FL-browser applications freely available at http://www.gdb.unibe.ch. MQN-similarity is shown to efficiently recover 15 different fragrance molecule families from the different FL subsets, demonstrating the relevance of the MQN-based tool to explore the fragrance chemical space.
Collapse
Affiliation(s)
- Lars Ruddigkeit
- Department of Chemistry and Biochemistry, University of Bern, Freiestrasse 3, 3012 Bern, Switzerland
| | - Mahendra Awale
- Department of Chemistry and Biochemistry, University of Bern, Freiestrasse 3, 3012 Bern, Switzerland
| | - Jean-Louis Reymond
- Department of Chemistry and Biochemistry, University of Bern, Freiestrasse 3, 3012 Bern, Switzerland
| |
Collapse
|
14
|
Bürgi JJ, Awale M, Boss SD, Schaer T, Marger F, Viveros-Paredes JM, Bertrand S, Gertsch J, Bertrand D, Reymond JL. Discovery of potent positive allosteric modulators of the α3β2 nicotinic acetylcholine receptor by a chemical space walk in ChEMBL. ACS Chem Neurosci 2014; 5:346-59. [PMID: 24593915 DOI: 10.1021/cn4002297] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023] Open
Abstract
While a plethora of ligands are known for the well studied α7 and α4β2 nicotinic acetylcholine receptor (nAChR), only very few ligands address the related α3β2 nAChR expressed in the central nervous system and at the neuromuscular junction. Starting with the public database ChEMBL organized in the chemical space of Molecular Quantum Numbers (MQN, a series of 42 integer value descriptors of molecular structure), a visual survey of nearest neighbors of the α7 nAChR partial agonist N-(3R)-1-azabicyclo[2.2.2]oct-3-yl-4-chlorobenzamide (PNU-282,987) pointed to N-(2-halobenzyl)-3-aminoquinuclidines as possible nAChR modulators. This simple "chemical space walk" was performed using a web-browser available at www.gdb.unibe.ch . Electrophysiological recordings revealed that these ligands represent a new and to date most potent class of positive allosteric modulators (PAMs) of the α3β2 nAChR, which also exert significant effects in vivo. The present discovery highlights the value of surveying chemical space neighbors of known drugs within public databases to uncover new pharmacology.
Collapse
Affiliation(s)
- Justus J. Bürgi
- Department
of Chemistry and Biochemistry, NCCR TransCure, University of Berne, Freiestrasse 3, 3012 Berne, Switzerland
| | - Mahendra Awale
- Department
of Chemistry and Biochemistry, NCCR TransCure, University of Berne, Freiestrasse 3, 3012 Berne, Switzerland
| | - Silvan D. Boss
- Department
of Chemistry and Biochemistry, NCCR TransCure, University of Berne, Freiestrasse 3, 3012 Berne, Switzerland
| | - Tifany Schaer
- HiQScreen, 6 rte de Compois, 1222 Vésenaz, Geneva, Switzerland
| | - Fabrice Marger
- HiQScreen, 6 rte de Compois, 1222 Vésenaz, Geneva, Switzerland
| | - Juan M. Viveros-Paredes
- Departamento
de Farmacobiología CUCEI, Universidad de Guadalajara, 44430 Guadalajara, Jalisco, México
| | - Sonia Bertrand
- HiQScreen, 6 rte de Compois, 1222 Vésenaz, Geneva, Switzerland
| | - Jürg Gertsch
- Institute
of Biochemistry and Molecular Medicine, University of Berne, Bühlstrasse 28, 3012 Berne, Switzerland
| | - Daniel Bertrand
- HiQScreen, 6 rte de Compois, 1222 Vésenaz, Geneva, Switzerland
| | - Jean-Louis Reymond
- Department
of Chemistry and Biochemistry, NCCR TransCure, University of Berne, Freiestrasse 3, 3012 Berne, Switzerland
| |
Collapse
|
15
|
Abstract
To confirm the activity of an initial small molecule 'hit compound' from an activity screening, one needs to probe the structure-activity relationships by testing close analogs. The multi-fingerprint browser presented here (http://dcb-reymond23.unibe.ch:8080/MCSS/) enables one to rapidly identify such close analogs among commercially available compounds in the ZINC database (>13 million molecules). The browser retrieves nearest neighbors of any query molecule in multi-dimensional chemical spaces defined by four different fingerprints, each of which represents relevant structural and pharmacophoric features in a different way: sFP (substructure fingerprint), ECFP4 (extended connectivity fingerprint), MQNs (molecular quantum numbers) and SMIfp (SMILES fingerprint). Distances are calculated using the city-block distance, a similarity measure that performs as well as Tanimoto similarity but is much faster to compute. The list of up to 1000 nearest neighbors of any query molecule is retrieved by the browser and can be then clustered using the K-means clustering algorithm to produce a focused list of analogs with likely similar bioactivity to be considered for experimental evaluation.
Collapse
Affiliation(s)
- Mahendra Awale
- Department of Chemistry and Biochemistry, University of Berne, Freiestrasse 3, Berne-3012, Switzerland
| | - Jean-Louis Reymond
- Department of Chemistry and Biochemistry, University of Berne, Freiestrasse 3, Berne-3012, Switzerland
| |
Collapse
|
16
|
Kombo DC, Bencherif M. Comparative study on the use of docking and Bayesian categorization to predict ligand binding to nicotinic acetylcholine receptors (nAChRs) subtypes. J Chem Inf Model 2013; 53:3212-22. [PMID: 24328365 DOI: 10.1021/ci400493a] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]
Abstract
We have carried out a comparative study between docking into homology models and Bayesian categorization, as applied to virtual screening of nicotinic ligands for binding at various nAChRs subtypes (human and rat α4β2, α7, α3β4, and α6β2β3). We found that although results vary with receptor subtype, Bayesian categorization exhibits higher accuracy and enrichment than unconstrained docking into homology models. However, docking accuracy is improved when one sets up a hydrogen-bond (HB) constraint between the cationic center of the ligand and the main-chain carbonyl group of the conserved Trp-149 or its homologue (a residue involved in cation-π interactions with the ligand basic nitrogen atom). This finding suggests that this HB is a hallmark of nicotinic ligands binding to nAChRs. Best predictions using either docking or Bayesian were obtained with the human α7 nAChR, when 100 nM was used as cutoff for biological activity. We also found that ligand-based Bayesian-derived enrichment factors and structure-based docking-derived enrichment factors highly correlate to each other. Moreover, they correlate with the mean molecular fractional polar surface area of actives ligands and the fractional hydrophobic/hydrophilic surface area of the binding site, respectively. This result is in agreement with the fact that hydrophobicity strongly contributes in promoting nicotinic ligands binding to their cognate nAChRs.
Collapse
Affiliation(s)
- David C Kombo
- Targacept, Inc., 100 North Main Street, Winston-Salem, North Carolina 27101, United States
| | | |
Collapse
|
17
|
Molgó J, Aráoz R, Benoit E, Iorga BI. Physical and virtual screening methods for marine toxins and drug discovery targeting nicotinic acetylcholine receptors. Expert Opin Drug Discov 2013; 8:1203-23. [DOI: 10.1517/17460441.2013.822365] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
|
18
|
Schwartz J, Awale M, Reymond JL. SMIfp (SMILES fingerprint) chemical space for virtual screening and visualization of large databases of organic molecules. J Chem Inf Model 2013; 53:1979-89. [PMID: 23845040 DOI: 10.1021/ci400206h] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023]
Abstract
SMIfp (SMILES fingerprint) is defined here as a scalar fingerprint describing organic molecules by counting the occurrences of 34 different symbols in their SMILES strings, which creates a 34-dimensional chemical space. Ligand-based virtual screening using the city-block distance CBD(SMIfp) as similarity measure provides good AUC values and enrichment factors for recovering series of actives from the directory of useful decoys (DUD-E) and from ZINC. DrugBank, ChEMBL, ZINC, PubChem, GDB-11, GDB-13, and GDB-17 can be searched by CBD(SMIfp) using an online SMIfp-browser at www.gdb.unibe.ch. Visualization of the SMIfp chemical space was performed by principal component analysis and color-coded maps of the (PC1, PC2)-planes, with interactive access to the molecules enabled by the Java application SMIfp-MAPPLET available from www.gdb.unibe.ch. These maps spread molecules according to their fraction of aromatic atoms, size and polarity. SMIfp provides a new and relevant entry to explore the small molecule chemical space.
Collapse
Affiliation(s)
- Julian Schwartz
- Department of Chemistry and Biochemistry, University of Berne, Freiestrasse 3, 3012 Berne, Switzerland
| | | | | |
Collapse
|
19
|
Awale M, van Deursen R, Reymond JL. MQN-mapplet: visualization of chemical space with interactive maps of DrugBank, ChEMBL, PubChem, GDB-11, and GDB-13. J Chem Inf Model 2013; 53:509-18. [PMID: 23297797 DOI: 10.1021/ci300513m] [Citation(s) in RCA: 44] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]
Abstract
The MQN-mapplet is a Java application giving access to the structure of small molecules in large databases via color-coded maps of their chemical space. These maps are projections from a 42-dimensional property space defined by 42 integer value descriptors called molecular quantum numbers (MQN), which count different categories of atoms, bonds, polar groups, and topological features and categorize molecules by size, rigidity, and polarity. Despite its simplicity, MQN-space is relevant to biological activities. The MQN-mapplet allows localization of any molecule on the color-coded images, visualization of the molecules, and identification of analogs as neighbors on the MQN-map or in the original 42-dimensional MQN-space. No query molecule is necessary to start the exploration, which may be particularly attractive for nonchemists. To our knowledge, this type of interactive exploration tool is unprecedented for very large databases such as PubChem and GDB-13 (almost one billion molecules). The application is freely available for download at www.gdb.unibe.ch.
Collapse
Affiliation(s)
- Mahendra Awale
- Department of Chemistry and Biochemistry, NCCR TransCure, University of Berne, Freiestrasse 3, 3012 Berne, Switzerland
| | | | | |
Collapse
|
20
|
Ruddigkeit L, Blum LC, Reymond JL. Visualization and virtual screening of the chemical universe database GDB-17. J Chem Inf Model 2013; 53:56-65. [PMID: 23259841 DOI: 10.1021/ci300535x] [Citation(s) in RCA: 75] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
Abstract
The chemical universe database GDB-17 contains 166.4 billion molecules of up to 17 atoms of C, N, O, S, and halogens obeying rules for chemical stability, synthetic feasibility, and medicinal chemistry. GDB-17 was analyzed using 42 integer value descriptors of molecular structure which we term "Molecular Quantum Numbers" (MQN). Principal component analysis and representation of the (PC1, PC2)-plane provided a graphical overview of the GDB-17 chemical space. Rapid ligand-based virtual screening (LBVS) of GDB-17 using the city-block distance CBD(MQN) as a similarity search measure was enabled by a hashed MQN-fingerprint. LBVS of the entire GDB-17 and of selected subsets identified shape similar, scaffold hopping analogs (ROCS > 1.6 and T(SF) < 0.5) of 15 drugs. Over 97% of these analogs occurred within CBD(MQN) ≤ 12 from each drug, a constraint which might help focus advanced virtual screening. An MQN-searchable 50 million subset of GDB-17 is publicly available at www.gdb.unibe.ch .
Collapse
Affiliation(s)
- Lars Ruddigkeit
- Department of Chemistry and Biochemistry, University of Berne, Freiestrasse 3, 3012 Berne, Switzerland
| | | | | |
Collapse
|
21
|
Abstract
Understanding structure-activity relationships (SARs) for a given set of molecules allows one to rationally explore chemical space and develop a chemical series optimizing multiple physicochemical and biological properties simultaneously, for instance, improving potency, reducing toxicity, and ensuring sufficient bioavailability. In silico methods allow rapid and efficient characterization of SARs and facilitate building a variety of models to capture and encode one or more SARs, which can then be used to predict activities for new molecules. By coupling these methods with in silico modifications of structures, one can easily prioritize large screening decks or even generate new compounds de novo and ascertain whether they belong to the SAR being studied. Computational methods can provide a guide for the experienced user by integrating and summarizing large amounts of preexisting data to suggest useful structural modifications. This chapter highlights the different types of SAR modeling methods and how they support the task of exploring chemical space to elucidate and optimize SARs in a drug discovery setting. In addition to considering modeling algorithms, I briefly discuss how to use databases as a source of SAR data to inform and enhance the exploration of SAR trends. I also review common modeling techniques that are used to encode SARs, recent work in the area of structure-activity landscapes, the role of SAR databases, and alternative approaches to exploring SAR data that do not involve explicit model development.
Collapse
Affiliation(s)
- Rajarshi Guha
- NIH Center for Advancing Translational Science, Rockville, MD, USA
| |
Collapse
|
22
|
Ruddigkeit L, van Deursen R, Blum LC, Reymond JL. Enumeration of 166 Billion Organic Small Molecules in the Chemical Universe Database GDB-17. J Chem Inf Model 2012; 52:2864-75. [DOI: 10.1021/ci300415d] [Citation(s) in RCA: 629] [Impact Index Per Article: 48.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Affiliation(s)
- Lars Ruddigkeit
- Department
of Chemistry and Biochemistry, NCCR TransCure, University of Berne, Freiestrasse 3, 3012 Berne, Switzerland
| | - Ruud van Deursen
- Biomolecular Screening Facility,
NCCR Chemical Biology, School of Life Sciences, Ecole Polytechnique Fédérale de Lausanne, 1015
Lausanne, Switzerland
| | - Lorenz C. Blum
- Department
of Chemistry and Biochemistry, NCCR TransCure, University of Berne, Freiestrasse 3, 3012 Berne, Switzerland
| | - Jean-Louis Reymond
- Department
of Chemistry and Biochemistry, NCCR TransCure, University of Berne, Freiestrasse 3, 3012 Berne, Switzerland
| |
Collapse
|
23
|
Reymond JL, Awale M. Exploring chemical space for drug discovery using the chemical universe database. ACS Chem Neurosci 2012; 3:649-57. [PMID: 23019491 DOI: 10.1021/cn3000422] [Citation(s) in RCA: 178] [Impact Index Per Article: 13.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2012] [Accepted: 04/25/2012] [Indexed: 01/20/2023] Open
Abstract
Herein we review our recent efforts in searching for bioactive ligands by enumeration and virtual screening of the unknown chemical space of small molecules. Enumeration from first principles shows that almost all small molecules (>99.9%) have never been synthesized and are still available to be prepared and tested. We discuss open access sources of molecules, the classification and representation of chemical space using molecular quantum numbers (MQN), its exhaustive enumeration in form of the chemical universe generated databases (GDB), and examples of using these databases for prospective drug discovery. MQN-searchable GDB, PubChem, and DrugBank are freely accessible at www.gdb.unibe.ch.
Collapse
Affiliation(s)
- Jean-Louis Reymond
- Department of Chemistry and Biochemistry, University of Berne, Freiestrasse 3, 3012 Berne, Switzerland
| | - Mahendra Awale
- Department of Chemistry and Biochemistry, University of Berne, Freiestrasse 3, 3012 Berne, Switzerland
| |
Collapse
|
24
|
Awale M, Reymond JL. Cluster analysis of the DrugBank chemical space using molecular quantum numbers. Bioorg Med Chem 2012; 20:5372-8. [DOI: 10.1016/j.bmc.2012.03.017] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2012] [Revised: 03/01/2012] [Accepted: 03/05/2012] [Indexed: 11/15/2022]
|
25
|
Bréthous L, Garcia-Delgado N, Schwartz J, Bertrand S, Bertrand D, Reymond JL. Synthesis and Nicotinic Receptor Activity of Chemical Space Analogues of N-(3R)-1-Azabicyclo[2.2.2]oct-3-yl-4-chlorobenzamide (PNU-282,987) and 1,4-Diazabicyclo[3.2.2]nonane-4-carboxylic Acid 4-Bromophenyl Ester (SSR180711). J Med Chem 2012; 55:4605-18. [DOI: 10.1021/jm300030r] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Lise Bréthous
- Department of Chemistry and Biochemistry, University of Berne, Freiestrasse 3, 3012 Berne, Switzerland
| | - Noemi Garcia-Delgado
- Department of Chemistry and Biochemistry, University of Berne, Freiestrasse 3, 3012 Berne, Switzerland
| | - Julian Schwartz
- Department of Chemistry and Biochemistry, University of Berne, Freiestrasse 3, 3012 Berne, Switzerland
| | - Sonia Bertrand
- HiQScreen, 15 rue de l'Athénée, 1206 Geneva, Switzerland
| | | | - Jean-Louis Reymond
- Department of Chemistry and Biochemistry, University of Berne, Freiestrasse 3, 3012 Berne, Switzerland
| |
Collapse
|
26
|
Reymond JL, Ruddigkeit L, Blum L, van Deursen R. The enumeration of chemical space. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE 2012. [DOI: 10.1002/wcms.1104] [Citation(s) in RCA: 74] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
|