1
|
Saigiridharan L, Hassen AK, Lai H, Torren-Peraire P, Engkvist O, Genheden S. AiZynthFinder 4.0: developments based on learnings from 3 years of industrial application. J Cheminform 2024; 16:57. [PMID: 38778382 PMCID: PMC11112899 DOI: 10.1186/s13321-024-00860-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2023] [Accepted: 05/15/2024] [Indexed: 05/25/2024] Open
Abstract
We present an updated overview of the AiZynthFinder package for retrosynthesis planning. Since the first version was released in 2020, we have added a substantial number of new features based on user feedback. Feature enhancements include policies for filter reactions, support for any one-step retrosynthesis model, a scoring framework and several additional search algorithms. To exemplify the typical use-cases of the software and highlight some learnings, we perform a large-scale analysis on several hundred thousand target molecules from diverse sources. This analysis looks at for instance route shape, stock usage and exploitation of reaction space, and points out strengths and weaknesses of our retrosynthesis approach. The software is released as open-source for educational purposes as well as to provide a reference implementation of the core algorithms for synthesis prediction. We hope that releasing the software as open-source will further facilitate innovation in developing novel methods for synthetic route prediction. AiZynthFinder is a fast, robust and extensible open-source software and can be downloaded from https://github.com/MolecularAI/aizynthfinder .
Collapse
Affiliation(s)
| | - Alan Kai Hassen
- Leiden Institute of Advanced Computer Science, Leiden University, Leiden, The Netherlands
| | - Helen Lai
- Molecular AI, Discovery Sciences, R&D, AstraZeneca, Cambridge, UK
| | - Paula Torren-Peraire
- Institute of Structural Biology, Molecular Targets and Therapeutics Center, Helmholtz Zentrum München, Neuherberg, Germany
| | - Ola Engkvist
- Molecular AI, Discovery Sciences, R&D, AstraZeneca, Gothenburg, Sweden
| | - Samuel Genheden
- Molecular AI, Discovery Sciences, R&D, AstraZeneca, Gothenburg, Sweden.
| |
Collapse
|
2
|
Abstract
A few months before the COVID-19 pandemic, Pierre Vogel and Kendall N. Houk published with a new textbook Wiley-VCH, “Organic Chemistry: Theory, Reactivity, and Mechanisms in Modern Synthesis”, with a foreword from the late Roberts H. Grubbs. The book demonstrates how catalytic processes dominate all fields of modern organic chemistry and synthesis, and how invention combines thermodynamics, kinetics, spectroscopy, quantum mechanics, and thermochemical data libraries. Here, the authors present a few case studies that should be of interest to teachers, practitioners of organic and organometallic chemistry, and the engineers of molecules. The Vogel–Houk book is both textbook and reference manual; it provides a modern way to think about chemical reactivity and a powerful toolbox to inventors of new reactions and new procedures.
Collapse
|
3
|
Warr WA, Nicklaus MC, Nicolaou CA, Rarey M. Exploration of Ultralarge Compound Collections for Drug Discovery. J Chem Inf Model 2022; 62:2021-2034. [PMID: 35421301 DOI: 10.1021/acs.jcim.2c00224] [Citation(s) in RCA: 45] [Impact Index Per Article: 22.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
Designing new medicines more cheaply and quickly is tightly linked to the quest of exploring chemical space more widely and efficiently. Chemical space is monumentally large, but recent advances in computer software and hardware have enabled researchers to navigate virtual chemical spaces containing billions of chemical structures. This review specifically concerns collections of many millions or even billions of enumerated chemical structures as well as even larger chemical spaces that are not fully enumerated. We present examples of chemical libraries and spaces and the means used to construct them, and we discuss new technologies for searching huge libraries and for searching combinatorially in chemical space. We also cover space navigation techniques and consider new approaches to de novo drug design and the impact of the "autonomous laboratory" on synthesis of designed compounds. Finally, we summarize some other challenges and opportunities for the future.
Collapse
Affiliation(s)
- Wendy A Warr
- Wendy Warr & Associates, 6 Berwick Court, Holmes Chapel, Crewe, Cheshire CW4 7HZ, United Kingdom
| | - Marc C Nicklaus
- NCI, NIH, CADD Group, NCI-Frederick, Frederick, Maryland 21702, United States
| | - Christos A Nicolaou
- Discovery Chemistry, Lilly Research Laboratories, Eli Lilly and Company, Indianapolis, Indiana 46285, United States
| | - Matthias Rarey
- Universität Hamburg, ZBH Center for Bioinformatics, 20146 Hamburg, Germany
| |
Collapse
|
4
|
Su A, Cheng Y, Xue H, She Y, Rajan K. Artificial intelligence informed toxicity screening of amine chemistries used in the synthesis of hybrid
organic–inorganic
perovskites. AIChE J 2022. [DOI: 10.1002/aic.17699] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Affiliation(s)
- An Su
- College of Chemical Engineering Zhejiang University of Technology Hangzhou China
- Department of Materials Design and Innovation University at Buffalo Buffalo New York USA
| | - Yingying Cheng
- College of Chemical Engineering Zhejiang University of Technology Hangzhou China
| | - Haotian Xue
- Collaborative Innovation Center of Yangtze River Delta Region Green Pharmaceuticals Zhejiang University of Technology Hangzhou China
| | - Yuanbin She
- College of Chemical Engineering Zhejiang University of Technology Hangzhou China
| | - Krishna Rajan
- Department of Materials Design and Innovation University at Buffalo Buffalo New York USA
| |
Collapse
|
5
|
Genheden S, Engkvist O, Bjerrum E. Fast prediction of distances between synthetic routes with deep learning. MACHINE LEARNING: SCIENCE AND TECHNOLOGY 2022. [DOI: 10.1088/2632-2153/ac4a91] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022] Open
Abstract
Abstract
We expand the recent work on clustering of synthetic routes and train a deep learning model to predict the distances between arbitrary routes. The model is based on a long short-term memory representation of a synthetic route and is trained as a twin network to reproduce the tree edit distance (TED) between two routes. The machine learning approach is approximately two orders of magnitude faster than the TED approach and enables clustering many more routes from a retrosynthesis route prediction. The clusters have a high degree of similarity to the clusters given by the TED-based approach and are accordingly intuitive and explainable. We provide the developed model as open-source.
Collapse
|
6
|
Pujol‐Giménez J, Poirier M, Bühlmann S, Schuppisser C, Bhardwaj R, Awale M, Visini R, Javor S, Hediger MA, Reymond J. Inhibitors of Human Divalent Metal Transporters DMT1 (SLC11A2) and ZIP8 (SLC39A8) from a GDB-17 Fragment Library. ChemMedChem 2021; 16:3306-3314. [PMID: 34309203 PMCID: PMC8596699 DOI: 10.1002/cmdc.202100467] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2021] [Indexed: 11/06/2022]
Abstract
Solute carrier proteins (SLCs) are membrane proteins controlling fluxes across biological membranes and represent an emerging class of drug targets. Here we searched for inhibitors of divalent metal transporters in a library of 1,676 commercially available 3D-shaped fragment-like molecules from the generated database GDB-17, which lists all possible organic molecules up to 17 atoms of C, N, O, S and halogen following simple criteria for chemical stability and synthetic feasibility. While screening against DMT1 (SLC11A2), an iron transporter associated with hemochromatosis and for which only very few inhibitors are known, only yielded two weak inhibitors, our approach led to the discovery of the first inhibitor of ZIP8 (SLC39A8), a zinc transporter associated with manganese homeostasis and osteoarthritis but with no previously reported pharmacology, demonstrating that this target is druggable.
Collapse
Affiliation(s)
- Jonai Pujol‐Giménez
- Department of Biomedical Research and Department of Nephrology and Hypertension Membrane Transport Discovery Lab Inselspital, Bern University HospitalUniversity of BernCH-3010BernSwitzerland
| | - Marion Poirier
- Department of Chemistry Biochemistry and Pharmaceutical SciencesUniversity of BernFreiestrasse 33012BernSwitzerland
| | - Sven Bühlmann
- Department of Chemistry Biochemistry and Pharmaceutical SciencesUniversity of BernFreiestrasse 33012BernSwitzerland
| | - Céline Schuppisser
- Department of Chemistry Biochemistry and Pharmaceutical SciencesUniversity of BernFreiestrasse 33012BernSwitzerland
| | - Rajesh Bhardwaj
- Department of Biomedical Research and Department of Nephrology and Hypertension Membrane Transport Discovery Lab Inselspital, Bern University HospitalUniversity of BernCH-3010BernSwitzerland
| | - Mahendra Awale
- Department of Chemistry Biochemistry and Pharmaceutical SciencesUniversity of BernFreiestrasse 33012BernSwitzerland
| | - Ricardo Visini
- Department of Chemistry Biochemistry and Pharmaceutical SciencesUniversity of BernFreiestrasse 33012BernSwitzerland
| | - Sacha Javor
- Department of Chemistry Biochemistry and Pharmaceutical SciencesUniversity of BernFreiestrasse 33012BernSwitzerland
| | - Matthias A. Hediger
- Department of Biomedical Research and Department of Nephrology and Hypertension Membrane Transport Discovery Lab Inselspital, Bern University HospitalUniversity of BernCH-3010BernSwitzerland
| | - Jean‐Louis Reymond
- Department of Chemistry Biochemistry and Pharmaceutical SciencesUniversity of BernFreiestrasse 33012BernSwitzerland
| |
Collapse
|
7
|
Meier K, Arús‐Pous J, Reymond J. A Potent and Selective Janus Kinase Inhibitor with a Chiral 3D‐Shaped Triquinazine Ring System from Chemical Space. Angew Chem Int Ed Engl 2021. [DOI: 10.1002/ange.202012049] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Affiliation(s)
- Kris Meier
- Department of Chemistry and Biochemistry University of Bern Freiestrasse 3 3012 Bern Switzerland
| | - Josep Arús‐Pous
- Department of Chemistry and Biochemistry University of Bern Freiestrasse 3 3012 Bern Switzerland
| | - Jean‐Louis Reymond
- Department of Chemistry and Biochemistry University of Bern Freiestrasse 3 3012 Bern Switzerland
| |
Collapse
|
8
|
Thakkar A, Chadimová V, Bjerrum EJ, Engkvist O, Reymond JL. Retrosynthetic accessibility score (RAscore) - rapid machine learned synthesizability classification from AI driven retrosynthetic planning. Chem Sci 2021; 12:3339-3349. [PMID: 34164104 DOI: 10.26434/chemrxiv.13019993.v1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/24/2023] Open
Abstract
Computer aided synthesis planning (CASP) is part of a suite of artificial intelligence (AI) based tools that are able to propose synthesis routes to a wide range of compounds. However, at present they are too slow to be used to screen the synthetic feasibility of millions of generated or enumerated compounds before identification of potential bioactivity by virtual screening (VS) workflows. Herein we report a machine learning (ML) based method capable of classifying whether a synthetic route can be identified for a particular compound or not by the CASP tool AiZynthFinder. The resulting ML models return a retrosynthetic accessibility score (RAscore) of any molecule of interest, and computes at least 4500 times faster than retrosynthetic analysis performed by the underlying CASP tool. The RAscore should be useful for pre-screening millions of virtual molecules from enumerated databases or generative models for synthetic accessibility and produce higher quality databases for virtual screening of biological activity.
Collapse
Affiliation(s)
- Amol Thakkar
- Hit Discovery, Discovery Sciences, R&D, AstraZeneca Gothenburg 431 50 Sweden
- Department of Chemistry and Biochemistry, University of Bern Bern CH-3012 Switzerland
| | - Veronika Chadimová
- Hit Discovery, Discovery Sciences, R&D, AstraZeneca Gothenburg 431 50 Sweden
| | | | - Ola Engkvist
- Hit Discovery, Discovery Sciences, R&D, AstraZeneca Gothenburg 431 50 Sweden
| | - Jean-Louis Reymond
- Department of Chemistry and Biochemistry, University of Bern Bern CH-3012 Switzerland
| |
Collapse
|
9
|
Thakkar A, Chadimová V, Bjerrum EJ, Engkvist O, Reymond JL. Retrosynthetic accessibility score (RAscore) - rapid machine learned synthesizability classification from AI driven retrosynthetic planning. Chem Sci 2021; 12:3339-3349. [PMID: 34164104 PMCID: PMC8179384 DOI: 10.1039/d0sc05401a] [Citation(s) in RCA: 47] [Impact Index Per Article: 15.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
Computer aided synthesis planning (CASP) is part of a suite of artificial intelligence (AI) based tools that are able to propose synthesis routes to a wide range of compounds. However, at present they are too slow to be used to screen the synthetic feasibility of millions of generated or enumerated compounds before identification of potential bioactivity by virtual screening (VS) workflows. Herein we report a machine learning (ML) based method capable of classifying whether a synthetic route can be identified for a particular compound or not by the CASP tool AiZynthFinder. The resulting ML models return a retrosynthetic accessibility score (RAscore) of any molecule of interest, and computes at least 4500 times faster than retrosynthetic analysis performed by the underlying CASP tool. The RAscore should be useful for pre-screening millions of virtual molecules from enumerated databases or generative models for synthetic accessibility and produce higher quality databases for virtual screening of biological activity. The retrosynthetic accessibility score (RAscore) is based on AI driven retrosynthetic planning, and is useful for rapid scoring of synthetic feasability and pre-screening of large datasets of virtual/generated molecules.![]()
Collapse
Affiliation(s)
- Amol Thakkar
- Hit Discovery, Discovery Sciences, R&D, AstraZeneca Gothenburg 431 50 Sweden .,Department of Chemistry and Biochemistry, University of Bern Bern CH-3012 Switzerland
| | - Veronika Chadimová
- Hit Discovery, Discovery Sciences, R&D, AstraZeneca Gothenburg 431 50 Sweden
| | | | - Ola Engkvist
- Hit Discovery, Discovery Sciences, R&D, AstraZeneca Gothenburg 431 50 Sweden
| | - Jean-Louis Reymond
- Department of Chemistry and Biochemistry, University of Bern Bern CH-3012 Switzerland
| |
Collapse
|
10
|
Yang T, Li Z, Chen Y, Feng D, Wang G, Fu Z, Ding X, Tan X, Zhao J, Luo X, Chen K, Jiang H, Zheng M. DrugSpaceX: a large screenable and synthetically tractable database extending drug space. Nucleic Acids Res 2021; 49:D1170-D1178. [PMID: 33104791 PMCID: PMC7778939 DOI: 10.1093/nar/gkaa920] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2020] [Revised: 09/11/2020] [Accepted: 10/05/2020] [Indexed: 02/07/2023] Open
Abstract
One of the most prominent topics in drug discovery is efficient exploration of the vast drug-like chemical space to find synthesizable and novel chemical structures with desired biological properties. To address this challenge, we created the DrugSpaceX (https://drugspacex.simm.ac.cn/) database based on expert-defined transformations of approved drug molecules. The current version of DrugSpaceX contains >100 million transformed chemical products for virtual screening, with outstanding characteristics in terms of structural novelty, diversity and large three-dimensional chemical space coverage. To illustrate its practical application in drug discovery, we used a case study of discoidin domain receptor 1 (DDR1), a kinase target implicated in fibrosis and other diseases, to show DrugSpaceX performing a quick search of initial hit compounds. Additionally, for ligand identification and optimization purposes, DrugSpaceX also provides several subsets for download, including a 10% diversity subset, an extended drug-like subset, a drug-like subset, a lead-like subset, and a fragment-like subset. In addition to chemical properties and transformation instructions, DrugSpaceX can locate the position of transformation, which will enable medicinal chemists to easily integrate strategy planning and protection design.
Collapse
Affiliation(s)
- Tianbiao Yang
- Drug Discovery and Design Center, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, 555 Zuchongzhi Road, Shanghai 201203, China
- Department of Pharmacy, University of Chinese Academy of Sciences, No.19A Yuquan Road, Beijing 100049, China
- School of Pharmaceutical Science and Technology, Hangzhou Institute for Advanced Study, UCAS, Hangzhou 310024, China
| | - Zhaojun Li
- School of Information Management, Dezhou University, No. 566 University Rd. West, Dezhou 253023, Shandong, China
| | - Yingjia Chen
- Drug Discovery and Design Center, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, 555 Zuchongzhi Road, Shanghai 201203, China
- Department of Pharmacy, University of Chinese Academy of Sciences, No.19A Yuquan Road, Beijing 100049, China
| | - Dan Feng
- Drug Discovery and Design Center, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, 555 Zuchongzhi Road, Shanghai 201203, China
- Department of Chemistry, College of Sciences, Shanghai University, Shanghai, China
| | - Guangchao Wang
- School of Information Management, Dezhou University, No. 566 University Rd. West, Dezhou 253023, Shandong, China
| | - Zunyun Fu
- Drug Discovery and Design Center, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, 555 Zuchongzhi Road, Shanghai 201203, China
- Nanjing University of Chinese Medicine, 138 Xianlin Road, Jiangsu, Nanjing 210023, China
| | - Xiaoyu Ding
- Drug Discovery and Design Center, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, 555 Zuchongzhi Road, Shanghai 201203, China
- Department of Pharmacy, University of Chinese Academy of Sciences, No.19A Yuquan Road, Beijing 100049, China
| | - Xiaoqin Tan
- Drug Discovery and Design Center, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, 555 Zuchongzhi Road, Shanghai 201203, China
- Department of Pharmacy, University of Chinese Academy of Sciences, No.19A Yuquan Road, Beijing 100049, China
| | - Jihui Zhao
- Drug Discovery and Design Center, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, 555 Zuchongzhi Road, Shanghai 201203, China
- Department of Pharmacy, University of Chinese Academy of Sciences, No.19A Yuquan Road, Beijing 100049, China
| | - Xiaomin Luo
- Drug Discovery and Design Center, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, 555 Zuchongzhi Road, Shanghai 201203, China
- Department of Pharmacy, University of Chinese Academy of Sciences, No.19A Yuquan Road, Beijing 100049, China
| | - Kaixian Chen
- Drug Discovery and Design Center, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, 555 Zuchongzhi Road, Shanghai 201203, China
- Department of Pharmacy, University of Chinese Academy of Sciences, No.19A Yuquan Road, Beijing 100049, China
| | - Hualiang Jiang
- Drug Discovery and Design Center, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, 555 Zuchongzhi Road, Shanghai 201203, China
- Department of Pharmacy, University of Chinese Academy of Sciences, No.19A Yuquan Road, Beijing 100049, China
- School of Pharmaceutical Science and Technology, Hangzhou Institute for Advanced Study, UCAS, Hangzhou 310024, China
- School of Life Science and Technology, ShanghaiTech University, 393 Huaxiazhong Road, Shanghai 200031, China
| | - Mingyue Zheng
- Drug Discovery and Design Center, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, 555 Zuchongzhi Road, Shanghai 201203, China
- Department of Pharmacy, University of Chinese Academy of Sciences, No.19A Yuquan Road, Beijing 100049, China
| |
Collapse
|
11
|
Meier K, Arús‐Pous J, Reymond J. A Potent and Selective Janus Kinase Inhibitor with a Chiral 3D‐Shaped Triquinazine Ring System from Chemical Space. Angew Chem Int Ed Engl 2020; 60:2074-2077. [DOI: 10.1002/anie.202012049] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2020] [Revised: 09/25/2020] [Indexed: 01/31/2023]
Affiliation(s)
- Kris Meier
- Department of Chemistry and Biochemistry University of Bern Freiestrasse 3 3012 Bern Switzerland
| | - Josep Arús‐Pous
- Department of Chemistry and Biochemistry University of Bern Freiestrasse 3 3012 Bern Switzerland
| | - Jean‐Louis Reymond
- Department of Chemistry and Biochemistry University of Bern Freiestrasse 3 3012 Bern Switzerland
| |
Collapse
|
12
|
Poirier M, Pujol-Giménez J, Manatschal C, Bühlmann S, Embaby A, Javor S, Hediger MA, Reymond JL. Pyrazolyl-pyrimidones inhibit the function of human solute carrier protein SLC11A2 (hDMT1) by metal chelation. RSC Med Chem 2020; 11:1023-1031. [PMID: 33479694 PMCID: PMC7649969 DOI: 10.1039/d0md00085j] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2020] [Accepted: 05/06/2020] [Indexed: 12/22/2022] Open
Abstract
Solute carrier proteins (SLCs) control fluxes of ions and molecules across biological membranes and represent an emerging class of drug targets. SLC11A2 (hDMT1) mediates intestinal iron uptake and its inhibition might be used to treat iron overload diseases such as hereditary hemochromatosis. Here we report a micromolar (IC50 = 1.1 μM) pyrazolyl-pyrimidone inhibitor of radiolabeled iron uptake in hDMT1 overexpressing HEK293 cells acting by a non-competitive mechanism, which however does not affect the electrophysiological properties of the transporter. Isothermal titration calorimetry, competition with calcein, induced precipitation of radioactive iron and cross inhibition of the unrelated iron transporter SLC39A8 (hZIP8) indicate that inhibition is mediated by metal chelation. Mapping the chemical space of thousands of pyrazolo-pyrimidones and similar 2,2'-diazabiaryls in ChEMBL suggests that their reported activities might partly reflect metal chelation. Such metal chelating groups are not listed in pan-assay interference compounds (PAINS) but should be checked when addressing SLCs.
Collapse
Affiliation(s)
- Marion Poirier
- Department of Chemistry and Biochemistry , University of Bern , Freiestrasse 3 , 3012 Bern , Switzerland .
| | - Jonai Pujol-Giménez
- Institute of Biochemistry and Molecular Medicine , University of Bern , Bühlstrasse 28 , 3012 Bern , Switzerland
- Membrane Transport Discovery Lab , Department of Nephrology and Hypertension , Inselspital , University of Bern Kinderklinik , Freiburgstrasse 15 , 3010 Bern , Switzerland .
- Department of Biomedical Research , University of Bern , Murtenstrasse 35 , 3008 Bern , Switzerland
| | - Cristina Manatschal
- Department of Biochemistry , University of Zürich , Winterthurerstrasse 190 , Zürich , Switzerland
| | - Sven Bühlmann
- Department of Chemistry and Biochemistry , University of Bern , Freiestrasse 3 , 3012 Bern , Switzerland .
| | - Ahmed Embaby
- Department of Chemistry and Biochemistry , University of Bern , Freiestrasse 3 , 3012 Bern , Switzerland .
| | - Sacha Javor
- Department of Chemistry and Biochemistry , University of Bern , Freiestrasse 3 , 3012 Bern , Switzerland .
| | - Matthias A Hediger
- Institute of Biochemistry and Molecular Medicine , University of Bern , Bühlstrasse 28 , 3012 Bern , Switzerland
- Membrane Transport Discovery Lab , Department of Nephrology and Hypertension , Inselspital , University of Bern Kinderklinik , Freiburgstrasse 15 , 3010 Bern , Switzerland .
- Department of Biomedical Research , University of Bern , Murtenstrasse 35 , 3008 Bern , Switzerland
| | - Jean-Louis Reymond
- Department of Chemistry and Biochemistry , University of Bern , Freiestrasse 3 , 3012 Bern , Switzerland .
| |
Collapse
|
13
|
Bühlmann S, Reymond JL. ChEMBL-Likeness Score and Database GDBChEMBL. Front Chem 2020; 8:46. [PMID: 32117874 PMCID: PMC7010641 DOI: 10.3389/fchem.2020.00046] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2019] [Accepted: 01/15/2020] [Indexed: 01/02/2023] Open
Abstract
The generated database GDB17 enumerates 166.4 billion molecules up to 17 atoms of C, N, O, S and halogens following simple rules of chemical stability and synthetic feasibility. However, most molecules in GDB17 are too complex to be considered for chemical synthesis. To address this limitation, we report GDBChEMBL as a subset of GDB17 featuring 10 million molecules selected according to a ChEMBL-likeness score (CLscore) calculated from the frequency of occurrence of circular substructures in ChEMBL, followed by uniform sampling across molecular size, stereocenters and heteroatoms. Compared to the previously reported subsets FDB17 and GDBMedChem selected from GDB17 by fragment-likeness, respectively, medicinal chemistry criteria, our new subset features molecules with higher synthetic accessibility and possibly bioactivity yet retains a broad and continuous coverage of chemical space typical of the entire GDB17. GDBChEMBL is accessible at http://gdb.unibe.ch for download and for browsing using an interactive chemical space map at http://faerun.gdb.tools.
Collapse
Affiliation(s)
- Sven Bühlmann
- Department of Chemistry and Biochemistry, University of Bern, Bern, Switzerland
| | - Jean-Louis Reymond
- Department of Chemistry and Biochemistry, University of Bern, Bern, Switzerland
| |
Collapse
|