1
|
P V, Mohanan M, U K S, E Pa S, U C A J. Graph Attention Network based mapping of knowledge relations between chemical spaces of Nuclear factor kappa B and Centella asiatica. Comput Biol Chem 2023; 107:107955. [PMID: 37734134 DOI: 10.1016/j.compbiolchem.2023.107955] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2022] [Revised: 08/02/2023] [Accepted: 09/07/2023] [Indexed: 09/23/2023]
Abstract
The confounding nature of the innate immunity target Nuclear Factor kappa B (NF-κB) and its interaction with Centella asiatica (CA) molecules necessitate the intervention of advanced technologies, such as deep learning methods. The integration of chemical space concepts with deep learning technologies is a new way of knowledge mapping used to explore drug-target interactions, especially in molecular libraries derived from traditional medicine based molecular sources. The current constraint of virtual screening for mechanistic target hunting is the use of a binary classification model that includes active and inactive molecules from in vitro experiments to explore drug-target interaction. This study aims to explore the regulatory nature of the molecules from the inhibition and activation of the NF-κB bioassay data set and map this information for a knowledge-based analysis against the molecules of CA, a low-growing tropical plant. This finding has led to a new direction in the field, transitioning from the conventional active-inactive framework to a more comprehensive active-inactive-regulatory model. This approach can be thoroughly explored by leveraging a graph-based deep learning system. The study presents an innovative approach using a Graph Attention Network (GAT) to rank CA molecules in chemical space based on their similarity with NF-κB bioassay molecules, enabling the efficient analysis of complex relationships between molecules and their regulatory function. Graph Attention Network (GAT) overcomes the limitations of traditional deep learning models such as Convolutional Neural Network (CNN) and Recurrent Neural Network (RNN) in handling non-Euclidean graph data and allows for a more precise understanding of similarity ranking by utilizing molecular graphs and attention behavior. By measuring similarity and arranging a matrix of similarity ranking based on GAT, deep neural ranking-based algorithms confirmed the regulatory behaviour of an innate immunity target NF-κB with the support of underlying inverse mapping in the surjective chemical spaces of NF-κB bioassays and CA molecular spaces. Overall, the study introduces new techniques for exploring the regulatory behaviour of complex targets like NF-κB. We then used t-SNE for clustering in chemical space and scaffold hunting for scaffold property analysis and identified nine CA molecules that exhibit regulatory behavior of NF-κB target and are recommended for further investigation.
Collapse
Affiliation(s)
- Vivek P
- UL Research Center, UL Cyber Park Calicut, India
| | | | | | - Sandesh E Pa
- UL Research Center, UL Cyber Park Calicut, India
| | - Jaleel U C A
- OSPF-NIAS Drug DIscovery Lab, National Institute of Advanced Studies, Indian Institute of Science Campus, Bengaluru, India
| |
Collapse
|
2
|
Azad I, Khan T, Ahmad N, Khan AR, Akhter Y. Updates on drug designing approach through computational strategies: a review. Future Sci OA 2023; 9:FSO862. [PMID: 37180609 PMCID: PMC10167725 DOI: 10.2144/fsoa-2022-0085] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2022] [Accepted: 04/12/2023] [Indexed: 05/16/2023] Open
Abstract
The drug discovery and development (DDD) process in pursuit of novel drug candidates is a challenging procedure requiring lots of time and resources. Therefore, computer-aided drug design (CADD) methodologies are used extensively to promote proficiency in drug development in a systematic and time-effective manner. The point in reference is SARS-CoV-2 which has emerged as a global pandemic. In the absence of any confirmed drug moiety to treat the infection, the science fraternity adopted hit and trial methods to come up with a lead drug compound. This article is an overview of the virtual methodologies, which assist in finding novel hits and help in the progression of drug development in a short period with a specific medicinal solution.
Collapse
Affiliation(s)
- Iqbal Azad
- Department of Chemistry, Integral University, Dasauli, P.O. Bas-ha, Kursi Road, Lucknow, 226026, UP, India
| | - Tahmeena Khan
- Department of Chemistry, Integral University, Dasauli, P.O. Bas-ha, Kursi Road, Lucknow, 226026, UP, India
| | - Naseem Ahmad
- Department of Chemistry, Integral University, Dasauli, P.O. Bas-ha, Kursi Road, Lucknow, 226026, UP, India
| | - Abdul Rahman Khan
- Department of Chemistry, Integral University, Dasauli, P.O. Bas-ha, Kursi Road, Lucknow, 226026, UP, India
| | - Yusuf Akhter
- Department of Biotechnology, Babasaheb Bhimrao Ambedkar University, Vidya Vihar, Raebareli Road, Lucknow, UP, 2260025, India
| |
Collapse
|
3
|
Computational Approaches to Enzyme Inhibition by Marine Natural Products in the Search for New Drugs. Mar Drugs 2023; 21:md21020100. [PMID: 36827141 PMCID: PMC9961086 DOI: 10.3390/md21020100] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2022] [Revised: 01/26/2023] [Accepted: 01/28/2023] [Indexed: 02/03/2023] Open
Abstract
The exploration of biologically relevant chemical space for the discovery of small bioactive molecules present in marine organisms has led not only to important advances in certain therapeutic areas, but also to a better understanding of many life processes. The still largely untapped reservoir of countless metabolites that play biological roles in marine invertebrates and microorganisms opens new avenues and poses new challenges for research. Computational technologies provide the means to (i) organize chemical and biological information in easily searchable and hyperlinked databases and knowledgebases; (ii) carry out cheminformatic analyses on natural products; (iii) mine microbial genomes for known and cryptic biosynthetic pathways; (iv) explore global networks that connect active compounds to their targets (often including enzymes); (v) solve structures of ligands, targets, and their respective complexes using X-ray crystallography and NMR techniques, thus enabling virtual screening and structure-based drug design; and (vi) build molecular models to simulate ligand binding and understand mechanisms of action in atomic detail. Marine natural products are viewed today not only as potential drugs, but also as an invaluable source of chemical inspiration for the development of novel chemotypes to be used in chemical biology and medicinal chemistry research.
Collapse
|
4
|
Wang Q, Huang K, Chandak P, Zitnik M, Gehlenborg N. Extending the Nested Model for User-Centric XAI: A Design Study on GNN-based Drug Repurposing. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS 2023; 29:1266-1276. [PMID: 36223348 DOI: 10.1109/tvcg.2022.3209435] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
Whether AI explanations can help users achieve specific tasks efficiently (i.e., usable explanations) is significantly influenced by their visual presentation. While many techniques exist to generate explanations, it remains unclear how to select and visually present AI explanations based on the characteristics of domain users. This paper aims to understand this question through a multidisciplinary design study for a specific problem: explaining graph neural network (GNN) predictions to domain experts in drug repurposing, i.e., reuse of existing drugs for new diseases. Building on the nested design model of visualization, we incorporate XAI design considerations from a literature review and from our collaborators' feedback into the design process. Specifically, we discuss XAI-related design considerations for usable visual explanations at each design layer: target user, usage context, domain explanation, and XAI goal at the domain layer; format, granularity, and operation of explanations at the abstraction layer; encodings and interactions at the visualization layer; and XAI and rendering algorithm at the algorithm layer. We present how the extended nested model motivates and informs the design of DrugExplorer, an XAI tool for drug repurposing. Based on our domain characterization, DrugExplorer provides path-based explanations and presents them both as individual paths and meta-paths for two key XAI operations, why and what else. DrugExplorer offers a novel visualization design called MetaMatrix with a set of interactions to help domain users organize and compare explanation paths at different levels of granularity to generate domain-meaningful insights. We demonstrate the effectiveness of the selected visual presentation and DrugExplorer as a whole via a usage scenario, a user study, and expert interviews. From these evaluations, we derive insightful observations and reflections that can inform the design of XAI visualizations for other scientific applications.
Collapse
|
5
|
Schaub J, Zander J, Zielesny A, Steinbeck C. Scaffold Generator: a Java library implementing molecular scaffold functionalities in the Chemistry Development Kit (CDK). J Cheminform 2022; 14:79. [PMID: 36357931 PMCID: PMC9650898 DOI: 10.1186/s13321-022-00656-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2022] [Accepted: 10/30/2022] [Indexed: 11/12/2022] Open
Abstract
The concept of molecular scaffolds as defining core structures of organic molecules is utilised in many areas of chemistry and cheminformatics, e.g. drug design, chemical classification, or the analysis of high-throughput screening data. Here, we present Scaffold Generator, a comprehensive open library for the generation, handling, and display of molecular scaffolds, scaffold trees and networks. The new library is based on the Chemistry Development Kit (CDK) and highly customisable through multiple settings, e.g. five different structural framework definitions are available. For display of scaffold hierarchies, the open GraphStream Java library is utilised. Performance snapshots with natural products (NP) from the COCONUT (COlleCtion of Open Natural prodUcTs) database and drug molecules from DrugBank are reported. The generation of a scaffold network from more than 450,000 NP can be achieved within a single day.
Collapse
Affiliation(s)
- Jonas Schaub
- grid.9613.d0000 0001 1939 2794Institute for Inorganic and Analytical Chemistry, Friedrich-Schiller-University Jena, Lessing Strasse 8, 07743 Jena, Germany
| | - Julian Zander
- grid.454254.60000 0004 0647 4362Institute for Bioinformatics and Chemoinformatics, Westphalian University of Applied Sciences, August-Schmidt-Ring 10, 45665 Recklinghausen, Germany
| | - Achim Zielesny
- grid.454254.60000 0004 0647 4362Institute for Bioinformatics and Chemoinformatics, Westphalian University of Applied Sciences, August-Schmidt-Ring 10, 45665 Recklinghausen, Germany
| | - Christoph Steinbeck
- grid.9613.d0000 0001 1939 2794Institute for Inorganic and Analytical Chemistry, Friedrich-Schiller-University Jena, Lessing Strasse 8, 07743 Jena, Germany
| |
Collapse
|
6
|
Biofilm- i: A Platform for Predicting Biofilm Inhibitors Using Quantitative Structure-Relationship (QSAR) Based Regression Models to Curb Antibiotic Resistance. MOLECULES (BASEL, SWITZERLAND) 2022; 27:molecules27154861. [PMID: 35956807 PMCID: PMC9369795 DOI: 10.3390/molecules27154861] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/06/2022] [Revised: 07/16/2022] [Accepted: 07/17/2022] [Indexed: 11/19/2022]
Abstract
Antibiotic drug resistance has emerged as a major public health threat globally. One of the leading causes of drug resistance is the colonization of microorganisms in biofilm mode. Hence, there is an urgent need to design novel and highly effective biofilm inhibitors that can work either synergistically with antibiotics or individually. Therefore, we have developed a recursive regression-based platform “Biofilm-i” employing a quantitative structure–activity relationship approach for making generalized predictions, along with group and species-specific predictions of biofilm inhibition efficiency of chemical(s). The platform encompasses eight predictors, three analysis tools, and data visualization modules. The experimentally validated biofilm inhibitors for model development were retrieved from the “aBiofilm” resource and processed using a 10-fold cross-validation approach using the support vector machine and andom forest machine learning techniques. The data was further sub-divided into training/testing and independent validation sets. From training/testing data sets the Pearson’s correlation coefficient of overall chemicals, Gram-positive bacteria, Gram-negative bacteria, fungus, Pseudomonas aeruginosa, Staphylococcus aureus, Candida albicans, and Escherichia coli was 0.60, 0.77, 0.62, 0.77, 0.73, 0.83, 0.70, and 0.71 respectively via Support Vector Machine. Further, all the QSAR models performed equally well on independent validation data sets. Additionally, we also checked the performance of the random forest machine learning technique for the above datasets. The integrated analysis tools can convert the chemical structure into different formats, search for a similar chemical in the aBiofilm database and design the analogs. Moreover, the data visualization modules check the distribution of experimentally validated biofilm inhibitors according to their common scaffolds. The Biofilm-i platform would be of immense help to researchers engaged in designing highly efficacious biofilm inhibitors for tackling the menace of antibiotic drug resistance.
Collapse
|
7
|
Moshawih S, Goh HP, Kifli N, Idris AC, Yassin H, Kotra V, Goh KW, Liew KB, Ming LC. Synergy between machine learning and natural products cheminformatics: Application to the lead discovery of anthraquinone derivatives. Chem Biol Drug Des 2022; 100:185-217. [PMID: 35490393 DOI: 10.1111/cbdd.14062] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2022] [Revised: 04/15/2022] [Accepted: 04/23/2022] [Indexed: 11/28/2022]
Abstract
Cheminformatics utilizing machine learning (ML) techniques have opened up a new horizon in drug discovery. This is owing to vast chemical space expansion with rocketing numbers of expected hits and lead compounds that match druggable macromolecular targets, in particular from natural compounds. Due to the natural products' (NP) structural complexity, uniqueness, and diversity, they could occupy a bigger space in pharmaceuticals, allowing the industry to pursue more selective leads in the nanomolar range of binding affinity. ML is an essential part of each step of the drug design pipeline, such as target prediction, compound library preparation, and lead optimization. Notably, molecular mechanic and dynamic simulations, induced docking, and free energy perturbations are essential in predicting best binding poses, binding free energy values, and molecular mechanics force fields. Those applications have leveraged from artificial intelligence (AI), which decreases the computational costs required for such costly simulations. This review aimed to describe chemical space and compound libraries related to NPs. High-throughput screening utilized for fractionating NPs and high-throughput virtual screening and their strategies, and significance, are reviewed. Particular emphasis was given to AI approaches, ML tools, algorithms, and techniques, especially in drug discovery of macrocyclic compounds and approaches in computer-aided and ML-based drug discovery. Anthraquinone derivatives were discussed as a source of new lead compounds that can be developed using ML tools for diverse medicinal uses such as cancer, infectious diseases, and metabolic disorders. Furthermore, the power of principal component analysis in understanding relevant protein conformations, and molecular modeling of protein-ligand interaction were also presented. Apart from being a concise reference for cheminformatics, this review is a useful text to understand the application of ML-based algorithms to molecular dynamics simulation and in silico absorption, distribution, metabolism, excretion, and toxicity prediction.
Collapse
Affiliation(s)
- Said Moshawih
- PAP Rashidah Sa'adatul Bolkiah Institute of Health Sciences, Universiti Brunei Darussalam, Gadong, Brunei Darussalam
| | - Hui Poh Goh
- PAP Rashidah Sa'adatul Bolkiah Institute of Health Sciences, Universiti Brunei Darussalam, Gadong, Brunei Darussalam
| | - Nurolaini Kifli
- PAP Rashidah Sa'adatul Bolkiah Institute of Health Sciences, Universiti Brunei Darussalam, Gadong, Brunei Darussalam
| | - Azam Che Idris
- Faculty of Integrated Technologies, Universiti Brunei Darussalam, Gadong, Brunei Darussalam
| | - Hayati Yassin
- Faculty of Integrated Technologies, Universiti Brunei Darussalam, Gadong, Brunei Darussalam
| | - Vijay Kotra
- Faculty of Pharmacy, Quest International University, Perak, Malaysia
| | - Khang Wen Goh
- Faculty of Data Science and Information Technology, INTI International University, Nilai, Malaysia
| | - Kai Bin Liew
- Faculty of Pharmacy, University of Cyberjaya, Cyberjaya, Malaysia
| | - Long Chiau Ming
- PAP Rashidah Sa'adatul Bolkiah Institute of Health Sciences, Universiti Brunei Darussalam, Gadong, Brunei Darussalam
| |
Collapse
|
8
|
A new approach to the design of acyclic chemical compounds using skeleton trees and integer linear programming. APPL INTELL 2022. [DOI: 10.1007/s10489-021-03088-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
AbstractIntelligent systems are applied in a wide range of areas, and computer-aided drug design is a highly important one. One major approach to drug design is the inverse QSAR/QSPR (quantitative structure-activity and structure-property relationship), for which a method that uses both artificial neural networks (ANN) and mixed integer linear programming (MILP) has been proposed recently. This method consists of two phases: a forward prediction phase, and an inverse, inference phase. In the prediction phase, a feature function f over chemical compounds is defined, whereby a chemical compound G is represented as a vector f(G) of descriptors. Following, for a given chemical property $$\pi$$, using a dataset of chemical compounds with known values for property $$\pi$$, a regressive prediction function $$\psi$$ is computed by an ANN. It is desired that $$\psi (f(G))$$ takes a value that is close to the true value of property $$\pi$$ for the compound G for many of the compounds in the dataset. In the inference phase, one starts with a target value $$y^*$$ of the chemical property $$\pi$$, and then a chemical structure $$G^*$$ such that $$\psi (f(G^*))$$ is within a certain tolerance level of $$y^*$$ is constructed from the solution to a specially formulated MILP. This method has been used for the case of inferring acyclic chemical compounds. With this paper, we propose a new concept on acyclic chemical graphs, called a skeleton tree, and based on it develop a new MILP formulation for inferring acyclic chemical compounds. Our computational experiments indicate that our newly proposed method significantly outperforms the existing method when the diameter of graphs is up to 8. In a particular example where we inferred acyclic chemical compounds with 38 non-hydrogen atoms from the set {C, O, S} times faster.
Collapse
|
9
|
Saldívar-González FI, Aldas-Bulos VD, Medina-Franco JL, Plisson F. Natural product drug discovery in the artificial intelligence era. Chem Sci 2022; 13:1526-1546. [PMID: 35282622 PMCID: PMC8827052 DOI: 10.1039/d1sc04471k] [Citation(s) in RCA: 50] [Impact Index Per Article: 25.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2021] [Accepted: 12/10/2021] [Indexed: 12/19/2022] Open
Abstract
Natural products (NPs) are primarily recognized as privileged structures to interact with protein drug targets. Their unique characteristics and structural diversity continue to marvel scientists for developing NP-inspired medicines, even though the pharmaceutical industry has largely given up. High-performance computer hardware, extensive storage, accessible software and affordable online education have democratized the use of artificial intelligence (AI) in many sectors and research areas. The last decades have introduced natural language processing and machine learning algorithms, two subfields of AI, to tackle NP drug discovery challenges and open up opportunities. In this article, we review and discuss the rational applications of AI approaches developed to assist in discovering bioactive NPs and capturing the molecular "patterns" of these privileged structures for combinatorial design or target selectivity.
Collapse
Affiliation(s)
- F I Saldívar-González
- DIFACQUIM Research Group, School of Chemistry, Department of Pharmacy, Universidad Nacional Autónoma de México Avenida Universidad 3000 04510 Mexico Mexico
| | - V D Aldas-Bulos
- Unidad de Genómica Avanzada, Laboratorio Nacional de Genómica para la Biodiversidad (Langebio), Centro de Investigación y de Estudios Avanzados del IPN Irapuato Guanajuato Mexico
| | - J L Medina-Franco
- DIFACQUIM Research Group, School of Chemistry, Department of Pharmacy, Universidad Nacional Autónoma de México Avenida Universidad 3000 04510 Mexico Mexico
| | - F Plisson
- CONACYT - Unidad de Genómica Avanzada, Laboratorio Nacional de Genómica para la Biodiversidad (Langebio), Centro de Investigación y de Estudios Avanzados del IPN Irapuato Guanajuato Mexico
| |
Collapse
|
10
|
Simm J, Humbeck L, Zalewski A, Sturm N, Heyndrickx W, Moreau Y, Beck B, Schuffenhauer A. Splitting chemical structure data sets for federated privacy-preserving machine learning. J Cheminform 2021; 13:96. [PMID: 34876230 PMCID: PMC8650276 DOI: 10.1186/s13321-021-00576-2] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2021] [Accepted: 11/22/2021] [Indexed: 11/10/2022] Open
Abstract
With the increase in applications of machine learning methods in drug design and related fields, the challenge of designing sound test sets becomes more and more prominent. The goal of this challenge is to have a realistic split of chemical structures (compounds) between training, validation and test set such that the performance on the test set is meaningful to infer the performance in a prospective application. This challenge is by its own very interesting and relevant, but is even more complex in a federated machine learning approach where multiple partners jointly train a model under privacy-preserving conditions where chemical structures must not be shared between the different participating parties. In this work we discuss three methods which provide a splitting of a data set and are applicable in a federated privacy-preserving setting, namely: a. locality-sensitive hashing (LSH), b. sphere exclusion clustering, c. scaffold-based binning (scaffold network). For evaluation of these splitting methods we consider the following quality criteria (compared to random splitting): bias in prediction performance, classification label and data imbalance, similarity distance between the test and training set compounds. The main findings of the paper are a. both sphere exclusion clustering and scaffold-based binning result in high quality splitting of the data sets, b. in terms of compute costs sphere exclusion clustering is very expensive in the case of federated privacy-preserving setting.
Collapse
Affiliation(s)
- Jaak Simm
- KU Leuven, ESAT-STADIUS, Kasteelpark Arenberg 10, 3001, Heverlee, Belgium
| | - Lina Humbeck
- Medicinal Chemistry Department, Boehringer Ingelheim Pharma GmbH & Co. KG, Birkendorfer Str. 65, 88397, Biberach an der Riss, Germany
| | - Adam Zalewski
- Amgen Research (Munich) GmbH, Staffelseestraße 2, 81477, Munich, Germany
| | - Noe Sturm
- Novartis Institutes for BioMedical Research, Novartis Campus, CH-4002, Basel, Switzerland
| | - Wouter Heyndrickx
- Janssen Pharmaceutica N.V., Janssen Pharmaceutica, Turnhoutseweg 30, 2340, Beerse, Belgium
| | - Yves Moreau
- KU Leuven, ESAT-STADIUS, Kasteelpark Arenberg 10, 3001, Heverlee, Belgium
| | - Bernd Beck
- Medicinal Chemistry Department, Boehringer Ingelheim Pharma GmbH & Co. KG, Birkendorfer Str. 65, 88397, Biberach an der Riss, Germany
| | - Ansgar Schuffenhauer
- Novartis Institutes for BioMedical Research, Novartis Campus, CH-4002, Basel, Switzerland.
| |
Collapse
|
11
|
Dafniet B, Cerisier N, Boezio B, Clary A, Ducrot P, Dorval T, Gohier A, Brown D, Audouze K, Taboureau O. Development of a chemogenomics library for phenotypic screening. J Cheminform 2021; 13:91. [PMID: 34819133 PMCID: PMC8611952 DOI: 10.1186/s13321-021-00569-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2021] [Accepted: 11/06/2021] [Indexed: 12/03/2022] Open
Abstract
With the development of advanced technologies in cell-based phenotypic screening, phenotypic drug discovery (PDD) strategies have re-emerged as promising approaches in the identification and development of novel and safe drugs. However, phenotypic screening does not rely on knowledge of specific drug targets and needs to be combined with chemical biology approaches to identify therapeutic targets and mechanisms of actions induced by drugs and associated with an observable phenotype. In this study, we developed a system pharmacology network integrating drug-target-pathway-disease relationships as well as morphological profile from an existing high content imaging-based high-throughput phenotypic profiling assay known as “Cell Painting”. Furthermore, from this network, a chemogenomic library of 5000 small molecules that represent a large and diverse panel of drug targets involved in diverse biological effects and diseases has been developed. Such a platform and a chemogenomic library could assist in the target identification and mechanism deconvolution of some phenotypic assays. The usefulness of the platform is illustrated through examples.
Collapse
Affiliation(s)
- Bryan Dafniet
- Université de Paris, INSERM U1133, CNRS UMR8251, 75006, Paris, France
| | - Natacha Cerisier
- Université de Paris, INSERM U1133, CNRS UMR8251, 75006, Paris, France
| | - Batiste Boezio
- Université de Paris, INSERM U1133, CNRS UMR8251, 75006, Paris, France
| | - Anaelle Clary
- Institut de Recherche Servier, 125 Chemin de Ronde, 78290, Croissy-sur-Seine, France
| | - Pierre Ducrot
- Institut de Recherche Servier, 125 Chemin de Ronde, 78290, Croissy-sur-Seine, France
| | - Thierry Dorval
- Institut de Recherche Servier, 125 Chemin de Ronde, 78290, Croissy-sur-Seine, France
| | - Arnaud Gohier
- Institut de Recherche Servier, 125 Chemin de Ronde, 78290, Croissy-sur-Seine, France
| | - David Brown
- Institut de Recherche Servier, 125 Chemin de Ronde, 78290, Croissy-sur-Seine, France
| | - Karine Audouze
- Université de Paris, INSERM UMR S-1124, 75006, Paris, France
| | - Olivier Taboureau
- Université de Paris, INSERM U1133, CNRS UMR8251, 75006, Paris, France.
| |
Collapse
|
12
|
Wang Z, Ran T, Xu F, Wen C, Song S, Zhou Y, Chen H, Lu X. Deep learning-driven scaffold hopping in the discovery of Akt kinase inhibitors. Chem Commun (Camb) 2021; 57:10588-10591. [PMID: 34560776 DOI: 10.1039/d1cc03392a] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Scaffold hopping has been widely used in drug discovery and is a topic of high interest. Here a deep conditional transformer neural network, SyntaLinker, was applied for the scaffold hopping of a phase III clinical Akt inhibitor, AZD5363. A number of novel scaffolds were generated and compound 1a as a proof-of-concept was synthesized and validated by biochemical assay. Further structure-based optimization of 1a led to a novel Akt inhibitor with high potency (Akt1 IC50 = 88 nM) and in vitro antitumor activities.
Collapse
Affiliation(s)
- Zuqin Wang
- College of Pharmacy, Jinan University, 601 Huangpu Avenue West, Guangzhou 510632, China.
| | - Ting Ran
- Bioland Laboratory (Guangzhou Regenerative Medicine and Health - Guangdong Laboratory), Guangzhou 510530, China.
| | - Fang Xu
- College of Pharmacy, Jinan University, 601 Huangpu Avenue West, Guangzhou 510632, China.
| | - Chang Wen
- Bioland Laboratory (Guangzhou Regenerative Medicine and Health - Guangdong Laboratory), Guangzhou 510530, China.
| | - Shukai Song
- College of Pharmacy, Jinan University, 601 Huangpu Avenue West, Guangzhou 510632, China.
| | - Yang Zhou
- College of Pharmacy, Jinan University, 601 Huangpu Avenue West, Guangzhou 510632, China.
| | - Hongming Chen
- Bioland Laboratory (Guangzhou Regenerative Medicine and Health - Guangdong Laboratory), Guangzhou 510530, China.
| | - Xiaoyun Lu
- College of Pharmacy, Jinan University, 601 Huangpu Avenue West, Guangzhou 510632, China.
| |
Collapse
|
13
|
Kumar S, Nair AS, Bhashkar V, Sudevan ST, Koyiparambath VP, Khames A, Abdelgawad MA, Mathew B. Navigating into the Chemical Space of Monoamine Oxidase Inhibitors by Artificial Intelligence and Cheminformatics Approach. ACS OMEGA 2021; 6:23399-23411. [PMID: 34549139 PMCID: PMC8444296 DOI: 10.1021/acsomega.1c03250] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/23/2021] [Accepted: 08/06/2021] [Indexed: 05/20/2023]
Abstract
The monoamine oxidase (MAO) enzyme class is a prevalent target for many neurodegenerative and depressive disorders. Even though scrutinization of many promising drugs for the treatment of MAO inhibition has been carried out in recent times, a conclusive structural requirement for potent activity needs to be developed. Numerous approaches have been examined for the identification of structural features for potent MAO inhibitors (MAOIs) that mainly involve an array of computational studies, synthetic approaches, and biological evaluation. In this paper, we have analyzed ∼2200 well-known MAOIs to expand perceptions in the chemical space of MAOIs. The physicochemical properties of the MAOIs disclosed a discernible hydrophobic feature making a bunch discrete from the central nervous system (CNS) acting drugs, as exposed using the principal component analysis (PCA). The Murcko scaffold structure study revealed unfavorable and favorable scaffold structures, in both data sets, with the highest biological activity shown by the 3-phenyl-2H-chromen-2-one scaffold. This scaffold showed a polypharmacological effect. R-group disintegration and automatic structure-activity relationship (SAR) study resulted in identification of substructures responsible for the inhibitory bioactivity of the MAO-A and MAO-B enzymes. Moreover, with activity cliff analysis, significant biological activity was detected by simple molecular conversion in the chemical compound structure. In addition, we used the machine learning tool to generate a hypothesis wherein pyrazole, benzene ring, and amide containing structural functionalities can exhibit potential biological activities. This hypothesis revealed that CNS target drugs, C4155, C13390, C21265, C43862, C31524, C24810, C37100, C42075, and C43644, could be repurposed as valuable candidates for the MAO-B enzyme. For researchers, this study will bring new perceptions in the discovery and development of MAOIs and direct lead and hit optimization for the progress of small molecules beneficial for MAO-targeting associated diseases.
Collapse
Affiliation(s)
- Sunil Kumar
- Department
of Pharmaceutical Chemistry and Analysis, Amrita School of Pharmacy, Amrita Vishwa Vidyapeetham, AIMS Health Sciences Campus, Kochi 682041, India
| | - Aathira Sujathan Nair
- Department
of Pharmaceutical Chemistry and Analysis, Amrita School of Pharmacy, Amrita Vishwa Vidyapeetham, AIMS Health Sciences Campus, Kochi 682041, India
| | - Vaishnav Bhashkar
- Department
of Pharmaceutical Chemistry and Analysis, Amrita School of Pharmacy, Amrita Vishwa Vidyapeetham, AIMS Health Sciences Campus, Kochi 682041, India
| | - Sachithra Thazhathuveedu Sudevan
- Department
of Pharmaceutical Chemistry and Analysis, Amrita School of Pharmacy, Amrita Vishwa Vidyapeetham, AIMS Health Sciences Campus, Kochi 682041, India
| | - Vishal Payyalot Koyiparambath
- Department
of Pharmaceutical Chemistry and Analysis, Amrita School of Pharmacy, Amrita Vishwa Vidyapeetham, AIMS Health Sciences Campus, Kochi 682041, India
| | - Ahmed Khames
- Department
of Pharmaceutics and Industrial Pharmacy, College of Pharmacy, Taif University, P.O. Box 11099, Taif 21944, Saudi Arabia
| | - Mohamed A. Abdelgawad
- Department
of Pharmaceutical Chemistry, College of Pharmacy, Jouf University, Sakaka, Al Jouf 72341, Saudi Arabia
| | - Bijo Mathew
- Department
of Pharmaceutical Chemistry and Analysis, Amrita School of Pharmacy, Amrita Vishwa Vidyapeetham, AIMS Health Sciences Campus, Kochi 682041, India
- ,
| |
Collapse
|
14
|
Rajput A, Kumar M. Anti-Ebola: an initiative to predict Ebola virus inhibitors through machine learning. Mol Divers 2021; 26:1635-1644. [PMID: 34357513 PMCID: PMC8343361 DOI: 10.1007/s11030-021-10291-7] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2021] [Accepted: 07/28/2021] [Indexed: 01/17/2023]
Abstract
Ebola virus is a deadly pathogen responsible for a frequent series of outbreaks since 1976. Despite various efforts from researchers worldwide, its mortality and fatality are quite high. For antiviral drug discovery, the computational efforts are considered highly useful. Therefore, we have developed an 'anti-Ebola' web server, through quantitative structure-activity relationship information of available molecules with experimental anti-Ebola activities. Three hundred and five unique anti-Ebola compounds with their respective IC50 values were extracted from the 'DrugRepV' database. Later, the compounds were used to extract the molecular descriptors, which were subjected to regression-based model development. The robust machine learning techniques, namely support vector machine, random forest and artificial neural network, were employed using tenfold cross-validation. After a randomization approach, the best predictive model showed Pearson's correlation coefficient ranges from 0.83 to 0.98 on training/testing (T274) dataset. The robustness of the developed models was cross-evaluated using William's plot. The highly robust computational models are integrated into the web server. The 'anti-Ebola' web server is freely available at https://bioinfo.imtech.res.in/manojk/antiebola . We anticipate this will serve the scientific community for developing effective inhibitors against the Ebola virus.
Collapse
Affiliation(s)
- Akanksha Rajput
- Virology Unit and Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research (CSIR), Sector 39A, Chandigarh, 160036, India
| | - Manoj Kumar
- Virology Unit and Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research (CSIR), Sector 39A, Chandigarh, 160036, India. .,Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, 201002, India.
| |
Collapse
|
15
|
Humbeck L, Pretzel J, Spitzer S, Koch O. Discovery of an Unexpected Similarity in Ligand Binding between BRD4 and PPARγ. ACS Chem Biol 2021; 16:1255-1265. [PMID: 34180651 DOI: 10.1021/acschembio.1c00323] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Knowledge about interrelationships between different proteins is crucial in fundamental research for the elucidation of protein networks and pathways. Furthermore, it is especially critical in chemical biology to identify further key regulators of a disease and to take advantage of polypharmacology effects. Here, we present a new concept that combines a scaffold-based analysis of bioactivity data with a subsequent screening to identify novel inhibitors for a protein target of interest. The initial scaffold-based analysis revealed a flavone-like scaffold that can be found in ligands of different unrelated proteins indicating a similarity in ligand binding. This similarity was further investigated by testing compounds on bromodomain-containing protein 4 (BRD4) that were similar to known ligands of the other identified protein targets. Several new BRD4 inhibitors were identified and proven to be validated hits based on orthogonal assays and X-ray crystallography. The most important discovery was an unexpected relationship between BRD4 and peroxisome-proliferator activated receptor gamma (PPARγ). Both proteins share binding site similarities near a common hydrophobic subpocket which should allow the design of a polypharmacology-based ligand targeting both proteins. Such dual-BRD4-PPARγ modulators open up new therapeutic opportunities, because both are important drug targets for cancer therapy and many more important diseases. Thereon, a complex structure of sulfasalazine was obtained that involves two bromodomains and could be a potential starting point for the design of a bivalent BRD4 inhibitor.
Collapse
Affiliation(s)
- Lina Humbeck
- Faculty of Chemistry and Chemical Biology, TU Dortmund University, Otto-Hahn-Straße 6, 44227 Dortmund, Germany
| | - Jette Pretzel
- Faculty of Chemistry and Chemical Biology, TU Dortmund University, Otto-Hahn-Straße 6, 44227 Dortmund, Germany
| | - Saskia Spitzer
- Faculty of Chemistry and Chemical Biology, TU Dortmund University, Otto-Hahn-Straße 6, 44227 Dortmund, Germany
| | - Oliver Koch
- Faculty of Chemistry and Chemical Biology, TU Dortmund University, Otto-Hahn-Straße 6, 44227 Dortmund, Germany
| |
Collapse
|
16
|
Schaduangrat N, Malik AA, Nantasenamat C. ERpred: a web server for the prediction of subtype-specific estrogen receptor antagonists. PeerJ 2021; 9:e11716. [PMID: 34285834 PMCID: PMC8274494 DOI: 10.7717/peerj.11716] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2020] [Accepted: 06/11/2021] [Indexed: 11/22/2022] Open
Abstract
Estrogen receptors alpha and beta (ERα and ERβ) are responsible for breast cancer metastasis through their involvement of clinical outcomes. Estradiol and hormone replacement therapy targets both ERs, but this often leads to an increased risk of breast and endometrial cancers as well as thromboembolism. A major challenge is posed for the development of compounds possessing ER subtype specificity. Herein, we present a large-scale classification structure-activity relationship (CSAR) study of inhibitors from the ChEMBL database which consisted of an initial set of 11,618 compounds for ERα and 7,810 compounds for ERβ. The IC50 was selected as the bioactivity unit for further investigation and after the data curation process, this led to a final data set of 1,593 and 1,281 compounds for ERα and ERβ, respectively. We employed the random forest (RF) algorithm for model building and of the 12 fingerprint types, models built using the PubChem fingerprint was the most robust (Ac of 94.65% and 92.25% and Matthews correlation coefficient (MCC) of 89% and 76% for ERα and ERβ, respectively) and therefore selected for feature interpretation. Results indicated the importance of features pertaining to aromatic rings, nitrogen-containing functional groups and aliphatic hydrocarbons. Finally, the model was deployed as the publicly available web server called ERpred at http://codes.bio/erpred where users can submit SMILES notation as the input query for prediction of the bioactivity against ERα and ERβ.
Collapse
Affiliation(s)
- Nalini Schaduangrat
- Center of Data Mining and Biomedical Informatics, Faculty of Medical Technology, Mahidol University, Bangkok, Thailand
| | - Aijaz Ahmad Malik
- Center of Data Mining and Biomedical Informatics, Faculty of Medical Technology, Mahidol University, Bangkok, Thailand
| | - Chanin Nantasenamat
- Center of Data Mining and Biomedical Informatics, Faculty of Medical Technology, Mahidol University, Bangkok, Thailand
| |
Collapse
|
17
|
Egieyeh S, Malan SF, Christoffels A. Cheminformatics techniques in antimalarial drug discovery and development from natural products 2: Molecular scaffold and machine learning approaches. PHYSICAL SCIENCES REVIEWS 2021. [DOI: 10.1515/psr-2019-0029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Abstract
A large number of natural products, especially those used in ethnomedicine of malaria, have shown varying in-vitro antiplasmodial activities. Cheminformatics involves the organization, integration, curation, standardization, simulation, mining and transformation of pharmacology data (compounds and bioactivity) into knowledge that can drive rational and viable drug development decisions. This chapter will review the application of two cheminformatics techniques (including molecular scaffold analysis and bioactivity predictive modeling via Machine learning) to natural products with in-vitro and in-vivo antiplasmodial activities in order to facilitate their development into antimalarial drug candidates and design of new potential antimalarial compounds.
Collapse
Affiliation(s)
- Samuel Egieyeh
- School of Pharmacy , University of the Western Cape Faculty of Natural Science , Belville , South Africa
- South African Medical Research Council Bioinformatics Unit, South African National Bioinformatics Institute , University of the Western Cape Faculty of Natural Science , Belville , South Africa
| | - Sarel F. Malan
- School of Pharmacy , University of the Western Cape Faculty of Natural Science , Belville , South Africa
| | - Alan Christoffels
- South African Medical Research Council Bioinformatics Unit, South African National Bioinformatics Institute , University of the Western Cape Faculty of Natural Science , Belville , South Africa
| |
Collapse
|
18
|
Molecule Set Comparator (MSC): a CDK-based open rich-client tool for molecule set similarity evaluations. J Cheminform 2021; 13:5. [PMID: 33526050 PMCID: PMC7852119 DOI: 10.1186/s13321-021-00485-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2020] [Revised: 12/21/2020] [Accepted: 01/16/2021] [Indexed: 11/10/2022] Open
Abstract
The open rich-client Molecule Set Comparator (MSC) application enables a versatile and fast comparison of large molecule sets with a unique inter-set molecule-to-molecule mapping obtained e.g. by molecular-recognition-oriented machine learning approaches. The molecule-to-molecule comparison is based on chemical descriptors obtained with the Chemistry Development Kit (CDK), such as Tanimoto similarities, atom/bond/ring counts or physicochemical properties like logP. The results are summarized and presented graphically by interactive histogram charts that can be examined in detail and exported in publication quality.
Collapse
|
19
|
Lai J, Li X, Wang Y, Yin S, Zhou J, Liu Z. AIScaffold: A Web-Based Tool for Scaffold Diversification Using Deep Learning. J Chem Inf Model 2020; 61:1-6. [PMID: 33356237 DOI: 10.1021/acs.jcim.0c00867] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Molecular scaffolds are widely used in drug design. Many methods and tools have been developed to utilize the information in scaffolds. Scaffold diversification is frequently used by medicinal chemists in tasks such as lead compound optimization, but tools for scaffold diversification are still lacking. Here, we propose AIScaffold (https://iaidrug.stonewise.cn), a web-based tool for scaffold diversification using the deep generative model. This tool can perform large-scale (up to 500,000 molecules) diversification in several minutes and recommend the top 500 (top 0.1%) molecules. Features such as site-specific diversification are also supported. This tool can facilitate the scaffold diversification process for medicinal chemists, thereby accelerating drug design.
Collapse
Affiliation(s)
- Junyong Lai
- State Key Laboratory of Natural and Biomimetic Drugs, School of Pharmaceutical Sciences, Peking University, 100191 Beijing, P. R. China
| | - Xiangbin Li
- Stonewise, No. 19 Zhongguancun Street, Haidian District, 100080 Beijing, P. R. China
| | - Yanxing Wang
- State Key Laboratory of Natural and Biomimetic Drugs, School of Pharmaceutical Sciences, Peking University, 100191 Beijing, P. R. China
| | - Shiqiu Yin
- Stonewise, No. 19 Zhongguancun Street, Haidian District, 100080 Beijing, P. R. China
| | - Jielong Zhou
- Stonewise, No. 19 Zhongguancun Street, Haidian District, 100080 Beijing, P. R. China
| | - Zhenming Liu
- State Key Laboratory of Natural and Biomimetic Drugs, School of Pharmaceutical Sciences, Peking University, 100191 Beijing, P. R. China
| |
Collapse
|
20
|
Chen Y, Kirchmair J. Cheminformatics in Natural Product-based Drug Discovery. Mol Inform 2020; 39:e2000171. [PMID: 32725781 PMCID: PMC7757247 DOI: 10.1002/minf.202000171] [Citation(s) in RCA: 58] [Impact Index Per Article: 14.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2020] [Accepted: 07/28/2020] [Indexed: 12/20/2022]
Abstract
This review seeks to provide a timely survey of the scope and limitations of cheminformatics methods in natural product-based drug discovery. Following an overview of data resources of chemical, biological and structural information on natural products, we discuss, among other aspects, in silico methods for (i) data curation and natural products dereplication, (ii) analysis, visualization, navigation and comparison of the chemical space, (iii) quantification of natural product-likeness, (iv) prediction of the bioactivities (virtual screening, target prediction), ADME and safety profiles (toxicity) of natural products, (v) natural products-inspired de novo design and (vi) prediction of natural products prone to cause interference with biological assays. Among the many methods discussed are rule-based, similarity-based, shape-based, pharmacophore-based and network-based approaches, docking and machine learning methods.
Collapse
Affiliation(s)
- Ya Chen
- Center for Bioinformatics (ZBH)Department of Computer ScienceFaculty of MathematicsInformatics and Natural SciencesUniversität Hamburg20146HamburgGermany
| | - Johannes Kirchmair
- Center for Bioinformatics (ZBH)Department of Computer ScienceFaculty of MathematicsInformatics and Natural SciencesUniversität Hamburg20146HamburgGermany
- Department of Pharmaceutical ChemistryFaculty of Life SciencesUniversity of Vienna1090ViennaAustria
| |
Collapse
|
21
|
Kim JH, Cheng LW, Chan KL, Tam CC, Mahoney N, Friedman M, Shilman MM, Land KM. Antifungal Drug Repurposing. Antibiotics (Basel) 2020; 9:antibiotics9110812. [PMID: 33203147 PMCID: PMC7697925 DOI: 10.3390/antibiotics9110812] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2020] [Revised: 10/30/2020] [Accepted: 11/13/2020] [Indexed: 12/19/2022] Open
Abstract
Control of fungal pathogens is increasingly problematic due to the limited number of effective drugs available for antifungal therapy. Conventional antifungal drugs could also trigger human cytotoxicity associated with the kidneys and liver, including the generation of reactive oxygen species. Moreover, increased incidences of fungal resistance to the classes of azoles, such as fluconazole, itraconazole, voriconazole, or posaconazole, or echinocandins, including caspofungin, anidulafungin, or micafungin, have been documented. Of note, certain azole fungicides such as propiconazole or tebuconazole that are applied to agricultural fields have the same mechanism of antifungal action as clinical azole drugs. Such long-term application of azole fungicides to crop fields provides environmental selection pressure for the emergence of pan-azole-resistant fungal strains such as Aspergillus fumigatus having TR34/L98H mutations, specifically, a 34 bp insertion into the cytochrome P450 51A (CYP51A) gene promoter region and a leucine-to-histidine substitution at codon 98 of CYP51A. Altogether, the emerging resistance of pathogens to currently available antifungal drugs and insufficiency in the discovery of new therapeutics engender the urgent need for the development of new antifungals and/or alternative therapies for effective control of fungal pathogens. We discuss the current needs for the discovery of new clinical antifungal drugs and the recent drug repurposing endeavors as alternative methods for fungal pathogen control.
Collapse
Affiliation(s)
- Jong H. Kim
- Foodborne Toxin Detection and Prevention Research Unit, Western Regional Research Center, Agricultural Research Service, United States Department of Agriculture, Albany, CA 94710, USA; (L.W.C.); (K.L.C.); (C.C.T.); (N.M.)
- Correspondence: ; Tel.: +1-510-559-5841
| | - Luisa W. Cheng
- Foodborne Toxin Detection and Prevention Research Unit, Western Regional Research Center, Agricultural Research Service, United States Department of Agriculture, Albany, CA 94710, USA; (L.W.C.); (K.L.C.); (C.C.T.); (N.M.)
| | - Kathleen L. Chan
- Foodborne Toxin Detection and Prevention Research Unit, Western Regional Research Center, Agricultural Research Service, United States Department of Agriculture, Albany, CA 94710, USA; (L.W.C.); (K.L.C.); (C.C.T.); (N.M.)
| | - Christina C. Tam
- Foodborne Toxin Detection and Prevention Research Unit, Western Regional Research Center, Agricultural Research Service, United States Department of Agriculture, Albany, CA 94710, USA; (L.W.C.); (K.L.C.); (C.C.T.); (N.M.)
| | - Noreen Mahoney
- Foodborne Toxin Detection and Prevention Research Unit, Western Regional Research Center, Agricultural Research Service, United States Department of Agriculture, Albany, CA 94710, USA; (L.W.C.); (K.L.C.); (C.C.T.); (N.M.)
| | - Mendel Friedman
- Healthy Processed Foods Research Unit, Western Regional Research Center, Agricultural Research Service, United States Department of Agriculture, Albany, CA 94710, USA;
| | | | - Kirkwood M. Land
- Department of Biological Sciences, University of the Pacific, Stockton, CA 95211, USA;
| |
Collapse
|
22
|
Schaub J, Zielesny A, Steinbeck C, Sorokina M. Too sweet: cheminformatics for deglycosylation in natural products. J Cheminform 2020; 12:67. [PMID: 33292501 PMCID: PMC7641802 DOI: 10.1186/s13321-020-00467-y] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2020] [Accepted: 10/08/2020] [Indexed: 12/13/2022] Open
Abstract
Sugar units in natural products are pharmacokinetically important but often redundant and therefore obstructing the study of the structure and function of the aglycon. Therefore, it is recommended to remove the sugars before a theoretical or experimental study of a molecule. Deglycogenases, enzymes that specialized in sugar removal from small molecules, are often used in laboratories to perform this task. However, there is no standardized computational procedure to perform this task in silico. In this work, we present a systematic approach for in silico removal of ring and linear sugars from molecular structures. Particular attention is given to molecules of biological origin and to their structural specificities. This approach is made available in two forms, through a free and open web application and as standalone open-source software.
Collapse
Affiliation(s)
- Jonas Schaub
- Institute for Inorganic and Analytical Chemistry, Friedrich-Schiller University, Lessing Strasse 8, 07743, Jena, Germany
| | - Achim Zielesny
- Institute for Bioinformatics and Chemoinformatics, Westphalian University of Applied Sciences, August-Schmidt-Ring 10, 45665, Recklinghausen, Germany
| | - Christoph Steinbeck
- Institute for Inorganic and Analytical Chemistry, Friedrich-Schiller University, Lessing Strasse 8, 07743, Jena, Germany.
| | - Maria Sorokina
- Institute for Inorganic and Analytical Chemistry, Friedrich-Schiller University, Lessing Strasse 8, 07743, Jena, Germany.
| |
Collapse
|
23
|
Hsiao Y, Su BH, Tseng YJ. Current development of integrated web servers for preclinical safety and pharmacokinetics assessments in drug development. Brief Bioinform 2020; 22:5881374. [PMID: 32770190 DOI: 10.1093/bib/bbaa160] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2020] [Revised: 06/22/2020] [Accepted: 06/24/2020] [Indexed: 12/27/2022] Open
Abstract
In drug development, preclinical safety and pharmacokinetics assessments of candidate drugs to ensure the safety profile are a must. While in vivo and in vitro tests are traditionally used, experimental determinations have disadvantages, as they are usually time-consuming and costly. In silico predictions of these preclinical endpoints have each been developed in the past decades. However, only a few web-based tools have integrated different models to provide a simple one-step platform to help researchers thoroughly evaluate potential drug candidates. To efficiently achieve this approach, a platform for preclinical evaluation must not only predict key ADMET (absorption, distribution, metabolism, excretion and toxicity) properties but also provide some guidance on structural modifications to improve the undesired properties. In this review, we organized and compared several existing integrated web servers that can be adopted in preclinical drug development projects to evaluate the subject of interest. We also introduced our new web server, Virtual Rat, as an alternative choice to profile the properties of drug candidates. In Virtual Rat, we provide not only predictions of important ADMET properties but also possible reasons as to why the model made those structural predictions. Multiple models were implemented into Virtual Rat, including models for predicting human ether-a-go-go-related gene (hERG) inhibition, cytochrome P450 (CYP) inhibition, mutagenicity (Ames test), blood-brain barrier penetration, cytotoxicity and Caco-2 permeability. Virtual Rat is free and has been made publicly available at https://virtualrat.cmdm.tw/.
Collapse
|
24
|
Li Y, Hu J, Wang Y, Zhou J, Zhang L, Liu Z. DeepScaffold: A Comprehensive Tool for Scaffold-Based De Novo Drug Discovery Using Deep Learning. J Chem Inf Model 2019; 60:77-91. [PMID: 31809029 DOI: 10.1021/acs.jcim.9b00727] [Citation(s) in RCA: 55] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
The ultimate goal of drug design is to find novel compounds with desirable pharmacological properties. Designing molecules retaining particular scaffolds as their core structures is an efficient way to obtain potential drug candidates. We propose a scaffold-based molecular generative model for drug discovery, which performs molecule generation based on a wide spectrum of scaffold definitions, including Bemis-Murcko scaffolds, cyclic skeletons, and scaffolds with specifications on side-chain properties. The model can generalize the learned chemical rules of adding atoms and bonds to a given scaffold. The generated compounds were evaluated by molecular docking in DRD2 targets, and the results demonstrated that this approach can be effectively applied to solve several drug design problems, including the generation of compounds containing a given scaffold and de novo drug design of potential drug candidates with specific docking scores.
Collapse
Affiliation(s)
- Yibo Li
- State Key Laboratory of Natural and Biomimetic Drugs, School of Pharmaceutical Sciences , Peking University , Xueyuan Road 38 , Haidian District, 100191 Beijing , China.,Stonewise , Haidian Middle Street 15 , Haidian District, 100080 Beijing , China
| | - Jianxing Hu
- State Key Laboratory of Natural and Biomimetic Drugs, School of Pharmaceutical Sciences , Peking University , Xueyuan Road 38 , Haidian District, 100191 Beijing , China
| | - Yanxing Wang
- State Key Laboratory of Natural and Biomimetic Drugs, School of Pharmaceutical Sciences , Peking University , Xueyuan Road 38 , Haidian District, 100191 Beijing , China
| | - Jielong Zhou
- Stonewise , Haidian Middle Street 15 , Haidian District, 100080 Beijing , China
| | - Liangren Zhang
- State Key Laboratory of Natural and Biomimetic Drugs, School of Pharmaceutical Sciences , Peking University , Xueyuan Road 38 , Haidian District, 100191 Beijing , China
| | - Zhenming Liu
- State Key Laboratory of Natural and Biomimetic Drugs, School of Pharmaceutical Sciences , Peking University , Xueyuan Road 38 , Haidian District, 100191 Beijing , China
| |
Collapse
|
25
|
|
26
|
Construction of Quantitative Structure Activity Relationship (QSAR) Models to Predict Potency of Structurally Diversed Janus Kinase 2 Inhibitors. Molecules 2019; 24:molecules24234393. [PMID: 31805692 PMCID: PMC6930640 DOI: 10.3390/molecules24234393] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2019] [Revised: 11/24/2019] [Accepted: 11/28/2019] [Indexed: 12/22/2022] Open
Abstract
Janus kinase 2 (JAK2) inhibitors represent a promising therapeutic class of anticancer agents against many myeloproliferative disorders. Bioactivity data on pIC50 of 2229 JAK2 inhibitors were employed in the construction of quantitative structure-activity relationship (QSAR) models. The models were built from 100 data splits using decision tree (DT), support vector machine (SVM), deep neural network (DNN) and random forest (RF). The predictive power of RF models were assessed via 10-fold cross validation, which afforded excellent predictive performance with R2 and RMSE of 0.74 ± 0.05 and 0.63 ± 0.05, respectively. Moreover, test set has excellent performance of R2 (0.75 ± 0.03) and RMSE (0.62 ± 0.04). In addition, Y-scrambling was utilized to evaluate the possibility of chance correlation of the predictive model. A thorough analysis of the substructure fingerprint count was conducted to provide insights on the inhibitory properties of JAK2 inhibitors. Molecular cluster analysis revealed that pyrazine scaffolds have nanomolar potency against JAK2.
Collapse
|
27
|
Cheminformatics Explorations of Natural Products. PROGRESS IN THE CHEMISTRY OF ORGANIC NATURAL PRODUCTS 2019; 110:1-35. [PMID: 31621009 DOI: 10.1007/978-3-030-14632-0_1] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
The chemistry of natural products is fascinating and has continuously attracted the attention of the scientific community for many reasons including, but not limited to, biosynthesis pathways, chemical diversity, the source of bioactive compounds and their marked impact on drug discovery. There is a broad range of experimental and computational techniques (molecular modeling and cheminformatics) that have evolved over the years and have assisted the investigation of natural products. Herein, we discuss cheminformatics strategies to explore the chemistry and applications of natural products. Since the potential synergisms between cheminformatics and natural products are vast, we will focus on three major aspects: (1) exploration of the chemical space of natural products to identify bioactive compounds, with emphasis on drug discovery; (2) assessment of the toxicity profile of natural products; and (3) diversity analysis of natural product collections and the design of chemical collections inspired by natural sources.
Collapse
|
28
|
Naveja JJ, Pilón-Jiménez BA, Bajorath J, Medina-Franco JL. A general approach for retrosynthetic molecular core analysis. J Cheminform 2019; 11:61. [PMID: 33430974 PMCID: PMC6760108 DOI: 10.1186/s13321-019-0380-5] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2019] [Accepted: 08/04/2019] [Indexed: 11/13/2022] Open
Abstract
Scaffold analysis of compound data sets has reemerged as a chemically interpretable alternative to machine learning for chemical space and structure–activity relationships analysis. In this context, analog series-based scaffolds (ASBS) are synthetically relevant core structures that represent individual series of analogs. As an extension to ASBS, we herein introduce the development of a general conceptual framework that considers all putative cores of molecules in a compound data set, thus softening the often applied “single molecule–single scaffold” correspondence. A putative core is here defined as any substructure of a molecule complying with two basic rules: (a) the size of the core is a significant proportion of the whole molecule size and (b) the substructure can be reached from the original molecule through a succession of retrosynthesis rules. Thereafter, a bipartite network consisting of molecules and cores can be constructed for a database of chemical structures. Compounds linked to the same cores are considered analogs. We present case studies illustrating the potential of the general framework. The applications range from inter- and intra-core diversity analysis of compound data sets, structure–property relationships, and identification of analog series and ASBS. The molecule–core network herein presented is a general methodology with multiple applications in scaffold analysis. New statistical methods are envisioned that will be able to draw quantitative conclusions from these data. The code to use the method presented in this work is freely available as an additional file. Follow-up applications include analog searching and core structure–property relationships analyses.![]()
Collapse
Affiliation(s)
- J Jesús Naveja
- PECEM, School of Medicine, Universidad Nacional Autónoma de México, Avenida Universidad 3000, 04510, Mexico City, Mexico. .,Department of Pharmacy, School of Chemistry, Universidad Nacional Autónoma de México, Avenida Universidad 3000, 04510, Mexico City, Mexico.
| | - B Angélica Pilón-Jiménez
- Department of Pharmacy, School of Chemistry, Universidad Nacional Autónoma de México, Avenida Universidad 3000, 04510, Mexico City, Mexico
| | - Jürgen Bajorath
- Department of Life Science Informatics, B-IT, LIMES Program Unit Chemical Biology and Medicinal Chemistry, Rheinische Friedrich-Wilhelms-Universität, Endenicher Allee 19c, 53115, Bonn, Germany
| | - José L Medina-Franco
- Department of Pharmacy, School of Chemistry, Universidad Nacional Autónoma de México, Avenida Universidad 3000, 04510, Mexico City, Mexico.
| |
Collapse
|
29
|
Medina-Franco JL, Naveja JJ, López-López E. Reaching for the bright StARs in chemical space. Drug Discov Today 2019; 24:2162-2169. [PMID: 31557448 DOI: 10.1016/j.drudis.2019.09.013] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2019] [Revised: 09/10/2019] [Accepted: 09/17/2019] [Indexed: 02/07/2023]
Abstract
Visualization of activity data in chemical space is common in drug discovery. Navigating the space in a systematic manner is not trivial, given its size and huge coverage. To this end, methods for data visualization have been developed charting biological activity into chemical space. Herein, we review the progress in different visualization approaches to explore the chemical space aiming at reaching insightful structure-activity relationships (SARs) in the chemical space. We discuss recent methods including consensus diversity plots, ChemMaps, and constellation plots. Several of the methods we review can be extended to analyze other properties of interest in medicinal chemistry, such as structure-toxicity relationships, and can be adapted to postprocess results of virtual screening (VS) of large compound libraries.
Collapse
Affiliation(s)
- José L Medina-Franco
- Department of Pharmacy, School of Chemistry, Universidad Nacional Autónoma de México, Avenida Universidad 3000, Mexico City 04510, Mexico.
| | - J Jesús Naveja
- Department of Pharmacy, School of Chemistry, Universidad Nacional Autónoma de México, Avenida Universidad 3000, Mexico City 04510, Mexico; PECEM, School of Medicine, Universidad Nacional Autónoma de México, Mexico City 04510, Mexico
| | - Edgar López-López
- Department of Pharmacy, School of Chemistry, Universidad Nacional Autónoma de México, Avenida Universidad 3000, Mexico City 04510, Mexico
| |
Collapse
|
30
|
Smith KP, Dowgiallo MG, Chiaraviglio L, Parvatkar P, Kim C, Manetsch R, Kirby JE. A Whole-Cell Screen for Adjunctive and Direct Antimicrobials Active against Carbapenem-Resistant Enterobacteriaceae. SLAS DISCOVERY 2019; 24:842-853. [PMID: 31268804 DOI: 10.1177/2472555219859592] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
Abstract
Carbapenem-resistant Enterobacteriaceae (CRE) are an emerging antimicrobial resistance threat for which few if any therapeutic options remain. Identification of new agents that either inhibit CRE or restore activity of existing antimicrobials is highly desirable. Therefore, a high-throughput screen of 182,427 commercially available compounds was used to identify small molecules which either enhanced activity of meropenem against a carbapenem-resistant Klebsiella pneumoniae ST258 screening strain and/or directly inhibited its growth. The primary screening methodology was a whole-cell screen/counterscreen combination assay that tested for reduction of microbial growth in the presence or absence of meropenem, respectively. Screening hits demonstrating eukaryotic cell toxicity based on an orthogonal screening effort or identified as pan-assay interference compounds (PAINS) by computational methods were triaged. Primary screening hits were then clustered and ranked according to favorable physicochemical properties. Among remaining hits, we found 10 compounds that enhanced activity of carbapenems against a subset of CRE. Direct antimicrobials that passed toxicity and PAINS filters were not, however, identified in this relatively large screening effort. It was previously shown that the same screening strategy was productive for identifying candidates for further development when screening known bioactive libraries inclusive of natural products. Our findings therefore further highlight liabilities of commercially available small-molecule screening libraries in the Gram-negative antimicrobial space. In particular, there was especially low yield in identifying compelling activity against a representative, highly multidrug-resistant, carbapenemase-producing K. pneumoniae strain.
Collapse
Affiliation(s)
- Kenneth P Smith
- 1 Department of Pathology, Beth Israel Deaconess Medical Center, Boston, MA, USA.,2 Harvard Medical School, Boston, MA, USA
| | - Matthew G Dowgiallo
- 3 Department of Chemistry and Chemical Biology, Northeastern University, Boston, MA, USA
| | - Lucius Chiaraviglio
- 1 Department of Pathology, Beth Israel Deaconess Medical Center, Boston, MA, USA
| | - Prakash Parvatkar
- 3 Department of Chemistry and Chemical Biology, Northeastern University, Boston, MA, USA
| | - Chungsik Kim
- 3 Department of Chemistry and Chemical Biology, Northeastern University, Boston, MA, USA
| | - Roman Manetsch
- 3 Department of Chemistry and Chemical Biology, Northeastern University, Boston, MA, USA.,4 Department of Pharmaceutical Sciences, Northeastern University, Boston, MA, USA
| | - James E Kirby
- 1 Department of Pathology, Beth Israel Deaconess Medical Center, Boston, MA, USA.,2 Harvard Medical School, Boston, MA, USA
| |
Collapse
|
31
|
Comprehensive TCM molecular networking based on MS/MS in silico spectra with integration of virtual screening and affinity MS screening for discovering functional ligands from natural herbs. Anal Bioanal Chem 2019; 411:5785-5797. [DOI: 10.1007/s00216-019-01962-4] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2019] [Revised: 05/29/2019] [Accepted: 06/04/2019] [Indexed: 12/15/2022]
|
32
|
Receptor-based pharmacophore modeling, virtual screening, and molecular docking studies for the discovery of novel GSK-3β inhibitors. J Mol Model 2019; 25:171. [PMID: 31129879 DOI: 10.1007/s00894-019-4032-5] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2019] [Accepted: 04/07/2019] [Indexed: 10/26/2022]
Abstract
Considering the emerging importance of glycogen synthase kinase 3 beta (GSK-3β) inhibitors in treatment of Alzheimer's disease, multi-protein structure receptor-based pharmacophore modeling was adopted to generate a 3D pharmacophore model for (GSK-3β) inhibitors. The generated 3D pharmacophore was then validated using a test set of 1235 compounds. The ZINCPharmer web tool was used to virtually screen the public ZINC database using the generated 3D pharmacophore. A set of 12,251 hits was produced and then filtered according to their lead-like properties, predicted central nervous system (CNS) activity, and Pan-assay interference compounds (PAINS) fragments to 630 compounds. Scaffold Hunter was then used to cluster the filtered compounds according to their chemical structure framework. From the different clusters, 123 compounds were selected to cover the whole chemical space of the obtained hits. The SwissADME online tool was then used to filter out the compounds with undesirable pharmacokinetic properties giving a set of 91 compounds with promising predicted pharmacodynamic and pharmacokinetic properties. To confirm their binding capability to the GSK-3β binding site, molecular docking simulations were performed for the final 91 compounds in the GSK-3β binding site. Twenty-five compounds showed acceptable binding poses that bind to the key amino acids in the binding site Asp133 and Val135 with good binding scores. The quinolin-2-one derivative ZINC67773573 was found to be a promising lead for designing new GSK-3β inhibitors for Alzheimer's disease treatment. Graphical abstract A 3D pharmacophore model for the discovery of novel (GSK-3β) inhibitors.
Collapse
|
33
|
Saldívar-González FI, Pilón-Jiménez BA, Medina-Franco JL. Chemical space of naturally occurring compounds. PHYSICAL SCIENCES REVIEWS 2019. [DOI: 10.1515/psr-2018-0103] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
AbstractThe chemical space of naturally occurring compounds is vast and diverse. Other than biologics, naturally occurring small molecules include a large variety of compounds covering natural products from different sources such as plant, marine, and fungi, to name a few, and several food chemicals. The systematic exploration of the chemical space of naturally occurring compounds have significant implications in many areas of research including but not limited to drug discovery, nutrition, bio- and chemical diversity analysis. The exploration of the coverage and diversity of the chemical space of compound databases can be carried out in different ways. The approach will largely depend on the criteria to define the chemical space that is commonly selected based on the goals of the study. This chapter discusses major compound databases of natural products and cheminformatics strategies that have been used to characterize the chemical space of natural products. Recent exemplary studies of the chemical space of natural products from different sources and their relationships with other compounds are also discussed. We also present novel chemical descriptors and data mining approaches that are emerging to characterize the chemical space of naturally occurring compounds.
Collapse
|
34
|
Capoci IRG, Faria DR, Sakita KM, Rodrigues-Vendramini FAV, Bonfim-Mendonça PDS, Becker TCA, Kioshima ÉS, Svidzinski TIE, Maigret B. Repurposing approach identifies new treatment options for invasive fungal disease. Bioorg Chem 2018; 84:87-97. [PMID: 30496872 DOI: 10.1016/j.bioorg.2018.11.019] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2018] [Revised: 10/17/2018] [Accepted: 11/16/2018] [Indexed: 12/29/2022]
Abstract
Drug repositioning is the process of discovery, validation and marketing of previously approved drugs for new indications. Our aim was drug repositioning, using ligand-based and structure-based computational methods, of compounds that are similar to two hit compounds previously selected by our group that show promising antifungal activity. Through the ligand-based method, 100 compounds from each of three databases (MDDR, DrugBank and TargetMol) were selected by the Tanimoto coefficient, as similar to LMM5 or LMM11. These compounds were analyzed by the scaffold trees, and up to 10 compounds from each database were selected. The structure-based method (molecular docking) using thioredoxin reductase as the target drug was performed as a complementary approach, resulting in six compounds that were tested in an in vitro assay. All compounds, particularly raltegravir, showed antifungal activity against the genus Paracoccidioides. Raltegravir, an antiviral drug, showed promising antifungal activity against the experimental murine paracoccidioidomycosis, with significant reduction of the fungal burden and decreased alterations in the lung structure of mice treated with 1 mg/kg of raltegravir. In conclusion, the combination of two in silico methods for drug repositioning was able to select an antiviral drug with promising antifungal activity for treatment of paracoccidioidomycosis.
Collapse
Affiliation(s)
| | - Daniella Renata Faria
- Department of Clinical Analysis and Biomedicine, State University of Maringá, Maringá, Paraná, Brazil
| | - Karina Mayumi Sakita
- Department of Clinical Analysis and Biomedicine, State University of Maringá, Maringá, Paraná, Brazil
| | | | | | | | - Érika Seki Kioshima
- Department of Clinical Analysis and Biomedicine, State University of Maringá, Maringá, Paraná, Brazil
| | | | | |
Collapse
|
35
|
Ghosh D, Koch U, Hadian K, Sattler M, Tetko IV. Luciferase Advisor: High-Accuracy Model To Flag False Positive Hits in Luciferase HTS Assays. J Chem Inf Model 2018; 58:933-942. [PMID: 29667823 DOI: 10.1021/acs.jcim.7b00574] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
Abstract
Firefly luciferase is an enzyme that has found ubiquitous use in biological assays in high-throughput screening (HTS) campaigns. The inhibition of luciferase in such assays could lead to a false positive result. This issue has been known for a long time, and there have been significant efforts to identify luciferase inhibitors in order to enhance recognition of false positives in screening assays. However, although a large amount of publicly accessible luciferase counterscreen data is available, to date little effort has been devoted to building a chemoinformatic model that can identify such molecules in a given data set. In this study we developed models to identify these molecules using various methods, such as molecular docking, SMARTS screening, pharmacophores, and machine learning methods. Among the structure-based methods, the pharmacophore-based method showed promising results, with a balanced accuracy of 74.2%. However, machine-learning approaches using associative neural networks outperformed all of the other methods explored, producing a final model with a balanced accuracy of 89.7%. The high predictive accuracy of this model is expected to be useful for advising which compounds are potential luciferase inhibitors present in luciferase HTS assays. The models developed in this work are freely available at the OCHEM platform at http://ochem.eu .
Collapse
Affiliation(s)
- Dipan Ghosh
- Institute of Structural Biology , Helmholtz Zentrum München - German Research Center for Environmental Health (GmbH) , Ingolstaedter Landstrasse 1 , 85764 Neuherberg , Germany
| | - Uwe Koch
- Lead Discovery Center GmbH , Otto-Hahn-Straße 15 , 44227 Dortmund , Germany
| | - Kamyar Hadian
- Assay Development and Screening Platform , Helmholtz Zentrum München - German Research Center for Environmental Health (GmbH) , Ingolstaedter Landstrasse 1 , 85764 Neuherberg , Germany
| | - Michael Sattler
- Bayerisches NMR-Zentrum, Department of Chemistry , Technical University of Munich , Ernst-Otto-Fischer-Straße 2 , 85747 Garching , Germany
| | - Igor V Tetko
- Institute of Structural Biology , Helmholtz Zentrum München - German Research Center for Environmental Health (GmbH) , Ingolstaedter Landstrasse 1 , 85764 Neuherberg , Germany.,BIGCHEM GmbH , Ingolstaedter Landstrasse 1 b. 60w , 85764 Neuherberg , Germany
| |
Collapse
|
36
|
Jasper JB, Humbeck L, Brinkjost T, Koch O. A novel interaction fingerprint derived from per atom score contributions: exhaustive evaluation of interaction fingerprint performance in docking based virtual screening. J Cheminform 2018; 10:15. [PMID: 29549526 PMCID: PMC5856854 DOI: 10.1186/s13321-018-0264-0] [Citation(s) in RCA: 27] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2017] [Accepted: 02/17/2018] [Indexed: 01/28/2023] Open
Abstract
Protein ligand interaction fingerprints are a powerful approach for the analysis and assessment of docking poses to improve docking performance in virtual screening. In this study, a novel interaction fingerprint approach (PADIF, protein per atom score contributions derived interaction fingerprint) is presented which was specifically designed for utilising the GOLD scoring functions’ atom contributions together with a specific scoring scheme. This allows the incorporation of known protein–ligand complex structures for a target-specific scoring. Unlike many other methods, this approach uses weighting factors reflecting the relative frequency of a specific interaction in the references and penalizes destabilizing interactions. In addition, and for the first time, an exhaustive validation study was performed that assesses the performance of PADIF and two other interaction fingerprints in virtual screening. Here, PADIF shows superior results, and some rules of thumb for a successful use of interaction fingerprints could be identified.![]()
Collapse
Affiliation(s)
- Julia B Jasper
- Faculty of Chemistry and Chemical Biology, TU Dortmund University, Otto-Hahn-Str. 6, 44227, Dortmund, Germany
| | - Lina Humbeck
- Faculty of Chemistry and Chemical Biology, TU Dortmund University, Otto-Hahn-Str. 6, 44227, Dortmund, Germany
| | - Tobias Brinkjost
- Faculty of Chemistry and Chemical Biology, TU Dortmund University, Otto-Hahn-Str. 6, 44227, Dortmund, Germany.,Department of Computer Science, TU Dortmund University, Otto-Hahn-Str. 14, 44227, Dortmund, Germany
| | - Oliver Koch
- Faculty of Chemistry and Chemical Biology, TU Dortmund University, Otto-Hahn-Str. 6, 44227, Dortmund, Germany.
| |
Collapse
|
37
|
Behdju M, Meyer U. DFG Priority Programme SPP 1736: Algorithms for Big Data. KUNSTLICHE INTELLIGENZ 2018. [DOI: 10.1007/s13218-017-0518-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
|
38
|
Papageorgiou L, Eleni P, Raftopoulou S, Mantaiou M, Megalooikonomou V, Vlachakis D. Genomic big data hitting the storage bottleneck. EMBNET.JOURNAL 2018; 24:e910. [PMID: 29782620 PMCID: PMC5958914] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
During the last decades, there is a vast data explosion in bioinformatics. Big data centres are trying to face this data crisis, reaching high storage capacity levels. Although several scientific giants examine how to handle the enormous pile of information in their cupboards, the problem remains unsolved. On a daily basis, there is a massive quantity of permanent loss of extensive information due to infrastructure and storage space problems. The motivation for sequencing has fallen behind. Sometimes, the time that is spent to solve storage space problems is longer than the one dedicated to collect and analyse data. To bring sequencing to the foreground, scientists have to slide over such obstacles and find alternative ways to approach the issue of data volume. Scientific community experiences the data crisis era, where, out of the box solutions may ease the typical research workflow, until technological development meets the needs of Bioinformatics.
Collapse
Affiliation(s)
- Louis Papageorgiou
- Laboratory of Genetics, Department of Biotechnology, School of Food, Biotechnology and Development, Agricultural University of Athens, Athens, Greece,Department of Informatics and Telecommunications, National and Kapodistrian University of Athens, Athens, Greece
| | - Picasi Eleni
- Laboratory of Genetics, Department of Biotechnology, School of Food, Biotechnology and Development, Agricultural University of Athens, Athens, Greece
| | - Sofia Raftopoulou
- Laboratory of Genetics, Department of Biotechnology, School of Food, Biotechnology and Development, Agricultural University of Athens, Athens, Greece,Sotiria Chest Diseases Hospital, Athens, Greece,Division of Endocrinology and Metabolism, Center of Clinical, Experimental Surgery and Translational Research, Biomedical Research Foundation of the Academy of Athens, Athens, Greece
| | | | - Vasileios Megalooikonomou
- Computer Engineering and Informatics Department, School of Engineering, University of Patras, Patras, Greece
| | - Dimitrios Vlachakis
- Laboratory of Genetics, Department of Biotechnology, School of Food, Biotechnology and Development, Agricultural University of Athens, Athens, Greece,Division of Endocrinology and Metabolism, Center of Clinical, Experimental Surgery and Translational Research, Biomedical Research Foundation of the Academy of Athens, Athens, Greece,Computer Engineering and Informatics Department, School of Engineering, University of Patras, Patras, Greece
| |
Collapse
|