1
|
Diéguez-Santana K, Casañola-Martin GM, Torres R, Rasulev B, Green JR, González-Díaz H. Machine Learning Study of Metabolic Networks vs ChEMBL Data of Antibacterial Compounds. Mol Pharm 2022; 19:2151-2163. [PMID: 35671399 PMCID: PMC9986951 DOI: 10.1021/acs.molpharmaceut.2c00029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Antibacterial drugs (AD) change the metabolic status of bacteria, contributing to bacterial death. However, antibiotic resistance and the emergence of multidrug-resistant bacteria increase interest in understanding metabolic network (MN) mutations and the interaction of AD vs MN. In this study, we employed the IFPTML = Information Fusion (IF) + Perturbation Theory (PT) + Machine Learning (ML) algorithm on a huge dataset from the ChEMBL database, which contains >155,000 AD assays vs >40 MNs of multiple bacteria species. We built a linear discriminant analysis (LDA) and 17 ML models centered on the linear index and based on atoms to predict antibacterial compounds. The IFPTML-LDA model presented the following results for the training subset: specificity (Sp) = 76% out of 70,000 cases, sensitivity (Sn) = 70%, and Accuracy (Acc) = 73%. The same model also presented the following results for the validation subsets: Sp = 76%, Sn = 70%, and Acc = 73.1%. Among the IFPTML nonlinear models, the k nearest neighbors (KNN) showed the best results with Sn = 99.2%, Sp = 95.5%, Acc = 97.4%, and Area Under Receiver Operating Characteristic (AUROC) = 0.998 in training sets. In the validation series, the Random Forest had the best results: Sn = 93.96% and Sp = 87.02% (AUROC = 0.945). The IFPTML linear and nonlinear models regarding the ADs vs MNs have good statistical parameters, and they could contribute toward finding new metabolic mutations in antibiotic resistance and reducing time/costs in antibacterial drug research.
Collapse
Affiliation(s)
- Karel Diéguez-Santana
- Department of Organic and Inorganic Chemistry, University of Basque Country UPV/EHU, 48940 Leioa, Spain.,Universidad Regional Amazónica IKIAM, Tena, Napo 150150, Ecuador
| | - Gerardo M Casañola-Martin
- Department of Coatings and Polymeric Materials, North Dakota State University, Fargo, North Dakota 58102, United States.,Department of Systems and Computer Engineering, Carleton University, K1S5B6 Ottawa, Ontario, Canada
| | - Roldan Torres
- Universidad Regional Amazónica IKIAM, Tena, Napo 150150, Ecuador
| | - Bakhtiyor Rasulev
- Department of Coatings and Polymeric Materials, North Dakota State University, Fargo, North Dakota 58102, United States
| | - James R Green
- Department of Systems and Computer Engineering, Carleton University, K1S5B6 Ottawa, Ontario, Canada
| | - Humbert González-Díaz
- Department of Organic and Inorganic Chemistry, University of Basque Country UPV/EHU, 48940 Leioa, Spain.,BIOFISIKA, Basque Center for Biophysics CSIC-UPVEH, 48940 Leioa, Spain.,IKERBASQUE, Basque Foundation for Science, 48011 Bilbao, Biscay, Spain
| |
Collapse
|
2
|
Khan A, Poojary SS, Bhave KK, Nandan SR, Iyer KR, Coutinho EC. Prediction of QcrB Inhibition as a Measure of Antitubercular Activity with Machine Learning Protocols. ACS Omega 2022; 7:18094-18102. [PMID: 35664614 PMCID: PMC9161412 DOI: 10.1021/acsomega.2c01613] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/17/2022] [Accepted: 05/06/2022] [Indexed: 05/07/2023]
Abstract
It has always been a challenge to develop interventional therapies for Mycobacterium tuberculosis. Over the years, several attempts at developing such therapies have hit a dead-end owing to rapid mutation rates of the tubercular bacilli and their ability to lay dormant for years. Recently, cytochrome bcc complex (QcrB) has shown some promise as a novel target against the tubercular bacilli, with Q203 being the first molecule acting on this target. In this paper, we report the deployment of several ML-based approaches to design molecules against QcrB. Machine learning (ML) models were developed based on a data set of 350 molecules using three different sets of molecular features, i.e., MACCS keys, ECFP6 fingerprints, and Mordred descriptors. Each feature set was trained on eight ML classifier algorithms and optimized to classify molecules accurately. The support vector machine-based classifier using the ECFP6 feature set was found to be the best classifier in this study. Further, screening of the known imidazopyridine amide inhibitors demonstrated that the model correctly classified the most potent molecules as actives, hence validating the model for future applications.
Collapse
Affiliation(s)
- Afreen
A. Khan
- Department
of Pharmaceutical Chemistry, Vasvik Research Centre, Bombay College of Pharmacy, Kalina, Santacruz (E), Mumbai 400 098, India
| | - Sannidhi S. Poojary
- Department
of Pharmaceutical Chemistry, Vasvik Research Centre, Bombay College of Pharmacy, Kalina, Santacruz (E), Mumbai 400 098, India
| | - Ketki K. Bhave
- Department
of Pharmaceutical Chemistry, Vasvik Research Centre, Bombay College of Pharmacy, Kalina, Santacruz (E), Mumbai 400 098, India
| | - Santosh R. Nandan
- Ambernath
Organics Pvt. Ltd., 222,
The Summit Business Bay, Andheri (E), Mumbai 400 093, India
| | - Krishna R. Iyer
- Department
of Pharmaceutical Chemistry, Vasvik Research Centre, Bombay College of Pharmacy, Kalina, Santacruz (E), Mumbai 400 098, India
| | - Evans C. Coutinho
- Department
of Pharmaceutical Chemistry, Vasvik Research Centre, Bombay College of Pharmacy, Kalina, Santacruz (E), Mumbai 400 098, India
| |
Collapse
|
3
|
Halder AK, Moura AS, Cordeiro MNDS. Moving Average-Based Multitasking In Silico Classification Modeling: Where Do We Stand and What Is Next? Int J Mol Sci 2022; 23:ijms23094937. [PMID: 35563327 PMCID: PMC9099502 DOI: 10.3390/ijms23094937] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2022] [Revised: 04/24/2022] [Accepted: 04/28/2022] [Indexed: 01/27/2023] Open
Abstract
Conventional in silico modeling is often viewed as 'one-target' or 'single-task' computer-aided modeling since it mainly relies on forecasting an endpoint of interest from similar input data. Multitasking or multitarget in silico modeling, in contrast, embraces a set of computational techniques that efficiently integrate multiple types of input data for setting up unique in silico models able to predict the outcome(s) relating to various experimental and/or theoretical conditions. The latter, specifically, based upon the Box-Jenkins moving average approach, has been applied in the last decade to several research fields including drug and materials design, environmental sciences, and nanotechnology. The present review discusses the current status of multitasking computer-aided modeling efforts, meanwhile describing both the existing challenges and future opportunities of its underlying techniques. Some important applications are also discussed to exemplify the ability of multitasking modeling in deriving holistic and reliable in silico classification-based models as well as in designing new chemical entities, either through fragment-based design or virtual screening. Focus will also be given to some software recently developed to automate and accelerate such types of modeling. Overall, this review may serve as a guideline for researchers to grasp the scope of multitasking computer-aided modeling as a promising in silico tool.
Collapse
Affiliation(s)
- Amit Kumar Halder
- LAQV@REQUIMTE, Faculty of Sciences, University of Porto, 4169-007 Porto, Portugal; (A.K.H.); (A.S.M.)
- Dr. B. C. Roy College of Pharmacy and Allied Health Sciences, Dr. Meghnad Saha Sarani, Bidhannagar, Durgapur 713212, West Bengal, India
| | - Ana S. Moura
- LAQV@REQUIMTE, Faculty of Sciences, University of Porto, 4169-007 Porto, Portugal; (A.K.H.); (A.S.M.)
| | - Maria Natália D. S. Cordeiro
- LAQV@REQUIMTE, Faculty of Sciences, University of Porto, 4169-007 Porto, Portugal; (A.K.H.); (A.S.M.)
- Correspondence: ; Tel.: +35-12-2040-2502
| |
Collapse
|
4
|
Galati S, Di Stefano M, Martinelli E, Poli G, Tuccinardi T. Recent Advances in In Silico Target Fishing. Molecules 2021; 26:5124. [PMID: 34500568 DOI: 10.3390/molecules26175124] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2021] [Revised: 08/14/2021] [Accepted: 08/18/2021] [Indexed: 12/24/2022] Open
Abstract
In silico target fishing, whose aim is to identify possible protein targets for a query molecule, is an emerging approach used in drug discovery due its wide variety of applications. This strategy allows the clarification of mechanism of action and biological activities of compounds whose target is still unknown. Moreover, target fishing can be employed for the identification of off targets of drug candidates, thus recognizing and preventing their possible adverse effects. For these reasons, target fishing has increasingly become a key approach for polypharmacology, drug repurposing, and the identification of new drug targets. While experimental target fishing can be lengthy and difficult to implement, due to the plethora of interactions that may occur for a single small-molecule with different protein targets, an in silico approach can be quicker, less expensive, more efficient for specific protein structures, and thus easier to employ. Moreover, the possibility to use it in combination with docking and virtual screening studies, as well as the increasing number of web-based tools that have been recently developed, make target fishing a more appealing method for drug discovery. It is especially worth underlining the increasing implementation of machine learning in this field, both as a main target fishing approach and as a further development of already applied strategies. This review reports on the main in silico target fishing strategies, belonging to both ligand-based and receptor-based approaches, developed and applied in the last years, with a particular attention to the different web tools freely accessible by the scientific community for performing target fishing studies.
Collapse
|
5
|
Kleandrova VV, Speck-Planche A. The QSAR Paradigm in Fragment-Based Drug Discovery: From the Virtual Generation of Target Inhibitors to Multi-Scale Modeling. Mini Rev Med Chem 2021; 20:1357-1374. [PMID: 32013845 DOI: 10.2174/1389557520666200204123156] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2019] [Revised: 10/21/2019] [Accepted: 10/28/2019] [Indexed: 12/24/2022]
Abstract
Fragment-Based Drug Design (FBDD) has established itself as a promising approach in modern drug discovery, accelerating and improving lead optimization, while playing a crucial role in diminishing the high attrition rates at all stages in the drug development process. On the other hand, FBDD has benefited from the application of computational methodologies, where the models derived from the Quantitative Structure-Activity Relationships (QSAR) have become consolidated tools. This mini-review focuses on the evolution and main applications of the QSAR paradigm in the context of FBDD in the last five years. This report places particular emphasis on the QSAR models derived from fragment-based topological approaches to extract physicochemical and/or structural information, allowing to design potentially novel mono- or multi-target inhibitors from relatively large and heterogeneous databases. Here, we also discuss the need to apply multi-scale modeling, to exemplify how different datasets based on target inhibition can be simultaneously integrated and predicted together with other relevant endpoints such as the biological activity against non-biomolecular targets, as well as in vitro and in vivo toxicity and pharmacokinetic properties. In this context, seminal papers are briefly analyzed. As huge amounts of data continue to accumulate in the domains of the chemical, biological and biomedical sciences, it has become clear that drug discovery must be viewed as a multi-scale optimization process. An ideal multi-scale approach should integrate diverse chemical and biological data and also serve as a knowledge generator, enabling the design of potentially optimal chemicals that may become therapeutic agents.
Collapse
Affiliation(s)
- Valeria V Kleandrova
- Laboratory of Fundamental and Applied Research of Quality and Technology of Food Production, Moscow State University of Food Production, Volokolamskoe Shosse 11, 125080, Moscow, Russian Federation
| | - Alejandro Speck-Planche
- Department of Chemistry, Institute of Pharmacy, I.M. Sechenov First Moscow State Medical University, Trubetskaya Str., 8, b. 2, 119992, Moscow, Russian Federation
| |
Collapse
|
6
|
Affiliation(s)
- Su‐Qing Yang
- Xiangya School of Pharmaceutical Sciences Central South University Changsha Hunan China
| | - Qing Ye
- College of Pharmaceutical Sciences Innovation Institute for Artificial Intelligence in Medicine, Zhejiang University Hangzhou, Zhejiang China
| | - Jun‐Jie Ding
- Beijing Institute of Pharmaceutical Chemistry Beijing China
| | - Yin
- Department of Dermatology, Hunan Engineering Research Center of Skin Health and Disease, Hunan Key Laboratory of Skin Cancer and Psoriasis, Xiangya Hospital Central South University Changsha Hunan China
| | - Ai‐Ping Lu
- Institute for Advancing Translational Medicine in Bone and Joint Diseases, School of Chinese Medicine Hong Kong Baptist University Hong Kong China
| | - Xiang Chen
- Department of Dermatology, Hunan Engineering Research Center of Skin Health and Disease, Hunan Key Laboratory of Skin Cancer and Psoriasis, Xiangya Hospital Central South University Changsha Hunan China
| | - Ting‐Jun Hou
- College of Pharmaceutical Sciences Innovation Institute for Artificial Intelligence in Medicine, Zhejiang University Hangzhou, Zhejiang China
| | - Dong‐Sheng Cao
- Xiangya School of Pharmaceutical Sciences Central South University Changsha Hunan China
- Institute for Advancing Translational Medicine in Bone and Joint Diseases, School of Chinese Medicine Hong Kong Baptist University Hong Kong China
| |
Collapse
|
7
|
Zanni R, Galvez-Llompart M, Garcia-Domenech R, Galvez J. What place does molecular topology have in today’s drug discovery? Expert Opin Drug Discov 2020; 15:1133-1144. [DOI: 10.1080/17460441.2020.1770223] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Affiliation(s)
- Riccardo Zanni
- Molecular Topology and Drug Design Unit, Department of Physical Chemistry, University of Valencia, Valencia, Spain
- Departamento de Microbiologia, Facultad de Ciencias, Universidad de Malaga, Málaga, Spain
| | - Maria Galvez-Llompart
- Molecular Topology and Drug Design Unit, Department of Physical Chemistry, University of Valencia, Valencia, Spain
- Instituto de Tecnología Química, UPV-CSIC, Universidad Politécnica de Valencia, Valencia, Spain
| | - Ramon Garcia-Domenech
- Molecular Topology and Drug Design Unit, Department of Physical Chemistry, University of Valencia, Valencia, Spain
| | - Jorge Galvez
- Molecular Topology and Drug Design Unit, Department of Physical Chemistry, University of Valencia, Valencia, Spain
| |
Collapse
|
8
|
Toropov AA, Toropova AP, Marzo M, Carnesecchi E, Selvestrel G, Benfenati E. Pesticides, cosmetics, drugs: identical and opposite influences of various molecular features as measures of endpoints similarity and dissimilarity. Mol Divers 2020; 25:1137-1144. [PMID: 32323128 DOI: 10.1007/s11030-020-10085-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2020] [Accepted: 04/06/2020] [Indexed: 11/26/2022]
Abstract
The similarity is an important category in natural sciences. A measure of similarity for a group of various biochemical endpoints is suggested. The list of examined endpoints contains (1) toxicity of pesticides towards rainbow trout; (2) human skin sensitization; (3) mutagenicity; (4) toxicity of psychotropic drugs; and (5) anti HIV activity. Further applying and evolution of the suggested approach is discussed. In particular, the conception of the similarity (dissimilarity) of endpoints can play the role of a "useful bridge" between quantitative structure property/activity relationships (QSPRs/QSARs) and read-across technique.
Collapse
Affiliation(s)
- Andrey A Toropov
- Laboratory of Environmental Chemistry and Toxicology, Department of Environmental Health Science, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Via Mario Negri 2, 20156, Milan, Italy
| | - Alla P Toropova
- Laboratory of Environmental Chemistry and Toxicology, Department of Environmental Health Science, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Via Mario Negri 2, 20156, Milan, Italy.
| | - Marco Marzo
- Laboratory of Environmental Chemistry and Toxicology, Department of Environmental Health Science, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Via Mario Negri 2, 20156, Milan, Italy
| | - Edoardo Carnesecchi
- Laboratory of Environmental Chemistry and Toxicology, Department of Environmental Health Science, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Via Mario Negri 2, 20156, Milan, Italy
- Institute for Risk Assessment Sciences (IRAS), Utrecht University, P.O. Box 80177, 3508 TD, Utrecht, The Netherlands
| | - Gianluca Selvestrel
- Laboratory of Environmental Chemistry and Toxicology, Department of Environmental Health Science, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Via Mario Negri 2, 20156, Milan, Italy
| | - Emilio Benfenati
- Laboratory of Environmental Chemistry and Toxicology, Department of Environmental Health Science, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Via Mario Negri 2, 20156, Milan, Italy
| |
Collapse
|
9
|
Toropova AP, Toropov AA, Benfenati E, Leszczynska D, Leszczynski J. Virtual Screening of Anti-Cancer Compounds: Application of Monte Carlo Technique. Anticancer Agents Med Chem 2019; 19:148-153. [PMID: 30360729 DOI: 10.2174/1871520618666181025122318] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2017] [Revised: 09/08/2017] [Accepted: 03/21/2018] [Indexed: 01/27/2023]
Abstract
Possibility and necessity of standardization of predictive models for anti-cancer activity are discussed. The hypothesis about rationality of common quantitative analysis of anti-cancer activity and carcinogenicity is developed. Potential of optimal descriptors to be used as a tool to build up predictive models for anti-cancer activity is examined from practical point of view. Various perspectives of application of optimal descriptors are reviewed. Stochastic nature of phenomena which are related to carcinogenic potential of various substances can be successfully detected and interpreted by the Monte Carlo technique. Hypothesises related to practical strategy and tactics of the searching for new anticancer agents are suggested.
Collapse
Affiliation(s)
- Alla P Toropova
- Department of Environmental Health Science, Laboratory of Environmental Chemistry and Toxicology, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Via La Masa 19, 20156 Milano, Italy
| | - Andrey A Toropov
- Department of Environmental Health Science, Laboratory of Environmental Chemistry and Toxicology, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Via La Masa 19, 20156 Milano, Italy
| | - Emilio Benfenati
- Department of Environmental Health Science, Laboratory of Environmental Chemistry and Toxicology, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Via La Masa 19, 20156 Milano, Italy
| | - Danuta Leszczynska
- Interdisciplinary Nanotoxicity Center, Department of Civil and Environmental; Engineering, Jackson State University, 1325 Lynch Street, Jackson, MS 39217-0510, United States
| | - Jerzy Leszczynski
- Interdisciplinary Nanotoxicity Center, Department of Chemistry and Biochemistry, Jackson State University, 1400 J. R. Lynch Street, P.O. Box 17910, Jackson, MS 39217, United States
| |
Collapse
|
10
|
Andrade CH, Neves BJ, Melo-Filho CC, Rodrigues J, Silva DC, Braga RC, Cravo PVL. In Silico Chemogenomics Drug Repositioning Strategies for Neglected Tropical Diseases. Curr Med Chem 2019. [DOI: 10.2174/0929867325666180309114824] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Only ~1% of all drug candidates against Neglected Tropical Diseases (NTDs)
have reached clinical trials in the last decades, underscoring the need for new, safe and effective
treatments. In such context, drug repositioning, which allows finding novel indications
for approved drugs whose pharmacokinetic and safety profiles are already known,
emerging as a promising strategy for tackling NTDs. Chemogenomics is a direct descendent
of the typical drug discovery process that involves the systematic screening of chemical
compounds against drug targets in high-throughput screening (HTS) efforts, for the identification
of lead compounds. However, different to the one-drug-one-target paradigm, chemogenomics
attempts to identify all potential ligands for all possible targets and diseases. In
this review, we summarize current methodological development efforts in drug repositioning
that use state-of-the-art computational ligand- and structure-based chemogenomics approaches.
Furthermore, we highlighted the recent progress in computational drug repositioning
for some NTDs, based on curation and modeling of genomic, biological, and chemical data.
Additionally, we also present in-house and other successful examples and suggest possible solutions
to existing pitfalls.
Collapse
Affiliation(s)
- Carolina Horta Andrade
- LabMol - Laboratory for Molecular Modeling and Drug Design, Faculty of Pharmacy, Federal University of Goias, Goiania, GO, 74605-170, Brazil
| | - Bruno Junior Neves
- LabMol - Laboratory for Molecular Modeling and Drug Design, Faculty of Pharmacy, Federal University of Goias, Goiania, GO, 74605-170, Brazil
| | - Cleber Camilo Melo-Filho
- LabMol - Laboratory for Molecular Modeling and Drug Design, Faculty of Pharmacy, Federal University of Goias, Goiania, GO, 74605-170, Brazil
| | - Juliana Rodrigues
- LabMol - Laboratory for Molecular Modeling and Drug Design, Faculty of Pharmacy, Federal University of Goias, Goiania, GO, 74605-170, Brazil
| | - Diego Cabral Silva
- LabMol - Laboratory for Molecular Modeling and Drug Design, Faculty of Pharmacy, Federal University of Goias, Goiania, GO, 74605-170, Brazil
| | - Rodolpho Campos Braga
- LabMol - Laboratory for Molecular Modeling and Drug Design, Faculty of Pharmacy, Federal University of Goias, Goiania, GO, 74605-170, Brazil
| | - Pedro Vitor Lemos Cravo
- Laboratory of Cheminformatics, Centro Universitario de Anapolis (UniEVANGELICA), Anapolis, GO, 75083-515, Brazil
| |
Collapse
|
11
|
Speck-Planche A. Combining Ensemble Learning with a Fragment-Based Topological Approach To Generate New Molecular Diversity in Drug Discovery: In Silico Design of Hsp90 Inhibitors. ACS Omega 2018; 3:14704-14716. [PMID: 30555986 PMCID: PMC6289491 DOI: 10.1021/acsomega.8b02419] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/18/2018] [Accepted: 10/23/2018] [Indexed: 05/05/2023]
Abstract
Machine learning methods have revolutionized modern science, providing fast and accurate solutions to multiple problems. However, they are commonly treated as "black boxes". Therefore, in important scientific fields such as medicinal chemistry and drug discovery, machine learning methods are restricted almost exclusively to the task of performing predictions of large and heterogeneous data sets of chemicals. The lack of interpretability prevents the full exploitation of the machine learning models as generators of new chemical knowledge. This work focuses on the development of an ensemble learning model for the prediction and design of potent dual heat shock protein 90 (Hsp90) inhibitors. The model displays accuracy higher than 80% in both training and test sets. To use the ensemble model as a generator of new chemical knowledge, three steps were followed. First, a physicochemical and/or structural interpretation was provided for each molecular descriptor present in the ensemble learning model. Second, the term "pseudolinear equation" was introduced within the context of machine learning to calculate the relative quantitative contributions of different molecular fragments to the inhibitory activity against the two Hsp90 isoforms studied here. Finally, by assembling the fragments with positive contributions, new molecules were designed, being predicted as potent Hsp90 inhibitors. According to Lipinski's rule of five, the designed molecules were found to exhibit potentially good oral bioavailability, a primordial property that chemicals must have to pass early stages in drug discovery. The present approach based on the combination of ensemble learning and fragment-based topological design holds great promise in drug discovery, and it can be adapted and applied to many different scientific disciplines.
Collapse
|
12
|
Chen C, Lee MH, Weng CF, Leong MK. Theoretical Prediction of the Complex P-Glycoprotein Substrate Efflux Based on the Novel Hierarchical Support Vector Regression Scheme. Molecules 2018; 23:E1820. [PMID: 30037151 PMCID: PMC6100076 DOI: 10.3390/molecules23071820] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2018] [Revised: 07/19/2018] [Accepted: 07/20/2018] [Indexed: 12/13/2022] Open
Abstract
P-glycoprotein (P-gp), a membrane-bound transporter, can eliminate xenobiotics by transporting them out of the cells or blood⁻brain barrier (BBB) at the expense of ATP hydrolysis. Thus, P-gp mediated efflux plays a pivotal role in altering the absorption and disposition of a wide range of substrates. Nevertheless, the mechanism of P-gp substrate efflux is rather complex since it can take place through active transport and passive permeability in addition to multiple P-gp substrate binding sites. A nonlinear quantitative structure⁻activity relationship (QSAR) model was developed in this study using the novel machine learning-based hierarchical support vector regression (HSVR) scheme to explore the perplexing relationships between descriptors and efflux ratio. The predictions by HSVR were found to be in good agreement with the observed values for the molecules in the training set (n = 50, r² = 0.96, qCV2 = 0.94, RMSE = 0.10, s = 0.10) and test set (n = 13, q² = 0.80⁻0.87, RMSE = 0.21, s = 0.22). When subjected to a variety of statistical validations, the developed HSVR model consistently met the most stringent criteria. A mock test also asserted the predictivity of HSVR. Consequently, this HSVR model can be adopted to facilitate drug discovery and development.
Collapse
Affiliation(s)
- Chun Chen
- Department of Chemistry, National Dong Hwa University, Shoufeng, Hualien 97401, Taiwan.
| | - Ming-Han Lee
- Department of Chemistry, National Dong Hwa University, Shoufeng, Hualien 97401, Taiwan.
| | - Ching-Feng Weng
- Department of Life Science and Institute of Biotechnology, National Dong Hwa University, Shoufeng, Hualien 97401, Taiwan.
| | - Max K Leong
- Department of Chemistry, National Dong Hwa University, Shoufeng, Hualien 97401, Taiwan.
- Department of Life Science and Institute of Biotechnology, National Dong Hwa University, Shoufeng, Hualien 97401, Taiwan.
| |
Collapse
|
13
|
Fu RG, Sun Y, Sheng WB, Liao DF. Designing multi-targeted agents: An emerging anticancer drug discovery paradigm. Eur J Med Chem 2017; 136:195-211. [PMID: 28494256 DOI: 10.1016/j.ejmech.2017.05.016] [Citation(s) in RCA: 133] [Impact Index Per Article: 19.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2017] [Revised: 04/30/2017] [Accepted: 05/04/2017] [Indexed: 12/11/2022]
Abstract
The dominant paradigm in drug discovery is to design ligands with maximum selectivity to act on individual drug targets. With the target-based approach, many new chemical entities have been discovered, developed, and further approved as drugs. However, there are a large number of complex diseases such as cancer that cannot be effectively treated or cured only with one medicine to modulate the biological function of a single target. As simultaneous intervention of two (or multiple) cancer progression relevant targets has shown improved therapeutic efficacy, the innovation of multi-targeted drugs has become a promising and prevailing research topic and numerous multi-targeted anticancer agents are currently at various developmental stages. However, most multi-pharmacophore scaffolds are usually discovered by serendipity or screening, while rational design by combining existing pharmacophore scaffolds remains an enormous challenge. In this review, four types of multi-pharmacophore modes are discussed, and the examples from literature will be used to introduce attractive lead compounds with the capability of simultaneously interfering with different enzyme or signaling pathway of cancer progression, which will reveal the trends and insights to help the design of the next generation multi-targeted anticancer agents.
Collapse
|
14
|
Speck-Planche A, Cordeiro MNDS. Fragment-based in silico modeling of multi-target inhibitors against breast cancer-related proteins. Mol Divers 2017; 21:511-23. [PMID: 28194627 DOI: 10.1007/s11030-017-9731-1] [Citation(s) in RCA: 43] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2016] [Accepted: 01/16/2017] [Indexed: 10/20/2022]
Abstract
Breast cancer is the most frequent cancer reported in women, being responsible for hundreds of thousands of deaths. Chemotherapy has proven to be effective against this malignant neoplasm depending on different biological factors such as the histopathology, grade, and stage, among others. However, breast cancer cells have become resistant to current chemotherapeutic regimens, urging the discovery of new anti-breast cancer drugs. Computational approaches have the potential to offer promising alternatives to accelerate the search for potent and versatile anti-breast cancer agents. In the present work, we introduce the first multitasking (mtk) computational model devoted to the in silico fragment-based design of new molecules with high inhibitory activity against 19 different proteins involved in breast cancer. The mtk-computational model was created from a dataset formed by 24,285 cases, and it exhibited accuracy around 93% in both training and prediction (test) sets. Several molecular fragments were extracted from the molecules present in the dataset, and their quantitative contributions to the inhibitory activities against all the proteins under study were calculated. The combined use of the fragment contributions and the physicochemical interpretations of the different molecular descriptors in the mtk-computational model allowed the design of eight new molecular entities not reported in our dataset. These molecules were predicted as potent multi-target inhibitors against all the proteins, and they exhibited a desirable druglikeness according to the Lipinski's rule of five and its variants.
Collapse
|
15
|
Kleandrova VV, Ruso JM, Speck-Planche A, Dias Soeiro Cordeiro MN. Enabling the Discovery and Virtual Screening of Potent and Safe Antimicrobial Peptides. Simultaneous Prediction of Antibacterial Activity and Cytotoxicity. ACS Comb Sci 2016; 18:490-8. [PMID: 27280735 DOI: 10.1021/acscombsci.6b00063] [Citation(s) in RCA: 66] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Antimicrobial peptides (AMPs) represent promising alternatives to fight against bacterial pathogens. However, cellular toxicity remains one of the main concerns in the early development of peptide-based drugs. This work introduces the first multitasking (mtk) computational model focused on performing simultaneous predictions of antibacterial activities, and cytotoxicities of peptides. The model was created from a data set containing 3592 cases, and it displayed accuracy higher than 96% for classifying/predicting peptides in both training and prediction (test) sets. The technique known as alanine scanning was computationally applied to illustrate the calculation of the quantitative contributions of the amino acids (in their respective positions of the sequence) to the biological effects of a defined peptide. A small library formed by 10 peptides was generated, where peptides were designed by considering the interpretations of the different descriptors in the mtk-computational model. All the peptides were predicted to exhibit high antibacterial activities against multiple bacterial strains, and low cytotoxicity against various cell types. The present mtk-computational model can be considered a very useful tool to support high throughput research for the discovery of potent and safe AMPs.
Collapse
Affiliation(s)
- Valeria V. Kleandrova
- Faculty
of Technology and Production Management, Moscow State University of Food Production, Volokolamskoe shosse 11, Moscow, Russia
| | - Juan M. Ruso
- Department
of Applied Physics, University of Santiago de Compostela (USC), 15782 Santiago de Compostela, Spain
| | - Alejandro Speck-Planche
- Department
of Applied Physics, University of Santiago de Compostela (USC), 15782 Santiago de Compostela, Spain
- LAQV@REQUIMTE,
Department of Chemistry and Biochemistry, University of Porto, 4169-007 Porto, Portugal
| | | |
Collapse
|
16
|
Abstract
Computational methods for Target Fishing (TF), also known as Target Prediction or Polypharmacology Prediction, can be used to discover new targets for small-molecule drugs. This may result in repositioning the drug in a new indication or improving our current understanding of its efficacy and side effects. While there is a substantial body of research on TF methods, there is still a need to improve their validation, which is often limited to a small part of the available targets and not easily interpretable by the user. Here we discuss how target-centric TF methods are inherently limited by the number of targets that can possibly predict (this number is by construction much larger in ligand-centric techniques). We also propose a new benchmark to validate TF methods, which is particularly suited to analyse how predictive performance varies with the query molecule. On average over approved drugs, we estimate that only five predicted targets will have to be tested to find two true targets with submicromolar potency (a strong variability in performance is however observed). In addition, we find that an approved drug has currently an average of eight known targets, which reinforces the notion that polypharmacology is a common and strong event. Furthermore, with the assistance of a control group of randomly-selected molecules, we show that the targets of approved drugs are generally harder to predict. The benchmark and a simple target prediction method to use as a performance baseline are available at http://ballester.marseille.inserm.fr/TF-benchmark.tar.gz.
Collapse
Affiliation(s)
- Antonio Peón
- Cancer Research Center of Marseille (Institut National de la Santé et de la Recherche Médicale U1068, Institut Paoli-Calmettes, Aix-Marseille Université, Centre National de la Recherche Scientifique UMR7258) Marseille, France
| | - Cuong C Dang
- Cancer Research Center of Marseille (Institut National de la Santé et de la Recherche Médicale U1068, Institut Paoli-Calmettes, Aix-Marseille Université, Centre National de la Recherche Scientifique UMR7258) Marseille, France
| | - Pedro J Ballester
- Cancer Research Center of Marseille (Institut National de la Santé et de la Recherche Médicale U1068, Institut Paoli-Calmettes, Aix-Marseille Université, Centre National de la Recherche Scientifique UMR7258) Marseille, France
| |
Collapse
|
17
|
Halder A, Goodarzi M. Recent Advances in Multi-Task QSAR Modeling for Drug Design. Pharm Sci 2015. [DOI: 10.15171/ps.2015.33] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022] Open
|