1
|
Mukherjee J, Sharma R, Dutta P, Bhunia B. Artificial intelligence in healthcare: a mastery. Biotechnol Genet Eng Rev 2024; 40:1659-1708. [PMID: 37013913 DOI: 10.1080/02648725.2023.2196476] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2023] [Accepted: 03/22/2023] [Indexed: 04/05/2023]
Abstract
There is a vast development of artificial intelligence (AI) in recent years. Computational technology, digitized data collection and enormous advancement in this field have allowed AI applications to penetrate the core human area of specialization. In this review article, we describe current progress achieved in the AI field highlighting constraints on smooth development in the field of medical AI sector, with discussion of its implementation in healthcare from a commercial, regulatory and sociological standpoint. Utilizing sizable multidimensional biological datasets that contain individual heterogeneity in genomes, functionality and milieu, precision medicine strives to create and optimize approaches for diagnosis, treatment methods and assessment. With the arise of complexity and expansion of data in the health-care industry, AI can be applied more frequently. The main application categories include indications for diagnosis and therapy, patient involvement and commitment and administrative tasks. There has recently been a sharp rise in interest in medical AI applications due to developments in AI software and technology, particularly in deep learning algorithms and in artificial neural network (ANN). In this overview, we enlisted the major categories of issues that AI systems are ideally equipped to resolve followed by clinical diagnostic tasks. It also includes a discussion of the future potential of AI, particularly for risk prediction in complex diseases, and the difficulties, constraints and biases that must be meticulously addressed for the effective delivery of AI in the health-care sector.
Collapse
Affiliation(s)
- Jayanti Mukherjee
- Department of Pharmaceutical Chemistry, CMR College of Pharmacy Affiliated to Jawaharlal Nehru Technological University, Hyderabad, Telangana, India
| | - Ramesh Sharma
- Department of Bioengineering, National Institute of Technology, Agartala, India
| | - Prasenjit Dutta
- Department of Production Engineering, National Institute of Technology, Agartala, India
| | - Biswanath Bhunia
- Department of Bioengineering, National Institute of Technology, Agartala, India
| |
Collapse
|
2
|
Al-Fakih AM, Qasim MK, Algamal ZY, Alharthi AM, Zainal-Abidin MH. QSAR classification model for diverse series of antifungal agents based on binary coyote optimization algorithm. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2023; 34:285-298. [PMID: 37157994 DOI: 10.1080/1062936x.2023.2208374] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/10/2023]
Abstract
One of the recently developed metaheuristic algorithms, the coyote optimization algorithm (COA), has shown to perform better in a number of difficult optimization tasks. The binary form, BCOA, is used in this study as a solution to the descriptor selection issue in classifying diverse antifungal series. Z-shape transfer functions (ZTF) are evaluated to verify their efficiency in improving BCOA performance in QSAR classification based on classification accuracy (CA), the geometric mean of sensitivity and specificity (G-mean), and the area under the curve (AUC). The Kruskal-Wallis test is also applied to show the statistical differences between the functions. The efficacy of the best suggested transfer function, ZTF4, is further assessed by comparing it to the most recent binary algorithms. The results prove that ZTF, especially ZTF4, significantly improves the performance of the original BCOA. The ZTF4 function yields the best CA and G-mean of 99.03% and 0.992%, respectively. It shows the fastest convergence behaviour compared to other binary algorithms. It takes the fewest iterations to reach high classification performance and selects the fewest descriptors. In conclusion, the obtained results indicate the ability of the ZTF4-based BCOA to find the smallest subset of descriptors while maintaining the best classification accuracy performance.
Collapse
Affiliation(s)
- A M Al-Fakih
- Department of Chemistry, Faculty of Science, Universiti Teknologi Malaysia, Johor, Malaysia
- Department of Chemistry, Faculty of Science, Sana'a University, Sana'a, Yemen
| | - M K Qasim
- Department of General Science, University of Mosul, Mosul, Iraq
| | - Z Y Algamal
- Department of Statistics and Informatics, University of Mosul, Mosul, Iraq
- College of Engineering, University of Warith Al-Anbiyaa, Karbala, Iraq
| | - A M Alharthi
- Department of Mathematics, Turabah University College, Taif University, Taif, Saudi Arabia
| | - M H Zainal-Abidin
- Department of Chemistry, Faculty of Science, Universiti Teknologi Malaysia, Johor, Malaysia
| |
Collapse
|
3
|
Searching glycolate oxidase inhibitors based on QSAR, molecular docking, and molecular dynamic simulation approaches. Sci Rep 2022; 12:19969. [PMID: 36402831 PMCID: PMC9675741 DOI: 10.1038/s41598-022-24196-4] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2022] [Accepted: 11/11/2022] [Indexed: 11/21/2022] Open
Abstract
Primary hyperoxaluria type 1 (PHT1) treatment is mainly focused on inhibiting the enzyme glycolate oxidase, which plays a pivotal role in the production of glyoxylate, which undergoes oxidation to produce oxalate. When the renal secretion capacity exceeds, calcium oxalate forms stones that accumulate in the kidneys. In this respect, detailed QSAR analysis, molecular docking, and dynamics simulations of a series of inhibitors containing glycolic, glyoxylic, and salicylic acid groups have been performed employing different regression machine learning techniques. Three robust models with less than 9 descriptors-based on a tenfold cross (Q2 CV) and external (Q2 EXT) validation-were found i.e., MLR1 (Q2 CV = 0.893, Q2 EXT = 0.897), RF1 (Q2 CV = 0.889, Q2 EXT = 0.907), and IBK1 (Q2 CV = 0.891, Q2 EXT = 0.907). An ensemble model was built by averaging the predicted pIC50 of the three models, obtaining a Q2 EXT = 0.933. Physicochemical properties such as charge, electronegativity, hardness, softness, van der Waals volume, and polarizability were considered as attributes to build the models. To get more insight into the potential biological activity of the compouds studied herein, docking and dynamic analysis were carried out, finding the hydrophobic and polar residues show important interactions with the ligands. A screening of the DrugBank database V.5.1.7 was performed, leading to the proposal of seven commercial drugs within the applicability domain of the models, that can be suggested as possible PHT1 treatment.
Collapse
|
4
|
PTML Modeling for Pancreatic Cancer Research: In Silico Design of Simultaneous Multi-Protein and Multi-Cell Inhibitors. Biomedicines 2022; 10:biomedicines10020491. [PMID: 35203699 PMCID: PMC8962338 DOI: 10.3390/biomedicines10020491] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2022] [Revised: 02/10/2022] [Accepted: 02/15/2022] [Indexed: 02/07/2023] Open
Abstract
Pancreatic cancer (PANC) is a dangerous type of cancer that is a major cause of mortality worldwide and exhibits a remarkably poor prognosis. To date, discovering anti-PANC agents remains a very complex and expensive process. Computational approaches can accelerate the search for anti-PANC agents. We report for the first time two models that combined perturbation theory with machine learning via a multilayer perceptron network (PTML-MLP) to perform the virtual design and prediction of molecules that can simultaneously inhibit multiple PANC cell lines and PANC-related proteins, such as caspase-1, tumor necrosis factor-alpha (TNF-alpha), and the insulin-like growth factor 1 receptor (IGF1R). Both PTML-MLP models exhibited accuracies higher than 78%. Using the interpretation from one of the PTML-MLP models as a guideline, we extracted different molecular fragments desirable for the inhibition of the PANC cell lines and the aforementioned PANC-related proteins and then assembled some of those fragments to form three new molecules. The two PTML-MLP models predicted the designed molecules as potentially versatile anti-PANC agents through inhibition of the three PANC-related proteins and multiple PANC cell lines. Conclusions: This work opens new horizons for the application of the PTML modeling methodology to anticancer research.
Collapse
|
5
|
Computational Drug Repurposing for Antituberculosis Therapy: Discovery of Multi-Strain Inhibitors. Antibiotics (Basel) 2021; 10:antibiotics10081005. [PMID: 34439055 PMCID: PMC8388932 DOI: 10.3390/antibiotics10081005] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2021] [Revised: 08/15/2021] [Accepted: 08/17/2021] [Indexed: 12/13/2022] Open
Abstract
Tuberculosis remains the most afflicting infectious disease known by humankind, with one quarter of the population estimated to have it in the latent state. Discovering antituberculosis drugs is a challenging, complex, expensive, and time-consuming task. To overcome the substantial costs and accelerate drug discovery and development, drug repurposing has emerged as an attractive alternative to find new applications for “old” drugs and where computational approaches play an essential role by filtering the chemical space. This work reports the first multi-condition model based on quantitative structure–activity relationships and an ensemble of neural networks (mtc-QSAR-EL) for the virtual screening of potential antituberculosis agents able to act as multi-strain inhibitors. The mtc-QSAR-EL model exhibited an accuracy higher than 85%. A physicochemical and fragment-based structural interpretation of this model was provided, and a large dataset of agency-regulated chemicals was virtually screened, with the mtc-QSAR-EL model identifying already proven antituberculosis drugs while proposing chemicals with great potential to be experimentally repurposed as antituberculosis (multi-strain inhibitors) agents. Some of the most promising molecules identified by the mtc-QSAR-EL model as antituberculosis agents were also confirmed by another computational approach, supporting the capabilities of the mtc-QSAR-EL model as an efficient tool for computational drug repurposing.
Collapse
|
6
|
Kleandrova VV, Scotti L, Bezerra Mendonça Junior FJ, Muratov E, Scotti MT, Speck-Planche A. QSAR Modeling for Multi-Target Drug Discovery: Designing Simultaneous Inhibitors of Proteins in Diverse Pathogenic Parasites. Front Chem 2021; 9:634663. [PMID: 33777898 PMCID: PMC7987820 DOI: 10.3389/fchem.2021.634663] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2020] [Accepted: 01/22/2021] [Indexed: 11/21/2022] Open
Abstract
Parasitic diseases remain as unresolved health issues worldwide. While for some parasites the treatments involve drug combinations with serious side effects, for others, chemical therapies are inefficient due to the emergence of drug resistance. This urges the search for novel antiparasitic agents able to act through multiple mechanisms of action. Here, we report the first multi-target model based on quantitative structure-activity relationships and a multilayer perceptron neural network (mt-QSAR-MLP) to virtually design and predict versatile inhibitors of proteins involved in the survival and/or infectivity of different pathogenic parasites. The mt-QSAR-MLP model exhibited high accuracy (>80%) in both training and test sets for the classification/prediction of protein inhibitors. Several fragments were directly extracted from the physicochemical and structural interpretations of the molecular descriptors in the mt-QSAR-MLP model. Such interpretations enabled the generation of four molecules that were predicted as multi-target inhibitors against at least three of the five parasitic proteins reported here with two of the molecules being predicted to inhibit all the proteins. Docking calculations converged with the mt-QSAR-MLP model regarding the multi-target profile of the designed molecules. The designed molecules exhibited drug-like properties, complying with Lipinski’s rule of five, as well as Ghose’s filter and Veber’s guidelines.
Collapse
Affiliation(s)
- Valeria V Kleandrova
- Laboratory of Fundamental and Applied Research of Quality and Technology of Food Production, Moscow State University of Food Production, Moscow, Russian Federation
| | - Luciana Scotti
- Postgraduate Program in Natural and Synthetic Bioactive Products, Federal University of Paraíba, João Pessoa, Brazil
| | | | - Eugene Muratov
- Laboratory for Molecular Modeling, The UNC Eshelman School of Pharmacy, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
| | - Marcus T Scotti
- Postgraduate Program in Natural and Synthetic Bioactive Products, Federal University of Paraíba, João Pessoa, Brazil
| | - Alejandro Speck-Planche
- Postgraduate Program in Natural and Synthetic Bioactive Products, Federal University of Paraíba, João Pessoa, Brazil
| |
Collapse
|
7
|
Kleandrova VV, Scotti MT, Scotti L, Nayarisseri A, Speck-Planche A. Cell-based multi-target QSAR model for design of virtual versatile inhibitors of liver cancer cell lines. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2020; 31:815-836. [PMID: 32967475 DOI: 10.1080/1062936x.2020.1818617] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/14/2020] [Accepted: 08/31/2020] [Indexed: 06/11/2023]
Abstract
Liver cancers are one of the leading fatal diseases among malignant neoplasms. Current chemotherapeutic treatments used to fight these illnesses have become less efficient in terms of both efficacy and safety. Therefore, there is a great need of search for new anti-liver cancer agents and this can be accelerated by using computer-aided drug discovery approaches. In this work, we report the development of the first cell-based multi-target model based on quantitative structure-activity relationships (CBMT-QSAR) for the design and prediction of chemicals as anticancer agents against 17 liver cancer cell lines. While having a good quality and predictive power (accuracy higher than 80%) in the training and test sets, respectively, the CBMT-QSAR model was employed as a tool to directly extract suitable fragments from the physicochemical and structural interpretations of the molecular descriptors. Some of these desirable fragments were assembled, leading to the virtual design of eight molecules with drug-like properties, with six of them being predicted as versatile anticancer agents against the 17 liver cancer cell lines reported here.
Collapse
Affiliation(s)
- V V Kleandrova
- Laboratory of Fundamental and Applied Research of Quality and Technology of Food Production, Moscow State University of Food Production , Moscow, Russian Federation
| | - M T Scotti
- Postgraduate Program in Natural and Synthetic Bioactive Products, Federal University of Paraíba , João Pessoa, Brazil
| | - L Scotti
- Postgraduate Program in Natural and Synthetic Bioactive Products, Federal University of Paraíba , João Pessoa, Brazil
| | - A Nayarisseri
- In Silico Research Laboratory, Eminent Biosciences , Indore, Madhya Pradesh, India
| | - A Speck-Planche
- Postgraduate Program in Natural and Synthetic Bioactive Products, Federal University of Paraíba , João Pessoa, Brazil
| |
Collapse
|
8
|
Kleandrova VV, Speck-Planche A. PTML Modeling for Alzheimer’s Disease: Design and Prediction of Virtual Multi-Target Inhibitors of GSK3B, HDAC1, and HDAC6. Curr Top Med Chem 2020; 20:1661-1676. [DOI: 10.2174/1568026620666200607190951] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2019] [Revised: 12/12/2019] [Accepted: 01/05/2020] [Indexed: 01/23/2023]
Abstract
Background:
Alzheimer’s disease is characterized by a progressive pattern of cognitive and
functional impairment, which ultimately leads to death. Computational approaches have played an important
role in the context of drug discovery for anti-Alzheimer's therapies. However, most of the computational
models reported to date have been focused on only one protein associated with Alzheimer's,
while relying on small datasets of structurally related molecules.
Objective:
We introduce the first model combining perturbation theory and machine learning based on
artificial neural networks (PTML-ANN) for simultaneous prediction and design of inhibitors of three
Alzheimer’s disease-related proteins, namely glycogen synthase kinase 3 beta (GSK3B), histone deacetylase
1 (HDAC1), and histone deacetylase 6 (HDAC6).
Methods:
The PTML-ANN model was obtained from a dataset retrieved from ChEMBL, and it relied on
a classification approach to predict chemicals as active or inactive.
Results:
The PTML-ANN model displayed sensitivity and specificity higher than 85% in both training
and test sets. The physicochemical and structural interpretation of the molecular descriptors in the model
permitted the direct extraction of fragments suggested to favorably contribute to enhancing the multitarget
inhibitory activity. Based on this information, we assembled ten molecules from several fragments
with positive contributions. Seven of these molecules were predicted as triple target inhibitors while the
remaining three were predicted as dual-target inhibitors. The estimated physicochemical properties of
the designed molecules complied with Lipinski’s rule of five and its variants.
Conclusion:
This work opens new horizons toward the design of multi-target inhibitors for anti- Alzheimer's
therapies.
Collapse
Affiliation(s)
- Valeria V. Kleandrova
- Laboratory of Fundamental and Applied Research of Quality and Technology of Food Production, Moscow State University of Food Production, Volokolamskoe Shosse 11, 125080, Moscow, Russian Federation
| | - Alejandro Speck-Planche
- Programa Institucional de Fomento a la Investigacion, Desarrollo e Innovacion, Universidad Tecnologica Metropolitana, Ignacio Valdivieso 2409, P.O. Box 8940577, San Joaquin, Santiago, Chile
| |
Collapse
|
9
|
Alharthi AM, Lee MH, Algamal ZY, Al-Fakih AM. Quantitative structure-activity relationship model for classifying the diverse series of antifungal agents using ratio weighted penalized logistic regression. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2020; 31:571-583. [PMID: 32628042 DOI: 10.1080/1062936x.2020.1782467] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/14/2020] [Accepted: 06/10/2020] [Indexed: 06/11/2023]
Abstract
One of the most challenging issues when facing a Quantitative structure-activity relationship (QSAR) classification model is to deal with the descriptor selection. Penalized methods have been adapted and have gained popularity as a key for simultaneously performing descriptor selection and QSAR classification model estimation. However, penalized methods have drawbacks such as having biases and inconsistencies that make they lack the oracle properties. This paper proposes an adaptive penalized logistic regression (APLR) to overcome these drawbacks. This is done by employing a ratio (BWR) of the descriptors between-groups sum of squares (BSS) to the within-groups sum of squares (WSS) for each descriptor as a weight inside the L1-norm. The proposed method was applied to one dataset that consists of a diverse series of antimicrobial agents with their respective bioactivities against Candida albicans. By experimental study, it has been shown that the proposed method (APLR) was more efficient in the selection of descriptors and classification accuracy than the other competitive methods that could be used in developing QSAR classification models. Another dataset was also successfully experienced. Therefore, it can be concluded that the APLR method had significant impact on QSAR analysis and studies.
Collapse
Affiliation(s)
- A M Alharthi
- Department of Mathematical Sciences, Universiti Teknologi Malaysia , Skudai, Malaysia
| | - M H Lee
- Department of Mathematical Sciences, Universiti Teknologi Malaysia , Skudai, Malaysia
| | - Z Y Algamal
- Department of Statistics and Informatics, University of Mosul , Mosul, Iraq
| | - A M Al-Fakih
- Department of Chemistry, Faculty of Science, Universiti Teknologi Malaysia , Johor, Malaysia
| |
Collapse
|
10
|
Hu S, Chen P, Gu P, Wang B. A Deep Learning-Based Chemical System for QSAR Prediction. IEEE J Biomed Health Inform 2020; 24:3020-3028. [PMID: 32142459 DOI: 10.1109/jbhi.2020.2977009] [Citation(s) in RCA: 35] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
Research on quantitative structure-activity relationships (QSAR) provides an effective approach to determine new hits and promising lead compounds during drug discovery. In the past decades, various works have gained good performance for QSAR with the development of machine learning. The rise of deep learning, along with massive accessible chemical databases, made improvement on the QSAR performance. This article proposes a novel deep-learning-based method to implement QSAR prediction by the concatenation of end-to-end encoder-decoder model and convolutional neural network (CNN) architecture. The encoder-decoder model is mainly used to generate fixed-size latent features to represent chemical molecules; while these features are then input into CNN framework to train a robust and stable model and finally to predict active chemicals. Two models with different schemes are investigated to evaluate the validity of our proposed model on the same data sets. Experimental results showed that our proposed method outperforms other state-of-the-art methods in successful identification of chemical molecule whether it is active.
Collapse
|
11
|
Halder AK, Giri AK, Cordeiro MNDS. Multi-Target Chemometric Modelling, Fragment Analysis and Virtual Screening with ERK Inhibitors as Potential Anticancer Agents. Molecules 2019; 24:molecules24213909. [PMID: 31671605 PMCID: PMC6864583 DOI: 10.3390/molecules24213909] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2019] [Revised: 10/21/2019] [Accepted: 10/25/2019] [Indexed: 02/07/2023] Open
Abstract
Two isoforms of extracellular regulated kinase (ERK), namely ERK-1 and ERK-2, are associated with several cellular processes, the aberration of which leads to cancer. The ERK-1/2 inhibitors are thus considered as potential agents for cancer therapy. Multitarget quantitative structure–activity relationship (mt-QSAR) models based on the Box–Jenkins approach were developed with a dataset containing 6400 ERK inhibitors assayed under different experimental conditions. The first mt-QSAR linear model was built with linear discriminant analysis (LDA) and provided information regarding the structural requirements for better activity. This linear model was also utilised for a fragment analysis to estimate the contributions of ring fragments towards ERK inhibition. Then, the random forest (RF) technique was employed to produce highly predictive non-linear mt-QSAR models, which were used for screening the Asinex kinase library and identify the most potential virtual hits. The fragment analysis results justified the selection of the hits retrieved through such virtual screening. The latter were subsequently subjected to molecular docking and molecular dynamics simulations to understand their possible interactions with ERK enzymes. The present work, which utilises in-silico techniques such as multitarget chemometric modelling, fragment analysis, virtual screening, molecular docking and dynamics, may provide important guidelines to facilitate the discovery of novel ERK inhibitors.
Collapse
Affiliation(s)
- Amit Kumar Halder
- Department of Chemistry and Biochemistry, University of Porto, 4169-007 Porto, Portugal.
| | - Amal Kanta Giri
- Department of Chemistry and Biochemistry, University of Porto, 4169-007 Porto, Portugal.
| | | |
Collapse
|
12
|
Qian T, Zhu S, Hoshida Y. Use of big data in drug development for precision medicine: an update. EXPERT REVIEW OF PRECISION MEDICINE AND DRUG DEVELOPMENT 2019; 4:189-200. [PMID: 31286058 PMCID: PMC6613936 DOI: 10.1080/23808993.2019.1617632] [Citation(s) in RCA: 39] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2019] [Accepted: 05/08/2019] [Indexed: 02/08/2023]
Abstract
INTRODUCTION Big-data-driven drug development resources and methodologies have been evolving with ever-expanding data from large-scale biological experiments, clinical trials, and medical records from participants in data collection initiatives. The enrichment of biological- and clinical-context-specific large-scale data has enabled computational inference more relevant to real-world biomedical research, particularly identification of therapeutic targets and drugs for specific diseases and clinical scenarios. AREAS COVERED Here we overview recent progresses made in the fields: new big-data-driven approach to therapeutic target discovery, candidate drug prioritization, inference of clinical toxicity, and machine-learning methods in drug discovery. EXPERT OPINION In the near future, much larger volumes and complex datasets for precision medicine will be generated, e.g., individual and longitudinal multi-omic, and direct-to-consumer datasets. Closer collaborations between experts with different backgrounds would also be required to better translate analytic results into prognosis and treatment in the clinical practice. Meanwhile, cloud computing with protected patient privacy would become more routine analytic practice to fill the gaps within data integration along with the advent of big-data. To conclude, integration of multitudes of data generated for each individual along with techniques tailored for big-data analytics may eventually enable us to achieve precision medicine.
Collapse
Affiliation(s)
- Tongqi Qian
- Department of Genetics and Genomic Sciences and Icahn
Institute for Genomics and Multiscale Biology, Icahn School of Medicine at Mount
Sinai, New York, NY, USA
| | - Shijia Zhu
- Liver Tumor Translational Research Program, Simmons
Comprehensive Cancer Center, Division of Digestive and Liver Diseases, Department of
Internal Medicine, University of Texas Southwestern Medical Center, Dallas, TX
75390, USA
| | - Yujin Hoshida
- Liver Tumor Translational Research Program, Simmons
Comprehensive Cancer Center, Division of Digestive and Liver Diseases, Department of
Internal Medicine, University of Texas Southwestern Medical Center, Dallas, TX
75390, USA
| |
Collapse
|
13
|
García-Jacas CR, Marrero-Ponce Y, Cortés-Guzmán F, Suárez-Lezcano J, Martinez-Rios FO, García-González LA, Pupo-Meriño M, Martinez-Mayorga K. Enhancing Acute Oral Toxicity Predictions by using Consensus Modeling and Algebraic Form-Based 0D-to-2D Molecular Encodes. Chem Res Toxicol 2019; 32:1178-1192. [PMID: 31066547 DOI: 10.1021/acs.chemrestox.9b00011] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Quantitative structure-activity relationships (QSAR) are introduced to predict acute oral toxicity (AOT), by using the QuBiLS-MAS (acronym for quadratic, bilinear and N-Linear maps based on graph-theoretic electronic-density matrices and atomic weightings) framework for the molecular encoding. Three training sets were employed to build the models: EPA training set (5931 compounds), EPA-full training set (7413 compounds), and Zhu training set (10 152 compounds). Additionally, the EPA test set (1482 compounds) was used for the validation of the QSAR models built on the EPA training set, while the ProTox (425 compounds) and T3DB (284 compounds) external sets were employed for the assessment of all the models. The k-nearest neighbor, multilayer perceptron, random forest, and support vector machine procedures were employed to build several base (individual) models. The base models with REPA-training ≥ 0.75 ( R = correlation coefficient) and MAEEPA-training ≤ 0.5 (MAE = mean absolute error) were retained to build consensus models. As a result, two consensus models based on the minimum operator and denoted as M19 and M22, as well as a consensus model based on the weighted average operator and denoted as M24, were selected as the best ones for each training set considered. According to the applicability domain (AD) analysis performed, model M19 (built on the EPA training set) has MAEtest-AD = 0.4044, MAEProTox-AD = 0.4067 and MAET3DB-AD = 0.2586 on the EPA test set, ProTox external set, and T3DB external set, respectively; whereas model M22 (built on the EPA-full set) and model M24 (built on the Zhu set) present MAEProTox-AD = 0.3992 and MAET3DB-AD = 0.2286, and MAEProTox-AD = 0.3773 and MAET3DB-AD = 0.2471 on the two external sets accounted for, respectively. These outcomes were compared and statistically validated with respect to 14 QSAR methods (e.g., admetSAR, ProTox-II) from the literature. As a result, model M22 presents the best overall performance. In addition, a retrospective study on 261 withdrawn drugs due to their toxic/side effects was performed, to assess the usefulness of prospectively using the QSAR models proposed in the labeling of chemicals. A comparison with regard to the methods from the literature was also made. As a result, model M22 has the best ability of labeling a compound as toxic according to the globally harmonized system of classification and labeling of chemicals. Therefore, it can be concluded that the models proposed, especially model M22, constitute prominent tools for studying AOT, at providing the best results among all the methods examined. A freely available software was also developed to be used in virtual screening tasks ( http://tomocomd.com/apps/ptoxra ).
Collapse
Affiliation(s)
- César R García-Jacas
- Departamento de Ciencias de la Computación , Centro de Investigación Científica y de Educación Superior de Ensenada , Ensenada , Baja California , México
| | - Yovani Marrero-Ponce
- Universidad San Francisco de Quito, Grupo de Medicina Molecular y Traslacional, Colegio de Ciencias de la Salud , Escuela de Medicina, Edificio de Especialidades Médicas , Quito , Pichincha , Ecuador.,Grupo de Investigación Ambiental, Programas Ambientales, Facultad de Ingenierías , Fundacion Universitaria Tecnologico Comfenalco-Cartagena , Cr44 DN 30 A, 91 , Cartagena , Bolívar , Colombia
| | - Fernando Cortés-Guzmán
- Instituto de Química , Universidad Nacional Autónoma de México , Ciudad de México , México
| | - José Suárez-Lezcano
- Pontificia Universidad Católica del Ecuador Sede Esmeraldas , Esmeraldas , Ecuador
| | | | - Luis A García-González
- Grupo de Investigación de Bioinformática , Universidad de las Ciencias Informáticas , La Habana , Cuba
| | - Mario Pupo-Meriño
- Grupo de Investigación de Bioinformática , Universidad de las Ciencias Informáticas , La Habana , Cuba
| | | |
Collapse
|
14
|
Al-Fakih AM, Algamal ZY, Lee MH, Aziz M, Ali HTM. QSAR classification model for diverse series of antifungal agents based on improved binary differential search algorithm. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2019; 30:131-143. [PMID: 30734580 DOI: 10.1080/1062936x.2019.1568298] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/31/2018] [Accepted: 01/08/2019] [Indexed: 06/09/2023]
Abstract
An improved binary differential search (improved BDS) algorithm is proposed for QSAR classification of diverse series of antimicrobial compounds against Candida albicans inhibitors. The transfer functions is the most important component of the BDS algorithm, and converts continuous values of the donor into discrete values. In this paper, the eight types of transfer functions are investigated to verify their efficiency in improving BDS algorithm performance in QSAR classification. The performance was evaluated using three metrics: classification accuracy (CA), geometric mean of sensitivity and specificity (G-mean), and area under the curve. The Kruskal-Wallis test was also applied to show the statistical differences between the functions. Two functions, S1 and V4, show the best classification achievement, with a slightly better performance of V4 than S1. The V4 function takes the lowest iterations and selects the fewest descriptors. In addition, the V4 function yields the best CA and G-mean of 98.07% and 0.977%, respectively. The results prove that the V4 transfer function significantly improves the performance of the original BDS.
Collapse
Affiliation(s)
- A M Al-Fakih
- a Department of Chemistry , Universiti Teknologi Malaysia , Johor , Malaysia
- b Department of Chemistry , Sana'a University , Sana'a , Yemen
| | - Z Y Algamal
- c Department of Statistics and Informatics , University of Mosul , Mosul , Iraq
| | - M H Lee
- d Department of Mathematical Sciences , Universiti Teknologi Malaysia , Johor , Malaysia
| | - M Aziz
- a Department of Chemistry , Universiti Teknologi Malaysia , Johor , Malaysia
- e Advanced Membrane Technology Centre, Universiti Teknologi Malaysia , Johor , Malaysia
| | - H T M Ali
- f College of Computers and Information Technology, Nawroz University , Kurdistan region , Iraq
| |
Collapse
|
15
|
Speck-Planche A. Combining Ensemble Learning with a Fragment-Based Topological Approach To Generate New Molecular Diversity in Drug Discovery: In Silico Design of Hsp90 Inhibitors. ACS OMEGA 2018; 3:14704-14716. [PMID: 30555986 PMCID: PMC6289491 DOI: 10.1021/acsomega.8b02419] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/18/2018] [Accepted: 10/23/2018] [Indexed: 05/05/2023]
Abstract
Machine learning methods have revolutionized modern science, providing fast and accurate solutions to multiple problems. However, they are commonly treated as "black boxes". Therefore, in important scientific fields such as medicinal chemistry and drug discovery, machine learning methods are restricted almost exclusively to the task of performing predictions of large and heterogeneous data sets of chemicals. The lack of interpretability prevents the full exploitation of the machine learning models as generators of new chemical knowledge. This work focuses on the development of an ensemble learning model for the prediction and design of potent dual heat shock protein 90 (Hsp90) inhibitors. The model displays accuracy higher than 80% in both training and test sets. To use the ensemble model as a generator of new chemical knowledge, three steps were followed. First, a physicochemical and/or structural interpretation was provided for each molecular descriptor present in the ensemble learning model. Second, the term "pseudolinear equation" was introduced within the context of machine learning to calculate the relative quantitative contributions of different molecular fragments to the inhibitory activity against the two Hsp90 isoforms studied here. Finally, by assembling the fragments with positive contributions, new molecules were designed, being predicted as potent Hsp90 inhibitors. According to Lipinski's rule of five, the designed molecules were found to exhibit potentially good oral bioavailability, a primordial property that chemicals must have to pass early stages in drug discovery. The present approach based on the combination of ensemble learning and fragment-based topological design holds great promise in drug discovery, and it can be adapted and applied to many different scientific disciplines.
Collapse
|
16
|
BET bromodomain inhibitors: fragment-based in silico design using multi-target QSAR models. Mol Divers 2018; 23:555-572. [PMID: 30421269 DOI: 10.1007/s11030-018-9890-8] [Citation(s) in RCA: 38] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2018] [Accepted: 10/30/2018] [Indexed: 12/17/2022]
Abstract
Epigenetics has become a focus of interest in drug discovery. In this sense, bromodomain-containing proteins have emerged as potential epigenetic targets in cancer research and other therapeutic areas. Several computational approaches have been applied to the prediction of bromodomain inhibitors. Nevertheless, such approaches have several drawbacks such as the fact that they predict activity against only one bromodomain-containing protein, using structurally related compounds. Also, there are no reports focused on meaningfully analyzing the physicochemical/structural features that are necessary for the design of a bromodomain inhibitor. This work describes the development of two different multi-target models based on quantitative structure-activity relationships (mt-QSAR) for the prediction and in silico design of multi-target bromodomain inhibitors against the proteins BRD2, BRD3, and BRD4. The first model relied on linear discriminant analysis (LDA) while the second focused on artificial neural networks. Both models exhibited accuracies higher than 85% in the dataset. Several molecular fragments were extracted, and their contributions to the inhibitory activity against the three BET proteins were calculated by the LDA model. Six molecules were designed by assembling the fragments with positive contributions, and they were predicted as multi-target BET bromodomain inhibitors by the two mt-QSAR models. Molecular docking calculations converged with the predictions performed by the mt-QSAR models, suggesting that the designed molecules can exhibit potent activity against the three BET proteins. These molecules complied with the Lipinski's rule of five.
Collapse
|
17
|
García-Jacas CR, Cabrera-Leyva L, Marrero-Ponce Y, Suárez-Lezcano J, Cortés-Guzmán F, Pupo-Meriño M, Vivas-Reyes R. Choquet integral-based fuzzy molecular characterizations: when global definitions are computed from the dependency among atom/bond contributions (LOVIs/LOEIs). J Cheminform 2018; 10:51. [PMID: 30362050 PMCID: PMC6755596 DOI: 10.1186/s13321-018-0306-7] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2018] [Accepted: 10/15/2018] [Indexed: 01/22/2023] Open
Abstract
BACKGROUND Several topological (2D) and geometric (3D) molecular descriptors (MDs) are calculated from local vertex/edge invariants (LOVIs/LOEIs) by performing an aggregation process. To this end, norm-, mean- and statistic-based (non-fuzzy) operators are used, under the assumption that LOVIs/LOEIs are independent (orthogonal) values of one another. These operators are based on additive and/or linear measures and, consequently, they cannot be used to encode information from interrelated criteria. Thus, as LOVIs/LOEIs are not orthogonal values, then non-additive (fuzzy) measures can be used to encode the interrelation among them. RESULTS General approaches to compute fuzzy 2D/3D-MDs from the contribution of each atom (LOVIs) or covalent bond (LOEIs) within a molecule are proposed, by using the Choquet integral as fuzzy aggregation operator. The Choquet integral-based operator is rather different from the other operators often used for the 2D/3D-MDs calculation. It performs a reordering step to fuse the LOVIs/LOEIs according to their magnitudes and, in addition, it considers the interrelation among them through a fuzzy measure. With this operator, fuzzy definitions can be derived from traditional or recent MDs; for instance, fuzzy Randic-like connectivity indices, fuzzy Balaban-like indices, fuzzy Kier-Hall connectivity indices, among others. To demonstrate the feasibility of using this operator, the QuBiLS-MIDAS 3D-MDs were used as study case and, as a result, a module was built into the corresponding software to compute them ( http://tomocomd.com/qubils-midas ). Thus, it is the only software reported in the literature that can be employed to determine Choquet integral-based fuzzy MDs. Moreover, regression models were created on eight chemical datasets. In this way, a comparison between the results achieved by the models based on the non-fuzzy QuBiLS-MIDAS 3D-MDs with regard to the ones achieved by the models based on the fuzzy QuBiLS-MIDAS 3D-MDs was made. As a result, the models built with the fuzzy QuBiLS-MIDAS 3D-MDs achieved the best performance, which was statistically corroborated through the Wilcoxon signed-rank test. CONCLUSIONS All in all, it can be concluded that the Choquet integral constitutes a prominent alternative to compute fuzzy 2D/3D-MDs from LOVIs/LOEIs. In this way, better characterizations of the compounds can be obtained, which will be ultimately useful in enhancing the modelling ability of existing traditional 2D/3D-MDs.
Collapse
Affiliation(s)
- César R. García-Jacas
- Instituto de Química, Universidad Nacional Autónoma de México (UNAM), Ciudad de México, México
| | - Lisset Cabrera-Leyva
- Grupo de Investigación de Inteligencia Artificial (AIRES), Facultad de Informática, Universidad de Camagüey, Camagüey, Cuba
| | - Yovani Marrero-Ponce
- Grupo de Medicina Molecular y Traslacional (MeM&T), Colegio de Ciencias de la Salud (COCSA), Escuela de Medicina, Edificio de Especialidades Médicas, Universidad San Francisco de Quito (USFQ), Quito, Pichincha Ecuador
- Grupo de Investigación Ambiental (GIA), Programas Ambientales, Facultad de Ingenierías, Fundacion Universitaria Tecnologico Comfenalco – Cartagena, Cr 44 DN 30 A, 91, Cartagena, Bolívar Colombia
| | - José Suárez-Lezcano
- Pontificia Universidad Católica del Ecuador Sede Esmeraldas (PUCESE), Esmeraldas, Ecuador
| | - Fernando Cortés-Guzmán
- Instituto de Química, Universidad Nacional Autónoma de México (UNAM), Ciudad de México, México
| | - Mario Pupo-Meriño
- Grupo de Investigación de Bioinformática, Universidad de las Ciencias Informáticas (UCI), La Habana, Cuba
| | - Ricardo Vivas-Reyes
- Grupo de Química Cuántica y Teórica, Facultad de Ciencias Exactas y Naturales, Programa de Química, Universidad de Cartagena, Campus de San Pablo, Cartagena, Colombia
- Grupo CipTec, Facultad de Ingenierias, Fundacion Universitaria Tecnologico Comfenalco – Cartagena, Cr 44 DN 30 A, 91, Cartagena, Bolívar Colombia
| |
Collapse
|
18
|
Zhang L, Tan J, Han D, Zhu H. From machine learning to deep learning: progress in machine intelligence for rational drug discovery. Drug Discov Today 2017; 22:1680-1685. [PMID: 28881183 DOI: 10.1016/j.drudis.2017.08.010] [Citation(s) in RCA: 308] [Impact Index Per Article: 38.5] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2016] [Revised: 07/13/2017] [Accepted: 08/30/2017] [Indexed: 01/29/2023]
Abstract
Machine intelligence, which is normally presented as artificial intelligence, refers to the intelligence exhibited by computers. In the history of rational drug discovery, various machine intelligence approaches have been applied to guide traditional experiments, which are expensive and time-consuming. Over the past several decades, machine-learning tools, such as quantitative structure-activity relationship (QSAR) modeling, were developed that can identify potential biological active molecules from millions of candidate compounds quickly and cheaply. However, when drug discovery moved into the era of 'big' data, machine learning approaches evolved into deep learning approaches, which are a more powerful and efficient way to deal with the massive amounts of data generated from modern drug discovery approaches. Here, we summarize the history of machine learning and provide insight into recently developed deep learning approaches and their applications in rational drug discovery. We suggest that this evolution of machine intelligence now provides a guide for early-stage drug design and discovery in the current big data era.
Collapse
Affiliation(s)
- Lu Zhang
- College of Life Science and Bio-engineering, Beijing University of Technology, Beijing, 100124, China
| | - Jianjun Tan
- College of Life Science and Bio-engineering, Beijing University of Technology, Beijing, 100124, China.
| | - Dan Han
- College of Life Science and Bio-engineering, Beijing University of Technology, Beijing, 100124, China
| | - Hao Zhu
- College of Life Science and Bio-engineering, Beijing University of Technology, Beijing, 100124, China; Department of Chemistry, Rutgers University, Camden, NJ 08102, USA; The Rutgers Center for Computational and Integrative Biology, Camden, NJ 08102, USA.
| |
Collapse
|
19
|
Perfect JR, Tenor JL, Miao Y, Brennan RG. Trehalose pathway as an antifungal target. Virulence 2017; 8:143-149. [PMID: 27248439 PMCID: PMC5383216 DOI: 10.1080/21505594.2016.1195529] [Citation(s) in RCA: 51] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2016] [Revised: 05/18/2016] [Accepted: 05/20/2016] [Indexed: 01/23/2023] Open
Abstract
With an increasing immunocompromised population which is linked to invasive fungal infections, it is clear that our present 3 classes of antifungal agents may not be sufficient to provide optimal management to these fragile patients. Furthermore, with widespread use of antifungal agents, drug-resistant fungal infections are on the rise. Therefore, there is some urgency to develop the antifungal pipeline with the goal of new antifungal agent discovery. In this review, a simple metabolic pathway, which forms the disaccharide, trehalose, will be characterized and its potential as a focus for antifungal target(s) explained. It possesses several important features for development of antifungal agents. First, it appears to have fungicidal characteristics and second, it is broad spectrum with importance across both ascomycete and basidiomycete species. Finally, this pathway is not found in mammals so theoretically specific inhibitors of the trehalose pathway and its enzymes in fungi should be relatively non-toxic for mammals. The trehalose pathway and its critical enzymes are now in a position to have directed antifungal discovery initiated in order to find a new class of antifungal drugs.
Collapse
Affiliation(s)
- John R. Perfect
- Departments of Medicine and Biochemistry, Duke University Medical Center, Durham, NC, USA
| | - Jennifer L. Tenor
- Departments of Medicine and Biochemistry, Duke University Medical Center, Durham, NC, USA
| | - Yi Miao
- Departments of Medicine and Biochemistry, Duke University Medical Center, Durham, NC, USA
| | - Richard G. Brennan
- Departments of Medicine and Biochemistry, Duke University Medical Center, Durham, NC, USA
| |
Collapse
|
20
|
Shoombuatong W, Prathipati P, Owasirikul W, Worachartcheewan A, Simeon S, Anuwongcharoen N, Wikberg JES, Nantasenamat C. Towards the Revival of Interpretable QSAR Models. CHALLENGES AND ADVANCES IN COMPUTATIONAL CHEMISTRY AND PHYSICS 2017. [DOI: 10.1007/978-3-319-56850-8_1] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
|
21
|
Abstract
In recent years, the increase of invasive fungal infections and the emergence of antifungal resistance stressed the need for new antifungal drugs. Peptides have shown to be good candidates for the development of alternative antimicrobial agents through high-throughput screening, and subsequent optimization according to a rational approach. This review presents a brief overview on antifungal natural peptides of different sources (animals, plants, micro-organisms), peptide fragments derived by proteolytic cleavage of precursor physiological proteins (cryptides), synthetic unnatural peptides and peptide derivatives. Antifungal peptides are schematically reported based on their structure, antifungal spectrum and reported effects. Natural or synthetic peptides and their modified derivatives may represent the basis for new compounds active against fungal infections.
Collapse
|