1
|
Backenköhler M, Groß J, Wolf V, Volkamer A. Guided Docking as a Data Generation Approach Facilitates Structure-Based Machine Learning on Kinases. J Chem Inf Model 2024; 64:4009-4020. [PMID: 38751014 DOI: 10.1021/acs.jcim.4c00055] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/28/2024]
Abstract
Drug discovery pipelines nowadays rely on machine learning models to explore and evaluate large chemical spaces. While including 3D structural information is considered beneficial, structural models are hindered by the availability of protein-ligand complex structures. Exemplified for kinase drug discovery, we address this issue by generating kinase-ligand complex data using template docking for the kinase compound subset of available ChEMBL assay data. To evaluate the benefit of the created complex data, we use it to train a structure-based E(3)-invariant graph neural network. Our evaluation shows that binding affinities can be predicted with significantly higher precision by models that take synthetic binding poses into account compared to ligand- or drug-target interaction models alone.
Collapse
Affiliation(s)
- Michael Backenköhler
- Data Driven Drug Design, Center for Bioinformatics, Saarland University, Saarbrücken 66123, Germany
| | - Joschka Groß
- Modeling and Simulation, Saarland University, Saarbrücken 66123, Germany
| | - Verena Wolf
- Modeling and Simulation, Saarland University, Saarbrücken 66123, Germany
| | - Andrea Volkamer
- Data Driven Drug Design, Center for Bioinformatics, Saarland University, Saarbrücken 66123, Germany
- Structural Bioinformatics and in Silico Toxicology Institute of Physiology, Universitätsmedizin Berlin, Berlin 10117, Germany
| |
Collapse
|
2
|
Kumar S, Bhowmik R, Oh JM, Abdelgawad MA, Ghoneim MM, Al-Serwi RH, Kim H, Mathew B. Machine learning driven web-based app platform for the discovery of monoamine oxidase B inhibitors. Sci Rep 2024; 14:4868. [PMID: 38418571 PMCID: PMC10901862 DOI: 10.1038/s41598-024-55628-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2024] [Accepted: 02/26/2024] [Indexed: 03/01/2024] Open
Abstract
Monoamine oxidases (MAOs), specifically MAO-A and MAO-B, play important roles in the breakdown of monoamine neurotransmitters. Therefore, MAO inhibitors are crucial for treating various neurodegenerative disorders, including Parkinson's disease (PD), Alzheimer's disease (AD), and amyotrophic lateral sclerosis (ALS). In this study, we developed a novel cheminformatics pipeline by generating three diverse molecular feature-based machine learning-assisted quantitative structural activity relationship (ML-QSAR) models concerning MAO-B inhibition. PubChem fingerprints, substructure fingerprints, and one-dimensional (1D) and two-dimensional (2D) molecular descriptors were implemented to unravel the structural insights responsible for decoding the origin of MAO-B inhibition in 249 non-reductant molecules. Based on a random forest ML algorithm, the final PubChem fingerprint, substructure fingerprint, and 1D and 2D molecular descriptor prediction models demonstrated significant robustness, with correlation coefficients of 0.9863, 0.9796, and 0.9852, respectively. The significant features of each predictive model responsible for MAO-B inhibition were extracted using a comprehensive variance importance plot (VIP) and correlation matrix analysis. The final predictive models were further developed as a web application, MAO-B-pred ( https://mao-b-pred.streamlit.app/ ), to allow users to predict the bioactivity of molecules against MAO-B. Molecular docking and dynamics studies were conducted to gain insight into the atomic-level molecular interactions between the ligand-receptor complexes. These findings were compared with the structural features obtained from the ML-QSAR models, which supported the mechanistic understanding of the binding phenomena. The presented models have the potential to serve as tools for identifying crucial molecular characteristics for the rational design of MAO-B target inhibitors, which may be used to develop effective drugs for neurodegenerative disorders.
Collapse
Affiliation(s)
- Sunil Kumar
- Department of Pharmaceutical Chemistry, Amrita School of Pharmacy, Amrita Vishwa Vidyapeetham, AIMS Health Sciences Campus, Kochi, India
| | - Ratul Bhowmik
- Department of Pharmaceutical Chemistry, School of Pharmaceutical Education and Research, Jamia Hamdard, New Delhi, India
| | - Jong Min Oh
- Department of Pharmacy, and Research Institute of Life Pharmaceutical Sciences, Sunchon National University, Suncheon, 57922, Republic of Korea
| | - Mohamed A Abdelgawad
- Department of Pharmaceutical Chemistry, College of Pharmacy, Jouf University, 72341, Sakaka, Aljouf, Saudi Arabia
| | - Mohammed M Ghoneim
- Department of Pharmacy Practice, College of Pharmacy, AlMaarefa University, 13713, Ad Diriyah, Riyadh, Saudi Arabia
| | - Rasha Hamed Al-Serwi
- Department of Basic Dental Sciences, College of Dentistry, Princess Nourah Bint Abdulrahman University, P.O. Box 84428, 11671, Riyadh, Saudi Arabia
| | - Hoon Kim
- Department of Pharmacy, and Research Institute of Life Pharmaceutical Sciences, Sunchon National University, Suncheon, 57922, Republic of Korea.
| | - Bijo Mathew
- Department of Pharmaceutical Chemistry, Amrita School of Pharmacy, Amrita Vishwa Vidyapeetham, AIMS Health Sciences Campus, Kochi, India.
| |
Collapse
|
3
|
Akinola LK, Uzairu A, Shallangwa GA, Abechi SE, Umar AB. Identification of estrogen receptor agonists among hydroxylated polychlorinated biphenyls using classification-based quantitative structure-activity relationship models. Curr Res Toxicol 2024; 6:100158. [PMID: 38435023 PMCID: PMC10907392 DOI: 10.1016/j.crtox.2024.100158] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2023] [Revised: 02/22/2024] [Accepted: 02/22/2024] [Indexed: 03/05/2024] Open
Abstract
Identification of estrogen receptor (ER) agonists among environmental toxicants is essential for assessing the potential impact of toxicants on human health. Using 2D autocorrelation descriptors as predictor variables, two binary logistic regression models were developed to identify active ER agonists among hydroxylated polychlorinated biphenyls (OH-PCBs). The classifications made by the two models on the training set compounds resulted in accuracy, sensitivity and specificity of 95.9 %, 93.9 % and 97.6 % for ERα dataset and 91.9 %, 90.9 % and 92.7 % for ERβ dataset. The areas under the ROC curves, constructed with the training set data, were found to be 0.985 and 0.987 for the two models. Predictions made by models I and II correctly classified 84.0 % and 88.0 % of the test set compounds and 89.8 % and 85.8% of the cross-validation set compounds respectively. The two classification-based QSAR models proposed in this paper are considered robust and reliable for rapid identification of ERα and ERβ agonists among OH-PCB congeners.
Collapse
Affiliation(s)
- Lukman K. Akinola
- Department of Chemistry, Ahmadu Bello University, Zaria, Nigeria
- Department of Chemistry, Bauchi State University, Gadau, Nigeria
| | - Adamu Uzairu
- Department of Chemistry, Ahmadu Bello University, Zaria, Nigeria
| | | | | | | |
Collapse
|
4
|
Jawarkar RD, Zaki MEA, Al-Hussain SA, Al-Mutairi AA, Samad A, Masand V, Humane V, Mali S, Alzahrani AYA, Rashid S, Elossaily GM. Mechanistic QSAR modeling derived virtual screening, drug repurposing, ADMET and in- vitro evaluation to identify anticancer lead as lysine-specific demethylase 5a inhibitor. J Biomol Struct Dyn 2024:1-31. [PMID: 38385447 DOI: 10.1080/07391102.2024.2319104] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2023] [Accepted: 02/11/2024] [Indexed: 02/23/2024]
Abstract
A lysine-specific demethylase is an enzyme that selectively eliminates methyl groups from lysine residues. KDM5A, also known as JARID1A or RBP2, belongs to the KDM5 Jumonji histone demethylase subfamily. To identify novel molecules that interact with the LSD5A receptor, we created a quantitative structure-activity relationship (QSAR) model. A group of 435 compounds was used in a study of the quantitative relationship between structure and activity to guess the IC50 values for blocking LASD5A. We used a genetic algorithm-multilinear regression-based quantitative structure-activity connection model to forecast the bioactivity (PIC50) of 1615 food and drug administration pharmaceuticals from the zinc database with the goal of repurposing clinically used medications. We used molecular docking, molecular dynamic simulation modelling, and molecular mechanics generalised surface area analysis to investigate the molecule's binding mechanism. A genetic algorithm and multi-linear regression method were used to make six variable-based quantitative structure-activity relationship models that worked well (R2 = 0.8521, Q2LOO = 0.8438, and Q2LMO = 0.8414). ZINC000000538621 was found to be a new hit against LSD5A after a quantitative structure-activity relationship-based virtual screening of 1615 zinc food and drug administration compounds. The docking analysis revealed that the hit molecule 11 in the KDM5A binding pocket adopted a conformation similar to the pdb-6bh1 ligand (docking score: -8.61 kcal/mol). The results from molecular docking and the quantitative structure-activity relationship were complementary and consistent. The most active lead molecule 11, which has shown encouraging results, has good absorption, distribution, metabolism, and excretion (ADME) properties, and its toxicity has been shown to be minimal. In addition, the MTT assay of ZINC000000538621 with MCF-7 cell lines backs up the in silico studies. We used molecular mechanics generalise borne surface area analysis and a 200-ns molecular dynamics simulation to find structural motifs for KDM5A enzyme interactions. Thus, our strategy will likely expand food and drug administration molecule repurposing research to find better anticancer drugs and therapies.Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Rahul D Jawarkar
- Department of Medicinal Chemistry and Drug discovery, Dr. Rajendra Gode Institute of Pharmacy, Amravati, Maharashtra, India
| | - Magdi E A Zaki
- Department of Chemistry, Faculty of Science, Imam Mohammad Ibn Saud Islamic University, Riyadh, Saudi Arabia
| | - Sami A Al-Hussain
- Department of Chemistry, Faculty of Science, Imam Mohammad Ibn Saud Islamic University, Riyadh, Saudi Arabia
| | - Aamal A Al-Mutairi
- Department of Chemistry, Faculty of Science, Imam Mohammad Ibn Saud Islamic University, Riyadh, Saudi Arabia
| | - Abdul Samad
- Department of Pharmaceutical Chemistry, Faculty of Pharmacy, Tishk International University, Erbil, Kurdistan Region, Iraq
| | - Vijay Masand
- Department of Chemistry, Amravati, Maharashtra, India
| | - Vivek Humane
- Department of Chemistry, Shri R. R. Lahoti Science college, Morshi District: Amravati, Maharashtra, India
| | - Suraj Mali
- School of Pharmacy, D.Y. Patil University (Deemed to be University), Nerul, Navi Mumbai, India
| | | | - Summya Rashid
- Department of Pharmacology & Toxicology, College of Pharmacy, Prince Sattam Bin Abdulaziz University, Al-Kharj, Saudi Arabia
| | - Gehan M Elossaily
- Department of Basic Medical Sciences, College of Medicine, AlMaarefa University, Riyadh, Saudi Arabia
| |
Collapse
|
5
|
El Rhabori S, El Aissouq A, Daoui O, Elkhattabi S, Chtita S, Khalil F. Design of new molecules against cervical cancer using DFT, theoretical spectroscopy, 2D/3D-QSAR, molecular docking, pharmacophore and ADMET investigations. Heliyon 2024; 10:e24551. [PMID: 38318045 PMCID: PMC10839811 DOI: 10.1016/j.heliyon.2024.e24551] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2023] [Revised: 12/13/2023] [Accepted: 01/10/2024] [Indexed: 02/07/2024] Open
Abstract
Cervical cancer is a major health problem of women. Hormone therapy, via aromatase inhibition, has been proposed as a promising way of blocking estrogen production as well as treating the progression of estrogen-dependent cancer. To overcome the challenging complexities of costly drug design, in-silico strategy, integrating Structure-Based Drug Design (SBDD) and Ligand-Based Drug Design (LBDD), was applied to large representative databases of 39 quinazoline and thioquinazolinone compound derivatives. Quantum chemical and physicochemical descriptors have been investigated using density functional theory (DFT) and MM2 force fields, respectively, to develop 2D-QSAR models, while CoMSIA and CoMFA descriptors were used to build 3D-QSAR models. The robustness and predictive power of the reliable models were verified, via several validation methods, leading to the design of 6 new drug-candidates. Afterwards, 2 ligands were carefully selected using virtual screening methods, taking into account the applicability domain, synthetic accessibility, and Lipinski's criteria. Molecular docking and pharmacophore modelling studies were performed to examine potential interactions with aromatase (PDB ID: 3EQM). Finally, the ADMET properties were investigated in order to select potential drug-candidates against cervical cancer for experimental in vitro and in vivo testing.
Collapse
Affiliation(s)
- Said El Rhabori
- Laboratory of Processes, Materials and Environment (LPME), Sidi Mohamed Ben Abdellah University, Faculty of Science and Technology - Fez, Morocco
| | - Abdellah El Aissouq
- Laboratory of Processes, Materials and Environment (LPME), Sidi Mohamed Ben Abdellah University, Faculty of Science and Technology - Fez, Morocco
| | - Ossama Daoui
- Laboratory of Engineering, Systems and Applications, National School of Applied Sciences, Sidi Mohamed Ben Abdellah-Fez University, Fez, Morocco
| | - Souad Elkhattabi
- Laboratory of Engineering, Systems and Applications, National School of Applied Sciences, Sidi Mohamed Ben Abdellah-Fez University, Fez, Morocco
| | - Samir Chtita
- Laboratory of Analytical and Molecular Chemistry, Faculty of Sciences Ben M'Sik, Hassan II University of Casablanca, Morocco
| | - Fouad Khalil
- Laboratory of Processes, Materials and Environment (LPME), Sidi Mohamed Ben Abdellah University, Faculty of Science and Technology - Fez, Morocco
| |
Collapse
|
6
|
Zaki MEA, AL-Hussain SA, Al-Mutairi AA, Samad A, Masand VH, Ingle RG, Rathod VD, Gaikwad NM, Rashid S, Khatale PN, Burakale PV, Jawarkar RD. Application of in-silico drug discovery techniques to discover a novel hit for target-specific inhibition of SARS-CoV-2 Mpro's revealed allosteric binding with MAO-B receptor: A theoretical study to find a cure for post-covid neurological disorder. PLoS One 2024; 19:e0286848. [PMID: 38227609 PMCID: PMC10790994 DOI: 10.1371/journal.pone.0286848] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2023] [Accepted: 05/24/2023] [Indexed: 01/18/2024] Open
Abstract
Several studies have revealed that SARS-CoV-2 damages brain function and produces significant neurological disability. The SARS-CoV-2 coronavirus, which causes COVID-19, may infect the heart, kidneys, and brain. Recent research suggests that monoamine oxidase B (MAO-B) may be involved in metabolomics variations in delirium-prone individuals and severe SARS-CoV-2 infection. In light of this situation, we have employed a variety of computational to develop suitable QSAR model using PyDescriptor and genetic algorithm-multilinear regression (GA-MLR) models (R2 = 0.800-793, Q2LOO = 0.734-0.727, and so on) on the data set of 106 molecules whose anti-SARS-CoV-2 activity was empirically determined. QSAR models generated follow OECD standards and are predictive. QSAR model descriptors were also observed in x-ray-resolved structures. After developing a QSAR model, we did a QSAR-based virtual screening on an in-house database of 200 compounds and found a potential hit molecule. The new hit's docking score (-8.208 kcal/mol) and PIC50 (7.85 M) demonstrated a significant affinity for SARS-CoV-2's main protease. Based on post-covid neurodegenerative episodes in Alzheimer's and Parkinson's-like disorders and MAO-B's role in neurodegeneration, the initially disclosed hit for the SARS-CoV-2 main protease was repurposed against the MAO-B receptor using receptor-based molecular docking, which yielded a docking score of -12.0 kcal/mol. This shows that the compound that inhibits SARS-CoV-2's primary protease may bind allosterically to the MAO-B receptor. We then did molecular dynamic simulations and MMGBSA tests to confirm molecular docking analyses and quantify binding free energy. The drug-receptor complex was stable during the 150-ns MD simulation. The first computational effort to show in-silico inhibition of SARS-CoV-2 Mpro and allosteric interaction of novel inhibitors with MAO-B in post-covid neurodegenerative symptoms and other disorders. The current study seeks a novel compound that inhibits SAR's COV-2 Mpro and perhaps binds MAO-B allosterically. Thus, this study will enable scientists design a new SARS-CoV-2 Mpro that inhibits the MAO-B receptor to treat post-covid neurological illness.
Collapse
Affiliation(s)
- Magdi E. A. Zaki
- Faculty of Science, Department of Chemistry, Imam Mohammad Ibn Saud Islamic University, Riyadh, Saudi Arabia
| | - Sami A. AL-Hussain
- Faculty of Science, Department of Chemistry, Imam Mohammad Ibn Saud Islamic University, Riyadh, Saudi Arabia
| | - Aamal A. Al-Mutairi
- Faculty of Science, Department of Chemistry, Imam Mohammad Ibn Saud Islamic University, Riyadh, Saudi Arabia
| | - Abdul Samad
- Faculty of Pharmacy, Department of Pharmaceutical Chemistry, Tishk International University, Erbil, Kurdistan Region, Iraq
| | - Vijay H. Masand
- Department of Chemistry, Vidya Bharti Mahavidyalaya, Amravati, Maharashtra, India
| | - Rahul G. Ingle
- Datta Meghe College of Pharmacy, DMIHER Deemed University, Wardha, India
| | - Vivek Digamber Rathod
- Department of Chemical Technology, Dr Babasaheb Ambedkar Marathwada University, Aurangabad, India
| | | | - Summya Rashid
- Department of Pharmacology & Toxicology, College of Pharmacy, Prince Sattam Bin Abdulaziz University, Al-Kharj, Saudi Arabia
| | - Pravin N. Khatale
- Department of Medicinal Chemistry and Drug Discovery, Dr Rajendra Gode Institute of Pharmacy, University Mardi Road, Amravati, Maharashtra, India
| | - Pramod V. Burakale
- Department of Medicinal Chemistry and Drug Discovery, Dr Rajendra Gode Institute of Pharmacy, University Mardi Road, Amravati, Maharashtra, India
| | - Rahul D. Jawarkar
- Department of Medicinal Chemistry and Drug Discovery, Dr Rajendra Gode Institute of Pharmacy, University Mardi Road, Amravati, Maharashtra, India
| |
Collapse
|
7
|
Veríssimo GC, Pantaleão SQ, Fernandes PDO, Gertrudes JC, Kronenberger T, Honorio KM, Maltarollo VG. MASSA Algorithm: an automated rational sampling of training and test subsets for QSAR modeling. J Comput Aided Mol Des 2023; 37:735-754. [PMID: 37804393 DOI: 10.1007/s10822-023-00536-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2023] [Accepted: 09/14/2023] [Indexed: 10/09/2023]
Abstract
QSAR models capable of predicting biological, toxicity, and pharmacokinetic properties were widely used to search lead bioactive molecules in chemical databases. The dataset's preparation to build these models has a strong influence on the quality of the generated models, and sampling requires that the original dataset be divided into training (for model training) and test (for statistical evaluation) sets. This sampling can be done randomly or rationally, but the rational division is superior. In this paper, we present MASSA, a Python tool that can be used to automatically sample datasets by exploring the biological, physicochemical, and structural spaces of molecules using PCA, HCA, and K-modes. The proposed algorithm is very useful when the variables used for QSAR are not available or to construct multiple QSAR models with the same training and test sets, producing models with lower variability and better values for validation metrics. These results were obtained even when the descriptors used in the QSAR/QSPR were different from those used in the separation of training and test sets, indicating that this tool can be used to build models for more than one QSAR/QSPR technique. Finally, this tool also generates useful graphical representations that can provide insights into the data.
Collapse
Affiliation(s)
- Gabriel Corrêa Veríssimo
- Department of Pharmaceutical Products, Faculty of Pharmacy, Federal University of Minas Gerais, Belo Horizonte, MG, 31270-901, Brazil
| | | | - Philipe de Olveira Fernandes
- Department of Pharmaceutical Products, Faculty of Pharmacy, Federal University of Minas Gerais, Belo Horizonte, MG, 31270-901, Brazil
| | - Jadson Castro Gertrudes
- Department of Computing, Institute of Exact and Biological Sciences, Federal University of Ouro Preto, Ouro Preto, MG, 35400-000, Brazil
| | - Thales Kronenberger
- Department of Pharmaceutical and Medicinal Chemistry, University of Tübingen, Tübingen, BW, 72076, Germany
| | - Kathia Maria Honorio
- Federal University of ABC, Santo André, SP, 09210-170, Brazil
- School of Arts, Sciences and Humanities, University of São Paulo, São Paulo, SP, 03828-000, Brazil
| | - Vinícius Gonçalves Maltarollo
- Department of Pharmaceutical Products, Faculty of Pharmacy, Federal University of Minas Gerais, Belo Horizonte, MG, 31270-901, Brazil.
| |
Collapse
|
8
|
Nath A, Ojha PK, Roy K. QSAR assessment of aquatic toxicity potential of diverse agrochemicals. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2023:1-20. [PMID: 37941423 DOI: 10.1080/1062936x.2023.2278074] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/23/2023] [Accepted: 10/24/2023] [Indexed: 11/10/2023]
Abstract
The fast-increasing number of commercially produced chemicals challenges the experimental ecotoxicity assessment methods, which are costly, time-consuming, and dependent on the sacrifice of animals. In this regard, Quantitative Structure-Property/Activity Relationships (QSPR/QSAR) have led the way in developing ecotoxicity assessment models. In this study, QSAR models have been developed using the pEC50 values of 82 diverse agrochemicals or agro-molecules against a planktonic crustacean Daphnia magna with easily interpretable 2D descriptors. Moreover, a link among octanol-water partition coefficient (KOW), bio-concentration factor (BCF), and critical body residue (CBR) has been addressed, and their imputation for the prediction of the toxicity endpoint (EC50) has been done with an objective of the advanced exploration of several ecotoxicological parameters for toxic chemicals. The developed partial least squares (PLS) models were validated rigorously and proved to be robust, sound, and immensely well-predictive. The final Daphnia toxicity model derived from experimental derived properties along with computed descriptors emerged better in statistical quality and predictivity than those obtained solely from computed descriptors. Additionally, the pEC50 and other important properties (log KOW, log BCF, and log CBR) for a set of external agro-molecules, not employed in model development, were predicted to show the predictive ability of the models.
Collapse
Affiliation(s)
| | - P K Ojha
- Drug Theoretics and Cheminformatics Laboratory, Department of Pharmaceutical Technology, Jadavpur University, Kolkata, India
| | - K Roy
- Drug Theoretics and Cheminformatics Laboratory, Department of Pharmaceutical Technology, Jadavpur University, Kolkata, India
| |
Collapse
|
9
|
Akinola LK, Uzairu A, Shallangwa GA, Abechi SE. Development of binary classification models for grouping hydroxylated polychlorinated biphenyls into active and inactive thyroid hormone receptor agonists. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2023; 34:267-284. [PMID: 37139950 DOI: 10.1080/1062936x.2023.2207039] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/05/2023]
Abstract
Some adverse effects of hydroxylated polychlorinated biphenyls (OH-PCBs) in humans are presumed to be initiated via thyroid hormone receptor (TR) binding. Due to the trial-and-error approach adopted for OH-PCB selection in previous studies, experiments designed to test the TR binding hypothesis mostly utilized inactive OH-PCBs, leading to considerable waste of time, effort and other material resources. In this paper, linear discriminant analysis (LDA) and binary logistic regression (LR) were used to develop classification models to group OH-PCBs into active and inactive TR agonists using radial distribution function (RDF) descriptors as predictor variables. The classifications made by both LDA and LR models on the training set compounds resulted in an accuracy of 84.3%, sensitivity of 72.2% and specificity of 90.9%. The areas under the ROC curves, constructed with the training set data, were found to be 0.872 and 0.880 for LDA and LR models, respectively. External validation of the models revealed that 76.5% of the test set compounds were correctly classified by both LDA and LR models. These findings suggest that the two models reported in this paper are good and reliable for classifying OH-PCB congeners into active and inactive TR agonists.
Collapse
Affiliation(s)
- L K Akinola
- Department of Chemistry, Ahmadu Bello University, Zaria, Nigeria
- Department of Chemistry, Bauchi State University, Gadau, Nigeria
| | - A Uzairu
- Department of Chemistry, Ahmadu Bello University, Zaria, Nigeria
| | - G A Shallangwa
- Department of Chemistry, Ahmadu Bello University, Zaria, Nigeria
| | - S E Abechi
- Department of Chemistry, Ahmadu Bello University, Zaria, Nigeria
| |
Collapse
|
10
|
Free volume in physical absorption of carbon dioxide in ionic liquids: Molecular dynamics supported modeling. Sep Purif Technol 2023. [DOI: 10.1016/j.seppur.2023.123464] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/04/2023]
|
11
|
Chung E, Russo DP, Ciallella HL, Wang YT, Wu M, Aleksunes LM, Zhu H. Data-Driven Quantitative Structure-Activity Relationship Modeling for Human Carcinogenicity by Chronic Oral Exposure. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2023; 57:6573-6588. [PMID: 37040559 PMCID: PMC10134506 DOI: 10.1021/acs.est.3c00648] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/24/2023] [Revised: 03/28/2023] [Accepted: 03/29/2023] [Indexed: 06/19/2023]
Abstract
Traditional methodologies for assessing chemical toxicity are expensive and time-consuming. Computational modeling approaches have emerged as low-cost alternatives, especially those used to develop quantitative structure-activity relationship (QSAR) models. However, conventional QSAR models have limited training data, leading to low predictivity for new compounds. We developed a data-driven modeling approach for constructing carcinogenicity-related models and used these models to identify potential new human carcinogens. To this goal, we used a probe carcinogen dataset from the US Environmental Protection Agency's Integrated Risk Information System (IRIS) to identify relevant PubChem bioassays. Responses of 25 PubChem assays were significantly relevant to carcinogenicity. Eight assays inferred carcinogenicity predictivity and were selected for QSAR model training. Using 5 machine learning algorithms and 3 types of chemical fingerprints, 15 QSAR models were developed for each PubChem assay dataset. These models showed acceptable predictivity during 5-fold cross-validation (average CCR = 0.71). Using our QSAR models, we can correctly predict and rank 342 IRIS compounds' carcinogenic potentials (PPV = 0.72). The models predicted potential new carcinogens, which were validated by a literature search. This study portends an automated technique that can be applied to prioritize potential toxicants using validated QSAR models based on extensive training sets from public data resources.
Collapse
Affiliation(s)
- Elena Chung
- Department
of Chemistry and Biochemistry, Rowan University, 201 Mullica Hill Road, Glassboro, New Jersey 08028, United States
| | - Daniel P. Russo
- Department
of Chemistry and Biochemistry, Rowan University, 201 Mullica Hill Road, Glassboro, New Jersey 08028, United States
| | - Heather L. Ciallella
- Department
of Toxicology, Cuyahoga County Medical Examiner’s
Office, 11001 Cedar Avenue, Cleveland, Ohio 44106, United States
| | - Yu-Tang Wang
- Institute
of Agro-Products Processing Science and Technology, Chinese Academy of Agricultural Sciences/Key Laboratory of Agro-Products
Processing, Ministry of Agriculture, Beijing 100193, China
| | - Min Wu
- School
of Life Science and Technology, China Pharmaceutical
University, No. 24, Tong Jia Xiang, Nanjing 210009, China
| | - Lauren M. Aleksunes
- Department
of Pharmacology and Toxicology, Rutgers
University, Ernest Mario School of Pharmacy, 170 Frelinghuysen Road, Piscataway, New Jersey 08854, United States
| | - Hao Zhu
- Department
of Chemistry and Biochemistry, Rowan University, 201 Mullica Hill Road, Glassboro, New Jersey 08028, United States
| |
Collapse
|
12
|
Jasinski G, Salas-Sarduy E, Vega D, Fabian L, Florencia Martini M, Moglioni AG. Design, synthesis and biological evaluation of novel thiosemicarbazones as cruzipain inhibitors. Eur J Med Chem 2023; 254:115345. [PMID: 37054562 DOI: 10.1016/j.ejmech.2023.115345] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2023] [Revised: 03/23/2023] [Accepted: 04/03/2023] [Indexed: 04/15/2023]
Abstract
Based on the activity of 23 TSCs on CZ taken from the literature, we have developed a QSAR model for predicting the activity of TSCs. New TSCs were designed and then tested against CZP, resulting in inhibitors with IC50 values in the nanomolar range. The modelling of the corresponding TSC-CZ complexes by molecular docking and QM/QM ONIOM refinement indicates a binding mode compatible with what was expected for active TSCs, according to a geometry-based theoretical model previously developed by our research group. Kinetic experiments on CZP suggest that the new TSCs act by a mechanism that involves the formation of a reversible covalent adduct with slow association and dissociation kinetics. These results demonstrate the strong inhibitory effect of the new TSCs and the benefit of the combined use of QSAR and molecular modelling techniques in the design of new and potent CZ/CZP inhibitors.
Collapse
Affiliation(s)
- Gabriel Jasinski
- Universidad de Buenos Aires, Facultad de Farmacia y Bioquímica, Cátedra de Química Medicinal, Buenos Aires, C1113AAD, Argentina; CONICET-Universidad de Buenos Aires, Instituto de la Química y el Metabolismo del Fármaco (IQUIMEFA), Buenos Aires, 1113, Argentina
| | - Emir Salas-Sarduy
- Instituto de Investigaciones Biotecnológicas "Dr. Rodolfo Ugalde" (IIBIO), CONICET, San Martín, Buenos Aires, 1650, Argentina; Escuela de Bio y Nanotecnología (EByN), Universidad Nacional de San Martín (UNSAM), San Martín, Buenos Aires, 1650, Argentina
| | - Daniel Vega
- Departamento de Física de la Materia Condensada, GIyA, CAC, CNEA, Buenos Aires, B1650KNA, Argentina; Escuela de Ciencia y Tecnología, UNSAM, San Martín, Buenos Aires, B1650KNA, Argentina
| | - Lucas Fabian
- CONICET-Universidad de Buenos Aires, Instituto de la Química y el Metabolismo del Fármaco (IQUIMEFA), Buenos Aires, 1113, Argentina
| | - M Florencia Martini
- Universidad de Buenos Aires, Facultad de Farmacia y Bioquímica, Cátedra de Química Medicinal, Buenos Aires, C1113AAD, Argentina; CONICET-Universidad de Buenos Aires, Instituto de la Química y el Metabolismo del Fármaco (IQUIMEFA), Buenos Aires, 1113, Argentina
| | - Albertina G Moglioni
- Universidad de Buenos Aires, Facultad de Farmacia y Bioquímica, Cátedra de Química Medicinal, Buenos Aires, C1113AAD, Argentina; CONICET-Universidad de Buenos Aires, Instituto de la Química y el Metabolismo del Fármaco (IQUIMEFA), Buenos Aires, 1113, Argentina.
| |
Collapse
|
13
|
Akinola LK, Uzairu A, Shallangwa GA, Abechi SE. Development and Validation of Predictive Quantitative Structure-Activity Relationship Models for Estrogenic Activities of Hydroxylated Polychlorinated Biphenyls. ENVIRONMENTAL TOXICOLOGY AND CHEMISTRY 2023; 42:823-834. [PMID: 36692119 DOI: 10.1002/etc.5566] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/05/2022] [Revised: 11/17/2022] [Accepted: 01/18/2023] [Indexed: 06/17/2023]
Abstract
Disruption of the endocrine system by hydroxylated polychlorinated biphenyls (OH-PCBs) is hypothesized, among other potential mechanisms, to be mediated via nuclear receptor binding. Due to the high cost and lengthy time required to produce high-quality experimental data, empirical data to support the nuclear receptor binding hypothesis are in short supply. In the present study, two quantitative structure-activity relationship models were developed for predicting the estrogenic activities of OH-PCBs. Findings revealed that model I (for the estrogen receptor α dataset) contained five two-dimensional (2D) descriptors belonging to the classes autocorrelation, Burden modified eigenvalues, chi path, and atom type electrotopological state, whereas model II (for the estrogen receptor β dataset) contained three 2D and three 3D descriptors belonging to the classes autocorrelation, atom type electrotopological state, and Radial Distribution Function descriptors. The internal and external validation metrics reported for models I and II indicate that both models are robust, reliable, and suitable for predicting the estrogenic activities of untested OH-PCB congeners. Environ Toxicol Chem 2023;42:823-834. © 2023 SETAC.
Collapse
Affiliation(s)
- Lukman K Akinola
- Department of Chemistry, Ahmadu Bello University, Zaria, Nigeria
- Department of Chemistry, Bauchi State University, Gadau, Nigeria
| | - Adamu Uzairu
- Department of Chemistry, Ahmadu Bello University, Zaria, Nigeria
| | | | - Stephen E Abechi
- Department of Chemistry, Ahmadu Bello University, Zaria, Nigeria
| |
Collapse
|
14
|
Lowe CN, Charest N, Ramsland C, Chang DT, Martin TM, Williams AJ. Transparency in Modeling through Careful Application of OECD's QSAR/QSPR Principles via a Curated Water Solubility Data Set. Chem Res Toxicol 2023; 36:465-478. [PMID: 36877669 PMCID: PMC10357388 DOI: 10.1021/acs.chemrestox.2c00379] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/07/2023]
Abstract
The need for careful assembly, training, and validation of quantitative structure-activity/property models (QSAR/QSPR) is more significant than ever as data sets become larger and sophisticated machine learning tools become increasingly ubiquitous and accessible to the scientific community. Regulatory agencies such as the United States Environmental Protection Agency must carefully scrutinize each aspect of a resulting QSAR/QSPR model to determine its potential use in environmental exposure and hazard assessment. Herein, we revisit the goals of the Organisation for Economic Cooperation and Development (OECD) in our application and discuss the validation principles for structure-activity models. We apply these principles to a model for predicting water solubility of organic compounds derived using random forest regression, a common machine learning approach in the QSA/PR literature. Using public sources, we carefully assembled and curated a data set consisting of 10,200 unique chemical structures with associated water solubility measurements. This data set was then used as a focal narrative to methodically consider the OECD's QSA/PR principles and how they can be applied to random forests. Despite some expert, mechanistically informed supervision of descriptor selection to enhance model interpretability, we achieved a model of water solubility with comparable performance to previously published models (5-fold cross validated performance 0.81 R2 and 0.98 RMSE). We hope this work will catalyze a necessary conversation around the importance of cautiously modernizing and explicitly leveraging OECD principles while pursuing state-of-the-art machine learning approaches to derive QSA/PR models suitable for regulatory consideration.
Collapse
Affiliation(s)
- Charles N. Lowe
- Center for Computational Toxicology and Exposure, Office of Research and Development, United States Environmental Protection Agency, Research Triangle Park, North Carolina 27711, United States
| | - Nathaniel Charest
- ORAU Student Services Contractor to Center for Computational Toxicology and Exposure, Office of Research and Development, United States Environmental Protection Agency, Research Triangle Park, North Carolina 27711, United States
| | - Christian Ramsland
- ORAU Student Services Contractor to Center for Computational Toxicology and Exposure, Office of Research and Development, United States Environmental Protection Agency, Research Triangle Park, North Carolina 27711, United States
| | - Daniel T. Chang
- Center for Computational Toxicology and Exposure, Office of Research and Development, United States Environmental Protection Agency, Research Triangle Park, North Carolina 27711, United States
| | - Todd M. Martin
- Center for Computational Toxicology and Exposure, Office of Research and Development, United States Environmental Protection Agency, Research Triangle Park, North Carolina 27711, United States
| | - Antony J. Williams
- Center for Computational Toxicology and Exposure, Office of Research and Development, United States Environmental Protection Agency, Research Triangle Park, North Carolina 27711, United States
| |
Collapse
|
15
|
Development of QSPR-ANN models for the estimation of critical properties of pure hydrocarbons. J Mol Graph Model 2023; 121:108450. [PMID: 36907016 DOI: 10.1016/j.jmgm.2023.108450] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2022] [Revised: 02/21/2023] [Accepted: 03/06/2023] [Indexed: 03/09/2023]
Abstract
The current work aimed to predict three critical properties: critical temperature (Tc), critical volume (Vc), and critical pressure (Pc) of pure hydrocarbons. A multi-layer perceptron artificial neural network (MLP-ANN) has been adopted as a nonlinear modeling technique and computational approach based on a few relevant molecular descriptors. A set of diverse data points was used to build three QSPR-ANN models, including 223 points for Tc, Vc, and 221 for Pc. The entire database was randomly split into two subsets: 80% for the training set and 20% for the testing set. A large number of 1666 molecular descriptors were calculated and then reduced by a statistical methodology based on several phases to retain them into a reasonable number of relevant descriptors, wherein about 99% of initial descriptors were excluded. Thus, the Quasi-Newton backpropagation (BFGS) algorithm was applied to train the ANN structure. The results of three QSPR-ANN models showed good precision, confirmed by the high values of determination coefficient (R2) ranging from 0.9990 to 0.9945, and the low values of calculated errors, such as the Mean Absolute Percentage Error (MAPE) that ranged from 2.2497 to 0.7424% for the best three models of Tc, Vc, and Pc. The weight sensitivity analysis method was applied to know the contribution of each input descriptor individually or by class on each appropriate QSPR-ANN model. Moreover, the applicability domain (AD) method was also used with a strict limit of standardized residual values (di = ±2). However, the results were promising, with nearly 88% of the data points validated within the AD range. Finally, the results of the proposed QSPR-ANN models were compared with other well-known QSPR or ANN models for each property. Consequently, our three models provided satisfactory results, outperforming most of the models mentioned in this comparison. This computational approach can be applied in petroleum engineering and other related fields to accurately determine the critical properties of pure hydrocarbons: Tc, Vc, and Pc.
Collapse
|
16
|
Zaki MEA, Al-Hussain SA, Al-Mutairi AA, Samad A, Ghosh A, Chaudhari S, Khatale PN, Ajmire P, Jawarkar RD. In-silico studies to recognize repurposing therapeutics toward arginase-I inhibitors as a potential onco-immunomodulators. Front Pharmacol 2023; 14:1129997. [PMID: 37144217 PMCID: PMC10151555 DOI: 10.3389/fphar.2023.1129997] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2022] [Accepted: 02/27/2023] [Indexed: 05/06/2023] Open
Abstract
Rudolf Virchow was the first person to point out the important link between immune function and cancer. He did this by noticing that leukocytes were often found in tumors. Overexpression of arginase 1 (ARG1) and inducible nitric oxide synthase (iNOS) in myeloid-derived suppressor cells (MDSCs) and tumour-associated macrophages (TAMs) depletes both intracellular and extracellular arginine. TCR signalling is slowed as a result, and the same types of cells produce reactive oxygen and nitrogen species (ROS and RNS), which aggravates the situation. Human arginase I is a double-stranded manganese metalloenzyme that helps L-arginine break down into L-ornithine and urea. Thus, a quantitative structure-activity relationship (QSAR) analysis was performed to unearth the unrecognised structural aspects crucial for arginase-I inhibition. In this work, a balanced QSAR model with good prediction performance and clear mechanistic interpretation was developed using a dataset of 149 molecules encompassing a broad range of structural scaffolds and compositions. The model was made to meet OECD standards, and all of its validation parameters have values that are higher than the minimum requirements (R2 tr = 0.89, Q2 LMO = 0.86, and R2 ex = 0.85). The present QSAR study linked structural factors to arginase-I inhibitory action, including the proximity of lipophilic atoms to the molecule's centre of mass (within 3A), the position of the donor to the ring nitrogen (exactly 3 bonds away), and the surface area ratio. As OAT-1746 and two others are the only arginase-I inhibitors in development at the time, we have performed a QSAR-based virtual screening with 1650 FDA compounds taken from the zinc database. In this screening, 112 potential hit compounds were found to have a PIC50 value of less than 10 nm against the arginase-I receptor. The created QSAR model's application domain was evaluated in relation to the most active hit molecules identified using QSAR-based virtual screening, utilising a training set of 149 compounds and a prediction set of 112 hit molecules. As shown in the Williams plot, the top hit molecule, ZINC000252286875, has a low leverage value of HAT i/i h* = 0.140, placing it towards the boundary of the usable range. Furthermore, one of 112 hit molecules with a docking score of -10.891 kcal/mol (PIC50 = 10.023 M) was isolated from a study of arginase-I using molecular docking. Protonated ZINC000252286875-linked arginase-1 showed 2.9 RMSD, whereas non-protonated had 1.8. RMSD plots illustrate protein stability in protonated and non-protonated ZINC000252286875-bound states. Protonated-ZINC000252286875-bound proteins contain 25 Rg. The non-protonated protein-ligand combination exhibits a 25.2-Rg, indicating compactness. Protonated and non-protonated ZINC000252286875 stabilised protein targets in binding cavities posthumously. Significant root mean square fluctuations (RMSF) were seen in the arginase-1 protein at a small number of residues for a time function of 500 ns in both the protonated and unprotonated states. Protonated and non-protonated ligands interacted with proteins throughout the simulation. ZINC000252286875 bound Lys64, Asp124, Ala171, Arg222, Asp232, and Gly250. Aspartic acid residue 232 exhibited 200% ionic contact. 500-ns simulations-maintained ions. Salt bridges for ZINC000252286875 aided docking. ZINC000252286875 created six ionic bonds with Lys68, Asp117, His126, Ala171, Lys224, and Asp232 residues. Asp117, His126, and Lys224 showed 200% ionic interactions. In protonated and deprotonated states, GbindvdW, GbindLipo, and GbindCoulomb energies played crucial role. Moreover, ZINC000252286875 meets all of the ADMET standards to serve as a drug. As a result, the current analyses were successful in locating a novel and potent hit molecule that inhibits arginase-I effectively at nanomolar concentrations. The results of this investigation can be used to develop brand-new arginase I inhibitors as an alternative immune-modulating cancer therapy.
Collapse
Affiliation(s)
- Magdi E. A. Zaki
- Department of Chemistry, Faculty of Science, Imam Mohammad Ibn Saud Islamic University, Riyadh, Saudi Arabia
- *Correspondence: Magdi E. A. Zaki, ; Rahul D. Jawarkar,
| | - Sami A. Al-Hussain
- Department of Chemistry, Faculty of Science, Imam Mohammad Ibn Saud Islamic University, Riyadh, Saudi Arabia
| | - Aamal A. Al-Mutairi
- Department of Chemistry, Faculty of Science, Imam Mohammad Ibn Saud Islamic University, Riyadh, Saudi Arabia
| | - Abdul Samad
- Department of Pharmaceutical Chemistry, Faculty of Pharmacy, Tishk International University, Erbil, Kurdistan Region, Iraq
| | - Arabinda Ghosh
- Microbiology Division, Department of Botany, Gauhati University, Guwahati, India
| | - Somdatta Chaudhari
- Department of Pharmaceutical Chemistry, Progressive Education Society’s Modern College of Pharmacy, Pune, India
| | - Pravin N. Khatale
- Department of Medicinal Chemistry, Dr Rajendra Gode Institute of Pharmacy, Amravati, Maharashtra, India
| | - Prashant Ajmire
- Department of Medicinal Chemistry, Dr Rajendra Gode Institute of Pharmacy, Amravati, Maharashtra, India
| | - Rahul D. Jawarkar
- Department of Medicinal Chemistry, Dr Rajendra Gode Institute of Pharmacy, Amravati, Maharashtra, India
- *Correspondence: Magdi E. A. Zaki, ; Rahul D. Jawarkar,
| |
Collapse
|
17
|
Das BK, Chakraborty D. Deciphering the competitive inhibition of dihydropteroate synthase by 8 marcaptoguanine analogs: enhanced potency in phenylsulfonyl fragments. J Biomol Struct Dyn 2022; 40:13083-13102. [PMID: 34581241 DOI: 10.1080/07391102.2021.1981452] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
The emergence of sulfa-drug resistance and reduced efficacy of pterin-based analogs towards Dihydropteroate synthase (DHPS) inhibition dictate a pressing need of developing novel antimicrobial agents for immune-compromised patients. Recently, a series of 8-Marcaptoguanin (8-MG) derivatives synthesized for 6-Hydroxymethyl-7,8-dihydropterin pyrophosphokinase (experimental KD ∼ 100-.0.36) showed remarkable homology with the pteroic-acid and serve as a template for product antagonism in DHPS. The present work integrates ligand-based drug discovery techniques with structure-based docking, enhanced MD simulation, and MM/PBSA techniques to demonstrate the essential features of 8-MG analogs which make it a potent inhibitor for DHPS. The delicate balance in hydrophilic, hydrophobic substitutions on the 8-MG core is the crucial signature for DHPS inhibition. It is found that the dynamic interactions of active compounds are mainly dominated by consistent hydrogen bonding network with Asp 96, Asn 115, Asp 185, Ser 222, Arg 255 and π-π stacking, π-cation interactions with Phe 190, Lys 221. Further, two new 8-MG compounds containing N-phenylacetamide (compound S1, ΔGbind-eff = -62.03 kJ/mol) and phenylsulfonyl (compound S3, ΔGbind-eff = -71.29 kJ/mol) fragments were found to be the most potent inhibitor of DHPS, which stabilize the flexible pABA binding loop, thereby increasing their binding affinity. MM/PBSA calculation shows electrostatic energy contribution to be the principal component in stabilizing the inhibitors in the binding pocket. This fact is further confirmed by the higher energy barrier obtained in umbrella sampling for this class of inhibitors.
Collapse
Affiliation(s)
- Bratin Kumar Das
- Biophysical and Computational Chemistry Laboratory, Department of Chemistry, National Institute of Technology Karnataka, Mangalore, India
| | - Debashree Chakraborty
- Biophysical and Computational Chemistry Laboratory, Department of Chemistry, National Institute of Technology Karnataka, Mangalore, India
| |
Collapse
|
18
|
Metwally AA, Nayel AA, Hathout RM. In silico prediction of siRNA ionizable-lipid nanoparticles In vivo efficacy: Machine learning modeling based on formulation and molecular descriptors. Front Mol Biosci 2022; 9:1042720. [PMID: 36619167 PMCID: PMC9811823 DOI: 10.3389/fmolb.2022.1042720] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2022] [Accepted: 12/07/2022] [Indexed: 12/24/2022] Open
Abstract
In silico prediction of the in vivo efficacy of siRNA ionizable-lipid nanoparticles is desirable as it can save time and resources dedicated to wet-lab experimentation. This study aims to computationally predict siRNA nanoparticles in vivo efficacy. A data set containing 120 entries was prepared by combining molecular descriptors of the ionizable lipids together with two nanoparticles formulation characteristics. Input descriptor combinations were selected by an evolutionary algorithm. Artificial neural networks, support vector machines and partial least squares regression were used for QSAR modeling. Depending on how the data set is split, two training sets and two external validation sets were prepared. Training and validation sets contained 90 and 30 entries respectively. The results showed the successful predictions of validation set log (siRNA dose) with Rval 2= 0.86-0.89 and 0.75-80 for validation sets one and two, respectively. Artificial neural networks resulted in the best Rval 2 for both validation sets. For predictions that have high bias, improvement of Rval 2 from 0.47 to 0.96 was achieved by selecting the training set lipids lying within the applicability domain. In conclusion, in vivo performance of siRNA nanoparticles was successfully predicted by combining cheminformatics with machine learning techniques.
Collapse
Affiliation(s)
- Abdelkader A. Metwally
- Department of Pharmaceutics, Faculty of Pharmacy, Health Sciences Center, Kuwait University, Kuwait City, Kuwait,Department of Pharmaceutics and Industrial Pharmacy, Faculty of Pharmacy, Ain Shams University, Cairo, Egypt,*Correspondence: Abdelkader A. Metwally,
| | - Amira A. Nayel
- Clinical Pharmacy Department, Alexandria Ophthalmology Hospital, Alexandria, Egypt,Department of Clinical Pharmacy and Pharmacy Practice, Faculty of Pharmacy, Alexandria University, Alexandria, Egypt
| | - Rania M. Hathout
- Department of Pharmaceutics and Industrial Pharmacy, Faculty of Pharmacy, Ain Shams University, Cairo, Egypt
| |
Collapse
|
19
|
Fernandes PDO, Martins JPA, de Melo EB, de Oliveira RB, Kronenberger T, Maltarollo VG. Quantitative structure-activity relationship and machine learning studies of 2-thiazolylhydrazone derivatives with anti- Cryptococcus neoformans activity. J Biomol Struct Dyn 2022; 40:9789-9800. [PMID: 34121616 DOI: 10.1080/07391102.2021.1935321] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
Cryptococcus neoformans is a fungus responsible for infections in humans with a significant number of cases in immunosuppressed patients, mainly in underdeveloped countries. In this context, the thiazolylhydrazones are a promising class of compounds with activity against C. neoformans. The understanding of the structure-activity relationship of these derivatives could lead to the design of robust compounds that could be promising drug candidates for fungal infections. Specifically, modern techniques such as 4D-QSAR and machine learning methods were employed in this work to generate two QSAR models (one 2D and one 4D) with high predictive power (r2 for the test set equals to 0.934 and 0.831, respectively), and one random forest classification model was reported with Matthews correlation coefficient equals to 1 and 0.62 for internal and external validations, respectively. The physicochemical interpretation of selected models, indicated the importance of aliphatic substituents at the hydrazone moiety to antifungal activity, corroborating experimental data.Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Philipe de Oliveira Fernandes
- Departamento de Produtos Farmacêuticos, Faculdade de Farmácia, Universidade Federal de Minas Gerais, Belo Horizonte, Minas Gerais, Brazil
| | - João Paulo A Martins
- Departamento de Química, Instituto de Ciências Exatas, Universidade Federal de Minas Gerais, Belo Horizonte, Minas Gerais, Brazil
| | - Eduardo B de Melo
- Laboratório de Química Medicinal e Ambiental Teórica, Universidade Estadual do Oeste do Paraná, Cascavel, Paraná, Brazil
| | - Renata Barbosa de Oliveira
- Departamento de Produtos Farmacêuticos, Faculdade de Farmácia, Universidade Federal de Minas Gerais, Belo Horizonte, Minas Gerais, Brazil
| | - Thales Kronenberger
- Department of Pneumonology and Oncology, Internal Medicine VIII, University Hospital of Tübingen, Tübingen, Baden-Württemberg, Germany
| | - Vinícius Gonçalves Maltarollo
- Departamento de Produtos Farmacêuticos, Faculdade de Farmácia, Universidade Federal de Minas Gerais, Belo Horizonte, Minas Gerais, Brazil
| |
Collapse
|
20
|
Dhingra N, Kapoor K, Sharma S, Saxena A. Towards further understanding the structural insights of isoxazoles analogues against leishmaniasis using QSAR, molecular docking and molecular dynamics model. J INDIAN CHEM SOC 2022. [DOI: 10.1016/j.jics.2022.100847] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
|
21
|
Banerjee A, De P, Kumar V, Kar S, Roy K. Quick and efficient quantitative predictions of androgen receptor binding affinity for screening Endocrine Disruptor Chemicals using 2D-QSAR and Chemical Read-Across. CHEMOSPHERE 2022; 309:136579. [PMID: 36174732 DOI: 10.1016/j.chemosphere.2022.136579] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/07/2022] [Revised: 09/18/2022] [Accepted: 09/20/2022] [Indexed: 06/16/2023]
Abstract
Endocrine Disruptor Chemicals are synthetic or natural molecules in the environment that promote adverse modifications of endogenous hormone regulation in humans and/or in animals. In the present research, we have applied two-dimensional quantitative structure-activity relationship (2D-QSAR) modeling to analyze the structural features of these chemicals responsible for binding to the androgen receptors (logRBA) in rats. We have collected the receptor binding data from the EDKB database (https://www.fda.gov/science-research/endocrine-disruptor-knowledge-base/accessing-edkb-database) and then employed the DTC-QSAR tool, available from https://dtclab.webs.com/software-tools, for dataset division, feature selection, and model development. The final partial least squares model was evaluated using various stringent validation criteria. From the model, we interpreted that hydrophobicity, steroidal nucleus, bulkiness and a hydrogen bond donor at an appropriate position contribute to the receptor binding affinity, while presence of electron rich features like aromaticity and polar groups decrease the receptor binding affinity. Additionally we have also performed chemical Read-Across predictions using Read-Across-v3.1 available from https://sites.google.com/jadavpuruniversity.in/dtc-lab-software/home, and the results for the external validation metrics were found to be better than the QSAR-derived predictions. The best quality of external predictions emerged from the q-RASAR approach which combines both read-across and QSAR. To explore the essential features responsible for the receptor binding, pharmacophore mapping, molecular docking along with molecular dynamics simulation were also performed, and the results are in accordance with the QSAR/q-RASAR findings.
Collapse
Affiliation(s)
- Arkaprava Banerjee
- Drug Theoretics and Cheminformatics Laboratory, Department of Pharmaceutical Technology, Jadavpur University, Kolkata, 700032, India
| | - Priyanka De
- Drug Theoretics and Cheminformatics Laboratory, Department of Pharmaceutical Technology, Jadavpur University, Kolkata, 700032, India
| | - Vinay Kumar
- Drug Theoretics and Cheminformatics Laboratory, Department of Pharmaceutical Technology, Jadavpur University, Kolkata, 700032, India
| | - Supratik Kar
- Interdisciplinary Center for Nanotoxicity, Department of Chemistry, Physics and Atmospheric Sciences, Jackson State University, Jackson, MS 39217, United States
| | - Kunal Roy
- Drug Theoretics and Cheminformatics Laboratory, Department of Pharmaceutical Technology, Jadavpur University, Kolkata, 700032, India.
| |
Collapse
|
22
|
Masand VH, Al-Hussain SA, Rathore MM, Thakur SD, Akasapu S, Samad A, Al-Mutairi AA, Zaki MEA. Pharmacophore Synergism in Diverse Scaffold Clinches in Aurora Kinase B. Int J Mol Sci 2022; 23:ijms232314527. [PMID: 36498857 PMCID: PMC9739353 DOI: 10.3390/ijms232314527] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2022] [Revised: 11/01/2022] [Accepted: 11/11/2022] [Indexed: 11/23/2022] Open
Abstract
Aurora kinase B (AKB) is a crucial signaling kinase with an important role in cell division. Therefore, inhibition of AKB is an attractive approach to the treatment of cancer. In the present work, extensive quantitative structure-activity relationships (QSAR) analysis has been performed using a set of 561 structurally diverse aurora kinase B inhibitors. The Organization for Economic Cooperation and Development (OECD) guidelines were used to develop a QSAR model that has high statistical performance (R2tr = 0.815, Q2LMO = 0.808, R2ex = 0.814, CCCex = 0.899). The seven-variable-based newly developed QSAR model has an excellent balance of external predictive ability (Predictive QSAR) and mechanistic interpretation (Mechanistic QSAR). The QSAR analysis successfully identifies not only the visible pharmacophoric features but also the hidden features. The analysis indicates that the lipophilic and polar groups-especially the H-bond capable groups-must be present at a specific distance from each other. Moreover, the ring nitrogen and ring carbon atoms play important roles in determining the inhibitory activity for AKB. The analysis effectively captures reported as well as unreported pharmacophoric features. The results of the present analysis are also supported by the reported crystal structures of inhibitors bound to AKB.
Collapse
Affiliation(s)
- Vijay H. Masand
- Department of Chemistry, Vidya Bharati Mahavidyalaya, Amravati 444602, Maharashtra, India
- Correspondence: (V.H.M.); (M.E.A.Z.)
| | - Sami A. Al-Hussain
- Department of Chemistry, Faculty of Science, Imam Mohammad Ibn Saud Islamic University, Riyadh 11623, Saudi Arabia
| | - Mithilesh M. Rathore
- Department of Chemistry, Vidya Bharati Mahavidyalaya, Amravati 444602, Maharashtra, India
| | - Sumer D. Thakur
- Department of Chemistry, RDIK and NKD College, Badnera, Amravati 444701, Maharashtra, India
| | | | - Abdul Samad
- Department of Pharmaceutical Chemistry, Faculty of Pharmacy, Tishk International University, Erbil 44001, Iraq
| | - Aamal A. Al-Mutairi
- Department of Chemistry, Faculty of Science, Imam Mohammad Ibn Saud Islamic University, Riyadh 11623, Saudi Arabia
| | - Magdi E. A. Zaki
- Department of Chemistry, Faculty of Science, Imam Mohammad Ibn Saud Islamic University, Riyadh 11623, Saudi Arabia
- Correspondence: (V.H.M.); (M.E.A.Z.)
| |
Collapse
|
23
|
Zhu T, Tao C, Cheng H, Cong H. Versatile in silico modelling of microplastics adsorption capacity in aqueous environment based on molecular descriptor and machine learning. THE SCIENCE OF THE TOTAL ENVIRONMENT 2022; 846:157455. [PMID: 35863580 DOI: 10.1016/j.scitotenv.2022.157455] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/25/2022] [Revised: 07/10/2022] [Accepted: 07/13/2022] [Indexed: 06/15/2023]
Abstract
To comprehensively evaluate the hazards of microplastics and their coexisting organic pollutants, the sorption capacity of microplastics is a major issue that is quantified through the microplastic-aqueous sorption coefficient (Kd). Almost all quantitative structure-property relationship (QSPR) models that describe Kd apply only to narrow, relatively homogeneous groups of reactants. Herein, non-hybrid QSPR-based models were developed to predict PE-water (KPE-w), PE-seawater (KPE-sw), PVC-water (KPVC-w) and PP-seawater (KPP-sw) sorption coefficients at different temperatures, with eight machine learning algorithms. Moreover, novel hybrid intelligent models for predicting Kd more accurately were innovatively developed by applying GA, PSO and AdaBoost algorithms to optimize MLP and ELM models. The results indicated that all three optimization algorithms could improve the robustness and predictability of the standalone MLP and ELM models. In all models trained with KPE-w, KPE-sw, KPVC-w and KPP-sw data sets, GBDT-1 and XGBoost-1 models, MLP-GA-2 and MLP-PSO-2 models, MLR-3 and MLR-4 models performed better in terms of goodness of fit (Radj2: 0.907-0.999), robustness (QBOOT2: 0.900-0.937) and predictability (Rext2: 0.889-0.970), respectively. Analyzing the descriptors revealed that temperature, lipophilicity, ionization potential and molecular size were correlated closely with the adsorption capacity of microplastics to organic pollutants. The proposed QSPR models may assist in initial environmental exposure assessments without imposing heavy costs in the early experimental phase.
Collapse
Affiliation(s)
- Tengyi Zhu
- School of Environmental Science and Engineering, Yangzhou University, Yangzhou 225127, Jiangsu, China
| | - Cuicui Tao
- School of Environmental Science and Engineering, Yangzhou University, Yangzhou 225127, Jiangsu, China
| | - Haomiao Cheng
- School of Environmental Science and Engineering, Yangzhou University, Yangzhou 225127, Jiangsu, China
| | - Haibing Cong
- School of Environmental Science and Engineering, Yangzhou University, Yangzhou 225127, Jiangsu, China.
| |
Collapse
|
24
|
Ahmadi S, Abdolmaleki A, Jebeli Javan M. In silico study of natural antioxidants. VITAMINS AND HORMONES 2022; 121:1-43. [PMID: 36707131 DOI: 10.1016/bs.vh.2022.09.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Antioxidants are the body's defense system against the damage of reactive oxygen species, which are usually produced in the body through various physiological processes. There are various sources of these antioxidants such as endogenous antioxidants in the body and exogenous food sources. This chapter provides important information on methods used to investigate antioxidant activity and sources of plant antioxidants. Over the past two decades, numerous studies have demonstrated the importance of in silico research in the development of novel natural and synthesized antioxidants. In silico methods such as quantitative structure-activity relationships (QSAR), pharmacophore, docking, and virtual screenings are play critical roles in designing effective antioxidants that may be synthesized and tested later. This chapter introduces the available in silico approaches for different classes of antioxidants. Many successful applications of in silico methods in the development and design of novel antioxidants are thoroughly discussed. The QSAR, pharmacophore, molecular docking techniques, and virtual screenings process summarized here would help readers to find out the proper mechanism for the interaction between the free radicals and antioxidant compounds. Furthermore, this chapter focuses on introducing new QSAR models in combination with other in silico methods to predict antioxidants activity and design more active antioxidants. In silico studies are essential to explore largely unknown plant tissue, food sources for antioxidant synthesis, as well as saving time and money in such studies.
Collapse
Affiliation(s)
- Shahin Ahmadi
- Department of Chemistry, Faculty of Pharmaceutical Chemistry, Tehran Medical Sciences, Islamic Azad University, Tehran, Iran.
| | - Azizeh Abdolmaleki
- Department of Chemistry, Tuyserkan Branch, Islamic Azad University, Tuyserkan, Iran
| | - Marjan Jebeli Javan
- Department of Chemistry, Faculty of Pharmaceutical Chemistry, Tehran Medical Sciences, Islamic Azad University, Tehran, Iran
| |
Collapse
|
25
|
Wang X, Li F, Chen J, Teng Y, Ji C, Wu H. Critical features identification for chemical chronic toxicity based on mechanistic forecast models. ENVIRONMENTAL POLLUTION (BARKING, ESSEX : 1987) 2022; 307:119584. [PMID: 35688391 DOI: 10.1016/j.envpol.2022.119584] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/03/2022] [Revised: 05/03/2022] [Accepted: 06/03/2022] [Indexed: 06/15/2023]
Abstract
Facing billions of tons of pollutants entering the ocean each year, aquatic toxicity is becoming a crucial endpoint for evaluating chemical adverse effects on ecosystems. Notably, huge amount of toxic chemicals at environmental relevant doses can cause potential adverse effects. However, chronic aquatic toxicity effects of chemicals are much scarcer, especially at population level. Rotifers are highly sensitive to toxicants even at chronic low-doses and their communities are usually considered as effective indicators for assessing the status of aquatic ecosystems. Therefore, the no observed effect concentration (NOEC) for population abundance of rotifers were selected as endpoints to develop machine learning models for the prediction of chemical aquatic chronic toxicity. In this study, forty-eight binary models were built by eight types of chemical descriptors combined with six machine learning algorithms. The best binary model was 1D & 2D molecular descriptors - random trees model (RT) with high balanced accuracy (BA) (0.83 for training and 0.83 for validation set), and Matthews correlation coefficient (MCC) (0.72 for training set and 0.67 for validation set). Moreover, the optimal model identified the primary factors (SpMAD_Dzp, AMW, MATS2v) and filtered out three high alerting substructures [c1cc(Cl)cc1, CNCO, CCOP(=S)(OCC)O] influencing the chronic aquatic toxicity. These results showed that the compounds with low molecular volume, high polarity and molecular weight could contribute to adverse effects on rotifers, facilitating the deeper understanding of chronic toxicity mechanisms. In addition, forecast models had better performances than the common models embedded into ECOSAR software. This study provided insights into structural features responsible for the toxicity of different groups of chemicals and thereby allowed for the rational design of green and safer alternatives.
Collapse
Affiliation(s)
- Xiaoqing Wang
- CAS Key Laboratory of Coastal Environmental Processes and Ecological Remediation, Yantai Institute of Coastal Zone Research (YIC), Chinese Academy of Sciences (CAS), Shandong Key Laboratory of Coastal Environmental Processes, YICCAS, Yantai, 264003, PR China; University of Chinese Academy of Sciences, Beijing, 100049, PR China
| | - Fei Li
- CAS Key Laboratory of Coastal Environmental Processes and Ecological Remediation, Yantai Institute of Coastal Zone Research (YIC), Chinese Academy of Sciences (CAS), Shandong Key Laboratory of Coastal Environmental Processes, YICCAS, Yantai, 264003, PR China; Center for Ocean Mega-Science, Chinese Academy of Sciences, Qingdao, 266071, PR China.
| | - Jingwen Chen
- Key Laboratory of Industrial Ecology and Environmental Engineering (MOE), School of Environmental Science and Technology, Dalian University of Technology, Linggong Road 2, Dalian, 116024, China
| | - Yuefa Teng
- CAS Key Laboratory of Coastal Environmental Processes and Ecological Remediation, Yantai Institute of Coastal Zone Research (YIC), Chinese Academy of Sciences (CAS), Shandong Key Laboratory of Coastal Environmental Processes, YICCAS, Yantai, 264003, PR China; University of Chinese Academy of Sciences, Beijing, 100049, PR China
| | - Chenglong Ji
- CAS Key Laboratory of Coastal Environmental Processes and Ecological Remediation, Yantai Institute of Coastal Zone Research (YIC), Chinese Academy of Sciences (CAS), Shandong Key Laboratory of Coastal Environmental Processes, YICCAS, Yantai, 264003, PR China; Center for Ocean Mega-Science, Chinese Academy of Sciences, Qingdao, 266071, PR China
| | - Huifeng Wu
- CAS Key Laboratory of Coastal Environmental Processes and Ecological Remediation, Yantai Institute of Coastal Zone Research (YIC), Chinese Academy of Sciences (CAS), Shandong Key Laboratory of Coastal Environmental Processes, YICCAS, Yantai, 264003, PR China; Center for Ocean Mega-Science, Chinese Academy of Sciences, Qingdao, 266071, PR China
| |
Collapse
|
26
|
Jawarkar RD, Sharma P, Jain N, Gandhi A, Mukerjee N, Al-Mutairi AA, Zaki MEA, Al-Hussain SA, Samad A, Masand VH, Ghosh A, Bakal RL. QSAR, Molecular Docking, MD Simulation and MMGBSA Calculations Approaches to Recognize Concealed Pharmacophoric Features Requisite for the Optimization of ALK Tyrosine Kinase Inhibitors as Anticancer Leads. Molecules 2022; 27:molecules27154951. [PMID: 35956900 PMCID: PMC9370430 DOI: 10.3390/molecules27154951] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2022] [Revised: 07/15/2022] [Accepted: 07/22/2022] [Indexed: 12/04/2022] Open
Abstract
ALK tyrosine kinase ALK TK is an important target in the development of anticancer drugs. In the present work, we have performed a QSAR analysis on a dataset of 224 molecules in order to quickly predict anticancer activity on query compounds. Double cross validation assigns an upward plunge to the genetic algorithm−multi linear regression (GA-MLR) based on robust univariate and multivariate QSAR models with high statistical performance reflected in various parameters like, fitting parameters; R2 = 0.69−0.87, F = 403.46−292.11, etc., internal validation parameters; Q2LOO = 0.69−0.86, Q2LMO = 0.69−0.86, CCCcv = 0.82−0.93, etc., or external validation parameters Q2F1 = 0.64−0.82, Q2F2 = 0.63−0.82, Q2F3 = 0.65−0.81, R2ext = 0.65−0.83 including RMSEtr < RMSEcv. The present QSAR evaluation successfully identified certain distinct structural features responsible for ALK TK inhibitory potency, such as planar Nitrogen within four bonds from the Nitrogen atom, Fluorine atom within five bonds beside the non-ring Oxygen atom, lipophilic atoms within two bonds from the ring Carbon atoms. Molecular docking, MD simulation, and MMGBSA computation results are in consensus with and complementary to the QSAR evaluations. As a result, the current study assists medicinal chemists in prioritizing compounds for experimental detection of anticancer activity, as well as their optimization towards more potent ALK tyrosine kinase inhibitor.
Collapse
Affiliation(s)
- Rahul D. Jawarkar
- Faculty of Pharmacy, Oriental University, Indore 453555, Madhya Pradesh, India; (P.S.); (N.J.)
- Correspondence: (R.D.J.); (M.E.A.Z.); Tel.: +91-7385178762 (R.D.J.)
| | - Praveen Sharma
- Faculty of Pharmacy, Oriental University, Indore 453555, Madhya Pradesh, India; (P.S.); (N.J.)
| | - Neetesh Jain
- Faculty of Pharmacy, Oriental University, Indore 453555, Madhya Pradesh, India; (P.S.); (N.J.)
| | - Ajaykumar Gandhi
- Department of Chemistry, Government College of Arts and Science, Aurangabad 431004, Maharashtra, India;
| | - Nobendu Mukerjee
- Department of Microbiology, Ramakrishna Mission Vivekananda Centenary College, Kolkata 700118, West Bengal, India;
| | - Aamal A. Al-Mutairi
- Department of Chemistry, Faculty of Science, Imam Mohammad Ibn Saud Islamic University, Riyadh 13318, Saudi Arabia; (A.A.A.-M.); (S.A.A.-H.)
| | - Magdi E. A. Zaki
- Department of Chemistry, Faculty of Science, Imam Mohammad Ibn Saud Islamic University, Riyadh 13318, Saudi Arabia; (A.A.A.-M.); (S.A.A.-H.)
- Correspondence: (R.D.J.); (M.E.A.Z.); Tel.: +91-7385178762 (R.D.J.)
| | - Sami A. Al-Hussain
- Department of Chemistry, Faculty of Science, Imam Mohammad Ibn Saud Islamic University, Riyadh 13318, Saudi Arabia; (A.A.A.-M.); (S.A.A.-H.)
| | - Abdul Samad
- Department of Pharmaceutical Chemistry, Faculty of Pharmacy, Tishk International University, Erbil 44001, Kurdistan Region, Iraq;
| | - Vijay H. Masand
- Department of Chemistry, Vidyabharati Mahavidyalalya, Camp Road, Amravati 444602, Maharashtra, India;
| | - Arabinda Ghosh
- Microbiology Division, Department of Botany, Gauhati University, Guwahati 781014, Assam, India;
| | - Ravindra L. Bakal
- Department of Medicinal Chemistry, Dr. Rajendra Gode Institute of Pharmacy, University-Mardi Road, Amravati 444603, Maharashtra, India;
| |
Collapse
|
27
|
Zhong C, Ai J, Yang Y, Ma F, Sun W. Small Molecular Drug Screening Based on Clinical Therapeutic Effect. Molecules 2022; 27:molecules27154807. [PMID: 35956770 PMCID: PMC9369618 DOI: 10.3390/molecules27154807] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2022] [Revised: 07/22/2022] [Accepted: 07/25/2022] [Indexed: 11/16/2022] Open
Abstract
Virtual screening can significantly save experimental time and costs for early drug discovery. Drug multi-classification can speed up virtual screening and quickly predict the most likely class for a drug. In this study, 1019 drug molecules with actual therapeutic effects are collected from multiple databases and documents, and molecular sets are grouped according to therapeutic effect and mechanism of action. Molecular descriptors and molecular fingerprints are obtained through SMILES to quantify molecular structures. After using the Kennard–Stone method to divide the data set, a better combination can be obtained by comparing the combined results of five classification algorithms and a fusion method. Furthermore, for a specific data set, the model with the best performance is used to predict the validation data set. The test set shows that prediction accuracy can reach 0.862 and kappa coefficient can reach 0.808. The highest classification accuracy of the validation set is 0.873. The more reliable molecular set has been found, which could be used to predict potential attributes of unknown drug compounds and even to discover new use for old drugs. We hope this research can provide a reference for virtual screening of multiple classes of drugs at the same time in the future.
Collapse
Affiliation(s)
| | | | | | | | - Wei Sun
- Correspondence: ; Tel.: +86-10-64445826
| |
Collapse
|
28
|
QSAR Evaluations to Unravel the Structural Features in Lysine-Specific Histone Demethylase 1A Inhibitors for Novel Anticancer Lead Development Supported by Molecular Docking, MD Simulation and MMGBSA. MOLECULES (BASEL, SWITZERLAND) 2022; 27:molecules27154758. [PMID: 35897936 PMCID: PMC9332886 DOI: 10.3390/molecules27154758] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/14/2022] [Revised: 07/11/2022] [Accepted: 07/19/2022] [Indexed: 12/05/2022]
Abstract
Using 84 structurally diverse and experimentally validated LSD1/KDM1A inhibitors, quantitative structure–activity relationship (QSAR) models were built by OECD requirements. In the QSAR analysis, certainly significant and understated pharmacophoric features were identified as critical for LSD1 inhibition, such as a ring Carbon atom with exactly six bonds from a Nitrogen atom, partial charges of lipophilic atoms within eight bonds from a ring Sulphur atom, a non-ring Oxygen atom exactly nine bonds from the amide Nitrogen, etc. The genetic algorithm–multi-linear regression (GA-MLR) and double cross-validation criteria were used to create robust QSAR models with high predictability. In this study, two QSAR models were developed, with fitting parameters like R2 = 0.83–0.81, F = 61.22–67.96, internal validation parameters such as Q2LOO = 0.79–0.77, Q2LMO = 0.78–0.76, CCCcv = 0.89–0.88, and external validation parameters such as, R2ext = 0.82 and CCCex = 0.90. In terms of mechanistic interpretation and statistical analysis, both QSAR models are well-balanced. Furthermore, utilizing the pharmacophoric features revealed by QSAR modelling, molecular docking experiments corroborated with the most active compound’s binding to the LSD1 receptor. The docking results are then refined using Molecular dynamic simulation and MMGBSA analysis. As a consequence, the findings of the study can be used to produce LSD1/KDM1A inhibitors as anticancer leads.
Collapse
|
29
|
Bukhari SNA, Elsherif MA, Junaid K, Ejaz H, Alam P, Samad A, Jawarkar RD, Masand VH. Perceiving the Concealed and Unreported Pharmacophoric Features of the 5-Hydroxytryptamine Receptor Using Balanced QSAR Analysis. Pharmaceuticals (Basel) 2022; 15:ph15070834. [PMID: 35890133 PMCID: PMC9316833 DOI: 10.3390/ph15070834] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2022] [Revised: 06/12/2022] [Accepted: 06/25/2022] [Indexed: 02/04/2023] Open
Abstract
The 5-hydroxytryptamine receptor 6 (5-HT6) has gained attention as a target for developing therapeutics for Alzheimer’s disease, schizophrenia, cognitive dysfunctions, anxiety, and depression, to list a few. In the present analysis, a larger and diverse dataset of 1278 molecules covering a broad chemical and activity space was used to identify visual and concealed structural features associated with binding affinity for 5-HT6. For this, quantitative structure–activity relationships (QSAR) and molecular docking analyses were executed. This led to the development of a statistically robust QSAR model with a balance of excellent predictivity (R2tr = 0.78, R2ex = 0.77), the identification of unreported aspects of known features, and also novel mechanistic interpretations. Molecular docking and QSAR provided similar as well as complementary results. The present analysis indicates that the partial charges on ring carbons present within four bonds from a sulfur atom, the occurrence of sp3-hybridized carbon atoms bonded with donor atoms, and a conditional occurrence of lipophilic atoms/groups from nitrogen atoms, which are prominent but unreported pharmacophores that should be considered while optimizing a molecule for 5-HT6. Thus, the present analysis led to identification of some novel unreported structural features that govern the binding affinity of a molecule. The results could be beneficial in optimizing the molecules for 5-HT6.
Collapse
Affiliation(s)
- Syed Nasir Abbas Bukhari
- Department of Pharmaceutical Chemistry, College of Pharmacy, Jouf University, Sakaka 72388, Saudi Arabia
| | | | - Kashaf Junaid
- Department of Clinical Laboratory Sciences, College of Applied Medical Sciences, Jouf University, Sakaka 72388, Saudi Arabia
| | - Hasan Ejaz
- Department of Clinical Laboratory Sciences, College of Applied Medical Sciences, Jouf University, Sakaka 72388, Saudi Arabia
| | - Pravej Alam
- Department of Biology, College of Science and Humanities, Prince Sattam Bin Abdulaziz University, Al-Kharj 11942, Saudi Arabia
| | - Abdul Samad
- Department of Pharmaceutical Chemistry, Faculty of Pharmacy, Tishk International University, Erbil 44001, Iraq
| | - Rahul D Jawarkar
- Department of Medicinal Chemistry, Dr. Rajendra Gode Institute of Pharmacy, University-Mardi Road, Amravati 444603, Maharashtra, India
| | - Vijay H Masand
- Department of Chemistry, Vidya Bharati Mahavidyalaya, Amravati 444602, Maharashtra, India
| |
Collapse
|
30
|
Akinola LK, Uzairu A, Shallangwa GA, Abechi SE. Quantitative structure–activity relationship modeling of hydroxylated polychlorinated biphenyls as constitutive androstane receptor agonists. Struct Chem 2022. [DOI: 10.1007/s11224-022-01992-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|
31
|
Gensch T, Smith SR, Colacot TJ, Timsina YN, Xu G, Glasspoole BW, Sigman MS. Design and Application of a Screening Set for Monophosphine Ligands in Cross-Coupling. ACS Catal 2022. [DOI: 10.1021/acscatal.2c01970] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Affiliation(s)
- Tobias Gensch
- Department of Chemistry, TU Berlin, Straße des 17. Juni 135, Sekr. C2, 10623 Berlin, Germany
| | - Sleight R. Smith
- Department of Chemistry, University of Utah, 315 South 1400 East, Salt Lake City, Utah 84112, United States
| | - Thomas J. Colacot
- MilliporeSigma, 6000 N. Teutonia Ave, Milwaukee, Wisconsin 53209, United States
| | - Yam N. Timsina
- MilliporeSigma, 6000 N. Teutonia Ave, Milwaukee, Wisconsin 53209, United States
| | - Guolin Xu
- MilliporeSigma, 6000 N. Teutonia Ave, Milwaukee, Wisconsin 53209, United States
| | - Ben W. Glasspoole
- MilliporeSigma, 6000 N. Teutonia Ave, Milwaukee, Wisconsin 53209, United States
| | - Matthew S. Sigman
- Department of Chemistry, University of Utah, 315 South 1400 East, Salt Lake City, Utah 84112, United States
| |
Collapse
|
32
|
Mechanistic Analysis of Chemically Diverse Bromodomain-4 Inhibitors Using Balanced QSAR Analysis and Supported by X-ray Resolved Crystal Structures. Pharmaceuticals (Basel) 2022; 15:ph15060745. [PMID: 35745664 PMCID: PMC9231298 DOI: 10.3390/ph15060745] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2022] [Revised: 06/07/2022] [Accepted: 06/07/2022] [Indexed: 11/17/2022] Open
Abstract
Bromodomain-4 (BRD-4) is a key enzyme in post-translational modifications, transcriptional activation, and many other cellular processes. Its inhibitors find their therapeutic usage in cancer, acute heart failure, and inflammation to name a few. In the present study, a dataset of 980 molecules with a significant diversity of structural scaffolds and composition was selected to develop a balanced QSAR model possessing high predictive capability and mechanistic interpretation. The model was built as per the OECD (Organisation for Economic Co-operation and Development) guidelines and fulfills the endorsed threshold values for different validation parameters (R2tr = 0.76, Q2LMO = 0.76, and R2ex = 0.76). The present QSAR analysis identified that anti-BRD-4 activity is associated with structural characters such as the presence of saturated carbocyclic rings, the occurrence of carbon atoms near the center of mass of a molecule, and a specific combination of planer or aromatic nitrogen with ring carbon, donor, and acceptor atoms. The outcomes of the present analysis are also supported by X-ray-resolved crystal structures of compounds with BRD-4. Thus, the QSAR model effectively captured salient as well as unreported hidden pharmacophoric features. Therefore, the present study successfully identified valuable novel pharmacophoric features, which could be beneficial for the future optimization of lead/hit compounds for anti-BRD-4 activity.
Collapse
|
33
|
Stevens JM, Li J, Simmons EM, Wisniewski SR, DiSomma S, Fraunhoffer KJ, Geng P, Hao B, Jackson EW. Advancing Base Metal Catalysis through Data Science: Insight and Predictive Models for Ni-Catalyzed Borylation through Supervised Machine Learning. Organometallics 2022. [DOI: 10.1021/acs.organomet.2c00089] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Jason M. Stevens
- Chemical Process Development, Bristol-Myers Squibb, 556 Morris Avenue, Summit, New Jersey 07901, United States
| | - Jun Li
- Chemical Process Development, Bristol-Myers Squibb, 1 Squibb Drive, New Brunswick, New Jersey 08901, United States
| | - Eric M. Simmons
- Chemical Process Development, Bristol-Myers Squibb, 1 Squibb Drive, New Brunswick, New Jersey 08901, United States
| | - Steven R. Wisniewski
- Chemical Process Development, Bristol-Myers Squibb, 1 Squibb Drive, New Brunswick, New Jersey 08901, United States
| | - Stacey DiSomma
- Chemical Process Development, Bristol-Myers Squibb, 1 Squibb Drive, New Brunswick, New Jersey 08901, United States
| | - Kenneth J. Fraunhoffer
- Chemical Process Development, Bristol-Myers Squibb, 1 Squibb Drive, New Brunswick, New Jersey 08901, United States
| | - Peng Geng
- Chemical Process Development, Bristol-Myers Squibb, 1 Squibb Drive, New Brunswick, New Jersey 08901, United States
| | - Bo Hao
- Chemical Process Development, Bristol-Myers Squibb, 1 Squibb Drive, New Brunswick, New Jersey 08901, United States
| | - Erika W. Jackson
- Chemical Process Development, Bristol-Myers Squibb, 1 Squibb Drive, New Brunswick, New Jersey 08901, United States
| |
Collapse
|
34
|
Haas BC, Goetz AE, Bahamonde A, McWilliams JC, Sigman MS. Predicting relative efficiency of amide bond formation using multivariate linear regression. Proc Natl Acad Sci U S A 2022; 119:e2118451119. [PMID: 35412905 PMCID: PMC9169781 DOI: 10.1073/pnas.2118451119] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2021] [Accepted: 02/09/2022] [Indexed: 01/29/2023] Open
Abstract
Amides are ubiquitous in biologically active natural products and commercial drugs. The most common strategy for introducing this functional group is the coupling of a carboxylic acid with an amine, which requires the use of a coupling reagent to facilitate elimination of water. However, the optimal reaction conditions often appear rather arbitrary to the specific reaction. Herein, we report the development of statistical models correlating measured rates to physical organic descriptors to enable the prediction of reaction rates for untested carboxylic acid/amine pairs. The key to the success of this endeavor was the development of an end-to-end data science–based workflow to select a set of coupling partners that are appropriately distributed in chemical space to facilitate statistical model development. By using a parameterization, dimensionality reduction, and clustering protocol, a training set was identified. Reaction rates for a range of carboxylic acid and primary alkyl amine couplings utilizing carbonyldiimidazole (CDI) as the coupling reagent were measured. The collected rates span five orders of magnitude, confirming that the designed training set encompasses a wide range of chemical space necessary for effective model development. Regressing these rates with high-level density functional theory (DFT) descriptors allowed for identification of a statistical model wherein the molecular features of the carboxylic acid are primarily responsible for the observed rates. Finally, out-of-sample amide couplings are used to determine the limitations and effectiveness of the model.
Collapse
Affiliation(s)
- Brittany C. Haas
- Department of Chemistry, University of Utah, Salt Lake City, UT 84112
| | - Adam E. Goetz
- Chemical Research and Development, Groton Laboratories, Pfizer Worldwide Research and Development, Groton, CT 06340
| | - Ana Bahamonde
- Department of Chemistry, University of Utah, Salt Lake City, UT 84112
| | - J. Christopher McWilliams
- Chemical Research and Development, Groton Laboratories, Pfizer Worldwide Research and Development, Groton, CT 06340
| | - Matthew S. Sigman
- Department of Chemistry, University of Utah, Salt Lake City, UT 84112
| |
Collapse
|
35
|
Ghosh A, Mukerjee N, Sharma B, Pant A, Kishore Mohanta Y, Jawarkar RD, Bakal RL, Terefe EM, Batiha GES, Mostafa-Hedeab G, Aref Albezrah NK, Dey A, Baishya D. Target Specific Inhibition of Protein Tyrosine Kinase in Conjunction With Cancer and SARS-COV-2 by Olive Nutraceuticals. Front Pharmacol 2022; 12:812565. [PMID: 35356629 PMCID: PMC8959131 DOI: 10.3389/fphar.2021.812565] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2021] [Accepted: 12/24/2021] [Indexed: 12/18/2022] Open
Abstract
The fact that viruses cause human cancer dates back to the early 1980s. By reprogramming cellular signaling pathways, viruses encoded protein that can regulate altered control of cell cycle events. Viruses can interact with a superfamily of membrane bound protein, receptor tyrosine kinase to modulate their activity in order to increase virus entrance into cells and promotion of viral replication within the host. Therefore, our study aimed at screening of inhibitors of tyrosine kinase using natural compounds from olive. Protein tyrosine kinase (PTK) is an important factor for cancer progression and can be linked to coronavirus. It is evident that over expression of Protein tyrosine kinase (PTK) enhance viral endocytosis and proliferation and the use of tyrosine kinase inhibitors reduced the period of infection period. Functional network studies were carried out using two major PTKs viz. Anaplastic lymphoma kinase (ALK) and B-lymphocytic kinase (BTK). They are associated with coronavirus in regulation of cell signaling proteins for cellular processes. We virtually screened for 161 library of natural compounds from olive found overexpressed in ALK and BTK in metastatic as well as virus host cells. We have employed both ligand and target-based approach for drug designing by high throughput screening using Multilinear regression model based QSAR and docking. The QSAR based virtual screening of 161 olive nutraceutical compounds has successfully identified certain new hit; Wedelosin, in which, the descriptor rsa (ratio of molecular surface area to the solvent accessible surface area) plays crucial role in deciding Wedelosin’s inhibitory potency. The best-docked olive nutraceuticals further investigated for the stability and effectivity of the BTK and ALK during in 150 ns molecular dynamics and simulation. Post simulation analysis and binding energy estimation in MMGBSA further revealed the intensive potential of the olive nutraceuticals in PTK inhibition. This study is therefore expected to widen the use of nutraceuticals from olive in cancer as well as SARS-CoV2 alternative therapy.
Collapse
Affiliation(s)
- Arabinda Ghosh
- Microbiology Division, Department of Botany, Gauhati University, Guwahati, India
| | - Nobendu Mukerjee
- Department of Microbiology, Ramakrishna Mission Vivekananda Centenary College, Kolkata, India
| | - Bhavdeep Sharma
- Department of Biotechnology, Thapar Institute of Engineering and Technology, Patiala, India
| | - Anushree Pant
- Department of Biotechnology, Thapar Institute of Engineering and Technology, Patiala, India
| | - Yugal Kishore Mohanta
- Department of Applied Biology, University of Science and Technology Meghalaya, Ri Bhoi, India
| | - Rahul D Jawarkar
- Department of Medicinal Chemistry, Dr. Rajendra Gode Institute of Pharmacy, Amravati, India
| | - Ravindrakumar L Bakal
- Department of Medicinal Chemistry, Dr. Rajendra Gode Institute of Pharmacy, Amravati, India
| | - Ermias Mergia Terefe
- Department of Pharmacology and Pharmacognosy, School of Pharmacy and Health Sciences, United states International University-Africa, Nairobi, Kenya
| | - Gaber El-Saber Batiha
- Department of Pharmacology and Therapeutics, Faculty of Veterinary Medicine, Damanhour University, Damanhour, Egypt
| | - Gomaa Mostafa-Hedeab
- Pharmacology Department & Health Research Unit-Medical College-Jouf University, Sakakah, Saudi Arabia.,Pharmacology Department, Faculty of Medicine, Beni-Suef University, Beni-Suef, Egypt
| | | | - Abhijit Dey
- Department of Life Sciences, Presidency University, Kolkata, India
| | - Debabrat Baishya
- Department of Bioengineering, Gauhati University, Guwahati, India
| |
Collapse
|
36
|
User-assisted methodology targeted for building structure interpretable QSPR models for boosting CO2 capture with ionic liquids. J Mol Liq 2022. [DOI: 10.1016/j.molliq.2022.118511] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
|
37
|
Exploring the Prominent and Concealed Inhibitory Features for Cytoplasmic Isoforms of Hsp90 Using QSAR Analysis. Pharmaceuticals (Basel) 2022; 15:ph15030303. [PMID: 35337101 PMCID: PMC8953649 DOI: 10.3390/ph15030303] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2022] [Revised: 02/19/2022] [Accepted: 02/23/2022] [Indexed: 11/24/2022] Open
Abstract
Cancer is a major life-threatening disease with a high mortality rate in many countries. Even though different therapies and options are available, patients generally prefer chemotherapy. However, serious side effects of anti-cancer drugs compel us to search for a safer drug. To achieve this target, Hsp90 (heat shock protein 90), which is responsible for stabilization of many oncoproteins in cancer cells, is a promising target for developing an anti-cancer drug. The QSAR (Quantitative Structure–Activity Relationship) could be useful to identify crucial pharmacophoric features to develop a Hsp90 inhibitor. Therefore, in the present work, a larger dataset encompassing 1141 diverse compounds was used to develop a multi-linear QSAR model with a balance of acceptable predictive ability (Predictive QSAR) and mechanistic interpretation (Mechanistic QSAR). The new developed six-parameter model satisfies the recommended values for a good number of validation parameters such as R2tr = 0.78, Q2LMO = 0.77, R2ex = 0.78, and CCCex = 0.88. The present analysis reveals that the Hsp90 inhibitory activity is correlated with different types of nitrogen atoms and other hidden structural features such as the presence of hydrophobic ring/aromatic carbon atoms within a specific distance from the center of mass of the molecule, etc. Thus, the model successfully identified a variety of reported as well as novel pharmacophoric features. The results of QSAR analysis are further vindicated by reported crystal structures of compounds with Hsp90.
Collapse
|
38
|
Nakarin F, Boonpalit K, Kinchagawat J, Wachiraphan P, Rungrotmongkol T, Nutanong S. Assisting Multitargeted Ligand Affinity Prediction of Receptor Tyrosine Kinases Associated Nonsmall Cell Lung Cancer Treatment with Multitasking Principal Neighborhood Aggregation. Molecules 2022; 27:molecules27041226. [PMID: 35209011 PMCID: PMC8878292 DOI: 10.3390/molecules27041226] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2021] [Revised: 01/30/2022] [Accepted: 01/31/2022] [Indexed: 11/16/2022] Open
Abstract
A multitargeted therapeutic approach with hybrid drugs is a promising strategy to enhance anticancer efficiency and overcome drug resistance in nonsmall cell lung cancer (NSCLC) treatment. Estimating affinities of small molecules against targets of interest typically proceeds as a preliminary action for recent drug discovery in the pharmaceutical industry. In this investigation, we employed machine learning models to provide a computationally affordable means for computer-aided screening to accelerate the discovery of potential drug compounds. In particular, we introduced a quantitative structure–activity-relationship (QSAR)-based multitask learning model to facilitate an in silico screening system of multitargeted drug development. Our method combines a recently developed graph-based neural network architecture, principal neighborhood aggregation (PNA), with a descriptor-based deep neural network supporting synergistic utilization of molecular graph and fingerprint features. The model was generated by more than ten-thousands affinity-reported ligands of seven crucial receptor tyrosine kinases in NSCLC from two public data sources. As a result, our multitask model demonstrated better performance than all other benchmark models, as well as achieving satisfying predictive ability regarding applicable QSAR criteria for most tasks within the model’s applicability. Since our model could potentially be a screening tool for practical use, we have provided a model implementation platform with a tutorial that is freely accessible hence, advising the first move in a long journey of cancer drug development.
Collapse
Affiliation(s)
- Fahsai Nakarin
- School of Information Science and Technology, Vidyasirimedhi Institute of Science and Technology (VISTEC), Rayong 21210, Thailand; (K.B.); (J.K.); (P.W.); (S.N.)
- Correspondence: ; Tel.: +66-33-014-444
| | - Kajjana Boonpalit
- School of Information Science and Technology, Vidyasirimedhi Institute of Science and Technology (VISTEC), Rayong 21210, Thailand; (K.B.); (J.K.); (P.W.); (S.N.)
| | - Jiramet Kinchagawat
- School of Information Science and Technology, Vidyasirimedhi Institute of Science and Technology (VISTEC), Rayong 21210, Thailand; (K.B.); (J.K.); (P.W.); (S.N.)
| | - Patcharapol Wachiraphan
- School of Information Science and Technology, Vidyasirimedhi Institute of Science and Technology (VISTEC), Rayong 21210, Thailand; (K.B.); (J.K.); (P.W.); (S.N.)
| | - Thanyada Rungrotmongkol
- Center of Excellence in Biocatalyst and Sustainable Biotechnology, Faculty of Science, Chulalongkorn University, Bangkok 10330, Thailand;
- Program in Bioinformatics and Computational Biology, Graduate School, Chulalongkorn University, Bangkok 10330, Thailand
| | - Sarana Nutanong
- School of Information Science and Technology, Vidyasirimedhi Institute of Science and Technology (VISTEC), Rayong 21210, Thailand; (K.B.); (J.K.); (P.W.); (S.N.)
| |
Collapse
|
39
|
Chayawan, Selvestrel G, Baderna D, Toma C, Caballero Alfonso AY, Gamba A, Benfenati E. Skin sensitization quantitative QSAR models based on mechanistic structural alerts. Toxicology 2022; 468:153111. [DOI: 10.1016/j.tox.2022.153111] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2021] [Revised: 01/05/2022] [Accepted: 01/26/2022] [Indexed: 10/19/2022]
|
40
|
Ozbuyukkaya G, Parker RS, Veser G. Determining robust reaction kinetics from limited data. AIChE J 2021. [DOI: 10.1002/aic.17538] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Affiliation(s)
- Gizem Ozbuyukkaya
- Department of Chemical Engineering, Swanson School of Engineering, and Center for Energy University of Pittsburgh Pittsburgh Pennsylvania USA
| | - Robert S. Parker
- Department of Chemical Engineering, Swanson School of Engineering, and Center for Energy University of Pittsburgh Pittsburgh Pennsylvania USA
| | - Goetz Veser
- Department of Chemical Engineering, Swanson School of Engineering, and Center for Energy University of Pittsburgh Pittsburgh Pennsylvania USA
| |
Collapse
|
41
|
Sun Y, Yuan M, Liu X, Su M, Wang L, Zeng Y, Zang H, Nie L. A sample selection method specific to unknown test samples for calibration and validation sets based on spectra similarity. SPECTROCHIMICA ACTA. PART A, MOLECULAR AND BIOMOLECULAR SPECTROSCOPY 2021; 258:119870. [PMID: 33957450 DOI: 10.1016/j.saa.2021.119870] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/15/2020] [Revised: 02/10/2021] [Accepted: 04/21/2021] [Indexed: 06/12/2023]
Abstract
As is known to all, the construction of calibration and validation sets is of great importance for how to select representative samples into subsets so that the calibration model can be built, evaluated and predicted effectively for model development. In this study, a method was proposed for the calibration and validation sets constructed by selecting samples maximally similar to the test samples based on the spectra data. Both the Euclidean distance and Mahalanobis distance were attempted to estimate the spectra similarity. The method to select samples for calibration is more suitable and specific to unknown test samples in practical applications, thus improving the measurement accuracy. In addition, the optimization of calibration set size was carried out to avoid the influence of unnecessary samples. Two data sets of Salvia miltiorrhiza (S. miltiorrhiza) and corn by near infrared spectroscopy (NIR) were used to test the performance of the proposed method compared with two typical sample-selection algorithms, Kennard-Stone (KS) and sample set partitioning based on joint x-y distances (SPXY). The experimental results indicated that the proposed method could select a more targeted set of samples for the unknown test samples and had the superior predictive performance to the KS and SPXY methods.
Collapse
Affiliation(s)
- Yue Sun
- School of Pharmaceutical Sciences, Cheeloo College of Medicine, Shandong University, Jinan 250012, China
| | - Meng Yuan
- School of Pharmaceutical Sciences, Cheeloo College of Medicine, Shandong University, Jinan 250012, China
| | - Xiaoyan Liu
- School of Pharmaceutical Sciences, Cheeloo College of Medicine, Shandong University, Jinan 250012, China
| | - Mei Su
- School of Pharmaceutical Sciences, Cheeloo College of Medicine, Shandong University, Jinan 250012, China
| | - Linlin Wang
- School of Pharmaceutical Sciences, Cheeloo College of Medicine, Shandong University, Jinan 250012, China
| | - Yingzi Zeng
- Shandong Wohua Pharmaceutical Technology Co., Ltd, Weifang 261205, China
| | - Hengchang Zang
- School of Pharmaceutical Sciences, Cheeloo College of Medicine, Shandong University, Jinan 250012, China; National Glycoengineering Research Center, Jinan 250012, China
| | - Lei Nie
- School of Pharmaceutical Sciences, Cheeloo College of Medicine, Shandong University, Jinan 250012, China.
| |
Collapse
|
42
|
Gandhi A, Masand V, Zaki MEA, Al-Hussain SA, Ghorbal AB, Chapolikar A. QSAR analysis of sodium glucose co-transporter 2 (SGLT2) inhibitors for anti-hyperglycaemic lead development. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2021; 32:731-744. [PMID: 34494464 DOI: 10.1080/1062936x.2021.1971295] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/15/2021] [Accepted: 08/18/2021] [Indexed: 06/13/2023]
Abstract
QSAR (Quantitative Structure Activity Relationship) modelling was performed on a dataset of 90 sodium-dependent glucose cotransporter 2 (SGLT2) inhibitors. The quantitative and explicative evaluations revealed some of the subtle and distinguished structural features that are responsible for the inhibitory potency of these compounds against SGLT2, such as less possible number of ring carbons at 8 Å from the lipophilic atoms in the molecule (fringClipo8A) and more possible value for the sum of the partial charges of the lipophilic atoms present within seven bonds from the donor atoms (lipo_don_7Bc). Multivariate GA-MLR (genetic algorithm-multi linear regression) and thorough validation methodology out-turned a statistically robust QSAR model with a very high predictability shown from various statistical parameters. A QSAR model with r2 = 0.83, F = 51.54, Q2LOO = 0.79, Q2LMO = 0.79, CCCcv = 0.88, Q2Fn = 0.76-0.81, r2ext = 0.77, CCCext = 0.85, and with RMSEtr < RMSEcv was proposed. This QSAR model will assist synthetic chemists in the development of the SGLT2 inhibitors as the antidiabetic leads.
Collapse
Affiliation(s)
- A Gandhi
- Department of Chemistry, Government College of Arts and Science, Aurangabad, Maharashtra, India
| | - V Masand
- Department of Chemistry, Vidya Bharati Mahavidyalaya, Amravati, Maharashtra, India
| | - M E A Zaki
- Department of Chemistry, College of Science, Al-Imam Mohammad Ibn Saud Islamic University, Riyadh, Saudi Arabia
| | - S A Al-Hussain
- Department of Chemistry, College of Science, Al-Imam Mohammad Ibn Saud Islamic University, Riyadh, Saudi Arabia
| | - A Ben Ghorbal
- Department of Mathematics and Statistics, College of Sciences, Al-Imam Mohammad Ibn Saud Islamic University, Riyadh, Saudi Arabia
| | - A Chapolikar
- Department of Chemistry, Government College of Arts and Science, Aurangabad, Maharashtra, India
| |
Collapse
|
43
|
Gandhi A, Masand V, Zaki MEA, Al-Hussain SA, Ghorbal AB, Chapolikar A. Quantitative Structure-Activity Relationship Evaluation of MDA-MB-231 Cell Anti-Proliferative Leads. Molecules 2021; 26:molecules26164795. [PMID: 34443383 PMCID: PMC8401583 DOI: 10.3390/molecules26164795] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2021] [Revised: 07/30/2021] [Accepted: 08/03/2021] [Indexed: 11/23/2022] Open
Abstract
In the present endeavor, for the dataset of 219 in vitro MDA-MB-231 TNBC cell antagonists, a (QSAR) quantitative structure–activity relationships model has been carried out. The quantitative and explicative assessments were performed to identify inconspicuous yet pre-eminent structural features that govern the anti-tumor activity of these compounds. GA-MLR (genetic algorithm multi-linear regression) methodology was employed to build statistically robust and highly predictive multiple QSAR models, abiding by the OECD guidelines. Thoroughly validated QSAR models attained values for various statistical parameters well above the threshold values (i.e., R2 = 0.79, Q2LOO = 0.77, Q2LMO = 0.76–0.77, Q2-Fn = 0.72–0.76). Both de novo QSAR models have a sound balance of descriptive and statistical approaches. Decidedly, these QSAR models are serviceable in the development of MDA-MB-231 TNBC cell antagonists.
Collapse
Affiliation(s)
- Ajaykumar Gandhi
- Department of Chemistry, Government College of Arts and Science, Aurangabad 431 004, Maharashtra, India;
- Correspondence: (A.G.); (M.E.A.Z.)
| | - Vijay Masand
- Department of Chemistry, Vidya Bharati Mahavidyalaya, Amravati 444 602, Maharashtra, India;
| | - Magdi E. A. Zaki
- Department of Chemistry, Faculty of Science, Imam Mohammad Ibn Saud Islamic University, Riyadh 13318, Saudi Arabia;
- Correspondence: (A.G.); (M.E.A.Z.)
| | - Sami A. Al-Hussain
- Department of Chemistry, Faculty of Science, Imam Mohammad Ibn Saud Islamic University, Riyadh 13318, Saudi Arabia;
| | - Anis Ben Ghorbal
- Department of Mathematics and Statistics, College of Sciences, Imam Mohammad Ibn Saud Islamic University, Riyadh 13318, Saudi Arabia;
| | - Archana Chapolikar
- Department of Chemistry, Government College of Arts and Science, Aurangabad 431 004, Maharashtra, India;
| |
Collapse
|
44
|
Topomer-CoMFA proposed as a tool to construct dual EGFR/HER-2 models. J Mol Model 2021; 27:239. [PMID: 34363097 DOI: 10.1007/s00894-021-04852-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2021] [Accepted: 07/06/2021] [Indexed: 10/20/2022]
Abstract
Protein kinases (in this case, HER-2 and EGFR) are involved in cancer-related diseases. Some reports have shown unique CoMFA models using the sum of activities expressed as pIC50 (-log IC50), as the classical CoMFA technique would not be the best strategy to construct models for multitarget therapy considering that the molecular alignment will not be the same for different targets. An alternative for this problem is the use of Topomer-CoMFA, a variation of CoMFA, which does not require the alignment step in the generation of 3D models. In this study, we propose the combined use of the sum of activities and Topomer-CoMFA for the construction of a unique dual 3D model considering the inhibitory activities against EGFR and HER-2. For this, 88 compounds from the literature were divided into two groups: training (71) and test (17) sets. The biological activity of each compound, expressed as IC50 for EGFR and HER-2, was transformed into pIC50, summed, and used as the dependent variable in the Topomer-CoMFA analyses. The obtained model was considered statistically robust in the prediction of the dual activity of new compounds. Finally, based on the obtained model, we proposed structural modifications to some of the compounds used to improve the biological data. From the 3D model, we suggested new derivative compounds with improved biological activity for both targets. Therefore, the combination of the techniques proposed in this study proves to be a good strategy to construct better statistical models that can predict biological activities in multitarget systems.
Collapse
|
45
|
Ghosh K, Amin SA, Gayen S, Jha T. Unmasking of crucial structural fragments for coronavirus protease inhibitors and its implications in COVID-19 drug discovery. J Mol Struct 2021; 1237:130366. [PMID: 33814612 PMCID: PMC7997030 DOI: 10.1016/j.molstruc.2021.130366] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2020] [Revised: 03/19/2021] [Accepted: 03/20/2021] [Indexed: 12/19/2022]
Abstract
Fragment based drug discovery (FBDD) by the aid of different modelling techniques have been emerged as a key drug discovery tool in the area of pharmaceutical science and technology. The merits of employing these methods, in place of other conventional molecular modelling techniques, endorsed clear detection of the possible structural fragments present in diverse set of investigated compounds and can create alternate possibilities of lead optimization in drug discovery. In this work, two fragment identification tools namely SARpy and Laplacian-corrected Bayesian analysis were used for previous SARS-CoV PLpro and 3CLpro inhibitors. A robust and predictive SARpy based fragments identification was performed which have been validated further by Laplacian-corrected Bayesian model. These comprehensive approaches have advantages since fragments are straight forward to interpret. Moreover, distinguishing the key molecular features (with respect to ECFP_6 fingerprint) revealed good or bad influences for the SARS-CoV protease inhibitory activities. Furthermore, the identified fragments could be implemented in the medicinal chemistry endeavors of COVID-19 drug discovery.
Collapse
Affiliation(s)
- Kalyan Ghosh
- Laboratory of Drug Design and Discovery, Department of Pharmaceutical Sciences, Dr. Harisingh Gour University, Sagar, MP, India
| | - Sk Abdul Amin
- Natural Science Laboratory, Division of Medicinal and Pharmaceutical Chemistry, Department of Pharmaceutical Technology, Jadavpur University, Kolkata, India
| | - Shovanlal Gayen
- Laboratory of Drug Design and Discovery, Department of Pharmaceutical Sciences, Dr. Harisingh Gour University, Sagar, MP, India
| | - Tarun Jha
- Natural Science Laboratory, Division of Medicinal and Pharmaceutical Chemistry, Department of Pharmaceutical Technology, Jadavpur University, Kolkata, India
| |
Collapse
|
46
|
Mechanistic and Predictive QSAR Analysis of Diverse Molecules to Capture Salient and Hidden Pharmacophores for Anti-Thrombotic Activity. Int J Mol Sci 2021; 22:ijms22158352. [PMID: 34361118 PMCID: PMC8348508 DOI: 10.3390/ijms22158352] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2021] [Revised: 07/24/2021] [Accepted: 07/31/2021] [Indexed: 12/02/2022] Open
Abstract
Thrombosis is a life-threatening disease with a high mortality rate in many countries. Even though anti-thrombotic drugs are available, their serious side effects compel the search for safer drugs. In search of a safer anti-thrombotic drug, Quantitative Structure-Activity Relationship (QSAR) could be useful to identify crucial pharmacophoric features. The present work is based on a larger data set comprising 1121 diverse compounds to develop a QSAR model having a balance of acceptable predictive ability (Predictive QSAR) and mechanistic interpretation (Mechanistic QSAR). The developed six parametric model fulfils the recommended values for internal and external validation along with Y-randomization parameters such as R2tr = 0.831, Q2LMO = 0.828, R2ex = 0.783. The present analysis reveals that anti-thrombotic activity is found to be correlated with concealed structural traits such as positively charged ring carbon atoms, specific combination of aromatic Nitrogen and sp2-hybridized carbon atoms, etc. Thus, the model captured reported as well as novel pharmacophoric features. The results of QSAR analysis are further vindicated by reported crystal structures of compounds with factor Xa. The analysis led to the identification of useful novel pharmacophoric features, which could be used for future optimization of lead compounds.
Collapse
|
47
|
Enrique AV, Di Ianni ME, Goicoechea S, Lazarowski A, Valle-Dorado MG, Costa JJL, Rocha L, Girardi E, Talevi A. New anticonvulsant candidates prevent P-glycoprotein (P-gp) overexpression in a pharmacoresistant seizure model in mice. Epilepsy Behav 2021; 121:106451. [PMID: 31420290 DOI: 10.1016/j.yebeh.2019.106451] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/05/2019] [Revised: 07/17/2019] [Accepted: 07/19/2019] [Indexed: 01/04/2023]
Abstract
Despite the approval of a considerable number of last generation antiepileptic drugs (AEDs) (only in the last decade, six drugs have gained Food and Drug Administration approval), the global figures of seizure control have seemingly not improved, and available AED can still be regarded as symptomatic treatments. Fresh thinking in AEDs drug discovery, including the development of drugs with novel mechanisms of action, is required to achieve truly innovative antiepileptic medications. The transporter hypothesis proposes that inadequate penetration of AEDs across the blood-brain barrier, caused by increased expression of efflux transporters such as P-glycoprotein (P-gp), contributes to drug-resistant epilepsy. Neuroinflammation due to high levels of glutamate has been identified as one of the causes of P-gp upregulation, and several studies in animal models of epilepsy suggest that antiinflammatory drugs might prevent P-gp overexpression and, thus, avoid the development of refractory epilepsy. We have applied ligand-based in silico screening to select compounds that exert dual anticonvulsant and antiinflammatory effects. Five of the hits were tested in animal models of seizure, with protective effects. Later, two of them (sebacic acid (SA) and gamma-decanolactone) were submitted to the recently described MP23 model of drug-resistant seizures. All in all, SA displayed the best profile, showing activity in the maximal electroshock seizure (MES) and pentylenetetrazol (PTZ) seizure models, and reversing resistance to phenytoin (PHT) and decreasing the P-gp upregulation in the MP23 model. Furthermore, pretreatment with SA in the pilocarpine status epilepticus (SE) model resulted in decreased histamine release in comparison with nontreated animals. This is the first report of the use of the MP23 model to screen for novel anticonvulsant compounds that may avoid the development of P-gp-related drug resistance.
Collapse
Affiliation(s)
- Andrea Verónica Enrique
- Laboratorio de Investigación y Desarrollo de Bioactivos (LIDeB), Departamento de Ciencias Biologicas, Facultad de Ciencias Exactas, Universidad Nacional de La Plata (UNLP), 47 y115, La Plata B1900BJW, Argentina
| | - Mauricio Emiliano Di Ianni
- Laboratorio de Investigación y Desarrollo de Bioactivos (LIDeB), Departamento de Ciencias Biologicas, Facultad de Ciencias Exactas, Universidad Nacional de La Plata (UNLP), 47 y115, La Plata B1900BJW, Argentina
| | - Sofía Goicoechea
- Laboratorio de Investigación y Desarrollo de Bioactivos (LIDeB), Departamento de Ciencias Biologicas, Facultad de Ciencias Exactas, Universidad Nacional de La Plata (UNLP), 47 y115, La Plata B1900BJW, Argentina
| | - Alberto Lazarowski
- Instituto de Fisiopatología y Bioquímica Clínica (INFIBIOC), Facultad de Farmacia y Bioquímica (FFyB), Universidad de Buenos Aires (UBA,) Junín 956, C1113AAD CABA, Argentina
| | | | - Juan José López Costa
- Departamento de Biología Celular, Histología, Embriología y Genética, Facultad de Medicina, Universidad de Buenos Aires (UBA) / Instituto de Biología Celular y Neurociencia "Prof. E. De Robertis" (IBCN), CONICET-Universidad de Buenos Aires, Paraguay 2155, C1121ABG, Buenos Aires, Argentina
| | - Luisa Rocha
- Department of Pharmacobiology, Center for Research and Advanced Studies (CINVESTAV), Mexico City, Mexico
| | - Elena Girardi
- Departamento de Biología Celular, Histología, Embriología y Genética, Facultad de Medicina, Universidad de Buenos Aires (UBA) / Instituto de Biología Celular y Neurociencia "Prof. E. De Robertis" (IBCN), CONICET-Universidad de Buenos Aires, Paraguay 2155, C1121ABG, Buenos Aires, Argentina
| | - Alan Talevi
- Laboratorio de Investigación y Desarrollo de Bioactivos (LIDeB), Departamento de Ciencias Biologicas, Facultad de Ciencias Exactas, Universidad Nacional de La Plata (UNLP), 47 y115, La Plata B1900BJW, Argentina.
| |
Collapse
|
48
|
Morales JF, Chuguransky S, Alberca LN, Alice JI, Goicoechea S, Ruiz ME, Bellera CL, Talevi A. Positive Predictive Value Surfaces as a Complementary Tool to Assess the Performance of Virtual Screening Methods. Mini Rev Med Chem 2021; 20:1447-1460. [PMID: 32072906 DOI: 10.2174/1871525718666200219130229] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2019] [Revised: 10/28/2019] [Accepted: 10/29/2019] [Indexed: 11/22/2022]
Abstract
BACKGROUND Since their introduction in the virtual screening field, Receiver Operating Characteristic (ROC) curve-derived metrics have been widely used for benchmarking of computational methods and algorithms intended for virtual screening applications. Whereas in classification problems, the ratio between sensitivity and specificity for a given score value is very informative, a practical concern in virtual screening campaigns is to predict the actual probability that a predicted hit will prove truly active when submitted to experimental testing (in other words, the Positive Predictive Value - PPV). Estimation of such probability is however, obstructed due to its dependency on the yield of actives of the screened library, which cannot be known a priori. OBJECTIVE To explore the use of PPV surfaces derived from simulated ranking experiments (retrospective virtual screening) as a complementary tool to ROC curves, for both benchmarking and optimization of score cutoff values. METHODS The utility of the proposed approach is assessed in retrospective virtual screening experiments with four datasets used to infer QSAR classifiers: inhibitors of Trypanosoma cruzi trypanothione synthetase; inhibitors of Trypanosoma brucei N-myristoyltransferase; inhibitors of GABA transaminase and anticonvulsant activity in the 6 Hz seizure model. RESULTS Besides illustrating the utility of PPV surfaces to compare the performance of machine learning models for virtual screening applications and to select an adequate score threshold, our results also suggest that ensemble learning provides models with better predictivity and more robust behavior. CONCLUSION PPV surfaces are valuable tools to assess virtual screening tools and choose score thresholds to be applied in prospective in silico screens. Ensemble learning approaches seem to consistently lead to improved predictivity and robustness.
Collapse
Affiliation(s)
- Juan F Morales
- Laboratory of Bioactive Research and Development (LIDeB), Department of Biological Sciences, Faculty of Exact Sciences, University of La Plata (UNLP) - 47 & 115, La Plata (1900), Buenos Aires, Argentina
| | - Sara Chuguransky
- Laboratory of Bioactive Research and Development (LIDeB), Department of Biological Sciences, Faculty of Exact Sciences, University of La Plata (UNLP) - 47 & 115, La Plata (1900), Buenos Aires, Argentina
| | - Lucas N Alberca
- Laboratory of Bioactive Research and Development (LIDeB), Department of Biological Sciences, Faculty of Exact Sciences, University of La Plata (UNLP) - 47 & 115, La Plata (1900), Buenos Aires, Argentina
| | - Juan I Alice
- Laboratory of Bioactive Research and Development (LIDeB), Department of Biological Sciences, Faculty of Exact Sciences, University of La Plata (UNLP) - 47 & 115, La Plata (1900), Buenos Aires, Argentina
| | - Sofía Goicoechea
- Laboratory of Bioactive Research and Development (LIDeB), Department of Biological Sciences, Faculty of Exact Sciences, University of La Plata (UNLP) - 47 & 115, La Plata (1900), Buenos Aires, Argentina
| | - María E Ruiz
- Laboratory of Bioactive Research and Development (LIDeB), Department of Biological Sciences, Faculty of Exact Sciences, University of La Plata (UNLP) - 47 & 115, La Plata (1900), Buenos Aires, Argentina
| | - Carolina L Bellera
- Laboratory of Bioactive Research and Development (LIDeB), Department of Biological Sciences, Faculty of Exact Sciences, University of La Plata (UNLP) - 47 & 115, La Plata (1900), Buenos Aires, Argentina
| | - Alan Talevi
- Laboratory of Bioactive Research and Development (LIDeB), Department of Biological Sciences, Faculty of Exact Sciences, University of La Plata (UNLP) - 47 & 115, La Plata (1900), Buenos Aires, Argentina
| |
Collapse
|
49
|
Pinacho-Castellanos SA, García-Jacas CR, Gilson MK, Brizuela CA. Alignment-Free Antimicrobial Peptide Predictors: Improving Performance by a Thorough Analysis of the Largest Available Data Set. J Chem Inf Model 2021; 61:3141-3157. [PMID: 34081438 DOI: 10.1021/acs.jcim.1c00251] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
Abstract
In the last two decades, a large number of machine-learning-based predictors for the activities of antimicrobial peptides (AMPs) have been proposed. These predictors differ from one another in the learning method and in the training and testing data sets used. Unfortunately, the training data sets present several drawbacks, such as a low representativeness regarding the experimentally validated AMP space, and duplicated peptide sequences between negative and positive data sets. These limitations give a low confidence to most of the approaches to be used in prospective studies. To address these weaknesses, we propose novel modeling and assessing data sets from the largest experimentally validated nonredundant peptide data set reported to date. From these novel data sets, alignment-free quantitative sequence-activity models (AF-QSAMs) based on Random Forest are created to identify general AMPs and their antibacterial, antifungal, antiparasitic, and antiviral functional types. An applicability domain analysis is carried out to determine the reliability of the predictions obtained, which, to the best of our knowledge, is performed for the first time for AMP recognition. A benchmarking is undertaken between the models proposed and several models from the literature that are freely available in 13 programs (ClassAMP, iAMP-2L, ADAM, MLAMP, AMPScanner v2.0, AntiFP, AMPfun, PEPred-suite, AxPEP, CAMPR3, iAMPpred, APIN, and Meta-iAVP). The models proposed are those with the best performance in all of the endpoints modeled, while most of the methods from the literature have weak-to-random predictive agreements. The models proposed are also assessed through Y-scrambling and repeated k-fold cross-validation tests, demonstrating that the outcomes obtained by them are not given by chance. Three chemometric analyses also confirmed the relevance of the peptides descriptors used in the modeling. Therefore, it can be concluded that the models built by fixing the drawbacks existing in the literature contribute to identifying antibacterial, antifungal, antiparasitic, and antiviral peptides with high effectivity and reliability. Models are freely available via the AMPDiscover tool at https://biocom-ampdiscover.cicese.mx/.
Collapse
Affiliation(s)
- Sergio A Pinacho-Castellanos
- Departamento de Ciencias de la Computación, Centro de Investigación Científica y de Educación Superior de Ensenada (CICESE), 22860 Ensenada, Baja California, México.,Centro de Investigación y Desarrollo de Tecnología Digital (CITEDI), Instituto Politécnico Nacional (IPN), 22435 Tijuana, Baja California, México
| | - César R García-Jacas
- Cátedras CONACYT-Departamento de Ciencias de la Computación, Centro de Investigación Científica y de Educación Superior de Ensenada (CICESE), 22860 Ensenada, Baja California, México
| | - Michael K Gilson
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, La Jolla, California 92093, United States
| | - Carlos A Brizuela
- Departamento de Ciencias de la Computación, Centro de Investigación Científica y de Educación Superior de Ensenada (CICESE), 22860 Ensenada, Baja California, México
| |
Collapse
|
50
|
Kosugi Y, Mizuno K, Santos C, Sato S, Hosea N, Zientek M. Direct Comparison of the Prediction of the Unbound Brain-to-Plasma Partitioning Utilizing Machine Learning Approach and Mechanistic Neuropharmacokinetic Model. AAPS JOURNAL 2021; 23:72. [PMID: 34008121 PMCID: PMC8131289 DOI: 10.1208/s12248-021-00604-x] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/28/2021] [Accepted: 04/29/2021] [Indexed: 11/30/2022]
Abstract
The mechanistic neuropharmacokinetic (neuroPK) model was established to predict unbound brain-to-plasma partitioning (Kp,uu,brain) by considering in vitro efflux activities of multiple drug resistance 1 (MDR1) and breast cancer resistance protein (BCRP). Herein, we directly compare this model to a computational machine learning approach utilizing physicochemical descriptors and efflux ratios of MDR1 and BCRP-expressing cells for predicting Kp,uu,brain in rats. Two different types of machine learning techniques, Gaussian processes (GP) and random forest regression (RF), were assessed by the time and cluster-split validation methods using 640 internal compounds. The predictivity of machine learning models based on only molecular descriptors in the time-split dataset performed worse than the cluster-split dataset, whereas the models incorporating MDR1 and BCRP efflux ratios showed similar predictivity between time and cluster-split datasets. The GP incorporating MDR1 and BCRP in the time-split dataset achieved the highest correlation (R2 = 0.602). These results suggested that incorporation of MDR1 and BCRP in machine learning is beneficial for robust and accurate prediction. Kp,uu,brain prediction utilizing the neuroPK model was significantly worse compared to machine learning approaches for the same dataset. We also investigated the predictivity of Kp,uu,brain using an external independent test set of 34 marketed drugs. Compared to machine learning models, the neuroPK model showed better predictive performance with R2 of 0.577. This work demonstrates that the machine learning model for Kp,uu,brain achieves maximum predictive performance within the chemical applicability domain, whereas the neuroPK model is applicable more widely beyond the chemical space covered in the training dataset.
Collapse
Affiliation(s)
- Yohei Kosugi
- Global DMPK, Takeda California Inc., San Diego, California, 92121, USA.
| | - Kunihiko Mizuno
- Global DMPK, Takeda California Inc., San Diego, California, 92121, USA
| | - Cipriano Santos
- Global DMPK, Takeda California Inc., San Diego, California, 92121, USA
| | - Sho Sato
- Global DMPK, Takeda Pharmaceutical Company Limited, 26-1 Muraoka-Higashi, 2-Chome, Fujisawa, Kanagawa, 251-8555, Japan
| | - Natalie Hosea
- Global DMPK, Takeda California Inc., San Diego, California, 92121, USA
| | - Michael Zientek
- Global DMPK, Takeda California Inc., San Diego, California, 92121, USA
| |
Collapse
|