1
|
Schaduangrat N, Anuwongcharoen N, Charoenkwan P, Shoombuatong W. DeepAR: a novel deep learning-based hybrid framework for the interpretable prediction of androgen receptor antagonists. J Cheminform 2023; 15:50. [PMID: 37149650 PMCID: PMC10163717 DOI: 10.1186/s13321-023-00721-z] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2022] [Accepted: 04/08/2023] [Indexed: 05/08/2023] Open
Abstract
Drug resistance represents a major obstacle to therapeutic innovations and is a prevalent feature in prostate cancer (PCa). Androgen receptors (ARs) are the hallmark therapeutic target for prostate cancer modulation and AR antagonists have achieved great success. However, rapid emergence of resistance contributing to PCa progression is the ultimate burden of their long-term usage. Hence, the discovery and development of AR antagonists with capability to combat the resistance, remains an avenue for further exploration. Therefore, this study proposes a novel deep learning (DL)-based hybrid framework, named DeepAR, to accurately and rapidly identify AR antagonists by using only the SMILES notation. Specifically, DeepAR is capable of extracting and learning the key information embedded in AR antagonists. Firstly, we established a benchmark dataset by collecting active and inactive compounds against AR from the ChEMBL database. Based on this dataset, we developed and optimized a collection of baseline models by using a comprehensive set of well-known molecular descriptors and machine learning algorithms. Then, these baseline models were utilized for creating probabilistic features. Finally, these probabilistic features were combined and used for the construction of a meta-model based on a one-dimensional convolutional neural network. Experimental results indicated that DeepAR is a more accurate and stable approach for identifying AR antagonists in terms of the independent test dataset, by achieving an accuracy of 0.911 and MCC of 0.823. In addition, our proposed framework is able to provide feature importance information by leveraging a popular computational approach, named SHapley Additive exPlanations (SHAP). In the meanwhile, the characterization and analysis of potential AR antagonist candidates were achieved through the SHAP waterfall plot and molecular docking. The analysis inferred that N-heterocyclic moieties, halogenated substituents, and a cyano functional group were significant determinants of potential AR antagonists. Lastly, we implemented an online web server by using DeepAR (at http://pmlabstack.pythonanywhere.com/DeepAR ). We anticipate that DeepAR could be a useful computational tool for community-wide facilitation of AR candidates from a large number of uncharacterized compounds.
Collapse
Affiliation(s)
- Nalini Schaduangrat
- Center for Research Innovation and Biomedical Informatics, Faculty of Medical Technology, Mahidol University, Bangkok, 10700, Thailand
| | - Nuttapat Anuwongcharoen
- Department of Community Medical Technology, Faculty of Medical Technology, Mahidol University, Bangkok, 10700, Thailand
| | - Phasit Charoenkwan
- Modern Management and Information Technology, College of Arts, Media and Technology, Chiang Mai University, Chiang Mai, 50200, Thailand.
| | - Watshara Shoombuatong
- Center for Research Innovation and Biomedical Informatics, Faculty of Medical Technology, Mahidol University, Bangkok, 10700, Thailand.
| |
Collapse
|
2
|
Parker J. Pathophysiological Effects of Contemporary Lifestyle on Evolutionary-Conserved Survival Mechanisms in Polycystic Ovary Syndrome. Life (Basel) 2023; 13:life13041056. [PMID: 37109585 PMCID: PMC10145572 DOI: 10.3390/life13041056] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2023] [Revised: 04/15/2023] [Accepted: 04/18/2023] [Indexed: 04/29/2023] Open
Abstract
Polycystic ovary syndrome (PCOS) is increasingly being characterized as an evolutionary mismatch disorder that presents with a complex mixture of metabolic and endocrine symptoms. The Evolutionary Model proposes that PCOS arises from a collection of inherited polymorphisms that have been consistently demonstrated in a variety of ethnic groups and races. In utero developmental programming of susceptible genomic variants are thought to predispose the offspring to develop PCOS. Postnatal exposure to lifestyle and environmental risk factors results in epigenetic activation of developmentally programmed genes and disturbance of the hallmarks of health. The resulting pathophysiological changes represent the consequences of poor-quality diet, sedentary behaviour, endocrine disrupting chemicals, stress, circadian disruption, and other lifestyle factors. Emerging evidence suggests that lifestyle-induced gastrointestinal dysbiosis plays a central role in the pathogenesis of PCOS. Lifestyle and environmental exposures initiate changes that result in disturbance of the gastrointestinal microbiome (dysbiosis), immune dysregulation (chronic inflammation), altered metabolism (insulin resistance), endocrine and reproductive imbalance (hyperandrogenism), and central nervous system dysfunction (neuroendocrine and autonomic nervous system). PCOS can be a progressive metabolic condition that leads to obesity, gestational diabetes, type two diabetes, metabolic-associated fatty liver disease, metabolic syndrome, cardiovascular disease, and cancer. This review explores the mechanisms that underpin the evolutionary mismatch between ancient survival pathways and contemporary lifestyle factors involved in the pathogenesis and pathophysiology of PCOS.
Collapse
Affiliation(s)
- Jim Parker
- School of Medicine, University of Wollongong, Wollongong, NSW 2522, Australia
| |
Collapse
|
3
|
Srisongkram T, Khamtang P, Weerapreeyakul N. Prediction of KRAS G12C inhibitors using conjoint fingerprint and machine learning-based QSAR models. J Mol Graph Model 2023; 122:108466. [PMID: 37058997 DOI: 10.1016/j.jmgm.2023.108466] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2022] [Revised: 03/19/2023] [Accepted: 03/29/2023] [Indexed: 04/16/2023]
Abstract
Kirsten rat sarcoma virus G12C (KRASG12C) is the major protein mutation associated with non-small cell lung cancer (NSCLC) severity. Inhibiting KRASG12C is therefore one of the key therapeutic strategies for NSCLC patients. In this paper, a cost-effective data driven drug design employing machine learning-based quantitative structure-activity relationship (QSAR) analysis was built for predicting ligand affinities against KRASG12C protein. A curated and non-redundant dataset of 1033 compounds with KRASG12C inhibitory activity (pIC50) was used to build and test the models. The PubChem fingerprint, Substructure fingerprint, Substructure fingerprint count, and the conjoint fingerprint-a combination of PubChem fingerprint and Substructure fingerprint count-were used to train the models. Using comprehensive validation methods and various machine learning algorithms, the results clearly showed that the XGBoost regression (XGBoost) achieved the highest performance in term of goodness of fit, predictivity, generalizability and model robustness (R2 = 0.81, Q2CV = 0.60, Q2Ext = 0.62, R2 - Q2Ext = 0.19, R2Y-Random = 0.31 ± 0.03, Q2Y-Random = -0.09 ± 0.04). The top 13 molecular fingerprints that correlated with the predicted pIC50 values were SubFPC274 (aromatic atoms), SubFPC307 (number of chiral-centers), PubChemFP37 (≥1 Chlorine), SubFPC18 (Number of alkylarylethers), SubFPC1 (number of primary carbons), SubFPC300 (number of 1,3-tautomerizables), PubChemFP621 (N-C:C:C:N structure), PubChemFP23 (≥1 Fluorine), SubFPC2 (number of secondary carbons), SubFPC295 (number of C-ONS bonds), PubChemFP199 (≥4 6-membered rings), PubChemFP180 (≥1 nitrogen-containing 6-membered ring), and SubFPC180 (number of tertiary amine). These molecular fingerprints were virtualized and validated using molecular docking experiments. In conclusion, this conjoint fingerprint and XGBoost-QSAR model demonstrated to be useful as a high-throughput screening tool for KRASG12C inhibitor identification and drug design.
Collapse
Affiliation(s)
- Tarapong Srisongkram
- Division of Pharmaceutical Chemistry, Faculty of Pharmaceutical Sciences, Khon Kaen University, 40002, Thailand.
| | | | - Natthida Weerapreeyakul
- Division of Pharmaceutical Chemistry, Faculty of Pharmaceutical Sciences, Khon Kaen University, 40002, Thailand
| |
Collapse
|
4
|
Rasti B. Quantitative Characterization of the Chemical Space Governed by Human Carbonic Anhydrases and selenium-containing derivatives of solfonamides. BRAZ J PHARM SCI 2022. [DOI: 10.1590/s2175-97902022e19704] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
|
5
|
|
6
|
Bongers BJ, IJzerman AP, Van Westen GJP. Proteochemometrics - recent developments in bioactivity and selectivity modeling. DRUG DISCOVERY TODAY. TECHNOLOGIES 2019; 32-33:89-98. [PMID: 33386099 DOI: 10.1016/j.ddtec.2020.08.003] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/28/2020] [Revised: 08/18/2020] [Accepted: 08/28/2020] [Indexed: 06/12/2023]
Abstract
Proteochemometrics is a machine learning based modeling approach relying on a combination of ligand and protein descriptors. With ongoing developments in machine learning and increases in public data the technique is more frequently applied in early drug discovery, typically in ligand-target binding prediction. Common applications include improvements to single target quantitative structure-activity relationship models, protein selectivity and promiscuity modeling, and large-scale deep learning approaches. The increase in predictive power using proteochemometrics is observed in multi-target bioactivity modeling, opening the door to more extensive studies covering whole protein families. On top of that, with deep learning fueling more complex and larger scale models, proteochemometrics allows faster and higher quality computational models supporting the design, make, test cycle.
Collapse
Affiliation(s)
- Brandon J Bongers
- Division of Drug Discovery and Safety, Leiden Academic Centre for Drug Research, Leiden University, P.O. Box 9502, 2300 RA, Leiden, The Netherlands
| | - Adriaan P IJzerman
- Division of Drug Discovery and Safety, Leiden Academic Centre for Drug Research, Leiden University, P.O. Box 9502, 2300 RA, Leiden, The Netherlands
| | - Gerard J P Van Westen
- Division of Drug Discovery and Safety, Leiden Academic Centre for Drug Research, Leiden University, P.O. Box 9502, 2300 RA, Leiden, The Netherlands.
| |
Collapse
|
7
|
Rasti B, Mazraedoost S, Panahi H, Falahati M, Attar F. New insights into the selective inhibition of the β-carbonic anhydrases of pathogenic bacteria Burkholderia pseudomallei and Francisella tularensis: a proteochemometrics study. Mol Divers 2018; 23:263-273. [PMID: 30120657 DOI: 10.1007/s11030-018-9869-5] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2018] [Accepted: 08/09/2018] [Indexed: 10/28/2022]
Abstract
Nowadays, antibiotic resistance has turned into one of the most important worldwide health problems. Biological end point of critical enzymes induced by potent inhibitors is recently being considered as a highly effective and popular strategy to defeat antibiotic-resistant pathogens. For instance, the simple but critical β-carbonic anhydrase has recently been in the center of attention for anti-pathogen drug discoveries. However, no β-carbonic anhydrase selective inhibitor has yet been developed. Available β-carbonic anhydrase inhibitors are also highly potent with regard to human carbonic anhydrases, leading to severe inevitable side effects in case of usage. Therefore, developing novel inhibitors with high selectivity against pathogenic β-carbonic anhydrases is of great essence. Herein, for the first time, we have conducted a proteochemometric study to explore the structural and the chemical aspects of the interactions governed by bacterial β-carbonic anhydrases and their inhibitors. We have found valuable information which can lead to designing novel inhibitors with better selectivity for bacterial β-carbonic anhydrases.
Collapse
Affiliation(s)
- Behnam Rasti
- Department of Microbiology, Faculty of Basic Sciences, Lahijan Branch, Islamic Azad University (IAU), Lahijan, Guilan, Iran.
| | - Sargol Mazraedoost
- Department of Microbiology, Faculty of Basic Sciences, Lahijan Branch, Islamic Azad University (IAU), Lahijan, Guilan, Iran
| | - Hanieh Panahi
- Department of Mathematics and Statistics, Lahijan Branch, Islamic Azad University, Lahijan, Iran
| | - Mojtaba Falahati
- Department of Nanotechnology, Faculty of Advance Science and Technology, Pharmaceutical Sciences Branch, Islamic Azad University (IAUPS), Tehran, Iran
| | - Farnoosh Attar
- Department of Biology, Faculty of Food Industry and Agriculture, Standard Research Institute (SRI), Karaj, Iran
| |
Collapse
|
8
|
Shoombuatong W, Schaduangrat N, Nantasenamat C. Towards understanding aromatase inhibitory activity via QSAR modeling. EXCLI JOURNAL 2018; 17:688-708. [PMID: 30190660 PMCID: PMC6123608 DOI: 10.17179/excli2018-1417] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/08/2018] [Accepted: 07/10/2018] [Indexed: 12/14/2022]
Abstract
Aromatase is a rate-limiting enzyme for estrogen biosynthesis that is overproduced in breast cancer tissue. To block the growth of breast tumors, aromatase inhibitors (AIs) are employed to bind and inhibit aromatase in order to lower the amount of estrogen produced in the body. Although a number of synthetic aromatase inhibitors have been released for clinical use in the treatment of hormone-receptor positive breast cancer, these inhibitors may lead to undesirable side effects (e.g. increased rash, diarrhea and vomiting; effects on the bone, brain and heart) and therefore, the search for novel AIs continues. Over the past decades, there has been an intense effort in employing medicinal chemistry and quantitative structure-activity relationship (QSAR) to shed light on the mechanistic basis of aromatase inhibition. To the best of our knowledge, this article constitutes the first comprehensive review of all QSAR studies of both steroidal and non-steroidal AIs that have been published in the field. Herein, we summarize the experimental setup of these studies as well as summarizing the key features that are pertinent for robust aromatase inhibition.
Collapse
Affiliation(s)
- Watshara Shoombuatong
- Center of Data Mining and Biomedical Informatics, Faculty of Medical Technology, Mahidol University, Bangkok 10700, Thailand
| | - Nalini Schaduangrat
- Center of Data Mining and Biomedical Informatics, Faculty of Medical Technology, Mahidol University, Bangkok 10700, Thailand
| | - Chanin Nantasenamat
- Center of Data Mining and Biomedical Informatics, Faculty of Medical Technology, Mahidol University, Bangkok 10700, Thailand
| |
Collapse
|
9
|
Rasti B, Heravi YE. Probing the chemical interaction space governed by 4-aminosubstituted benzenesulfonamides and carbonic anhydrase isoforms. Res Pharm Sci 2018; 13:192-204. [PMID: 29853929 PMCID: PMC5921400 DOI: 10.4103/1735-5362.228940] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
Isoform diversity, critical physiological roles and involvement in major diseases/disorders such as glaucoma, epilepsy, Alzheimer's disease, obesity, and cancers have made carbonic anhydrase (CA), one of the most interesting case studies in the field of computer aided drug design. Since applying non-selective inhibitors can result in major side effects, there have been considerable efforts so far to achieve selective inhibitors for different isoforms of CA. Using proteochemometrics approach, the chemical interaction space governed by a group of 4-amino-substituted benzenesulfonamides and human CAs has been explored in the present study. Several validation methods have been utilized to assess the validity, robustness and predictivity power of the proposed proteochemometric model. Our model has offered major structural information that can be applied to design new selective inhibitors for distinct isoforms of CA. To prove the applicability of the proposed model, new compounds have been designed based on the offered discriminative structural features.
Collapse
Affiliation(s)
- Behnam Rasti
- Department of Microbiology, Faculty of Basic Sciences, Lahijan Branch, Islamic Azad University (IAU), Lahijan, Guilan, I.R. Iran
| | | |
Collapse
|
10
|
Rasti B, Shahangian SS. Proteochemometric modeling of the origin of thymidylate synthase inhibition. Chem Biol Drug Des 2018; 91:1007-1016. [PMID: 29251822 DOI: 10.1111/cbdd.13163] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2017] [Revised: 11/09/2017] [Accepted: 12/01/2017] [Indexed: 12/11/2022]
Affiliation(s)
- Behnam Rasti
- Department of Microbiology; Faculty of Basic Sciences; Lahijan Branch; Islamic Azad University (IAU); Lahijan Guilan Iran
| | | |
Collapse
|
11
|
Subramanian V, Ain QU, Henno H, Pietilä LO, Fuchs JE, Prusis P, Bender A, Wohlfahrt G. 3D proteochemometrics: using three-dimensional information of proteins and ligands to address aspects of the selectivity of serine proteases. MEDCHEMCOMM 2017; 8:1037-1045. [PMID: 30108817 PMCID: PMC6072133 DOI: 10.1039/c6md00701e] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/16/2016] [Accepted: 03/14/2017] [Indexed: 11/21/2022]
Abstract
The high similarity between certain sub-pockets of serine proteases may lead to low selectivity of protease inhibitors. Therefore the application of proteochemometrics (PCM), which quantifies the relationship between protein/ligand descriptors and affinity for multiple ligands and targets simultaneously, is useful to understand and improve the selectivity profiles of potential inhibitors. In this study, protein field-based PCM that uses knowledge-based and WaterMap derived fields to describe proteins in combination with 2D (RDKit and MOE fingerprints) and 3D (4 point pharmacophoric fingerprints and GRIND) ligand descriptors was used to model the bioactivities of 24 homologous serine proteases and 5863 inhibitors in an integrated fashion. Of the multiple field-based PCM models generated based on different ligand descriptors, RDKit fingerprints showed the best performance in terms of external prediction with Rtest2 of 0.72 and RMSEP of 0.81. Further, visual interpretation of the models highlights sub-pocket specific regions that influence affinity and selectivity of serine protease inhibitors.
Collapse
Affiliation(s)
- Vigneshwari Subramanian
- Division of Pharmaceutical Chemistry and Technology , Faculty of Pharmacy , University of Helsinki , 00014 Helsinki , Finland
- Computer-Aided Drug Design , Orion Pharma , Orionintie 1 , 02101 Espoo , Finland .
| | - Qurrat Ul Ain
- Centre for Molecular Informatics , Department of Chemistry , Lensfield Road , CB2 1EW Cambridge , UK
| | - Helena Henno
- Computer-Aided Drug Design , Orion Pharma , Orionintie 1 , 02101 Espoo , Finland .
| | - Lars-Olof Pietilä
- Computer-Aided Drug Design , Orion Pharma , Orionintie 1 , 02101 Espoo , Finland .
| | - Julian E Fuchs
- Centre for Molecular Informatics , Department of Chemistry , Lensfield Road , CB2 1EW Cambridge , UK
- Institute of General , Inorganic and Theoretical Chemistry , University of Innsbruck , Innrain 82 , 6020 Innsbruck , Austria
| | - Peteris Prusis
- Computer-Aided Drug Design , Orion Pharma , Orionintie 1 , 02101 Espoo , Finland .
| | - Andreas Bender
- Centre for Molecular Informatics , Department of Chemistry , Lensfield Road , CB2 1EW Cambridge , UK
| | - Gerd Wohlfahrt
- Computer-Aided Drug Design , Orion Pharma , Orionintie 1 , 02101 Espoo , Finland .
| |
Collapse
|
12
|
Rasti B, Schaduangrat N, Shahangian SS, Nantasenamat C. Exploring the origin of phosphodiesterase inhibition via proteochemometric modeling. RSC Adv 2017. [DOI: 10.1039/c7ra02332d] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
A proteochemometric study of a set of phosphodiesterase 4B and 4D inhibitors sheds light on the origin of their inhibition and selectivities.
Collapse
Affiliation(s)
- Behnam Rasti
- Department of Microbiology
- Faculty of Basic Sciences
- Lahijan Branch
- Islamic Azad University (IAU)
- Lahijan
| | - Nalini Schaduangrat
- Center of Data Mining and Biomedical Informatics
- Faculty of Medical Technology
- Mahidol University
- Bangkok 10700
- Thailand
| | - S. Shirin Shahangian
- Department of Biology
- Faculty of Sciences
- University of Guilan
- Rasht 41938-33697
- Iran
| | - Chanin Nantasenamat
- Center of Data Mining and Biomedical Informatics
- Faculty of Medical Technology
- Mahidol University
- Bangkok 10700
- Thailand
| |
Collapse
|
13
|
Shoombuatong W, Prathipati P, Owasirikul W, Worachartcheewan A, Simeon S, Anuwongcharoen N, Wikberg JES, Nantasenamat C. Towards the Revival of Interpretable QSAR Models. CHALLENGES AND ADVANCES IN COMPUTATIONAL CHEMISTRY AND PHYSICS 2017. [DOI: 10.1007/978-3-319-56850-8_1] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
|