1
|
Arora S, Satija S, Mittal A, Solanki S, Mohanty SK, Srivastava V, Sengupta D, Rout D, Arul Murugan N, Borkar RM, Ahuja G. Unlocking The Mysteries of DNA Adducts with Artificial Intelligence. Chembiochem 2024; 25:e202300577. [PMID: 37874183 DOI: 10.1002/cbic.202300577] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2023] [Revised: 10/18/2023] [Accepted: 10/23/2023] [Indexed: 10/25/2023]
Abstract
Cellular genome is considered a dynamic blueprint of a cell since it encodes genetic information that gets temporally altered due to various endogenous and exogenous insults. Largely, the extent of genomic dynamicity is controlled by the trade-off between DNA repair processes and the genotoxic potential of the causative agent (genotoxins or potential carcinogens). A subset of genotoxins form DNA adducts by covalently binding to the cellular DNA, triggering structural or functional changes that lead to significant alterations in cellular processes via genetic (e. g., mutations) or non-genetic (e. g., epigenome) routes. Identification, quantification, and characterization of DNA adducts are indispensable for their comprehensive understanding and could expedite the ongoing efforts in predicting carcinogenicity and their mode of action. In this review, we elaborate on using Artificial Intelligence (AI)-based modeling in adducts biology and present multiple computational strategies to gain advancements in decoding DNA adducts. The proposed AI-based strategies encompass predictive modeling for adduct formation via metabolic activation, novel adducts' identification, prediction of biochemical routes for adduct formation, adducts' half-life predictions within biological ecosystems, and, establishing methods to predict the link between adducts chemistry and its location within the genomic DNA. In summary, we discuss some futuristic AI-based approaches in DNA adduct biology.
Collapse
Affiliation(s)
- Sakshi Arora
- Department of Computational Biology, Indraprastha Institute of Information Technology (IIIT-Delhi) Okhla, Phase III, New Delhi, 110020, India
| | - Shiva Satija
- Department of Computational Biology, Indraprastha Institute of Information Technology (IIIT-Delhi) Okhla, Phase III, New Delhi, 110020, India
| | - Aayushi Mittal
- Department of Computational Biology, Indraprastha Institute of Information Technology (IIIT-Delhi) Okhla, Phase III, New Delhi, 110020, India
| | - Saveena Solanki
- Department of Computational Biology, Indraprastha Institute of Information Technology (IIIT-Delhi) Okhla, Phase III, New Delhi, 110020, India
| | - Sanjay Kumar Mohanty
- Department of Computational Biology, Indraprastha Institute of Information Technology (IIIT-Delhi) Okhla, Phase III, New Delhi, 110020, India
| | - Vaibhav Srivastava
- Division of Glycoscience, Department of Chemistry CBH School, Royal Institute of Technology (KTH) AlbaNova University Center, 10691, Stockholm, Sweden
| | - Debarka Sengupta
- Department of Computational Biology, Indraprastha Institute of Information Technology (IIIT-Delhi) Okhla, Phase III, New Delhi, 110020, India
| | - Diptiranjan Rout
- Department of Transfusion Medicine National Cancer Institute, AIIMS, New Delhi, All India Institute of Medical Sciences, Ansari Nagar, New Delhi, 110608, India
| | - Natarajan Arul Murugan
- Department of Computational Biology, Indraprastha Institute of Information Technology (IIIT-Delhi) Okhla, Phase III, New Delhi, 110020, India
| | - Roshan M Borkar
- Department of Pharmaceutical Analysis, National Institute of Pharmaceutical Education and Research (NIPER)-Guwahati, Sila Katamur Halugurisuk P.O.: Changsari, Dist, Guwahati, Assam, 781101, India
| | - Gaurav Ahuja
- Department of Computational Biology, Indraprastha Institute of Information Technology (IIIT-Delhi) Okhla, Phase III, New Delhi, 110020, India
| |
Collapse
|
2
|
Han R, Yoon H, Kim G, Lee H, Lee Y. Revolutionizing Medicinal Chemistry: The Application of Artificial Intelligence (AI) in Early Drug Discovery. Pharmaceuticals (Basel) 2023; 16:1259. [PMID: 37765069 PMCID: PMC10537003 DOI: 10.3390/ph16091259] [Citation(s) in RCA: 26] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2023] [Revised: 08/24/2023] [Accepted: 09/04/2023] [Indexed: 09/29/2023] Open
Abstract
Artificial intelligence (AI) has permeated various sectors, including the pharmaceutical industry and research, where it has been utilized to efficiently identify new chemical entities with desirable properties. The application of AI algorithms to drug discovery presents both remarkable opportunities and challenges. This review article focuses on the transformative role of AI in medicinal chemistry. We delve into the applications of machine learning and deep learning techniques in drug screening and design, discussing their potential to expedite the early drug discovery process. In particular, we provide a comprehensive overview of the use of AI algorithms in predicting protein structures, drug-target interactions, and molecular properties such as drug toxicity. While AI has accelerated the drug discovery process, data quality issues and technological constraints remain challenges. Nonetheless, new relationships and methods have been unveiled, demonstrating AI's expanding potential in predicting and understanding drug interactions and properties. For its full potential to be realized, interdisciplinary collaboration is essential. This review underscores AI's growing influence on the future trajectory of medicinal chemistry and stresses the importance of ongoing synergies between computational and domain experts.
Collapse
Affiliation(s)
| | | | | | | | - Yoonji Lee
- College of Pharmacy, Chung-Ang University, Seoul 06974, Republic of Korea
| |
Collapse
|
3
|
Mutagenic potential and structural alerts of phytotoxins. Food Chem Toxicol 2023; 173:113562. [PMID: 36563927 DOI: 10.1016/j.fct.2022.113562] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2022] [Revised: 12/09/2022] [Accepted: 12/14/2022] [Indexed: 12/25/2022]
Abstract
Toxic plant-produced chemicals, so-called phytotoxins, constitute a category of natural compounds belonging to a diversity of chemical classes. Some of them (e.g., alkaloids, terpenes, saponins) are associated with high toxic potency, while for many of others no toxicological data is available. In this study, the mutagenic potential of 1586 phytotoxins, as obtained from a publicly available database, was investigated applying different in silico approaches. (Q)SAR models (including statistical-based and rule-based systems) were used for the prediction of bacterial in vitro mutagenicity (Ames test) and the results from multiple tools were combined to assign consensus predicted values (i.e., positive, negative, inconclusive). The overall consensus outcome was then employed to investigate relationships between structural features of classes of phytotoxins and potential mutagenicity, allowing the identification of structural alerts raising a specific concern. The results highlighted that about 10% of the screened compounds were predicted to have mutagenic potential and the critical classes of concern, such as alkaloids, were further investigated in terms of subclasses (e.g., indole alkaloids, isoquinoline alkaloids), getting a deeper insight into the mutagenic potential of possible naturally occurring chemicals in plant materials and their structural alerts.
Collapse
|
4
|
Gajewicz-Skretna A, Wyrzykowska E, Gromelski M. Quantitative multi-species toxicity modeling: Does a multi-species, machine learning model provide better performance than a single-species model for the evaluation of acute aquatic toxicity by organic pollutants? THE SCIENCE OF THE TOTAL ENVIRONMENT 2023; 861:160590. [PMID: 36473653 DOI: 10.1016/j.scitotenv.2022.160590] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/29/2022] [Revised: 11/25/2022] [Accepted: 11/26/2022] [Indexed: 06/17/2023]
Abstract
The toxicological profile of any chemical is defined by multiple endpoints and testing procedures, including representative test species from different trophic levels. While computer-aided methods play an increasingly important role in supporting ecotoxicology research and chemical hazard assessment, most of the recently developed machine learning models are directed towards a single, specific endpoint. To overcome this limitation and accelerate the process of identifying potentially hazardous environmental pollutants, we are introducing an effective approach for quantitative, multi-species modeling. The proposed approach is based on canonical correlation analysis that finds a pair(s) of uncorrelated, linear combinations of the original variables that best defines the overall variability within and between multiple biological responses and predictor variables. Its effectiveness was confirmed by the machine learning model for estimating acute toxicity of diverse organic pollutants in aquatic species from three trophic levels: algae (Pseudokirchneriella subcapitata), daphnia (Daphnia magna), and fish (Oryzias latipes). The multi-species model achieved a favorable predictive performance that were in line with predictive models derived for the aquatic organisms individually. The chemical bioavailability and reactivity parameters (n-octanol/water partition coefficient, chemical potential, and molecular size and volume) were important to accurately predict acute ecotoxicity to the three aquatic organisms. To facilitate the use of this approach, an open-source, Python-based script, named qMTM (quantitative Multi-species Toxicity Modeling) has been provided.
Collapse
Affiliation(s)
- Agnieszka Gajewicz-Skretna
- Laboratory of Environmental Chemoinformatics, Faculty of Chemistry, University of Gdansk, Wita Stwosza 63, 80-308 Gdansk, Poland.
| | - Ewelina Wyrzykowska
- Laboratory of Environmental Chemoinformatics, Faculty of Chemistry, University of Gdansk, Wita Stwosza 63, 80-308 Gdansk, Poland
| | - Maciej Gromelski
- Laboratory of Environmental Chemoinformatics, Faculty of Chemistry, University of Gdansk, Wita Stwosza 63, 80-308 Gdansk, Poland
| |
Collapse
|
5
|
Drewe J, Küsters E, Hammann F, Kreuter M, Boss P, Schöning V. Modeling Structure-Activity Relationship of AMPK Activation. Molecules 2021; 26:molecules26216508. [PMID: 34770917 PMCID: PMC8587902 DOI: 10.3390/molecules26216508] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2021] [Revised: 10/25/2021] [Accepted: 10/26/2021] [Indexed: 12/23/2022] Open
Abstract
The adenosine monophosphate activated protein kinase (AMPK) is critical in the regulation of important cellular functions such as lipid, glucose, and protein metabolism; mitochondrial biogenesis and autophagy; and cellular growth. In many diseases-such as metabolic syndrome, obesity, diabetes, and also cancer-activation of AMPK is beneficial. Therefore, there is growing interest in AMPK activators that act either by direct action on the enzyme itself or by indirect activation of upstream regulators. Many natural compounds have been described that activate AMPK indirectly. These compounds are usually contained in mixtures with a variety of structurally different other compounds, which in turn can also alter the activity of AMPK via one or more pathways. For these compounds, experiments are complicated, since the required pure substances are often not yet isolated and/or therefore not sufficiently available. Therefore, our goal was to develop a screening tool that could handle the profound heterogeneity in activation pathways of the AMPK. Since machine learning algorithms can model complex (unknown) relationships and patterns, some of these methods (random forest, support vector machines, stochastic gradient boosting, logistic regression, and deep neural network) were applied and validated using a database, comprising of 904 activating and 799 neutral or inhibiting compounds identified by extensive PubMed literature search and PubChem Bioassay database. All models showed unexpectedly high classification accuracy in training, but more importantly in predicting the unseen test data. These models are therefore suitable tools for rapid in silico screening of established substances or multicomponent mixtures and can be used to identify compounds of interest for further testing.
Collapse
Affiliation(s)
- Jürgen Drewe
- Medical Department, Max Zeller Söhne AG, CH-8590 Romanshorn, Switzerland;
- Correspondence:
| | | | - Felix Hammann
- Clinical Pharmacology and Toxicology, Department of General Internal Medicine, Inselspital University Hospital, CH-3012 Bern, Switzerland; (F.H.); (V.S.)
| | - Matthias Kreuter
- Medical Department, Max Zeller Söhne AG, CH-8590 Romanshorn, Switzerland;
| | - Philipp Boss
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association, D-13125 Berlin, Germany;
| | - Verena Schöning
- Clinical Pharmacology and Toxicology, Department of General Internal Medicine, Inselspital University Hospital, CH-3012 Bern, Switzerland; (F.H.); (V.S.)
| |
Collapse
|