1
|
Nguyen HD, Hoang LT, Vu GH. An in silico investigation of the toxicological effects and biological activities of 3-phenoxybenzoic acid and its metabolite products. Xenobiotica 2024:1-20. [PMID: 38833509 DOI: 10.1080/00498254.2024.2361457] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2024] [Accepted: 05/26/2024] [Indexed: 06/06/2024]
Abstract
We aimed to elucidate the toxic effects and biological activities of 3-phenoxybenzoic acid (3PBA) and its metabolite products. Numerous in silico methods were used to identify the toxic effects and biological activities of 3PBA, including PASS online, molecular docking, ADMETlab 2.0, ADMESWISS, MetaTox, and molecular dynamic simulation. Ten metabolite products were identified via Phase II reactions (O-glucuronidation, O-sulfation, and methylation). All of the investigated compounds were followed by Lipinski's rule, indicating that they were stimulants or inducers of hazardous processes. Because of their high gastrointestinal absorption and ability to reach the blood-brain barrier, the studied compounds' physicochemical and pharmacokinetic properties matched existing evidence of harmful effects, including hematemesis, reproductive dysfunction, allergic dermatitis, toxic respiration, and neurotoxicity. The studied compounds have been linked to the apoptotic pathway, the reproductivity system, neuroendocrine disruptors, phospholipid-translocating ATPase inhibitors, and JAK2 expression. An O-glucuronidation metabolite product demonstrated higher binding affinity and interaction with CYP2C9, CYP3A4, caspase 3, and caspase 8 than 3PBA and other metabolite products, whereas metabolite products from methylation were predominant and more toxic. Our in silico findings partly meet the 3Rs principle by minimizing animal testing before more study is needed to identify the detrimental effects of 3PBA on other organs (liver, kidneys). Future research directions may involve experimental validation of in silico predictions, elucidation of molecular mechanisms, and exploration of therapeutic interventions. These findings contribute to our understanding of the toxicological profile of 3PBA and its metabolites, which has implications for risk assessment and regulatory decisions.
Collapse
Affiliation(s)
- Hai Duc Nguyen
- Division of Microbiology, Tulane National Private Research Center, Tulane University, Covington, Louisiana, 70433, USA
| | - Linh Thuy Hoang
- College of Pharmacy, California Northstate University College of Pharmacy, CA, USA
| | - Giang Huong Vu
- Department of Public Heath, Hong Bang Health Center, Hai Phong, Vietnam
| |
Collapse
|
2
|
Senanayake RD, Daly CA, Hernandez R. Optimized Bags of Artificial Neural Networks Can Predict the Viability of Organisms Exposed to Nanoparticles. J Phys Chem A 2024; 128:2857-2870. [PMID: 38536900 DOI: 10.1021/acs.jpca.3c07462] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/12/2024]
Abstract
Prediction of organismal viability upon exposure to a nanoparticle in varying environments─as fully specified at the molecular scale─has emerged as a useful figure of merit in the design of engineered nanoparticles. We build on our earlier finding that a bag of artificial neural networks (ANNs) can provide such a prediction when such machines are trained with a relatively small data set (with ca. 200 examples). Therein, viabilities were predicted by consensus using the weighted means of the predictions from the bags. Here, we confirm the accuracy and precision of the prediction of nanoparticle viabilities using an optimized bag of ANNs over sets of data examples that had not previously been used in the training and validation process. We also introduce the viability strip, rather than a single value, as the prediction and construct it from the viability probability distribution of an ensemble of ANNs compatible with the data set. Specifically, the ensemble consists of the ANNs arising from subsets of the data set corresponding to different splittings between training and validation, and the different bags (k-folds). A k - 1 k machine uses a single partition (or bag) of k - 1 ANNs each trained on 1/k of the data to obtain a consensus prediction, and a k-bag machine quorum samples the k possible k - 1 k machines available for a given partition. We find that with increasing k in the k-bag or k - 1 k machines, the viability strips become more normally distributed and their predictions become more precise. Benchmark comparisons between ensembles of 4-bag machines and 3 4 fraction machines suggest that the 3 4 fraction machine has similar accuracy while overcoming some of the challenges arising from divergent ANNs in the 4-bag machines.
Collapse
Affiliation(s)
- Ravithree D Senanayake
- Department of Chemistry, Johns Hopkins University, Baltimore, Maryland 21218, United States
| | - Clyde A Daly
- Department of Chemistry, Johns Hopkins University, Baltimore, Maryland 21218, United States
| | - Rigoberto Hernandez
- Department of Chemistry, Johns Hopkins University, Baltimore, Maryland 21218, United States
- Department of Chemical & Biomolecular Engineering and Department of Materials Science and Engineering, Johns Hopkins University, Baltimore, Maryland 21218, United States
| |
Collapse
|
3
|
Kleinstreuer N, Hartung T. Artificial intelligence (AI)-it's the end of the tox as we know it (and I feel fine). Arch Toxicol 2024; 98:735-754. [PMID: 38244040 PMCID: PMC10861653 DOI: 10.1007/s00204-023-03666-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2023] [Accepted: 12/12/2023] [Indexed: 01/22/2024]
Abstract
The rapid progress of AI impacts diverse scientific disciplines, including toxicology, and has the potential to transform chemical safety evaluation. Toxicology has evolved from an empirical science focused on observing apical outcomes of chemical exposure, to a data-rich field ripe for AI integration. The volume, variety and velocity of toxicological data from legacy studies, literature, high-throughput assays, sensor technologies and omics approaches create opportunities but also complexities that AI can help address. In particular, machine learning is well suited to handle and integrate large, heterogeneous datasets that are both structured and unstructured-a key challenge in modern toxicology. AI methods like deep neural networks, large language models, and natural language processing have successfully predicted toxicity endpoints, analyzed high-throughput data, extracted facts from literature, and generated synthetic data. Beyond automating data capture, analysis, and prediction, AI techniques show promise for accelerating quantitative risk assessment by providing probabilistic outputs to capture uncertainties. AI also enables explanation methods to unravel mechanisms and increase trust in modeled predictions. However, issues like model interpretability, data biases, and transparency currently limit regulatory endorsement of AI. Multidisciplinary collaboration is needed to ensure development of interpretable, robust, and human-centered AI systems. Rather than just automating human tasks at scale, transformative AI can catalyze innovation in how evidence is gathered, data are generated, hypotheses are formed and tested, and tasks are performed to usher new paradigms in chemical safety assessment. Used judiciously, AI has immense potential to advance toxicology into a more predictive, mechanism-based, and evidence-integrated scientific discipline to better safeguard human and environmental wellbeing across diverse populations.
Collapse
Affiliation(s)
| | - Thomas Hartung
- Bloomberg School of Public Health, Doerenkamp-Zbinden Chair for Evidence-Based Toxicology, Center for Alternatives to Animal Testing (CAAT), Johns Hopkins University, Baltimore, MD, USA.
- CAAT-Europe, University of Konstanz, Constance, Germany.
| |
Collapse
|
4
|
Ouahabi S, Loukili EH, Elbouzidi A, Taibi M, Bouslamti M, Nafidi HA, Salamatullah AM, Saidi N, Bellaouchi R, Addi M, Ramdani M, Bourhia M, Hammouti B. Pharmacological Properties of Chemically Characterized Extracts from Mastic Tree: In Vitro and In Silico Assays. Life (Basel) 2023; 13:1393. [PMID: 37374175 DOI: 10.3390/life13061393] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2023] [Revised: 06/05/2023] [Accepted: 06/08/2023] [Indexed: 06/29/2023] Open
Abstract
The mastic tree, scientifically known as Pistacia lentiscus, which belongs to the Anacardiaceae family, was used in this study. The aim of this research was to analyze the chemical composition of this plant and assess its antioxidant and antibacterial properties using both laboratory experiments and computer simulations through molecular docking, a method that predicts the binding strength of a small molecule to a protein. The soxhlet method (SE) was employed to extract substances from the leaves of P. lentiscus found in the eastern region of Morocco. Hexane and methanol were the solvents used for the extraction process. The n-hexane extract was subjected to gas chromatography-mass spectrometry (GC/MS) to identify its fatty acid content. The methanolic extract underwent high-performance liquid chromatography with a diode-array detector (HPLC-DAD) to determine the presence of phenolic compounds. Antioxidant activity was assessed using the DPPH spectrophotometric test. The findings revealed that the main components in the n-hexane extract were linoleic acid (40.97 ± 0.33%), oleic acid (23.69 ± 0.12%), and palmitic acid (22.83 ± 0.10%). Catechin (37.05 ± 0.15%) was identified as the predominant compound in the methanolic extract through HPLC analysis. The methanolic extract exhibited significant DPPH radical scavenging, with an IC50 value of 0.26 ± 0.14 mg/mL. The antibacterial activity was tested against Staphylococcus aureus, Listeria innocua, and Escherichia coli, while the antifungal activity was evaluated against Geotrichum candidum and Rhodotorula glutinis. The P. lentiscus extract demonstrated notable antimicrobial effects. Additionally, apart from molecular docking, other important factors, such as drug similarity, drug metabolism and distribution within the body, potential adverse effects, and impact on bodily systems, were considered for the substances derived from P. lentiscus. Scientific algorithms, such as Prediction of Activity Spectra for Substances (PASS), Absorption, Distribution, Metabolism, Excretion (ADME), and Pro-Tox II, were utilized for this assessment. The results obtained from this research support the traditional medicinal usage of P. lentiscus and suggest its potential for drug development.
Collapse
Affiliation(s)
- Safae Ouahabi
- Laboratory of Applied and Environmental Chemistry (LCAE), Faculty of Sciences, Mohammed First University, B.P. 717, Oujda 60000, Morocco
| | - El Hassania Loukili
- Laboratory of Applied and Environmental Chemistry (LCAE), Faculty of Sciences, Mohammed First University, B.P. 717, Oujda 60000, Morocco
| | - Amine Elbouzidi
- Laboratoire d'Amélioration des Productions Agricoles, Biotechnologie et Environnement (LAPABE), Faculté des Sciences, Université Mohammed Premier, Oujda 60000, Morocco
| | - Mohamed Taibi
- Laboratoire d'Amélioration des Productions Agricoles, Biotechnologie et Environnement (LAPABE), Faculté des Sciences, Université Mohammed Premier, Oujda 60000, Morocco
| | - Mohammed Bouslamti
- Laboratories of Natural Substances, Pharmacology, Environment, Modeling, Health and Quality of Life (SNAMOPEQ), Faculty of Sciences, Sidi Mohamed Ben Abdellah University, Fez 30000, Morocco
| | - Hiba-Allah Nafidi
- Department of Food Science, Faculty of Agricultural and Food Sciences, Laval University, Quebec City, QC G1V 0A6, Canada
| | - Ahmad Mohammad Salamatullah
- Department of Food Science & Nutrition, College of Food and Agricultural Sciences, King Saud University, P.O. Box 2460, Riyadh 11451, Saudi Arabia
| | - Nezha Saidi
- Laboratory of Applied and Environmental Chemistry (LCAE), Faculty of Sciences, Mohammed First University, B.P. 717, Oujda 60000, Morocco
| | - Reda Bellaouchi
- Laboratory of Bioresources, Biotechnology, Ethnopharmacology and Health, Faculty of Sciences, Mohammed First University, Boulevard Mohamed VI, B.P. 717, Oujda 60000, Morocco
| | - Mohamed Addi
- Laboratoire d'Amélioration des Productions Agricoles, Biotechnologie et Environnement (LAPABE), Faculté des Sciences, Université Mohammed Premier, Oujda 60000, Morocco
| | - Mohamed Ramdani
- Laboratory of Applied and Environmental Chemistry (LCAE), Faculty of Sciences, Mohammed First University, B.P. 717, Oujda 60000, Morocco
| | - Mohammed Bourhia
- Department of Chemistry and Biochemistry, Faculty of Medicine and Pharmacy, Ibn Zohr University, Laayoune 70000, Morocco
| | - Belkheir Hammouti
- Laboratory of Applied and Environmental Chemistry (LCAE), Faculty of Sciences, Mohammed First University, B.P. 717, Oujda 60000, Morocco
| |
Collapse
|
5
|
Shimizu E, Ishikawa T, Tanji M, Agata N, Nakayama S, Nakahara Y, Yokoiwa R, Sato S, Hanyuda A, Ogawa Y, Hirayama M, Tsubota K, Sato Y, Shimazaki J, Negishi K. Artificial intelligence to estimate the tear film breakup time and diagnose dry eye disease. Sci Rep 2023; 13:5822. [PMID: 37037877 PMCID: PMC10085985 DOI: 10.1038/s41598-023-33021-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2022] [Accepted: 04/06/2023] [Indexed: 04/12/2023] Open
Abstract
The use of artificial intelligence (AI) in the diagnosis of dry eye disease (DED) remains limited due to the lack of standardized image formats and analysis models. To overcome these issues, we used the Smart Eye Camera (SEC), a video-recordable slit-lamp device, and collected videos of the anterior segment of the eye. This study aimed to evaluate the accuracy of the AI algorithm in estimating the tear film breakup time and apply this model for the diagnosis of DED according to the Asia Dry Eye Society (ADES) DED diagnostic criteria. Using the retrospectively corrected DED videos of 158 eyes from 79 patients, 22,172 frames were annotated by the DED specialist to label whether or not the frame had breakup. The AI algorithm was developed using the training dataset and machine learning. The DED criteria of the ADES was used to determine the diagnostic performance. The accuracy of tear film breakup time estimation was 0.789 (95% confidence interval (CI) 0.769-0.809), and the area under the receiver operating characteristic curve of this AI model was 0.877 (95% CI 0.861-0.893). The sensitivity and specificity of this AI model for the diagnosis of DED was 0.778 (95% CI 0.572-0.912) and 0.857 (95% CI 0.564-0.866), respectively. We successfully developed a novel AI-based diagnostic model for DED. Our diagnostic model has the potential to enable ophthalmology examination outside hospitals and clinics.
Collapse
Affiliation(s)
- Eisuke Shimizu
- Department of Ophthalmology, Keio University School of Medicine, 35 Shinanomachi, Shinjuku-ku, Tokyo, 160-8582, Japan.
- OUI Inc., DF Building 510, 2-2-8 Minami-Aoyama, Minato-ku, Tokyo, 107-0062, Japan.
- Yokohama Keiai Eye Clinic, Courtley House 2F, 1-11-17 Wada, Hodogaya-ku, Kanagawa, 240-0065, Japan.
| | - Toshiki Ishikawa
- Department of Ophthalmology, Keio University School of Medicine, 35 Shinanomachi, Shinjuku-ku, Tokyo, 160-8582, Japan
- OUI Inc., DF Building 510, 2-2-8 Minami-Aoyama, Minato-ku, Tokyo, 107-0062, Japan
| | - Makoto Tanji
- Department of Ophthalmology, Keio University School of Medicine, 35 Shinanomachi, Shinjuku-ku, Tokyo, 160-8582, Japan
- OUI Inc., DF Building 510, 2-2-8 Minami-Aoyama, Minato-ku, Tokyo, 107-0062, Japan
| | - Naomichi Agata
- OUI Inc., DF Building 510, 2-2-8 Minami-Aoyama, Minato-ku, Tokyo, 107-0062, Japan
| | - Shintaro Nakayama
- Department of Ophthalmology, Keio University School of Medicine, 35 Shinanomachi, Shinjuku-ku, Tokyo, 160-8582, Japan
- OUI Inc., DF Building 510, 2-2-8 Minami-Aoyama, Minato-ku, Tokyo, 107-0062, Japan
| | - Yo Nakahara
- OUI Inc., DF Building 510, 2-2-8 Minami-Aoyama, Minato-ku, Tokyo, 107-0062, Japan
| | - Ryota Yokoiwa
- OUI Inc., DF Building 510, 2-2-8 Minami-Aoyama, Minato-ku, Tokyo, 107-0062, Japan
| | - Shinri Sato
- Department of Ophthalmology, Keio University School of Medicine, 35 Shinanomachi, Shinjuku-ku, Tokyo, 160-8582, Japan
- Yokohama Keiai Eye Clinic, Courtley House 2F, 1-11-17 Wada, Hodogaya-ku, Kanagawa, 240-0065, Japan
| | - Akiko Hanyuda
- Department of Ophthalmology, Keio University School of Medicine, 35 Shinanomachi, Shinjuku-ku, Tokyo, 160-8582, Japan
| | - Yoko Ogawa
- Department of Ophthalmology, Keio University School of Medicine, 35 Shinanomachi, Shinjuku-ku, Tokyo, 160-8582, Japan
| | - Masatoshi Hirayama
- Department of Ophthalmology, Keio University School of Medicine, 35 Shinanomachi, Shinjuku-ku, Tokyo, 160-8582, Japan
| | - Kazuo Tsubota
- Department of Ophthalmology, Keio University School of Medicine, 35 Shinanomachi, Shinjuku-ku, Tokyo, 160-8582, Japan
| | - Yasunori Sato
- Department of Preventive Medicine and Public Health, School of Medicine, Keio University School of Medicine, 35 Shinanomachi, Shinjuku-ku, Tokyo, 160-8582, Japan
| | - Jun Shimazaki
- Department of Ophthalmology, Tokyo Dental College Ichikawa General Hospital, 5-11-13 Sugano, Ichikawa-shi, Chiba, 272-8513, Japan
| | - Kazuno Negishi
- Department of Ophthalmology, Keio University School of Medicine, 35 Shinanomachi, Shinjuku-ku, Tokyo, 160-8582, Japan
| |
Collapse
|
6
|
Schietgat L, Cuissart B, De Grave K, Efthymiadis K, Bureau R, Crémilleux B, Ramon J, Lepailleur A. Automated detection of toxicophores and prediction of mutagenicity using PMCSFG algorithm. Mol Inform 2023; 42:e2200232. [PMID: 36529710 DOI: 10.1002/minf.202200232] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2022] [Revised: 12/13/2022] [Accepted: 12/18/2022] [Indexed: 12/23/2022]
Abstract
Maximum common substructures (MCS) have received a lot of attention in the chemoinformatics community. They are typically used as a similarity measure between molecules, showing high predictive performance when used in classification tasks, while being easily explainable substructures. In the present work, we applied the Pairwise Maximum Common Subgraph Feature Generation (PMCSFG) algorithm to automatically detect toxicophores (structural alerts) and to compute fingerprints based on MCS. We present a comparison between our MCS-based fingerprints and 12 well-known chemical fingerprints when used as features in machine learning models. We provide an experimental evaluation and discuss the usefulness of the different methods on mutagenicity data. The features generated by the MCS method have a state-of-the-art performance when predicting mutagenicity, while they are more interpretable than the traditional chemical fingerprints.
Collapse
Affiliation(s)
- Leander Schietgat
- Artificial Intelligence Lab, Vrije Universiteit Brussel, Brussel, Belgium.,Department of Computer Science, KU Leuven, Leuven, Belgium
| | - Bertrand Cuissart
- Groupe de Recherche en Informatique, Image, Automatique et Instrumentation de Caen, UNICAEN, ENSICAEN, CNRS - UMR GREYC, Normandie Univ., Caen, France
| | | | | | - Ronan Bureau
- Centre d'Etudes et de Recherche sur le Médicament de Normandie, UNICAEN, CERMN, Normandie Univ., Caen, France
| | - Bruno Crémilleux
- Groupe de Recherche en Informatique, Image, Automatique et Instrumentation de Caen, UNICAEN, ENSICAEN, CNRS - UMR GREYC, Normandie Univ., Caen, France
| | - Jan Ramon
- INRIA Lille Nord Europe, Lille, France
| | - Alban Lepailleur
- Centre d'Etudes et de Recherche sur le Médicament de Normandie, UNICAEN, CERMN, Normandie Univ., Caen, France
| |
Collapse
|
7
|
Cavasotto CN, Scardino V. Machine Learning Toxicity Prediction: Latest Advances by Toxicity End Point. ACS OMEGA 2022; 7:47536-47546. [PMID: 36591139 PMCID: PMC9798519 DOI: 10.1021/acsomega.2c05693] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/02/2022] [Accepted: 11/28/2022] [Indexed: 05/29/2023]
Abstract
Machine learning (ML) models to predict the toxicity of small molecules have garnered great attention and have become widely used in recent years. Computational toxicity prediction is particularly advantageous in the early stages of drug discovery in order to filter out molecules with high probability of failing in clinical trials. This has been helped by the increase in the number of large toxicology databases available. However, being an area of recent application, a greater understanding of the scope and applicability of ML methods is still necessary. There are various kinds of toxic end points that have been predicted in silico. Acute oral toxicity, hepatotoxicity, cardiotoxicity, mutagenicity, and the 12 Tox21 data end points are among the most commonly investigated. Machine learning methods exhibit different performances on different data sets due to dissimilar complexity, class distributions, or chemical space covered, which makes it hard to compare the performance of algorithms over different toxic end points. The general pipeline to predict toxicity using ML has already been analyzed in various reviews. In this contribution, we focus on the recent progress in the area and the outstanding challenges, making a detailed description of the state-of-the-art models implemented for each toxic end point. The type of molecular representation, the algorithm, and the evaluation metric used in each research work are explained and analyzed. A detailed description of end points that are usually predicted, their clinical relevance, the available databases, and the challenges they bring to the field are also highlighted.
Collapse
Affiliation(s)
- Claudio N. Cavasotto
- Computational
Drug Design and Biomedical Informatics Laboratory, Instituto de Investigaciones
en Medicina Traslacional (IIMT), CONICET-Universidad
Austral, Pilar, B1629AHJ Buenos Aires, Argentina
- Austral
Institute for Applied Artificial Intelligence, Universidad Austral, Pilar, B1629AHJ Buenos Aires, Argentina
- Facultad
de Ciencias Biomédicas, Facultad de Ingenierá, Universidad Austral, Pilar, B1630FHB Buenos
Aires, Argentina
| | - Valeria Scardino
- Austral
Institute for Applied Artificial Intelligence, Universidad Austral, Pilar, B1629AHJ Buenos Aires, Argentina
- Meton
AI, Inc., Wilmington, Delaware 19801, United
States
| |
Collapse
|
8
|
Ruiz P, Loizou G. Editorial: Application of computational tools to health and environmental sciences, Volume II. Front Pharmacol 2022; 13:1102431. [DOI: 10.3389/fphar.2022.1102431] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2022] [Accepted: 11/22/2022] [Indexed: 12/04/2022] Open
|
9
|
Liu J, Guo W, Dong F, Aungst J, Fitzpatrick S, Patterson TA, Hong H. Machine learning models for rat multigeneration reproductive toxicity prediction. Front Pharmacol 2022; 13:1018226. [PMID: 36238576 PMCID: PMC9552001 DOI: 10.3389/fphar.2022.1018226] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2022] [Accepted: 09/09/2022] [Indexed: 11/13/2022] Open
Abstract
Reproductive toxicity is one of the prominent endpoints in the risk assessment of environmental and industrial chemicals. Due to the complexity of the reproductive system, traditional reproductive toxicity testing in animals, especially guideline multigeneration reproductive toxicity studies, take a long time and are expensive. Therefore, machine learning, as a promising alternative approach, should be considered when evaluating the reproductive toxicity of chemicals. We curated rat multigeneration reproductive toxicity testing data of 275 chemicals from ToxRefDB (Toxicity Reference Database) and developed predictive models using seven machine learning algorithms (decision tree, decision forest, random forest, k-nearest neighbors, support vector machine, linear discriminant analysis, and logistic regression). A consensus model was built based on the seven individual models. An external validation set was curated from the COSMOS database and the literature. The performances of individual and consensus models were evaluated using 500 iterations of 5-fold cross-validations and the external validation data set. The balanced accuracy of the models ranged from 58% to 65% in the 5-fold cross-validations and 45%–61% in the external validations. Prediction confidence analysis was conducted to provide additional information for more appropriate applications of the developed models. The impact of our findings is in increasing confidence in machine learning models. We demonstrate the importance of using consensus models for harnessing the benefits of multiple machine learning models (i.e., using redundant systems to check validity of outcomes). While we continue to build upon the models to better characterize weak toxicants, there is current utility in saving resources by being able to screen out strong reproductive toxicants before investing in vivo testing. The modeling approach (machine learning models) is offered for assessing the rat multigeneration reproductive toxicity of chemicals. Our results suggest that machine learning may be a promising alternative approach to evaluate the potential reproductive toxicity of chemicals.
Collapse
Affiliation(s)
- Jie Liu
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR, United States
| | - Wenjing Guo
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR, United States
| | - Fan Dong
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR, United States
| | - Jason Aungst
- Center for Food Safety and Applied Nutrition, U.S. Food and Drug Administration, College Park, MD, United States
| | - Suzanne Fitzpatrick
- Center for Food Safety and Applied Nutrition, U.S. Food and Drug Administration, College Park, MD, United States
| | - Tucker A. Patterson
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR, United States
| | - Huixiao Hong
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR, United States
- *Correspondence: Huixiao Hong,
| |
Collapse
|
10
|
Abstract
Machine learning and artificial intelligence approaches have revolutionized multiple disciplines, including toxicology. This review summarizes representative recent applications of machine learning and artificial intelligence approaches in different areas of toxicology, including physiologically based pharmacokinetic (PBPK) modeling, quantitative structure-activity relationship modeling for toxicity prediction, adverse outcome pathway analysis, high-throughput screening, toxicogenomics, big data and toxicological databases. By leveraging machine learning and artificial intelligence approaches, now it is possible to develop PBPK models for hundreds of chemicals efficiently, to create in silico models to predict toxicity for a large number of chemicals with similar accuracies compared to in vivo animal experiments, and to analyze a large amount of different types of data (toxicogenomics, high-content image data, etc.) to generate new insights into toxicity mechanisms rapidly, which was impossible by manual approaches in the past. To continue advancing the field of toxicological sciences, several challenges should be considered: (1) not all machine learning models are equally useful for a particular type of toxicology data, and thus it is important to test different methods to determine the optimal approach; (2) current toxicity prediction is mainly on bioactivity classification (yes/no), so additional studies are needed to predict the intensity of effect or dose-response relationship; (3) as more data become available, it is crucial to perform rigorous data quality check and develop infrastructure to store, share, analyze, evaluate, and manage big data; and (4) it is important to convert machine learning models to user-friendly interfaces to facilitate their applications by both computational and bench scientists.
Collapse
Affiliation(s)
- Zhoumeng Lin
- Department of Environmental and Global Health, College of Public Health and Health Professions, University of Florida, Gainesville, FL, 32610, USA.,Center for Environmental and Human Toxicology, University of Florida, FL, 32608, USA
| | - Wei-Chun Chou
- Department of Environmental and Global Health, College of Public Health and Health Professions, University of Florida, Gainesville, FL, 32610, USA.,Center for Environmental and Human Toxicology, University of Florida, FL, 32608, USA
| |
Collapse
|
11
|
Lin Z, Chou WC, Cheng YH, He C, Monteiro-Riviere NA, Riviere JE. Predicting Nanoparticle Delivery to Tumors Using Machine Learning and Artificial Intelligence Approaches. Int J Nanomedicine 2022; 17:1365-1379. [PMID: 35360005 PMCID: PMC8961007 DOI: 10.2147/ijn.s344208] [Citation(s) in RCA: 31] [Impact Index Per Article: 15.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2021] [Accepted: 03/10/2022] [Indexed: 12/12/2022] Open
Abstract
Background Low delivery efficiency of nanoparticles (NPs) to the tumor is a critical barrier in the field of cancer nanomedicine. Strategies on how to improve NP tumor delivery efficiency remain to be determined. Methods This study analyzed the roles of NP physicochemical properties, tumor models, and cancer types in NP tumor delivery efficiency using multiple machine learning and artificial intelligence methods, using data from a recently published Nano-Tumor Database that contains 376 datasets generated from a physiologically based pharmacokinetic (PBPK) model. Results The deep neural network model adequately predicted the delivery efficiency of different NPs to different tumors and it outperformed all other machine learning methods; including random forest, support vector machine, linear regression, and bagged model methods. The adjusted determination coefficients (R2) in the full training dataset were 0.92, 0.77, 0.77 and 0.76 for the maximum delivery efficiency (DEmax), delivery efficiency at 24 h (DE24), at 168 h (DE168), and at the last sampling time (DETlast). The corresponding R2 values in the test dataset were 0.70, 0.46, 0.33 and 0.63, respectively. Also, this study showed that cancer type was an important determinant for the deep neural network model in predicting the tumor delivery efficiency across all endpoints (19-29%). Among all physicochemical properties, the Zeta potential and core material played a greater role than other properties, such as the type, shape, and targeting strategy. Conclusion This study provides a quantitative model to improve the design of cancer nanomedicine with greater tumor delivery efficiency. These results help to improve our understanding of the causes of low NP tumor delivery efficiency. This study demonstrates the feasibility of integrating artificial intelligence with PBPK modeling approaches to study cancer nanomedicine.
Collapse
Affiliation(s)
- Zhoumeng Lin
- Department of Environmental and Global Health, College of Public Health and Health Professions, University of Florida, Gainesville, FL, USA
- Center for Environmental and Human Toxicology, University of Florida, Gainesville, FL, USA
- Institute of Computational Comparative Medicine, Kansas State University, Manhattan, KS, USA
- Department of Anatomy and Physiology, College of Veterinary Medicine, Kansas State University, Manhattan, KS, USA
| | - Wei-Chun Chou
- Department of Environmental and Global Health, College of Public Health and Health Professions, University of Florida, Gainesville, FL, USA
- Center for Environmental and Human Toxicology, University of Florida, Gainesville, FL, USA
- Institute of Computational Comparative Medicine, Kansas State University, Manhattan, KS, USA
- Department of Anatomy and Physiology, College of Veterinary Medicine, Kansas State University, Manhattan, KS, USA
| | - Yi-Hsien Cheng
- Institute of Computational Comparative Medicine, Kansas State University, Manhattan, KS, USA
- Department of Anatomy and Physiology, College of Veterinary Medicine, Kansas State University, Manhattan, KS, USA
| | - Chunla He
- Department of Biostatistics, College of Public Health and Health Professions, University of Florida, Gainesville, FL, USA
| | - Nancy A Monteiro-Riviere
- Nanotechnology Innovation Center of Kansas State, Kansas State University, Manhattan, KS, USA
- Center for Chemical Toxicology Research and Pharmacokinetics, North Carolina State University, Raleigh, NC, USA
| | - Jim E Riviere
- Center for Chemical Toxicology Research and Pharmacokinetics, North Carolina State University, Raleigh, NC, USA
- 1Data Consortium, Kansas State University, Olathe, KS, USA
| |
Collapse
|
12
|
Xu F, Wan C, Zhao L, Liu S, Hong J, Xiang Y, You Q, Zhou L, Li Z, Gong S, Zhu Y, Chen C, Zhang L, Gong Y, Li L, Li C, Zhang X, Guo C, Lai K, Huang C, Ting D, Lin H, Jin C. Predicting Post-Therapeutic Visual Acuity and OCT Images in Patients With Central Serous Chorioretinopathy by Artificial Intelligence. Front Bioeng Biotechnol 2021; 9:649221. [PMID: 34888298 PMCID: PMC8650495 DOI: 10.3389/fbioe.2021.649221] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2021] [Accepted: 10/28/2021] [Indexed: 12/02/2022] Open
Abstract
To predict visual acuity (VA) and post-therapeutic optical coherence tomography (OCT) images 1, 3, and 6 months after laser treatment in patients with central serous chorioretinopathy (CSC) by artificial intelligence (AI). Real-world clinical and imaging data were collected at Zhongshan Ophthalmic Center (ZOC) and Xiamen Eye Center (XEC). The data obtained from ZOC (416 eyes of 401 patients) were used as the training set; the data obtained from XEC (64 eyes of 60 patients) were used as the test set. Six different machine learning algorithms and a blending algorithm were used to predict VA, and a pix2pixHD method was adopted to predict post-therapeutic OCT images in patients after laser treatment. The data for VA predictions included clinical features obtained from electronic medical records (20 features) and measured features obtained from fundus fluorescein angiography, indocyanine green angiography, and OCT (145 features). The data for OCT predictions included 480 pairs of pre- and post-therapeutic OCT images. The VA and OCT images predicted by AI were compared with the ground truth. In the VA predictions of XEC dataset, the mean absolute errors (MAEs) were 0.074–0.098 logMAR (within four to five letters), and the root mean square errors were 0.096–0.127 logMAR (within five to seven letters) for the 1-, 3-, and 6-month predictions, respectively; in the post-therapeutic OCT predictions, only about 5.15% (5 of 97) of synthetic OCT images could be accurately identified as synthetic images. The MAEs of central macular thickness of synthetic OCT images were 30.15 ± 13.28 μm and 22.46 ± 9.71 μm for the 1- and 3-month predictions, respectively. This is the first study to apply AI to predict VA and post-therapeutic OCT of patients with CSC. This work establishes a reliable method of predicting prognosis 6 months in advance; the application of AI has the potential to help reduce patient anxiety and serve as a reference for ophthalmologists when choosing optimal laser treatments.
Collapse
Affiliation(s)
- Fabao Xu
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangzhou, China
| | - Cheng Wan
- College of Electronic Information Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing, China
| | - Lanqin Zhao
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangzhou, China
| | - Shaopeng Liu
- School of Computer Science, Guangdong Polytechnic Normal University, Guangzhou, China
| | - Jiaming Hong
- School of Medical Information Engineering, Guangzhou University of Chinese Medicine, Guangzhou, China
| | - Yifan Xiang
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangzhou, China
| | - Qijing You
- College of Electronic Information Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing, China
| | - Lijun Zhou
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangzhou, China
| | - Zhongwen Li
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangzhou, China
| | - Songjian Gong
- Xiamen Eye Center, Affiliated with Xiamen University, Xiamen, China
| | - Yi Zhu
- Department of Molecular and Cellular Pharmacology, University of Miami Miller School, Miami, FL, United States
| | - Chuan Chen
- Department of Molecular and Cellular Pharmacology, University of Miami Miller School, Miami, FL, United States
| | - Li Zhang
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangzhou, China.,Department of Ophthalmology, The Central Hospital of Wuhan, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
| | - Yajun Gong
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangzhou, China
| | - Longhui Li
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangzhou, China
| | - Cong Li
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangzhou, China
| | - Xiayin Zhang
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangzhou, China
| | - Chong Guo
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangzhou, China
| | - Kunbei Lai
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangzhou, China
| | - Chuangxin Huang
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangzhou, China
| | - Daniel Ting
- Singapore National Eye Center, Department of Ophthalmology, Singapore, Singapore
| | - Haotian Lin
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangzhou, China.,Center of Precision Medicine, Sun Yat-sen University, Guangzhou, China
| | - Chenjin Jin
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangzhou, China
| |
Collapse
|
13
|
Green AJ, Mohlenkamp MJ, Das J, Chaudhari M, Truong L, Tanguay RL, Reif DM. Leveraging high-throughput screening data, deep neural networks, and conditional generative adversarial networks to advance predictive toxicology. PLoS Comput Biol 2021; 17:e1009135. [PMID: 34214078 PMCID: PMC8301607 DOI: 10.1371/journal.pcbi.1009135] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2020] [Revised: 07/23/2021] [Accepted: 05/31/2021] [Indexed: 12/01/2022] Open
Abstract
There are currently 85,000 chemicals registered with the Environmental Protection Agency (EPA) under the Toxic Substances Control Act, but only a small fraction have measured toxicological data. To address this gap, high-throughput screening (HTS) and computational methods are vital. As part of one such HTS effort, embryonic zebrafish were used to examine a suite of morphological and mortality endpoints at six concentrations from over 1,000 unique chemicals found in the ToxCast library (phase 1 and 2). We hypothesized that by using a conditional generative adversarial network (cGAN) or deep neural networks (DNN), and leveraging this large set of toxicity data we could efficiently predict toxic outcomes of untested chemicals. Utilizing a novel method in this space, we converted the 3D structural information into a weighted set of points while retaining all information about the structure. In vivo toxicity and chemical data were used to train two neural network generators. The first was a DNN (Go-ZT) while the second utilized cGAN architecture (GAN-ZT) to train generators to produce toxicity data. Our results showed that Go-ZT significantly outperformed the cGAN, support vector machine, random forest and multilayer perceptron models in cross-validation, and when tested against an external test dataset. By combining both Go-ZT and GAN-ZT, our consensus model improved the SE, SP, PPV, and Kappa, to 71.4%, 95.9%, 71.4% and 0.673, respectively, resulting in an area under the receiver operating characteristic (AUROC) of 0.837. Considering their potential use as prescreening tools, these models could provide in vivo toxicity predictions and insight into the hundreds of thousands of untested chemicals to prioritize compounds for HT testing.
Collapse
Affiliation(s)
- Adrian J. Green
- Department of Biological Sciences, and the Bioinformatics Research Center, NC State University, Raleigh, North Carolina, United States of America
| | - Martin J. Mohlenkamp
- Department of Mathematics, Ohio University, Athens, Ohio, United States of America
| | - Jhuma Das
- Marsico Lung Institute, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
| | - Meenal Chaudhari
- Department of Computational Science and Engineering, North Carolina A&T State University, Greensboro, North Carolina, United States of America
| | - Lisa Truong
- Department of Environmental and Molecular Toxicology, Oregon State University, Corvallis, Oregon, United States of America
| | - Robyn L. Tanguay
- Department of Environmental and Molecular Toxicology, Oregon State University, Corvallis, Oregon, United States of America
| | - David M. Reif
- Department of Biological Sciences, and the Bioinformatics Research Center, NC State University, Raleigh, North Carolina, United States of America
| |
Collapse
|
14
|
GPCR_LigandClassify.py; a rigorous machine learning classifier for GPCR targeting compounds. Sci Rep 2021; 11:9510. [PMID: 33947911 PMCID: PMC8097070 DOI: 10.1038/s41598-021-88939-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2020] [Accepted: 04/12/2021] [Indexed: 02/02/2023] Open
Abstract
The current study describes the construction of various ligand-based machine learning models to be used for drug-repurposing against the family of G-Protein Coupled Receptors (GPCRs). In building these models, we collected > 500,000 data points, encompassing experimentally measured molecular association data of > 160,000 unique ligands against > 250 GPCRs. These data points were retrieved from the GPCR-Ligand Association (GLASS) database. We have used diverse molecular featurization methods to describe the input molecules. Multiple supervised ML algorithms were developed, tested and compared for their accuracy, F scores, as well as for their Matthews' correlation coefficient scores (MCC). Our data suggest that combined with molecular fingerprinting, ensemble decision trees and gradient boosted trees ML algorithms are on the accuracy border of the rather sophisticated deep neural nets (DNNs)-based algorithms. On a test dataset, these models displayed an excellent performance, reaching a ~ 90% classification accuracy. Additionally, we showcase a few examples where our models were able to identify interesting connections between known drugs from the Drug-Bank database and members of the GPCR family of receptors. Our findings are in excellent agreement with previously reported experimental observations in the literature. We hope the models presented in this paper synergize with the currently ongoing interest of applying machine learning modeling in the field of drug repurposing and computational drug discovery in general.
Collapse
|
15
|
Hastings J, Glauer M, Memariani A, Neuhaus F, Mossakowski T. Learning chemistry: exploring the suitability of machine learning for the task of structure-based chemical ontology classification. J Cheminform 2021; 13:23. [PMID: 33726837 PMCID: PMC7962259 DOI: 10.1186/s13321-021-00500-8] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2020] [Accepted: 02/26/2021] [Indexed: 12/22/2022] Open
Abstract
Chemical data is increasingly openly available in databases such as PubChem, which contains approximately 110 million compound entries as of February 2021. With the availability of data at such scale, the burden has shifted to organisation, analysis and interpretation. Chemical ontologies provide structured classifications of chemical entities that can be used for navigation and filtering of the large chemical space. ChEBI is a prominent example of a chemical ontology, widely used in life science contexts. However, ChEBI is manually maintained and as such cannot easily scale to the full scope of public chemical data. There is a need for tools that are able to automatically classify chemical data into chemical ontologies, which can be framed as a hierarchical multi-class classification problem. In this paper we evaluate machine learning approaches for this task, comparing different learning frameworks including logistic regression, decision trees and long short-term memory artificial neural networks, and different encoding approaches for the chemical structures, including cheminformatics fingerprints and character-based encoding from chemical line notation representations. We find that classical learning approaches such as logistic regression perform well with sets of relatively specific, disjoint chemical classes, while the neural network is able to handle larger sets of overlapping classes but needs more examples per class to learn from, and is not able to make a class prediction for every molecule. Future work will explore hybrid and ensemble approaches, as well as alternative network architectures including neuro-symbolic approaches.
Collapse
Affiliation(s)
- Janna Hastings
- Department of Computer Science, Otto-von-Guericke University of Magdeburg, Magdeburg, Germany
| | - Martin Glauer
- Department of Computer Science, Otto-von-Guericke University of Magdeburg, Magdeburg, Germany
| | - Adel Memariani
- Department of Computer Science, Otto-von-Guericke University of Magdeburg, Magdeburg, Germany
| | - Fabian Neuhaus
- Department of Computer Science, Otto-von-Guericke University of Magdeburg, Magdeburg, Germany
| | - Till Mossakowski
- Department of Computer Science, Otto-von-Guericke University of Magdeburg, Magdeburg, Germany
| |
Collapse
|
16
|
Xu F, Xiang Y, Wan C, You Q, Zhou L, Li C, Gong S, Gong Y, Li L, Li Z, Zhang L, Zhang X, Guo C, Lai K, Huang C, Zhao H, Jin C, Lin H. Predicting subretinal fluid absorption with machine learning in patients with central serous chorioretinopathy. ANNALS OF TRANSLATIONAL MEDICINE 2021; 9:242. [PMID: 33708869 PMCID: PMC7940879 DOI: 10.21037/atm-20-1519] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
Background Machine learning was used to predict subretinal fluid absorption (SFA) at 1, 3 and 6 months after laser treatment in patients with central serous chorioretinopathy (CSC). Methods The clinical and imaging data from 480 eyes of 461 patients with CSC were collected at Zhongshan Ophthalmic Center (ZOC) and Xiamen Eye Center (XEC). The data included clinical features from electronic medical records and measured features from fundus fluorescein angiography (FFA), indocyanine green angiography (ICGA), optical coherence tomography angiography (OCTA), and optical coherence tomography (OCT). A ZOC dataset was used for training and internal validation. An XEC dataset was used for external validation. Six machine learning algorithms and a blending algorithm were trained to predict SFA in patients with CSC after laser treatment. The SFA results predicted by machine learning were compared with the actual patient prognoses. Based on the initial detailed investigation, we constructed a simplified model using fewer clinical features and OCT features for convenient application. Results During the internal validation, random forest performed best in SFA prediction, with accuracies of 0.651±0.068, 0.753±0.065 and 0.818±0.058 at 1, 3 and 6 months, respectively. In the external validation, XGBoost performed best at SFA prediction with accuracies of 0.734, 0.727, and 0.900 at 1, 3 and 6 months, respectively. The simplified model showed a comparable level of predictive power. Conclusions Machine learning can achieve high accuracy in long-term SFA predictions and identify the features relevant to CSC patients’ prognoses. Our study provides an individualized reference for ophthalmologists to treat and create a follow-up schedule for CSC patients.
Collapse
Affiliation(s)
- Fabao Xu
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-Sen University, Guangzhou, China
| | - Yifan Xiang
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-Sen University, Guangzhou, China
| | - Cheng Wan
- Nanjing University of Aeronautics and Astronautics, Nanjing, China
| | - Qijing You
- Nanjing University of Aeronautics and Astronautics, Nanjing, China
| | - Lijun Zhou
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-Sen University, Guangzhou, China
| | - Cong Li
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-Sen University, Guangzhou, China
| | - Songjian Gong
- Xiamen Eye Center, Affiliated to Xiamen University, Xiamen, China
| | - Yajun Gong
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-Sen University, Guangzhou, China
| | - Longhui Li
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-Sen University, Guangzhou, China
| | - Zhongwen Li
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-Sen University, Guangzhou, China
| | - Li Zhang
- Department of Ophthalmology, The Central Hospital of Wuhan, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
| | - Xiayin Zhang
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-Sen University, Guangzhou, China
| | - Chong Guo
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-Sen University, Guangzhou, China
| | - Kunbei Lai
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-Sen University, Guangzhou, China
| | - Chuangxin Huang
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-Sen University, Guangzhou, China
| | - Hongkun Zhao
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-Sen University, Guangzhou, China
| | - Chenjin Jin
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-Sen University, Guangzhou, China
| | - Haotian Lin
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-Sen University, Guangzhou, China.,Center of Precision Medicine, Sun Yat-Sen University, Guangzhou, China
| |
Collapse
|
17
|
Schmidt F. Computational Toxicology. SYSTEMS MEDICINE 2021. [DOI: 10.1016/b978-0-12-801238-3.11534-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open
|
18
|
Wang MWH, Goodman JM, Allen TEH. Machine Learning in Predictive Toxicology: Recent Applications and Future Directions for Classification Models. Chem Res Toxicol 2020; 34:217-239. [PMID: 33356168 DOI: 10.1021/acs.chemrestox.0c00316] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
In recent times, machine learning has become increasingly prominent in predictive toxicology as it has shifted from in vivo studies toward in silico studies. Currently, in vitro methods together with other computational methods such as quantitative structure-activity relationship modeling and absorption, distribution, metabolism, and excretion calculations are being used. An overview of machine learning and its applications in predictive toxicology is presented here, including support vector machines (SVMs), random forest (RF) and decision trees (DTs), neural networks, regression models, naïve Bayes, k-nearest neighbors, and ensemble learning. The recent successes of these machine learning methods in predictive toxicology are summarized, and a comparison of some models used in predictive toxicology is presented. In predictive toxicology, SVMs, RF, and DTs are the dominant machine learning methods due to the characteristics of the data available. Lastly, this review describes the current challenges facing the use of machine learning in predictive toxicology and offers insights into the possible areas of improvement in the field.
Collapse
Affiliation(s)
- Marcus W H Wang
- Centre for Molecular Informatics, Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| | - Jonathan M Goodman
- Centre for Molecular Informatics, Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| | - Timothy E H Allen
- Centre for Molecular Informatics, Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom.,MRC Toxicology Unit, University of Cambridge, Hodgkin Building, Lancaster Road, Leicester LE1 7HB, United Kingdom
| |
Collapse
|
19
|
Development of In Vitro Corneal Models: Opportunity for Pharmacological Testing. Methods Protoc 2020; 3:mps3040074. [PMID: 33147693 PMCID: PMC7711486 DOI: 10.3390/mps3040074] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2020] [Accepted: 10/30/2020] [Indexed: 12/12/2022] Open
Abstract
The human eye is a specialized organ with a complex anatomy and physiology, because it is characterized by different cell types with specific physiological functions. Given the complexity of the eye, ocular tissues are finely organized and orchestrated. In the last few years, many in vitro models have been developed in order to meet the 3Rs principle (Replacement, Reduction and Refinement) for eye toxicity testing. This procedure is highly necessary to ensure that the risks associated with ophthalmic products meet appropriate safety criteria. In vitro preclinical testing is now a well-established practice of significant importance for evaluating the efficacy and safety of cosmetic, pharmaceutical, and nutraceutical products. Along with in vitro testing, also computational procedures, herein described, for evaluating the pharmacological profile of potential ocular drug candidates including their toxicity, are in rapid expansion. In this review, the ocular cell types and functionality are described, providing an overview about the scientific challenge for the development of three-dimensional (3D) in vitro models.
Collapse
|
20
|
Papadiamantis AG, Jänes J, Voyiatzis E, Sikk L, Burk J, Burk P, Tsoumanis A, Ha MK, Yoon TH, Valsami-Jones E, Lynch I, Melagraki G, Tämm K, Afantitis A. Predicting Cytotoxicity of Metal Oxide Nanoparticles using Isalos Analytics Platform. NANOMATERIALS (BASEL, SWITZERLAND) 2020; 10:E2017. [PMID: 33066094 PMCID: PMC7601995 DOI: 10.3390/nano10102017] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/13/2020] [Revised: 10/03/2020] [Accepted: 10/07/2020] [Indexed: 02/07/2023]
Abstract
A literature curated dataset containing 24 distinct metal oxide (MexOy) nanoparticles (NPs), including 15 physicochemical, structural and assay-related descriptors, was enriched with 62 atomistic computational descriptors and exploited to produce a robust and validated in silico model for prediction of NP cytotoxicity. The model can be used to predict the cytotoxicity (cell viability) of MexOy NPs based on the colorimetric lactate dehydrogenase (LDH) assay and the luminometric adenosine triphosphate (ATP) assay, both of which quantify irreversible cell membrane damage. Out of the 77 total descriptors used, 7 were identified as being significant for induction of cytotoxicity by MexOy NPs. These were NP core size, hydrodynamic size, assay type, exposure dose, the energy of the MexOy conduction band (EC), the coordination number of the metal atoms on the NP surface (Avg. C.N. Me atoms surface) and the average force vector surface normal component of all metal atoms (v⟂ Me atoms surface). The significance and effect of these descriptors is discussed to demonstrate their direct correlation with cytotoxicity. The produced model has been made publicly available by the Horizon 2020 (H2020) NanoSolveIT project and will be added to the project's Integrated Approach to Testing and Assessment (IATA).
Collapse
Affiliation(s)
- Anastasios G. Papadiamantis
- NovaMechanics Ltd., Nicosia 1065, Cyprus; (A.G.P.); (E.V.); (A.T.)
- School of Geography, Earth and Environmental Sciences, University of Birmingham, Birmingham B15 2TT, UK; (E.V.-J.); (I.L.)
| | - Jaak Jänes
- Institute of Chemistry, University of Tartu, 50411 Tartu, Estonia; (J.J.); (L.S.); (J.B.); (P.B.)
| | | | - Lauri Sikk
- Institute of Chemistry, University of Tartu, 50411 Tartu, Estonia; (J.J.); (L.S.); (J.B.); (P.B.)
| | - Jaanus Burk
- Institute of Chemistry, University of Tartu, 50411 Tartu, Estonia; (J.J.); (L.S.); (J.B.); (P.B.)
| | - Peeter Burk
- Institute of Chemistry, University of Tartu, 50411 Tartu, Estonia; (J.J.); (L.S.); (J.B.); (P.B.)
| | | | - My Kieu Ha
- Department of Chemistry, College of Natural Sciences, Hanyang University, Seoul 04763, Korea; (M.K.H.); (T.H.Y.)
| | - Tae Hyun Yoon
- Department of Chemistry, College of Natural Sciences, Hanyang University, Seoul 04763, Korea; (M.K.H.); (T.H.Y.)
- Institute of Next Generation Material Design, Hanyang University, Seoul 04763, Korea
| | - Eugenia Valsami-Jones
- School of Geography, Earth and Environmental Sciences, University of Birmingham, Birmingham B15 2TT, UK; (E.V.-J.); (I.L.)
| | - Iseult Lynch
- School of Geography, Earth and Environmental Sciences, University of Birmingham, Birmingham B15 2TT, UK; (E.V.-J.); (I.L.)
| | - Georgia Melagraki
- Division of Physical Sciences and Applications, Hellenic Military Academy, 16672 Vari, Greece;
| | - Kaido Tämm
- Institute of Chemistry, University of Tartu, 50411 Tartu, Estonia; (J.J.); (L.S.); (J.B.); (P.B.)
| | | |
Collapse
|
21
|
Jenkinson S, Schmidt F, Rosenbrier Ribeiro L, Delaunois A, Valentin JP. A practical guide to secondary pharmacology in drug discovery. J Pharmacol Toxicol Methods 2020; 105:106869. [PMID: 32302774 DOI: 10.1016/j.vascn.2020.106869] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2020] [Revised: 03/21/2020] [Accepted: 04/03/2020] [Indexed: 01/29/2023]
Abstract
Secondary pharmacological profiling is increasingly applied in pharmaceutical drug discovery to address unwanted pharmacological side effects of drug candidates before entering the clinic. Regulators, drug makers and patients share a demand for deep characterization of secondary pharmacology effects of novel drugs and their metabolites. The scope of such profiling has therefore expanded substantially in the past two decades, leading to the implementation of broad in silico profiling methods and focused in vitro off-target screening panels, to identify liabilities, but also opportunities, as early as possible. The pharmaceutical industry applies such panels at all stages of drug discovery routinely up to early development. Nevertheless, target composition, screening technologies, assay formats, interpretation and scheduling of panels can vary significantly between companies in the absence of dedicated guidelines. To contribute towards best practices in secondary pharmacology profiling, this review aims to summarize the state-of-the art in this field. Considerations are discussed with respect to panel design, screening strategy, implementation and interpretation of the data, including regulatory perspectives. The cascaded, or integrated, use of in silico and off-target profiling allows to exploit synergies for comprehensive safety assessment of drug candidates.
Collapse
Affiliation(s)
- Stephen Jenkinson
- Drug Safety Research and Development, Pfizer Inc., La Jolla, CA 92121, United States of America.
| | - Friedemann Schmidt
- Sanofi, R&D Preclinical Safety, Industriepark Höchst, 65926 Frankfurt/Main, Germany
| | - Lyn Rosenbrier Ribeiro
- Medicines Discovery Catapult, Block 35, Mereside, Alderley Park, Alderley Edge, SK10 4TG, United Kingdom
| | - Annie Delaunois
- UCB BioPharma SRL, Early Solutions, Development Science, Non-Clinical Safety, 1420 Braine L'Alleud, Walloon Region, Belgium
| | - Jean-Pierre Valentin
- UCB BioPharma SRL, Early Solutions, Development Science, Non-Clinical Safety, 1420 Braine L'Alleud, Walloon Region, Belgium
| |
Collapse
|
22
|
Xu T, Ngan DK, Ye L, Xia M, Xie HQ, Zhao B, Simeonov A, Huang R. Predictive Models for Human Organ Toxicity Based on In Vitro Bioactivity Data and Chemical Structure. Chem Res Toxicol 2020; 33:731-741. [PMID: 32077278 PMCID: PMC10926239 DOI: 10.1021/acs.chemrestox.9b00305] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Traditional toxicity testing reliant on animal models is costly and low throughput, posing a significant challenge with the increasing numbers of chemicals that humans are exposed to in the environment. The purpose of this investigation was to build optimal prediction models for various human in vivo/organ-level toxicity end points (extracted from ChemIDPlus) using chemical structure and Tox21 in vitro quantitative high-throughput screening (qHTS) bioactivity assay data. Several supervised machine learning algorithms were applied to model 14 human toxicity end points pertaining to vascular, kidney, ureter and bladder, and liver organ systems. Three metrics were used to evaluate model performance: area under the receiver operating characteristic curve (AUC-ROC), balanced accuracy (BA), and Matthews correlation coefficient (MCC). The top four models, with AUC-ROC values >0.8, were derived for endocrine (0.90 ± 0.00), musculoskeletal (0.88 ± 0.02), peripheral nerve and sensation (0.85 ± 0.01), and brain and coverings (0.83 ± 0.02) toxicities, whereas the best model AUC-ROC values were >0.7 for the remaining 10 toxicities. Model performance was found to be dependent on the specific data set, model type, and feature selection method used. In addition, chemical structure and assay data showed different levels of contribution to the prediction of different toxicity end points. Although in vitro assay data, when combined with chemical structure, slightly improved the predictive accuracy for most end points (11 out of 14), a noteworthy finding was the near equal success of the structure-only models, which do not require Tox21 qHTS screening data, and the relatively poor performance of assay-only models. Thus, the top-performing structure-only models from this study could be applied for hazard screening of large sets of chemicals for potential human toxicity, whereas the largest assay contributions to models (i.e., cellular targets) could be used, along with the top-contributing structural features, to provide insight into toxicity mechanisms.
Collapse
Affiliation(s)
- Tuan Xu
- Division of Pre-clinical Innovation, National Center for Advancing Translational Sciences (NCATS), National Institutes of Health (NIH), Rockville, MD 20850, USA
| | - Deborah K. Ngan
- Division of Pre-clinical Innovation, National Center for Advancing Translational Sciences (NCATS), National Institutes of Health (NIH), Rockville, MD 20850, USA
| | - Lin Ye
- Division of Pre-clinical Innovation, National Center for Advancing Translational Sciences (NCATS), National Institutes of Health (NIH), Rockville, MD 20850, USA
| | - Menghang Xia
- Division of Pre-clinical Innovation, National Center for Advancing Translational Sciences (NCATS), National Institutes of Health (NIH), Rockville, MD 20850, USA
| | - Heidi Q. Xie
- State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center of Eco-Environment Sciences, Chinese Academy of Sciences, Beijing 100085, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Bin Zhao
- State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center of Eco-Environment Sciences, Chinese Academy of Sciences, Beijing 100085, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Anton Simeonov
- Division of Pre-clinical Innovation, National Center for Advancing Translational Sciences (NCATS), National Institutes of Health (NIH), Rockville, MD 20850, USA
| | - Ruili Huang
- Division of Pre-clinical Innovation, National Center for Advancing Translational Sciences (NCATS), National Institutes of Health (NIH), Rockville, MD 20850, USA
| |
Collapse
|
23
|
Matsuzaka Y, Hosaka T, Ogaito A, Yoshinari K, Uesawa Y. Prediction Model of Aryl Hydrocarbon Receptor Activation by a Novel QSAR Approach, DeepSnap-Deep Learning. Molecules 2020; 25:molecules25061317. [PMID: 32183141 PMCID: PMC7144728 DOI: 10.3390/molecules25061317] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2020] [Revised: 03/05/2020] [Accepted: 03/09/2020] [Indexed: 12/31/2022] Open
Abstract
The aryl hydrocarbon receptor (AhR) is a ligand-dependent transcription factor that senses environmental exogenous and endogenous ligands or xenobiotic chemicals. In particular, exposure of the liver to environmental metabolism-disrupting chemicals contributes to the development and propagation of steatosis and hepatotoxicity. However, the mechanisms for AhR-induced hepatotoxicity and tumor propagation in the liver remain to be revealed, due to the wide variety of AhR ligands. Recently, quantitative structure–activity relationship (QSAR) analysis using deep neural network (DNN) has shown superior performance for the prediction of chemical compounds. Therefore, this study proposes a novel QSAR analysis using deep learning (DL), called the DeepSnap–DL method, to construct prediction models of chemical activation of AhR. Compared with conventional machine learning (ML) techniques, such as the random forest, XGBoost, LightGBM, and CatBoost, the proposed method achieves high-performance prediction of AhR activation. Thus, the DeepSnap–DL method may be considered a useful tool for achieving high-throughput in silico evaluation of AhR-induced hepatotoxicity.
Collapse
Affiliation(s)
- Yasunari Matsuzaka
- Department of Medical Molecular Informatics, Meiji Pharmaceutical University, 204-8588 Tokyo, Japan;
| | - Takuomi Hosaka
- Laboratory of Molecular Toxicology, School of Pharmaceutical Sciences, University of Shizuoka, Shizuoka 422-8529, Japan; (T.H.); (A.O.); (K.Y.)
| | - Anna Ogaito
- Laboratory of Molecular Toxicology, School of Pharmaceutical Sciences, University of Shizuoka, Shizuoka 422-8529, Japan; (T.H.); (A.O.); (K.Y.)
| | - Kouichi Yoshinari
- Laboratory of Molecular Toxicology, School of Pharmaceutical Sciences, University of Shizuoka, Shizuoka 422-8529, Japan; (T.H.); (A.O.); (K.Y.)
| | - Yoshihiro Uesawa
- Department of Medical Molecular Informatics, Meiji Pharmaceutical University, 204-8588 Tokyo, Japan;
- Correspondence:
| |
Collapse
|
24
|
Talevi A, Morales JF, Hather G, Podichetty JT, Kim S, Bloomingdale PC, Kim S, Burton J, Brown JD, Winterstein AG, Schmidt S, White JK, Conrado DJ. Machine Learning in Drug Discovery and Development Part 1: A Primer. CPT-PHARMACOMETRICS & SYSTEMS PHARMACOLOGY 2020; 9:129-142. [PMID: 31905263 PMCID: PMC7080529 DOI: 10.1002/psp4.12491] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/17/2019] [Accepted: 12/10/2019] [Indexed: 01/13/2023]
Abstract
Artificial intelligence, in particular machine learning (ML), has emerged as a key promising pillar to overcome the high failure rate in drug development. Here, we present a primer on the ML algorithms most commonly used in drug discovery and development. We also list possible data sources, describe good practices for ML model development and validation, and share a reproducible example. A companion article will summarize applications of ML in drug discovery, drug development, and postapproval phase.
Collapse
Affiliation(s)
- Alan Talevi
- Laboratorio de Investigación y Desarrollo de Bioactivos (LIDeB), Faculty of Exact Sciences, National University of La Plata (UNLP), Buenos Aires, Argentina
| | - Juan Francisco Morales
- Laboratorio de Investigación y Desarrollo de Bioactivos (LIDeB), Faculty of Exact Sciences, National University of La Plata (UNLP), Buenos Aires, Argentina
| | - Gregory Hather
- Statistical & Quantitative Sciences, Takeda Pharmaceuticals, Cambridge, Massachusetts, USA
| | | | - Sarah Kim
- Center for Pharmacometrics and Systems Pharmacology, Department of Pharmaceutics, College of Pharmacy, University of Florida, Orlando, Florida, USA
| | - Peter C Bloomingdale
- Quantitative Pharmacology and Pharmacometrics, Merck & Co. Inc, Kenilworth, New Jersey, USA
| | | | - Jackson Burton
- Quantitative Medicine, Critical Path Institute, Tucson, Arizona, USA
| | - Joshua D Brown
- Center for Drug Evaluation and Safety, Department of Pharmaceutical Outcomes and Policy, College of Pharmacy, University of Florida, Gainesville, Florida, USA
| | - Almut G Winterstein
- Center for Drug Evaluation and Safety, Department of Pharmaceutical Outcomes and Policy, College of Pharmacy, University of Florida, Gainesville, Florida, USA
| | - Stephan Schmidt
- Center for Pharmacometrics and Systems Pharmacology, Department of Pharmaceutics, College of Pharmacy, University of Florida, Orlando, Florida, USA
| | - Jensen Kael White
- Quantitative Medicine, Critical Path Institute, Tucson, Arizona, USA
| | | |
Collapse
|
25
|
Pereira VR, Pereira DR, de Melo Tavares Vieira KC, Ribas VP, Constantino CJL, Antunes PA, Favareto APA. Sperm quality of rats exposed to difenoconazole using classical parameters and surface-enhanced Raman scattering: classification performance by machine learning methods. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2019; 26:35253-35265. [PMID: 31701422 DOI: 10.1007/s11356-019-06407-0] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/20/2018] [Accepted: 09/03/2019] [Indexed: 05/23/2023]
Abstract
Difenoconazole is a fungicide extensively used in agriculture. The aim of this study was to evaluate the effects of difenoconazole fungicide on the sperm quality of rats. Wistar rats were divided into four groups: control and exposed to 5 (D5), 10 (D10), or 50 mg-1 kg bw-1day (D50) of difenoconazole for 30 days, by gavage. Classical sperm parameters and surface-enhanced Raman scattering (SERS) were performed. Progressive motility, acrosomal integrity, and percentage of morphologically normal spermatozoa were reduced in the D10 and D50 groups in comparison with the control group. Sperm viability was reduced only in the D50 group. Sperm number in the testis and caput/corpus epididymis and daily sperm production were reduced in the three exposed groups. SERS measurements showed changes in the spectra of spermatozoa from D50 group, suggesting DNA damage. In addition, machine learning (ML) methods were used to evaluate the performance of three classification algorithms (artificial neural network-ANN, K-nearest neighbors-K-NN, and support vector machine-SVM) in the identification task of the groups exposed to difenoconazole. The results obtained by ML algorithms were very promising with accuracy ≥ 90% and validated the hypothesis of the exposure to difenoconazole reduces sperm quality. In conclusion, exposure of rats to different doses of the fungicide difenoconazole may impair sperm quality, with a recognizable classification pattern of exposure groups.
Collapse
Affiliation(s)
- Viviane Ribas Pereira
- Graduate Program in Environment and Regional Development, University of Western São Paulo - UNOESTE, Presidente Prudente, SP, CEP. 19.067-175, Brazil
| | - Danillo Roberto Pereira
- Graduate Program in Environment and Regional Development, University of Western São Paulo - UNOESTE, Presidente Prudente, SP, CEP. 19.067-175, Brazil
| | - Kátia Cristina de Melo Tavares Vieira
- Graduate Program in Environment and Regional Development, University of Western São Paulo - UNOESTE, Presidente Prudente, SP, CEP. 19.067-175, Brazil
| | - Vitor Pereira Ribas
- College of Science, Letters and Education from Presidente Prudente - FACLEPP, University of Western São Paulo - UNOESTE, Presidente Prudente, SP, CEP. 19.067-175, Brazil
| | - Carlos José Leopoldo Constantino
- School of Technology and Applied Sciences, São Paulo State University (UNESP), Campus Presidente Prudente, Presidente Prudente, SP, Brazil
| | - Patrícia Alexandra Antunes
- College of Science, Letters and Education from Presidente Prudente - FACLEPP, University of Western São Paulo - UNOESTE, Presidente Prudente, SP, CEP. 19.067-175, Brazil
| | - Ana Paula Alves Favareto
- Graduate Program in Environment and Regional Development, University of Western São Paulo - UNOESTE, Presidente Prudente, SP, CEP. 19.067-175, Brazil.
| |
Collapse
|
26
|
Barbosa Silva Cavalcanti A, Costa Barros RP, Costa VCDO, Sobral da Silva M, Fechine Tavares J, Scotti L, Scotti MT. Computer-Aided Chemotaxonomy and Bioprospecting Study of Diterpenes of the Lamiaceae Family. Molecules 2019; 24:molecules24213908. [PMID: 31671588 PMCID: PMC6864738 DOI: 10.3390/molecules24213908] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2019] [Revised: 10/25/2019] [Accepted: 10/27/2019] [Indexed: 01/12/2023] Open
Abstract
Lamiaceae is one of the largest families of angiosperms and is classified into 12 subfamilies that are composed of 295 genera and 7775 species. It presents a variety of secondary metabolites such as diterpenes that are commonly found in their species, and some of them are known to be chemotaxonomic markers. The aim of this work was to construct a database of diterpenes and to use it to perform a chemotaxonomic analysis among the subfamilies of Lamiaceae, using molecular descriptors and self-organizing maps (SOMs). The 4115 different diterpenes corresponding to 6386 botanical occurrences, which are distributed in eight subfamilies, 66 genera, 639 different species and 4880 geographical locations, were added to SistematX. Molecular descriptors of diterpenes and their respective botanical occurrences were used to generate the SOMs. In all obtained maps, a match rate higher than 80% was observed, demonstrating a separation of the Lamiaceae subfamilies, corroborating with the morphological and molecular data proposed by Li et al. Therefore, through this chemotaxonomic study, we can predict the localization of a diterpene in a subfamily and assist in the search for secondary metabolites with specific structural characteristics, such as compounds with potential biological activity.
Collapse
Affiliation(s)
- Andreza Barbosa Silva Cavalcanti
- Post-Graduate Program in Natural Synthetic Bioactive Products, Federal University of Paraiba, João Pessoa 58051-900, Paraíba, Brazil.
| | - Renata Priscila Costa Barros
- Post-Graduate Program in Natural Synthetic Bioactive Products, Federal University of Paraiba, João Pessoa 58051-900, Paraíba, Brazil.
| | - Vicente Carlos de Oliveira Costa
- Post-Graduate Program in Natural Synthetic Bioactive Products, Federal University of Paraiba, João Pessoa 58051-900, Paraíba, Brazil.
| | - Marcelo Sobral da Silva
- Post-Graduate Program in Natural Synthetic Bioactive Products, Federal University of Paraiba, João Pessoa 58051-900, Paraíba, Brazil.
| | - Josean Fechine Tavares
- Post-Graduate Program in Natural Synthetic Bioactive Products, Federal University of Paraiba, João Pessoa 58051-900, Paraíba, Brazil.
| | - Luciana Scotti
- Post-Graduate Program in Natural Synthetic Bioactive Products, Federal University of Paraiba, João Pessoa 58051-900, Paraíba, Brazil.
| | - Marcus Tullius Scotti
- Post-Graduate Program in Natural Synthetic Bioactive Products, Federal University of Paraiba, João Pessoa 58051-900, Paraíba, Brazil.
| |
Collapse
|
27
|
Sosnin S, Karlov D, Tetko IV, Fedorov MV. Comparative Study of Multitask Toxicity Modeling on a Broad Chemical Space. J Chem Inf Model 2019; 59:1062-1072. [PMID: 30589269 DOI: 10.1021/acs.jcim.8b00685] [Citation(s) in RCA: 47] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
Acute toxicity is one of the most challenging properties to predict purely with computational methods due to its direct relationship to biological interactions. Moreover, toxicity can be represented by different end points: it can be measured for different species using different types of administration, etc., and it is questionable if the knowledge transfer between end points is possible. We performed a comparative study of prediction multitask toxicity for a broad chemical space using different descriptors and modeling algorithms and applied multitask learning for a large toxicity data set extracted from the Registry of Toxic Effects of Chemical Substances (RTECS). We demonstrated that multitask modeling provides significant improvement over single-output models and other machine learning methods. Our research reveals that multitask learning can be very useful to improve the quality of acute toxicity modeling and raises a discussion about the usage of multitask approaches for regulation purposes. Our MultiTox models are freely available in OCHEM platform ( ochem.eu/multitox ) under CC-BY-NC license.
Collapse
Affiliation(s)
- Sergey Sosnin
- Skolkovo Institute of Science and Technology , Skolkovo Innovation Center , Moscow 143026 , Russia
| | - Dmitry Karlov
- Skolkovo Institute of Science and Technology , Skolkovo Innovation Center , Moscow 143026 , Russia
| | - Igor V Tetko
- Helmholtz Zentrum München-Research Center for Environmental Health (GmbH) , Institute of Structural Biology and BIGCHEM GmbH , Ingolstädter Landstraße 1 , D-85764 Neuherberg , Germany
| | - Maxim V Fedorov
- Skolkovo Institute of Science and Technology , Skolkovo Innovation Center , Moscow 143026 , Russia.,University of Strathclyde , Department of Physics , John Anderson Building, 107 Rottenrow East , Glasgow , U.K. G40NG
| |
Collapse
|
28
|
Chierici M, Giulini M, Bussola N, Jurman G, Furlanello C. Machine learning models for predicting endocrine disruption potential of environmental chemicals. JOURNAL OF ENVIRONMENTAL SCIENCE AND HEALTH. PART C, ENVIRONMENTAL CARCINOGENESIS & ECOTOXICOLOGY REVIEWS 2019; 36:237-251. [PMID: 30628533 DOI: 10.1080/10590501.2018.1537155] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
We introduce here ML4Tox, a framework offering Deep Learning and Support Vector Machine models to predict agonist, antagonist, and binding activities of chemical compounds, in this case for the estrogen receptor ligand-binding domain. The ML4Tox models have been developed with a 10 × 5-fold cross-validation schema on the training portion of the CERAPP ToxCast dataset, formed by 1677 chemicals, each described by 777 molecular features. On the CERAPP "All Literature" evaluation set (agonist: 6319 compounds; antagonist 6539; binding 7283), ML4Tox significantly improved sensitivity over published results on all three tasks, with agonist: 0.78 vs 0.56; antagonist: 0.69 vs 0.11; binding: 0.66 vs 0.26.
Collapse
Affiliation(s)
| | | | - Nicole Bussola
- a Fondazione Bruno Kessler , Trento , Italy
- b Centre for Integrative Biology, University of Trento , Trento , Italy
| | | | | |
Collapse
|
29
|
Khan PM, Roy K. Current approaches for choosing feature selection and learning algorithms in quantitative structure-activity relationships (QSAR). Expert Opin Drug Discov 2018; 13:1075-1089. [PMID: 30372648 DOI: 10.1080/17460441.2018.1542428] [Citation(s) in RCA: 44] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
INTRODUCTION Quantitative structure-activity/property relationships (QSAR/QSPR) are statistical models which quantitatively correlate quantitative chemical structure information (described as molecular descriptors) to the response end points (biological activity, property, toxicity, etc.). Important strategies for QSAR model development and validation include dataset curation, variable selection, and dataset division, selection of modeling algorithms and appropriate measures of model validation. Areas covered: Different feature selection methods and various linear and nonlinear learning algorithms are employed to address the complexity of data sets for selection of appropriate features important for the responses being modeled, to reduce overfitting of the models, and to derive interpretable models. This review provides an overview of various feature selection methods as well as different statistical learning algorithms for QSAR modeling at an elementary level for nonexpert readers. Expert opinion: Novel sets of descriptors are being continuously introduced to this field; therefore, to handle this issue, there is a need to improve new tools for feature selection, which can lead to development of statistically meaningful models, usable by nonexperts in the fields. While handling data sets of limited size, special techniques like double cross-validation and consensus modeling might be more meaningful in order to remove the possibility of bias in descriptor selection.
Collapse
Affiliation(s)
- Pathan Mohsin Khan
- a Department of Pharmacoinformatics , National Institute of Pharmaceutical Educational and Research (NIPER) , Kolkata , India
| | - Kunal Roy
- b Drug Theoretics and Cheminformatics Laboratory, Division of Medicinal and Pharmaceutical Chemistry, Department of Pharmaceutical Technology , Jadavpur University , Kolkata , India
| |
Collapse
|
30
|
Grenet I, Yin Y, Comet JP. G-Networks to Predict the Outcome of Sensing of Toxicity. SENSORS 2018; 18:s18103483. [PMID: 30332807 PMCID: PMC6210391 DOI: 10.3390/s18103483] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/14/2018] [Revised: 10/05/2018] [Accepted: 10/12/2018] [Indexed: 01/09/2023]
Abstract
G-Networks and their simplified version known as the Random Neural Network have often been used to classify data. In this paper, we present a use of the Random Neural Network to the early detection of potential of toxicity chemical compounds through the prediction of their bioactivity from the compounds' physico-chemical structure, and propose that it be automated using machine learning (ML) techniques. Specifically the Random Neural Network is shown to be an effective analytical tool to this effect, and the approach is illustrated and compared with several ML techniques.
Collapse
Affiliation(s)
- Ingrid Grenet
- University Côte d'Azur, I3S laboratory, UMR CNRS 7271, CS 40121, 06903 Sophia Antipolis CEDEX, France.
| | - Yonghua Yin
- Intelligent Systems and Networks Group, Department of Electrical and Electronic Engineering, Imperial College, London SW7 2AZ, UK.
| | - Jean-Paul Comet
- University Côte d'Azur, I3S laboratory, UMR CNRS 7271, CS 40121, 06903 Sophia Antipolis CEDEX, France.
| |
Collapse
|