1
|
Guan R, Cai R, Guo B, Wang Y, Zhao C. A Data-Driven Computational Framework for Assessing the Risk of Placental Exposure to Environmental Chemicals. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2024; 58:7770-7781. [PMID: 38665120 DOI: 10.1021/acs.est.4c00475] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/08/2024]
Abstract
A computational framework based on placental gene networks was proposed in this work to improve the accuracy of the placental exposure risk assessment of environmental compounds. The framework quantitatively characterizes the ability of compounds to cross the placental barrier by systematically considering the interaction and pathway-level information on multiple placental transporters. As a result, probability scores were generated for 307 compounds crossing the placental barrier based on this framework. These scores were then used to categorize the compounds into different levels of transplacental transport range, creating a gradient partition. These probability scores not only facilitated a more intuitive understanding of a compound's ability to cross the placental barrier but also provided valuable information for predicting potential placental disruptors. Compounds with probability scores greater than 90% were considered to have significant transplacental transport potential, whereas those with probability scores less than 80% were classified as unlikely to cross the placental barrier. Furthermore, external validation set results showed that the probability score could accurately predict the compounds known to cross the placental barrier. In conclusion, the computational framework proposed in this study enhances the intuitive understanding of the ability of compounds to cross the placental barrier and opens up new avenues for assessing the placental exposure risk of compounds.
Collapse
Affiliation(s)
- Ruining Guan
- School of Pharmacy, Lanzhou University, Lanzhou 730000, China
| | - Ruitong Cai
- School of Pharmacy, Lanzhou University, Lanzhou 730000, China
| | - Binbin Guo
- School of Pharmacy, Lanzhou University, Lanzhou 730000, China
| | - Yawei Wang
- Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, Beijing 100085, China
| | - Chunyan Zhao
- School of Pharmacy, Lanzhou University, Lanzhou 730000, China
- Hubei Key Laboratory of Environmental and Health Effects of Persistent Toxic Substances, School of Environment and Health, Jianghan University, Wuhan 430056, China
| |
Collapse
|
2
|
Kleinstreuer N, Hartung T. Artificial intelligence (AI)-it's the end of the tox as we know it (and I feel fine). Arch Toxicol 2024; 98:735-754. [PMID: 38244040 PMCID: PMC10861653 DOI: 10.1007/s00204-023-03666-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2023] [Accepted: 12/12/2023] [Indexed: 01/22/2024]
Abstract
The rapid progress of AI impacts diverse scientific disciplines, including toxicology, and has the potential to transform chemical safety evaluation. Toxicology has evolved from an empirical science focused on observing apical outcomes of chemical exposure, to a data-rich field ripe for AI integration. The volume, variety and velocity of toxicological data from legacy studies, literature, high-throughput assays, sensor technologies and omics approaches create opportunities but also complexities that AI can help address. In particular, machine learning is well suited to handle and integrate large, heterogeneous datasets that are both structured and unstructured-a key challenge in modern toxicology. AI methods like deep neural networks, large language models, and natural language processing have successfully predicted toxicity endpoints, analyzed high-throughput data, extracted facts from literature, and generated synthetic data. Beyond automating data capture, analysis, and prediction, AI techniques show promise for accelerating quantitative risk assessment by providing probabilistic outputs to capture uncertainties. AI also enables explanation methods to unravel mechanisms and increase trust in modeled predictions. However, issues like model interpretability, data biases, and transparency currently limit regulatory endorsement of AI. Multidisciplinary collaboration is needed to ensure development of interpretable, robust, and human-centered AI systems. Rather than just automating human tasks at scale, transformative AI can catalyze innovation in how evidence is gathered, data are generated, hypotheses are formed and tested, and tasks are performed to usher new paradigms in chemical safety assessment. Used judiciously, AI has immense potential to advance toxicology into a more predictive, mechanism-based, and evidence-integrated scientific discipline to better safeguard human and environmental wellbeing across diverse populations.
Collapse
Affiliation(s)
| | - Thomas Hartung
- Bloomberg School of Public Health, Doerenkamp-Zbinden Chair for Evidence-Based Toxicology, Center for Alternatives to Animal Testing (CAAT), Johns Hopkins University, Baltimore, MD, USA.
- CAAT-Europe, University of Konstanz, Constance, Germany.
| |
Collapse
|
3
|
He C, Yu W, Yang M, Li Z, Yu J, Zhong D, Deng S, Song Z, Cheng S. Qi Fu Yin ameliorates neuroinflammation through inhibiting RAGE and TLR4/NF-κB pathway in AD model rats. Aging (Albany NY) 2023; 15:13239-13264. [PMID: 38006400 DOI: 10.18632/aging.205238] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2023] [Accepted: 10/23/2023] [Indexed: 11/27/2023]
Abstract
The purpose of this study is to investigate the therapeutic effect of Qi Fu Yin (QFY) on Alzheimer's disease (AD) both computationally and experimentally. Network pharmacology analysis and molecular docking were conducted to identify potential targets and signaling pathways involved in QFY treating AD. Streptozotocin-induced AD rat model was used to verify important targets and predicted pathways. The components of QFY were identified using liquid chromatography-tandem mass spectrometry. The results indicate that the potential targets of QFY are highly enriched for anti-inflammatory pathways. Molecular docking analysis revealed stable structures formed between QFY's active compounds, including stigmasterol, β-sitosterol, and isorhamnetin, and the identified targets. In vivo, QFY improved cognitive memory in AD rats and reduced the mRNA expression levels of toll-like receptor 4 (TLR4), the receptor for advanced glycation end products (AGER), and the inflammatory factors interleukin-1β (IL-1β) and tumor necrosis factor-α (TNF-α) in the brains of AD rats. Furthermore, QFY effectively reduced nuclear translocation of nuclear factor-kappa B (NF-κB) and inhibited NF-κB and microglia activation. In conclusion, QFY can ameliorate neuroinflammation in AD model rats, partly via the inhibition of TLR4 and RAGE/NF-κB pathway and microglia activation, thereby enhancing learning and memory in AD model rats.
Collapse
Affiliation(s)
- Chunxiang He
- School of Integrated Chinese and Western Medicine, Hunan University of Chinese Medicine, Changsha, Hunan 410208, China
- Key Laboratory of Hunan Province for Integrated Traditional Chinese and Western Medicine on Prevention and Treatment of Cardio-Cerebral Diseases, College of Integrated Traditional Chinese and Western Medicine, Hunan University of Chinese Medicine, Changsha, Hunan 410208, China
| | - Wenjing Yu
- School of Integrated Chinese and Western Medicine, Hunan University of Chinese Medicine, Changsha, Hunan 410208, China
- Key Laboratory of Hunan Province for Integrated Traditional Chinese and Western Medicine on Prevention and Treatment of Cardio-Cerebral Diseases, College of Integrated Traditional Chinese and Western Medicine, Hunan University of Chinese Medicine, Changsha, Hunan 410208, China
| | - Miao Yang
- School of Integrated Chinese and Western Medicine, Hunan University of Chinese Medicine, Changsha, Hunan 410208, China
- Key Laboratory of Hunan Province for Integrated Traditional Chinese and Western Medicine on Prevention and Treatment of Cardio-Cerebral Diseases, College of Integrated Traditional Chinese and Western Medicine, Hunan University of Chinese Medicine, Changsha, Hunan 410208, China
| | - Ze Li
- School of Integrated Chinese and Western Medicine, Hunan University of Chinese Medicine, Changsha, Hunan 410208, China
- Key Laboratory of Hunan Province for Integrated Traditional Chinese and Western Medicine on Prevention and Treatment of Cardio-Cerebral Diseases, College of Integrated Traditional Chinese and Western Medicine, Hunan University of Chinese Medicine, Changsha, Hunan 410208, China
| | - Jingping Yu
- Key Laboratory of Hunan Province for Integrated Traditional Chinese and Western Medicine on Prevention and Treatment of Cardio-Cerebral Diseases, College of Integrated Traditional Chinese and Western Medicine, Hunan University of Chinese Medicine, Changsha, Hunan 410208, China
- Baoshan College of Traditional Chinese Medicine, Baoshan, Yunnan 678000, China
| | - Dayuan Zhong
- Key Laboratory of Hunan Province for Integrated Traditional Chinese and Western Medicine on Prevention and Treatment of Cardio-Cerebral Diseases, College of Integrated Traditional Chinese and Western Medicine, Hunan University of Chinese Medicine, Changsha, Hunan 410208, China
- Guangdong Provincial Hospital of Integrated Traditional Chinese and Western Medicine, Foshan, Guangdong 528000, China
| | - Sisi Deng
- School of Integrated Chinese and Western Medicine, Hunan University of Chinese Medicine, Changsha, Hunan 410208, China
- Key Laboratory of Hunan Province for Integrated Traditional Chinese and Western Medicine on Prevention and Treatment of Cardio-Cerebral Diseases, College of Integrated Traditional Chinese and Western Medicine, Hunan University of Chinese Medicine, Changsha, Hunan 410208, China
| | - Zhenyan Song
- School of Integrated Chinese and Western Medicine, Hunan University of Chinese Medicine, Changsha, Hunan 410208, China
- Key Laboratory of Hunan Province for Integrated Traditional Chinese and Western Medicine on Prevention and Treatment of Cardio-Cerebral Diseases, College of Integrated Traditional Chinese and Western Medicine, Hunan University of Chinese Medicine, Changsha, Hunan 410208, China
| | - Shaowu Cheng
- School of Integrated Chinese and Western Medicine, Hunan University of Chinese Medicine, Changsha, Hunan 410208, China
- Key Laboratory of Hunan Province for Integrated Traditional Chinese and Western Medicine on Prevention and Treatment of Cardio-Cerebral Diseases, College of Integrated Traditional Chinese and Western Medicine, Hunan University of Chinese Medicine, Changsha, Hunan 410208, China
| |
Collapse
|
4
|
Guo W, Liu J, Dong F, Song M, Li Z, Khan MKH, Patterson TA, Hong H. Review of machine learning and deep learning models for toxicity prediction. Exp Biol Med (Maywood) 2023; 248:1952-1973. [PMID: 38057999 PMCID: PMC10798180 DOI: 10.1177/15353702231209421] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/08/2023] Open
Abstract
The ever-increasing number of chemicals has raised public concerns due to their adverse effects on human health and the environment. To protect public health and the environment, it is critical to assess the toxicity of these chemicals. Traditional in vitro and in vivo toxicity assays are complicated, costly, and time-consuming and may face ethical issues. These constraints raise the need for alternative methods for assessing the toxicity of chemicals. Recently, due to the advancement of machine learning algorithms and the increase in computational power, many toxicity prediction models have been developed using various machine learning and deep learning algorithms such as support vector machine, random forest, k-nearest neighbors, ensemble learning, and deep neural network. This review summarizes the machine learning- and deep learning-based toxicity prediction models developed in recent years. Support vector machine and random forest are the most popular machine learning algorithms, and hepatotoxicity, cardiotoxicity, and carcinogenicity are the frequently modeled toxicity endpoints in predictive toxicology. It is known that datasets impact model performance. The quality of datasets used in the development of toxicity prediction models using machine learning and deep learning is vital to the performance of the developed models. The different toxicity assignments for the same chemicals among different datasets of the same type of toxicity have been observed, indicating benchmarking datasets is needed for developing reliable toxicity prediction models using machine learning and deep learning algorithms. This review provides insights into current machine learning models in predictive toxicology, which are expected to promote the development and application of toxicity prediction models in the future.
Collapse
Affiliation(s)
- Wenjing Guo
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA
| | - Jie Liu
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA
| | - Fan Dong
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA
| | - Meng Song
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA
| | - Zoe Li
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA
| | - Md Kamrul Hasan Khan
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA
| | - Tucker A Patterson
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA
| | - Huixiao Hong
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA
| |
Collapse
|
5
|
Duh-Leong C, Maffini MV, Kassotis CD, Vandenberg LN, Trasande L. The regulation of endocrine-disrupting chemicals to minimize their impact on health. Nat Rev Endocrinol 2023; 19:600-614. [PMID: 37553404 DOI: 10.1038/s41574-023-00872-x] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 06/29/2023] [Indexed: 08/10/2023]
Abstract
Endocrine-disrupting chemicals (EDCs) are substances generated by human industrial activities that are detrimental to human health through their effects on the endocrine system. The global societal and economic burden posed by EDCs is substantial. Poorly defined or unenforced policies can increase human exposure to EDCs, thereby contributing to human disease, disability and economic damage. Researchers have shown that policies and interventions implemented at both individual and government levels have the potential to reduce exposure to EDCs. This Review describes a set of evidence-based policy actions to manage, minimize or even eliminate the widespread use of these chemicals and better protect human health and society. A number of specific challenges exist: defining, identifying and prioritizing EDCs; considering the non-linear or non-monotonic properties of EDCs; accounting for EDC exposure effects that are latent and do not appear until later in life; and updating testing paradigms to reflect 'real-world' mixtures of chemicals and cumulative exposure. A sound strategy also requires partnering with health-care providers to integrate strategies to prevent EDC exposure in clinical care. Critical next steps include addressing EDCs within global policy frameworks by integrating EDC exposure prevention into emerging climate policy.
Collapse
Affiliation(s)
- Carol Duh-Leong
- Department of Pediatrics, New York University Grossman School of Medicine, New York, NY, USA
| | | | - Christopher D Kassotis
- Institute of Environmental Health Sciences and Department of Pharmacology, Wayne State University, Detroit, MI, USA
| | - Laura N Vandenberg
- Department of Environmental Health Sciences, University of Massachusetts - Amherst, Amherst, MA, USA
| | - Leonardo Trasande
- Department of Pediatrics, New York University Grossman School of Medicine, New York, NY, USA.
- Department of Environmental Medicine, New York University Grossman School of Medicine, New York, NY, USA.
- Department of Population Health, New York University Grossman School of Medicine, New York, NY, USA.
- New York University Wagner Graduate School of Public Service, New York, NY, USA.
| |
Collapse
|
6
|
Béquignon OM, Gómez-Tamayo JC, Lenselink EB, Wink S, Hiemstra S, Lam CC, Gadaleta D, Roncaglioni A, Norinder U, Water BVD, Pastor M, van Westen GJP. Collaborative SAR Modeling and Prospective In Vitro Validation of Oxidative Stress Activation in Human HepG2 Cells. J Chem Inf Model 2023; 63:5433-5445. [PMID: 37616385 PMCID: PMC10498489 DOI: 10.1021/acs.jcim.3c00220] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2023] [Indexed: 08/26/2023]
Abstract
Oxidative stress is the consequence of an abnormal increase of reactive oxygen species (ROS). ROS are generated mainly during the metabolism in both normal and pathological conditions as well as from exposure to xenobiotics. Xenobiotics can, on the one hand, disrupt molecular machinery involved in redox processes and, on the other hand, reduce the effectiveness of the antioxidant activity. Such dysregulation may lead to oxidative damage when combined with oxidative stress overpassing the cell capacity to detoxify ROS. In this work, a green fluorescent protein (GFP)-tagged nuclear factor erythroid 2-related factor 2 (NRF2)-regulated sulfiredoxin reporter (Srxn1-GFP) was used to measure the antioxidant response of HepG2 cells to a large series of drug and drug-like compounds (2230 compounds). These compounds were then classified as positive or negative depending on cellular response and distributed among different modeling groups to establish structure-activity relationship (SAR) models. A selection of models was used to prospectively predict oxidative stress induced by a new set of compounds subsequently experimentally tested to validate the model predictions. Altogether, this exercise exemplifies the different challenges of developing SAR models of a phenotypic cellular readout, model combination, chemical space selection, and results interpretation.
Collapse
Affiliation(s)
- Olivier
J. M. Béquignon
- Leiden
Academic Centre for Drug Research, Leiden
University, Wassenaarseweg 76, 2333 AL Leiden, The Netherlands
| | - Jose C. Gómez-Tamayo
- Research
Programme on Biomedical Informatics (GRIB), Department of Medicine
and Life Sciences, Hospital del Mar Medical Research Institute, Universitat Pompeu Fabra, Carrer del Dr. Aiguader 88, 08002 Barcelona, Spain
| | - Eelke B. Lenselink
- Leiden
Academic Centre for Drug Research, Leiden
University, Wassenaarseweg 76, 2333 AL Leiden, The Netherlands
| | - Steven Wink
- Leiden
Academic Centre for Drug Research, Leiden
University, Wassenaarseweg 76, 2333 AL Leiden, The Netherlands
| | - Steven Hiemstra
- Leiden
Academic Centre for Drug Research, Leiden
University, Wassenaarseweg 76, 2333 AL Leiden, The Netherlands
| | - Chi Chung Lam
- Leiden
Academic Centre for Drug Research, Leiden
University, Wassenaarseweg 76, 2333 AL Leiden, The Netherlands
| | - Domenico Gadaleta
- Laboratory
of Environmental Chemistry and Toxicology, Department of Environmental
Health Sciences, IRCCS—Istituto di
Ricerche Farmacologiche Mario Negri, Via la Masa 19, 20156 Milano, Italy
| | - Alessandra Roncaglioni
- Laboratory
of Environmental Chemistry and Toxicology, Department of Environmental
Health Sciences, IRCCS—Istituto di
Ricerche Farmacologiche Mario Negri, Via la Masa 19, 20156 Milano, Italy
| | - Ulf Norinder
- MTM
Research Centre, School of Science and Technology, Örebro University, SE-70182 Örebro, Sweden
| | - Bob van de Water
- Leiden
Academic Centre for Drug Research, Leiden
University, Wassenaarseweg 76, 2333 AL Leiden, The Netherlands
| | - Manuel Pastor
- Research
Programme on Biomedical Informatics (GRIB), Department of Medicine
and Life Sciences, Hospital del Mar Medical Research Institute, Universitat Pompeu Fabra, Carrer del Dr. Aiguader 88, 08002 Barcelona, Spain
| | - Gerard J. P. van Westen
- Leiden
Academic Centre for Drug Research, Leiden
University, Wassenaarseweg 76, 2333 AL Leiden, The Netherlands
| |
Collapse
|
7
|
Hao Y, Romano JD, Moore JH. Knowledge graph aids comprehensive explanation of drug and chemical toxicity. CPT Pharmacometrics Syst Pharmacol 2023; 12:1072-1079. [PMID: 37475158 PMCID: PMC10431039 DOI: 10.1002/psp4.12975] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2023] [Revised: 04/04/2023] [Accepted: 04/06/2023] [Indexed: 07/22/2023] Open
Abstract
In computational toxicology, prediction of complex endpoints has always been challenging, as they often involve multiple distinct mechanisms. State-of-the-art models are either limited by low accuracy, or lack of interpretability due to their black-box nature. Here, we introduce AIDTox, an interpretable deep learning model which incorporates curated knowledge of chemical-gene connections, gene-pathway annotations, and pathway hierarchy. AIDTox accurately predicts cytotoxicity outcomes in HepG2 and HEK293 cells. It also provides comprehensive explanations of cytotoxicity covering multiple aspects of drug activity, including target interaction, metabolism, and elimination. In summary, AIDTox provides a computational framework for unveiling cellular mechanisms for complex toxicity endpoints.
Collapse
Affiliation(s)
- Yun Hao
- Genomics and Computational Biology (GCB) Graduate ProgramUniversity of PennsylvaniaPhiladelphiaPennsylvaniaUSA
| | - Joseph D. Romano
- Institute for Biomedical InformaticsUniversity of PennsylvaniaPhiladelphiaPennsylvaniaUSA
- Center of Excellence in Environmental ToxicologyUniversity of PennsylvaniaPhiladelphiaPennsylvaniaUSA
| | - Jason H. Moore
- Department of Computational BiomedicineCedars‐Sinai Medical CenterLos AngelesCaliforniaUSA
| |
Collapse
|
8
|
Wu S, Daston G, Rose J, Blackburn K, Fisher J, Reis A, Selman B, Naciff J. Identifying chemicals based on receptor binding/bioactivation/mechanistic explanation associated with potential to elicit hepatotoxicity and to support structure activity relationship-based read-across. Curr Res Toxicol 2023; 5:100108. [PMID: 37363741 PMCID: PMC10285556 DOI: 10.1016/j.crtox.2023.100108] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2023] [Revised: 06/02/2023] [Accepted: 06/06/2023] [Indexed: 06/28/2023] Open
Abstract
The liver is the most common target organ in toxicology studies. The development of chemical structural alerts for identifying hepatotoxicity will play an important role in in silico model prediction and help strengthen the identification of analogs used in structure activity relationship (SAR)- based read-across. The aim of the current study is development of an SAR-based expert-system decision tree for screening of hepatotoxicants across a wide range of chemistry space and proposed modes of action for clustering of chemicals using defined core chemical categories based on receptor-binding or bioactivation. The decision tree is based on ∼ 1180 different chemicals that were reviewed for hepatotoxicity information. Knowledge of chemical receptor binding, metabolism and mechanistic information were used to group these chemicals into 16 different categories and 102 subcategories: four categories describe binders to 9 different receptors, 11 categories are associated with possible reactive metabolites (RMs) and there is one miscellaneous category. Each chemical subcategory has been associated with possible modes of action (MOAs) or similar key structural features. This decision tree can help to screen potential liver toxicants associated with core structural alerts of receptor binding and/or RMs and be used as a component of weight of evidence decisions based on SAR read-across, and to fill data gaps.
Collapse
|
9
|
Huang L, Zhang Z, Xing H, Luo Y, Yang J, Sui X, Wang Y. Risk assessment based on dose-responsive and time-responsive genes to build PLS-DA models for exogenously induced lung injury. ECOTOXICOLOGY AND ENVIRONMENTAL SAFETY 2023; 256:114891. [PMID: 37054470 DOI: 10.1016/j.ecoenv.2023.114891] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/26/2022] [Revised: 02/28/2023] [Accepted: 04/08/2023] [Indexed: 06/19/2023]
Abstract
Xenobiotics can easily harm human lungs owing to the openness of the respiratory system. Identifying pulmonary toxicity remains challenging owing to several reasons: 1) no biomarkers for pulmonary toxicity are available that might help to detect lung injury; 2) traditional animal experiments are time-consuming; 3) traditional detection methods solely focus on poisoning accidents; 4) analytical chemistry methods hardly achieve universal detection. An in vitro testing system able to identify the pulmonary toxicity of contaminants from food, the environment, and drugs is urgently needed. Compounds are virtually infinite, whereas toxicological mechanisms are countable. Therefore, universal methods to identify and predict the risks of contaminants can be designed based on these well-known toxicity mechanisms. In this study, we established a dataset based on transcriptome sequencing of A549 cells upon treatment with different compounds. The representativeness of our dataset was analyzed using bioinformatics methods. Artificial intelligence methods, namely partial least squares discriminant analysis (PLS-DA) models, were employed for toxicity prediction and toxicant identification. The developed model predicted the pulmonary toxicity of compounds with a 92 % accuracy. These models were submitted to an external validation using highly heterogeneous compounds, which supported the accuracy and robustness of our developed methodology. This assay exhibits universal potential applications for water quality monitoring, crop pollution detection, food and drug safety evaluation, as well as chemical warfare agent detection.
Collapse
Affiliation(s)
- Lijuan Huang
- State Key Laboratory of Toxicology and Medical Countermeasures, Institute of Pharmacology and Toxicology, Academy of Military Medical Sciences, Beijing, China
| | - Zinan Zhang
- State Key Laboratory of Toxicology and Medical Countermeasures, Institute of Pharmacology and Toxicology, Academy of Military Medical Sciences, Beijing, China
| | - Huanchun Xing
- State Key Laboratory of Toxicology and Medical Countermeasures, Institute of Pharmacology and Toxicology, Academy of Military Medical Sciences, Beijing, China
| | - Yuan Luo
- State Key Laboratory of Toxicology and Medical Countermeasures, Institute of Pharmacology and Toxicology, Academy of Military Medical Sciences, Beijing, China
| | - Jun Yang
- State Key Laboratory of Toxicology and Medical Countermeasures, Institute of Pharmacology and Toxicology, Academy of Military Medical Sciences, Beijing, China
| | - Xin Sui
- State Key Laboratory of Toxicology and Medical Countermeasures, Institute of Pharmacology and Toxicology, Academy of Military Medical Sciences, Beijing, China.
| | - Yongan Wang
- State Key Laboratory of Toxicology and Medical Countermeasures, Institute of Pharmacology and Toxicology, Academy of Military Medical Sciences, Beijing, China.
| |
Collapse
|
10
|
Kim C, Jeong J, Choi J. Effects of Class Imbalance and Data Scarcity on the Performance of Binary Classification Machine Learning Models Developed Based on ToxCast/Tox21 Assay Data. Chem Res Toxicol 2022; 35:2219-2226. [PMID: 36475638 DOI: 10.1021/acs.chemrestox.2c00189] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
The development of toxicity classification models using the ToxCast database has been extensively studied. Machine learning approaches are effective in identifying the bioactivity of untested chemicals. However, ToxCast assays differ in the amount of data and degree of class imbalance (CI). Therefore, the resampling algorithm employed should vary depending on the data distribution to achieve optimal classification performance. In this study, the effects of CI and data scarcity (DS) on the performance of binary classification models were investigated using ToxCast bioassay data. An assay matrix based on CI and DS was prepared for 335 assays with biologically intended target information, and 28 CI assays and 3 DS assays were selected. Thirty models established by combining five molecular fingerprints (i.e., Morgan, MACCS, RDKit, Pattern, and Layered) and six algorithms [i.e., gradient boosting tree, random forest (RF), multi-layered perceptron, k-nearest neighbor, logistic regression, and naive Bayes] were trained using the selected assay data set. Of the 30 trained models, MACCS-RF showed the best performance and thus was selected for analyses of the effects of CI and DS. Results showed that recall and F1 were significantly lower when training with the CI assays than with the DS assays. In addition, hyperparameter tuning of the RF algorithm significantly improved F1 on CI assays. This study provided a basis for developing a toxicity classification model with improved performance by evaluating the effects of data set characteristics. This study also emphasized the importance of using appropriate evaluation metrics and tuning hyperparameters in model development.
Collapse
Affiliation(s)
- Changhun Kim
- Chemical Bigdata Research Center, University of Seoul, 163 Seoulsiripdae-ro, Dongdaemun-gu, Seoul 02504, Republic of Korea.,School of Environmental Engineering, University of Seoul, 163 Seoulsiripdae-ro, Dongdaemun-gu, Seoul 02504, Republic of Korea
| | - Jaeseong Jeong
- Chemical Bigdata Research Center, University of Seoul, 163 Seoulsiripdae-ro, Dongdaemun-gu, Seoul 02504, Republic of Korea.,School of Environmental Engineering, University of Seoul, 163 Seoulsiripdae-ro, Dongdaemun-gu, Seoul 02504, Republic of Korea
| | - Jinhee Choi
- Chemical Bigdata Research Center, University of Seoul, 163 Seoulsiripdae-ro, Dongdaemun-gu, Seoul 02504, Republic of Korea.,School of Environmental Engineering, University of Seoul, 163 Seoulsiripdae-ro, Dongdaemun-gu, Seoul 02504, Republic of Korea
| |
Collapse
|
11
|
Luijten M, Sprong RC, Rorije E, van der Ven LTM. Prioritization of chemicals in food for risk assessment by integrating exposure estimates and new approach methodologies: A next generation risk assessment case study. FRONTIERS IN TOXICOLOGY 2022; 4:933197. [PMID: 36199824 PMCID: PMC9527283 DOI: 10.3389/ftox.2022.933197] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2022] [Accepted: 08/02/2022] [Indexed: 11/15/2022] Open
Abstract
Next generation risk assessment is defined as a knowledge-driven system that allows for cost-efficient assessment of human health risk related to chemical exposure, without animal experimentation. One of the key features of next generation risk assessment is to facilitate prioritization of chemical substances that need a more extensive toxicological evaluation, in order to address the need to assess an increasing number of substances. In this case study focusing on chemicals in food, we explored how exposure data combined with the Threshold of Toxicological Concern (TTC) concept could be used to prioritize chemicals, both for existing substances and new substances entering the market. Using a database of existing chemicals relevant for dietary exposure we calculated exposure estimates, followed by application of the TTC concept to identify substances of higher concern. Subsequently, a selected set of these priority substances was screened for toxicological potential using high-throughput screening (HTS) approaches. Remarkably, this approach resulted in alerts for a selection of substances that are already on the market and represent relevant exposure in consumers. Taken together, the case study provides proof-of-principle for the approach taken to identify substances of concern, and this approach can therefore be considered a supportive element to a next generation risk assessment strategy.
Collapse
Affiliation(s)
- Mirjam Luijten
- Centre for Health Protection, Bilthoven, Netherlands
- *Correspondence: Mirjam Luijten,
| | - R. Corinne Sprong
- Centre for Nutrition, Prevention and Health Services, Bilthoven, Netherlands
| | - Emiel Rorije
- Centre for Safety of Substances and Products, National Institute for Public Health and the Environment (RIVM), Bilthoven, Netherlands
| | | |
Collapse
|
12
|
Hao Y, Romano JD, Moore JH. Knowledge-guided deep learning models of drug toxicity improve interpretation. PATTERNS (NEW YORK, N.Y.) 2022; 3:100565. [PMID: 36124309 PMCID: PMC9481960 DOI: 10.1016/j.patter.2022.100565] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/11/2022] [Revised: 05/16/2022] [Accepted: 07/12/2022] [Indexed: 12/04/2022]
Abstract
In drug development, a major reason for attrition is the lack of understanding of cellular mechanisms governing drug toxicity. The black-box nature of conventional classification models has limited their utility in identifying toxicity pathways. Here we developed DTox (deep learning for toxicology), an interpretation framework for knowledge-guided neural networks, which can predict compound response to toxicity assays and infer toxicity pathways of individual compounds. We demonstrate that DTox can achieve the same level of predictive performance as conventional models with a significant improvement in interpretability. Using DTox, we were able to rediscover mechanisms of transcription activation by three nuclear receptors, recapitulate cellular activities induced by aromatase inhibitors and pregnane X receptor (PXR) agonists, and differentiate distinctive mechanisms leading to HepG2 cytotoxicity. Virtual screening by DTox revealed that compounds with predicted cytotoxicity are at higher risk for clinical hepatic phenotypes. In summary, DTox provides a framework for deciphering cellular mechanisms of toxicity in silico.
Collapse
Affiliation(s)
- Yun Hao
- Genomics and Computational Biology (GCB) Graduate Program, University of Pennsylvania, Philadelphia, PA, USA
| | - Joseph D. Romano
- Institute for Biomedical Informatics, University of Pennsylvania, Philadelphia, PA, USA
- Center of Excellence in Environmental Toxicology, University of Pennsylvania, Philadelphia, PA, USA
| | - Jason H. Moore
- Department of Computational Biomedicine, Cedars-Sinai Medical Center, Los Angeles, CA, USA
| |
Collapse
|
13
|
Filer DL, Hoffman K, Sargis RM, Trasande L, Kassotis CD. On the Utility of ToxCast-Based Predictive Models to Evaluate Potential Metabolic Disruption by Environmental Chemicals. ENVIRONMENTAL HEALTH PERSPECTIVES 2022; 130:57005. [PMID: 35533074 PMCID: PMC9084331 DOI: 10.1289/ehp6779] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/13/2020] [Revised: 03/19/2022] [Accepted: 04/06/2022] [Indexed: 05/26/2023]
Abstract
BACKGROUND Research suggests environmental contaminants can impact metabolic health; however, high costs prohibit in vivo screening of putative metabolic disruptors. High-throughput screening programs, such as ToxCast, hold promise to reduce testing gaps and prioritize higher-order (in vivo) testing. OBJECTIVES We sought to a) examine the concordance of in vitro testing in 3T3-L1 cells to a targeted literature review for 38 semivolatile environmental chemicals, and b) assess the predictive utility of various expert models using ToxCast data against the set of 38 reference chemicals. METHODS Using a set of 38 chemicals with previously published results in 3T3-L1 cells, we performed a metabolism-targeted literature review to determine consensus activity determinations. To assess ToxCast predictive utility, we used two published ToxPi models: a) the 8-Slice model published by Janesick et al. (2016) and b) the 5-Slice model published by Auerbach et al. (2016). We examined the performance of the two models against the Janesick in vitro results and our own 38-chemical reference set. We further evaluated the predictive performance of various modifications to these models using cytotoxicity filtering approaches and validated our best-performing model with new chemical testing in 3T3-L1 cells. RESULTS The literature review revealed relevant publications for 30 out of the 38 chemicals (the remaining 8 chemicals were only examined in our previous 3T3-L1 testing). We observed a balanced accuracy (average of sensitivity and specificity) of 0.86 comparing our previous in vitro results to the literature-derived calls. ToxPi models provided balanced accuracies ranging from 0.55 to 0.88, depending on the model specifications and reference set. Validation chemical testing correctly predicted 29 of 30 chemicals as per 3T3-L1 testing, suggesting good adipogenic prediction performance for our best adapted model. DISCUSSION Using the most recent ToxCast data and an updated ToxPi model, we found ToxCast performed similarly to that of our own 3T3-L1 testing in predicting consensus calls. Furthermore, we provide the full ranked list of largely untested chemicals with ToxPi scores that predict adipogenic activity and that require further investigation. https://doi.org/10.1289/EHP6779.
Collapse
Affiliation(s)
- Dayne L. Filer
- Department of Genetics, School of Medicine, and Renaissance Computing Institute, University of North Carolina, Chapel Hill, North Carolina, USA
| | - Kate Hoffman
- Nicholas School of the Environment, Duke University, Durham, North Carolina, USA
| | - Robert M. Sargis
- Department of Medicine, University of Illinois at Chicago, Chicago, Illinois, USA
| | - Leonardo Trasande
- Department of Pediatrics, New York University (NYU) School of Medicine, New York, New York, USA
- Department of Environmental Medicine, New York University (NYU) School of Medicine, New York, New York, USA
- Department of Population Health, New York University (NYU) School of Medicine, New York, New York, USA
- NYU College of Global Public Health, New York University, New York, New York, USA
| | - Christopher D. Kassotis
- Institute of Environmental Health Sciences and Department of Pharmacology, School of Medicine, Wayne State University, Detroit, Michigan, USA
| |
Collapse
|
14
|
Xu T, Xu M, Zhu W, Chen CZ, Zhang Q, Zheng W, Huang R. Efficient Identification of Anti-SARS-CoV-2 Compounds Using Chemical Structure- and Biological Activity-Based Modeling. J Med Chem 2022; 65:4590-4599. [PMID: 35275639 PMCID: PMC8936051 DOI: 10.1021/acs.jmedchem.1c01372] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2021] [Indexed: 12/12/2022]
Abstract
Identification of anti-SARS-CoV-2 compounds through traditional high-throughput screening (HTS) assays is limited by high costs and low hit rates. To address these challenges, we developed machine learning models to identify compounds acting via inhibition of the entry of SARS-CoV-2 into human host cells or the SARS-CoV-2 3-chymotrypsin-like (3CL) protease. The optimal classification models achieved good performance with area under the receiver operating characteristic curve (AUC-ROC) values of >0.78. Experimental validation showed that the best performing models increased the assay hit rate by 2.1-fold for viral entry inhibitors and 10.4-fold for 3CL protease inhibitors compared to those of the original drug repurposing screens. Twenty-two compounds showed potent (<5 μM) antiviral activities in a SARS-CoV-2 live virus assay. In conclusion, machine learning models can be developed and used as a complementary approach to HTS to expand compound screening capacities and improve the speed and efficiency of anti-SARS-CoV-2 drug discovery.
Collapse
Affiliation(s)
- Tuan Xu
- Division of Pre-clinical Innovation, National Center for Advancing Translational Sciences (NCATS), National Institutes of Health (NIH), Rockville, Maryland 20850, United States
| | - Miao Xu
- Division of Pre-clinical Innovation, National Center for Advancing Translational Sciences (NCATS), National Institutes of Health (NIH), Rockville, Maryland 20850, United States
| | - Wei Zhu
- Division of Pre-clinical Innovation, National Center for Advancing Translational Sciences (NCATS), National Institutes of Health (NIH), Rockville, Maryland 20850, United States
| | - Catherine Z Chen
- Division of Pre-clinical Innovation, National Center for Advancing Translational Sciences (NCATS), National Institutes of Health (NIH), Rockville, Maryland 20850, United States
| | - Qi Zhang
- Division of Pre-clinical Innovation, National Center for Advancing Translational Sciences (NCATS), National Institutes of Health (NIH), Rockville, Maryland 20850, United States
| | - Wei Zheng
- Division of Pre-clinical Innovation, National Center for Advancing Translational Sciences (NCATS), National Institutes of Health (NIH), Rockville, Maryland 20850, United States
| | - Ruili Huang
- Division of Pre-clinical Innovation, National Center for Advancing Translational Sciences (NCATS), National Institutes of Health (NIH), Rockville, Maryland 20850, United States
| |
Collapse
|
15
|
Liu J, Guo W, Sakkiah S, Ji Z, Yavas G, Zou W, Chen M, Tong W, Patterson TA, Hong H. Machine Learning Models for Predicting Liver Toxicity. Methods Mol Biol 2022; 2425:393-415. [PMID: 35188640 DOI: 10.1007/978-1-0716-1960-5_15] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Liver toxicity is a major adverse drug reaction that accounts for drug failure in clinical trials and withdrawal from the market. Therefore, predicting potential liver toxicity at an early stage in drug discovery is crucial to reduce costs and the potential for drug failure. However, current in vivo animal toxicity testing is very expensive and time consuming. As an alternative approach, various machine learning models have been developed to predict potential liver toxicity in humans. This chapter reviews current advances in the development and application of machine learning models for prediction of potential liver toxicity in humans and discusses possible improvements to liver toxicity prediction.
Collapse
Affiliation(s)
- Jie Liu
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR, USA
| | - Wenjing Guo
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR, USA
| | - Sugunadevi Sakkiah
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR, USA
| | - Zuowei Ji
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR, USA
| | - Gokhan Yavas
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR, USA
| | - Wen Zou
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR, USA
| | - Minjun Chen
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR, USA
| | - Weida Tong
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR, USA
| | - Tucker A Patterson
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR, USA
| | - Huixiao Hong
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR, USA.
| |
Collapse
|
16
|
Ellison C, Hewitt M, Przybylak K. In Silico Models for Hepatotoxicity. Methods Mol Biol 2022; 2425:355-392. [PMID: 35188639 DOI: 10.1007/978-1-0716-1960-5_14] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
In this chapter, we review the state of the art of predicting human hepatotoxicity using in silico techniques. There has been significant progress in this area over the past 20 years but there are still some challenges ahead. Principally, these challenges are our partial understanding of a very complex biochemical system and our ability to emulate that in a predictive capacity. Here, we provide an overview of the published modeling approaches in this area to date and discuss their design, strengths and weaknesses. It is interesting to note the diversity in modeling approaches, whether they be statistical algorithms or evidenced-based approaches including structural alerts and pharmacophore models. Irrespective of modeling approach, it appears a common theme of access to appropriate, relevant, and high-quality data is a limitation to all and is likely to continue to be the focus of future research.
Collapse
Affiliation(s)
- Claire Ellison
- Human and Natural Sciences Directorate, School of Science, Engineering and Environment, University of Salford, Manchester, UK
| | - Mark Hewitt
- School of Pharmacy, Faculty of Science and Engineering, University of Wolverhampton, Wolverhampton, UK.
| | | |
Collapse
|
17
|
Joint Decision-Making Model Based on Consensus Modeling Technology for the Prediction of Drug-Induced Liver Injury. J CHEM-NY 2021. [DOI: 10.1155/2021/2293871] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Drug-induced liver injury (DILI) is the major cause of clinical trial failure and postmarketing withdrawals of approved drugs. It is very expensive and time-consuming to evaluate hepatotoxicity using animal or cell-based experiments in the early stage of drug development. In this study, an in silico model based on the joint decision-making strategy was developed for DILI assessment using a relatively large dataset of 2608 compounds. Five consensus models were developed with PaDEL descriptors and PubChem, Substructure, Estate, and Klekota–Roth fingerprints, respectively. Submodels for each consensus model were obtained through joint optimization. The parameters and features of each submodel were optimized jointly based on the hybrid quantum particle swarm optimization (HQPSO) algorithm. The application domain (AD) based on the frequency-weighted and distance (FWD)-based method and Tanimoto similarity index showed the wide AD of the qualified consensus models. A joint decision-making model was integrated by the qualified consensus models, and the overwhelming majority principle was used to improve the performance of consensus models. The application scope narrowing caused by the overwhelming majority principle was successfully solved by joint decision-making. The proposed model successfully predicted 99.2% of the compounds in the test set, with an accuracy of 80.0%, a sensitivity of 83.9, and a specificity of 73.3%. For an external validation set containing 390 compounds collected from DILIrank, 98.2% of the compounds were successfully predicted with an accuracy of 79.9%, a sensitivity of 97.1%, and a specificity of 66.0%. Furthermore, 25 privileged substructures responsible for DILI were identified from Substructure, PubChem, and Klekota–Roth fingerprints. These privileged substructures can be regarded as structural alerts in hepatotoxicity evaluation. Compared with the main published studies, our method exhibits certain advantage in data size, transparency, and standardization of the modeling process and accuracy and credibility of prediction results. It is a promising tool for virtual screening in the early stage of drug development.
Collapse
|
18
|
Hao Y, Moore JH. TargetTox: A Feature Selection Pipeline for Identifying Predictive Targets Associated with Drug Toxicity. J Chem Inf Model 2021; 61:5386-5394. [PMID: 34757743 DOI: 10.1021/acs.jcim.1c00733] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
In silico assessment of drug toxicity is becoming a critical step in drug development. Conventional ligand-based models are limited by low accuracy and lack of interpretability. Further, they often fail to explain cellular mechanisms underlying structure-toxicity associations. We addressed these limitations by incorporating target profile as an intermediate connecting structure to toxicity. To accommodate for high-dimensional feature space, we developed a pipeline named TargetTox that can identity a subset of predictive features. We implemented TargetTox to study 569 targets and 815 adverse events. The features identified by TargetTox comprise less than 10% of the original feature space; nevertheless, they accurately predicted binding outcomes for 377 targets and toxicity outcomes for 36 adverse events. We demonstrated that predictive targets tend to be differentially expressed in the tissue of toxicity. We also rediscovered key cellular functions associated with cardiotoxicity from the predictive targets, as well as markers of skin and liver diseases. Furthermore, we found evidence supporting diagnostic and therapeutic applications of some predictive targets in hepatotoxicity and nephrotoxicity. Our findings highlighted the critical role of predictive targets in cellular mechanisms leading to toxicity. In general, our study improved the interpretability of toxicity prediction without sacrificing accuracy. Our novel pipeline may benefit future studies of high-dimensional data sets.
Collapse
Affiliation(s)
- Yun Hao
- Genomics and Computational Biology (GCB) Graduate Program, University of Pennsylvania, Philadelphia, Pennsylvania 19104, United States
| | - Jason H Moore
- Department of Biostatistics, Epidemiology, and Informatics, University of Pennsylvania, Philadelphia, Pennsylvania 19104, United States
| |
Collapse
|
19
|
Bassan A, Alves VM, Amberg A, Anger LT, Auerbach S, Beilke L, Bender A, Cronin MT, Cross KP, Hsieh JH, Greene N, Kemper R, Kim MT, Mumtaz M, Noeske T, Pavan M, Pletz J, Russo DP, Sabnis Y, Schaefer M, Szabo DT, Valentin JP, Wichard J, Williams D, Woolley D, Zwickl C, Myatt GJ. In silico approaches in organ toxicity hazard assessment: current status and future needs in predicting liver toxicity. COMPUTATIONAL TOXICOLOGY (AMSTERDAM, NETHERLANDS) 2021; 20:100187. [PMID: 35340402 PMCID: PMC8955833 DOI: 10.1016/j.comtox.2021.100187] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/15/2023]
Abstract
Hepatotoxicity is one of the most frequently observed adverse effects resulting from exposure to a xenobiotic. For example, in pharmaceutical research and development it is one of the major reasons for drug withdrawals, clinical failures, and discontinuation of drug candidates. The development of faster and cheaper methods to assess hepatotoxicity that are both more sustainable and more informative is critically needed. The biological mechanisms and processes underpinning hepatotoxicity are summarized and experimental approaches to support the prediction of hepatotoxicity are described, including toxicokinetic considerations. The paper describes the increasingly important role of in silico approaches and highlights challenges to the adoption of these methods including the lack of a commonly agreed upon protocol for performing such an assessment and the need for in silico solutions that take dose into consideration. A proposed framework for the integration of in silico and experimental information is provided along with a case study describing how computational methods have been used to successfully respond to a regulatory question concerning non-genotoxic impurities in chemically synthesized pharmaceuticals.
Collapse
Affiliation(s)
- Arianna Bassan
- Innovatune srl, Via Giulio Zanon 130/D, 35129 Padova, Italy
| | - Vinicius M. Alves
- The National Institute of Environmental Health Sciences, Division of the National Toxicology, Program, Research Triangle Park, NC 27709, USA
| | - Alexander Amberg
- Sanofi, R&D Preclinical Safety Frankfurt, Industriepark Hoechst, D-65926 Frankfurt am Main, Germany
| | | | - Scott Auerbach
- The National Institute of Environmental Health Sciences, Division of the National Toxicology, Program, Research Triangle Park, NC 27709, USA
| | - Lisa Beilke
- Toxicology Solutions Inc., San Diego, CA, USA
| | - Andreas Bender
- AI and Data Analytics, Clinical Pharmacology and Safety Sciences, R&D, AstraZeneca, Cambridge, UK
- Centre for Molecular Informatics, Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW
| | - Mark T.D. Cronin
- School of Pharmacy and Biomolecular Sciences, Liverpool John Moores University, Liverpool, L3 3AF, UK
| | | | - Jui-Hua Hsieh
- The National Institute of Environmental Health Sciences, Division of the National Toxicology, Program, Research Triangle Park, NC 27709, USA
| | - Nigel Greene
- Data Science and AI, DSM, IMED Biotech Unit, AstraZeneca, Boston, USA
| | - Raymond Kemper
- Nuvalent, One Broadway, 14th floor, Cambridge, MA, 02142, USA
| | - Marlene T. Kim
- US Food and Drug Administration, Center for Drug Evaluation and Research, Silver Spring, MD, 20993, USA
| | - Moiz Mumtaz
- Office of the Associate Director for Science (OADS), Agency for Toxic Substances and Disease, Registry, US Department of Health and Human Services, Atlanta, GA, USA
| | - Tobias Noeske
- Imaging and Data Analytics, Clinical Pharmacology & Safety Sciences, R&D, AstraZeneca, Gothenburg, Sweden
| | - Manuela Pavan
- Innovatune srl, Via Giulio Zanon 130/D, 35129 Padova, Italy
| | - Julia Pletz
- School of Pharmacy and Biomolecular Sciences, Liverpool John Moores University, Liverpool, L3 3AF, UK
| | - Daniel P. Russo
- Department of Chemistry, Rutgers University, Camden, NJ 08102, USA
- The Rutgers Center for Computational and Integrative Biology, Camden, NJ 08102, USA
| | - Yogesh Sabnis
- UCB Biopharma SRL, Chemin du Foriest – B-1420 Braine-l’Alleud, Belgium
| | - Markus Schaefer
- Sanofi, R&D Preclinical Safety Frankfurt, Industriepark Hoechst, D-65926 Frankfurt am Main, Germany
| | | | | | - Joerg Wichard
- Bayer AG, Genetic Toxicology, Müllerstr. 178, 13353 Berlin, Germany
| | - Dominic Williams
- Functional & Mechanistic Safety, Clinical Pharmacology & Safety Sciences, AstraZeneca, Darwin Building 310, Cambridge Science Park, Milton Rd, Cambridge CB4 0FZ, UK
| | - David Woolley
- ForthTox Limited, PO Box 13550, Linlithgow, EH49 7YU, UK
| | - Craig Zwickl
- Transendix LLC, 1407 Moores Manor, Indianapolis, IN 46229, USA
| | - Glenn J. Myatt
- Instem, 1393 Dublin Road, Columbus, OH 43215. USA
- Corresponding author. (G.J. Myatt)
| |
Collapse
|
20
|
Sakhteman A, Failli M, Kublbeck J, Levonen AL, Fortino V. A toxicogenomic data space for system-level understanding and prediction of EDC-induced toxicity. ENVIRONMENT INTERNATIONAL 2021; 156:106751. [PMID: 34271427 DOI: 10.1016/j.envint.2021.106751] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/19/2021] [Revised: 06/30/2021] [Accepted: 07/01/2021] [Indexed: 06/13/2023]
Abstract
Endocrine disrupting compounds (EDCs) are a persistent threat to humans and wildlife due to their ability to interfere with endocrine signaling pathways. Inspired by previous work to improve chemical hazard identification through the use of toxicogenomics data, we developed a genomic-oriented data space for profiling the molecular activity of EDCs in an in silico manner, and for creating predictive models that identify and prioritize EDCs. Predictive models of EDCs, derived from gene expression data from rats (in vivo and in vitro primary hepatocytes) and humans (in vitro primary hepatocytes and HepG2), achieve testing accuracy greater than 90%. Negative test sets indicate that known safer chemicals are not predicted as EDCs. The rat in vivo-based classifiers achieve accuracy greater than 75% when tested for invitro to in vivoextrapolation. This study reveals key metabolic pathways and genes affected by EDCs together with a set of predictive models that utilize these pathways to prioritize EDCs in dose/time dependent manner and to predict EDCevokedmetabolic diseases.
Collapse
Affiliation(s)
- A Sakhteman
- Institute of Biomedicine, University of Eastern Finland, Kuopio 70210, Finland
| | - M Failli
- Department of Chemical, Materials and Industrial Engineering, University of Naples, 'Federico II', Naples 80125, Italy
| | - J Kublbeck
- A.I. Virtanen Institute for Molecular Sciences, University of Eastern Finland, Kuopio 70210, Finland; School of Pharmacy, University of Eastern Finland, Kuopio 70210, Finland
| | - A L Levonen
- A.I. Virtanen Institute for Molecular Sciences, University of Eastern Finland, Kuopio 70210, Finland
| | - V Fortino
- Institute of Biomedicine, University of Eastern Finland, Kuopio 70210, Finland.
| |
Collapse
|
21
|
Gallego V, Naveiro R, Roca C, Ríos Insua D, Campillo NE. AI in drug development: a multidisciplinary perspective. Mol Divers 2021; 25:1461-1479. [PMID: 34251580 PMCID: PMC8342381 DOI: 10.1007/s11030-021-10266-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2021] [Accepted: 06/29/2021] [Indexed: 01/09/2023]
Abstract
The introduction of a new drug to the commercial market follows a complex and long process that typically spans over several years and entails large monetary costs due to a high attrition rate. Because of this, there is an urgent need to improve this process using innovative technologies such as artificial intelligence (AI). Different AI tools are being applied to support all four steps of the drug development process (basic research for drug discovery; pre-clinical phase; clinical phase; and postmarketing). Some of the main tasks where AI has proven useful include identifying molecular targets, searching for hit and lead compounds, synthesising drug-like compounds and predicting ADME-Tox. This review, on the one hand, brings in a mathematical vision of some of the key AI methods used in drug development closer to medicinal chemists and, on the other hand, brings the drug development process and the use of different models closer to mathematicians. Emphasis is placed on two aspects not mentioned in similar surveys, namely, Bayesian approaches and their applications to molecular modelling and the eventual final use of the methods to actually support decisions. Promoting a perfect synergy.
Collapse
Affiliation(s)
- Víctor Gallego
- Institute of Mathematical Sciences (ICMAT-CSIC), Nicolás Cabrera 13-15, 28049, Madrid, Spain
| | - Roi Naveiro
- Institute of Mathematical Sciences (ICMAT-CSIC), Nicolás Cabrera 13-15, 28049, Madrid, Spain
| | - Carlos Roca
- AItenea Biotech S.L. Parque Científico de Madrid, Faraday, 7, 28049, Madrid, Spain
| | - David Ríos Insua
- ICMAT-CSIC and Dept. of Statistics and OR, U. Compl. Madrid, Madrid, Spain
| | - Nuria E Campillo
- CIB-Margarita Salas (CSIC), Ramiro de Maeztu, 9, 28040, Madrid, Spain.
| |
Collapse
|
22
|
Shimizu Y, Sasaki T, Takeshita JI, Watanabe M, Shizu R, Hosaka T, Yoshinari K. Identification of average molecular weight (AMW) as a useful chemical descriptor to discriminate liver injury-inducing drugs. PLoS One 2021; 16:e0253855. [PMID: 34170966 PMCID: PMC8232420 DOI: 10.1371/journal.pone.0253855] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2020] [Accepted: 06/14/2021] [Indexed: 11/19/2022] Open
Abstract
Drug-induced liver injury (DILI) is one of major causes of discontinuing drug development and withdrawing drugs from the market. In this study, we investigated chemical properties associated with DILI using in silico methods, to identify a physicochemical property useful for DILI screening at the early stages of drug development. Total of 652 drugs, including 432 DILI-positive drugs (DILI drugs) and 220 DILI-negative drugs (no-DILI drugs) were selected from Liver Toxicity Knowledge Base of US Food and Drug Administration. Decision tree models were constructed using 2,473 descriptors as explanatory variables. In the final model, the descriptor AMW, representing average molecular weight, was found to be at the first node and showed the highest importance value. With AMW alone, 276 DILI drugs (64%) and 156 no-DILI drugs (71%) were correctly classified. Discrimination with AMW was then performed using therapeutic category information. The performance of discrimination depended on the category and significantly high performance (>0.8 balanced accuracy) was obtained in some categories. Taken together, the present results suggest AMW as a novel descriptor useful for detecting drugs with DILI risk. The information presented may be valuable for the safety assessment of drug candidates at the early stage of drug development.
Collapse
Affiliation(s)
- Yuki Shimizu
- Laboratory of Molecular Toxicology, School of Pharmaceutical Sciences, University of Shizuoka, Shizuoka, Japan
| | - Takamitsu Sasaki
- Laboratory of Molecular Toxicology, School of Pharmaceutical Sciences, University of Shizuoka, Shizuoka, Japan
| | - Jun-ichi Takeshita
- Laboratory of Molecular Toxicology, School of Pharmaceutical Sciences, University of Shizuoka, Shizuoka, Japan
- Research Institute of Science for Safety and Sustainability, National Institute of Advanced Industrial Science and Technology (AIST), Tsukuba, Japan
| | - Michiko Watanabe
- Laboratory of Molecular Toxicology, School of Pharmaceutical Sciences, University of Shizuoka, Shizuoka, Japan
| | - Ryota Shizu
- Laboratory of Molecular Toxicology, School of Pharmaceutical Sciences, University of Shizuoka, Shizuoka, Japan
| | - Takuomi Hosaka
- Laboratory of Molecular Toxicology, School of Pharmaceutical Sciences, University of Shizuoka, Shizuoka, Japan
| | - Kouichi Yoshinari
- Laboratory of Molecular Toxicology, School of Pharmaceutical Sciences, University of Shizuoka, Shizuoka, Japan
- * E-mail:
| |
Collapse
|
23
|
Zhao L, Russo DP, Wang W, Aleksunes LM, Zhu H. Mechanism-Driven Read-Across of Chemical Hepatotoxicants Based on Chemical Structures and Biological Data. Toxicol Sci 2021; 174:178-188. [PMID: 32073637 DOI: 10.1093/toxsci/kfaa005] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023] Open
Abstract
Hepatotoxicity is a leading cause of attrition in the drug development process. Traditional preclinical and clinical studies to evaluate hepatotoxicity liabilities are expensive and time consuming. With the advent of critical advancements in high-throughput screening, there has been a rapid accumulation of in vitro toxicity data available to inform the risk assessment of new pharmaceuticals and chemicals. To this end, we curated and merged all available in vivo hepatotoxicity data obtained from the literature and public resources, which yielded a comprehensive database of 4089 compounds that includes hepatotoxicity classifications. After dividing the original database of chemicals into modeling and test sets, PubChem assay data were automatically extracted using an in-house data mining tool and clustered based on relationships between structural fragments and cellular responses in in vitro assays. The resultant PubChem assay clusters were further investigated. During the cross-validation procedure, the biological data obtained from several assay clusters exhibited high predictivity of hepatotoxicity and these assays were selected to evaluate the test set compounds. The read-across results indicated that if a new compound contained specific identified chemical fragments (ie, Molecular Initiating Event) and showed active responses in the relevant selected PubChem assays, there was potential for the chemical to be hepatotoxic in vivo. Furthermore, several mechanisms that might contribute to toxicity were derived from the modeling results including alterations in nuclear receptor signaling and inhibition of DNA repair. This modeling strategy can be further applied to the investigation of other complex chemical toxicity phenomena (eg, developmental and reproductive toxicities) as well as drug efficacy.
Collapse
Affiliation(s)
- Linlin Zhao
- The Rutgers Center for Computational and Integrative Biology, Camden, New Jersey
| | - Daniel P Russo
- The Rutgers Center for Computational and Integrative Biology, Camden, New Jersey
| | - Wenyi Wang
- The Rutgers Center for Computational and Integrative Biology, Camden, New Jersey
| | - Lauren M Aleksunes
- Department of Pharmacology and Toxicology, Ernest Mario School of Pharmacy, Rutgers University, Piscataway, New Jersey
| | - Hao Zhu
- The Rutgers Center for Computational and Integrative Biology, Camden, New Jersey.,Department of Chemistry, Rutgers University, Camden, New Jersey
| |
Collapse
|
24
|
Ring C, Sipes NS, Hsieh JH, Carberry C, Koval LE, Klaren WD, Harris MA, Auerbach SS, Rager JE. Predictive modeling of biological responses in the rat liver using in vitro Tox21 bioactivity: Benefits from high-throughput toxicokinetics. COMPUTATIONAL TOXICOLOGY (AMSTERDAM, NETHERLANDS) 2021; 18:100166. [PMID: 34013136 PMCID: PMC8130852 DOI: 10.1016/j.comtox.2021.100166] [Citation(s) in RCA: 29] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
Computational methods are needed to more efficiently leverage data from in vitro cell-based models to predict what occurs within whole body systems after chemical insults. This study set out to test the hypothesis that in vitro high-throughput screening (HTS) data can more effectively predict in vivo biological responses when chemical disposition and toxicokinetic (TK) modeling are employed. In vitro HTS data from the Tox21 consortium were analyzed in concert with chemical disposition modeling to derive nominal, aqueous, and intracellular estimates of concentrations eliciting 50% maximal activity. In vivo biological responses were captured using rat liver transcriptomic data from the DrugMatrix and TG-Gates databases and evaluated for pathway enrichment. In vivo dosing data were translated to equivalent body concentrations using HTTK modeling. Random forest models were then trained and tested to predict in vivo pathway-level activity across 221 chemicals using in vitro bioactivity data and physicochemical properties as predictor variables, incorporating methods to address imbalanced training data resulting from high instances of inactivity. Model performance was quantified using the area under the receiver operator characteristic curve (AUC-ROC) and compared across pathways for different combinations of predictor variables. All models that included toxicokinetics were found to outperform those that excluded toxicokinetics. Biological interpretation of the model features revealed that rather than a direct mapping of in vitro assays to in vivo pathways, unexpected combinations of multiple in vitro assays predicted in vivo pathway-level activities. To demonstrate the utility of these findings, the highest-performing model was leveraged to make new predictions of in vivo biological responses across all biological pathways for remaining chemicals tested in Tox21 with adequate data coverage (n = 6617). These results demonstrate that, when chemical disposition and toxicokinetics are carefully considered, in vitro HT screening data can be used to effectively predict in vivo biological responses to chemicals.
Collapse
Affiliation(s)
- Caroline Ring
- ToxStrategies, Inc., Austin, TX 78751, United States
| | - Nisha S. Sipes
- Division of the National Toxicology Program, National Institute of Environmental Health Sciences, Research Triangle Park, NC 27709, United States
| | - Jui-Hua Hsieh
- Kelly Government Solutions, Durham, NC 27709, United States
| | - Celeste Carberry
- Department of Environmental Sciences and Engineering, Gillings School of Global Public Health, The University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, United States
- The Institute for Environmental Health Solutions, Gillings School of Global Public Health, The University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, United States
| | - Lauren E. Koval
- Department of Environmental Sciences and Engineering, Gillings School of Global Public Health, The University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, United States
- The Institute for Environmental Health Solutions, Gillings School of Global Public Health, The University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, United States
| | - William D. Klaren
- Department of Veterinary Integrative Biosciences, Texas A&M University, College Station, TX 77840, United States
| | | | - Scott S. Auerbach
- Division of the National Toxicology Program, National Institute of Environmental Health Sciences, Research Triangle Park, NC 27709, United States
| | - Julia E. Rager
- Department of Environmental Sciences and Engineering, Gillings School of Global Public Health, The University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, United States
- The Institute for Environmental Health Solutions, Gillings School of Global Public Health, The University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, United States
- Curriculum in Toxicology and Environmental Medicine, The University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, United States
| |
Collapse
|
25
|
Seal S, Yang H, Vollmers L, Bender A. Comparison of Cellular Morphological Descriptors and Molecular Fingerprints for the Prediction of Cytotoxicity- and Proliferation-Related Assays. Chem Res Toxicol 2021; 34:422-437. [PMID: 33522793 DOI: 10.1021/acs.chemrestox.0c00303] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
Cell morphology features, such as those from the Cell Painting assay, can be generated at relatively low costs and represent versatile biological descriptors of a system and thereby compound response. In this study, we explored cell morphology descriptors and molecular fingerprints, separately and in combination, for the prediction of cytotoxicity- and proliferation-related in vitro assay endpoints. We selected 135 compounds from the MoleculeNet ToxCast benchmark data set which were annotated with Cell Painting readouts, where the relatively small size of the data set is due to the overlap of required annotations. We trained Random Forest classification models using nested cross-validation and Cell Painting descriptors, Morgan and ErG fingerprints, and their combinations. While using leave-one-cluster-out cross-validation (with clusters based on physicochemical descriptors), models using Cell Painting descriptors achieved higher average performance over all assays (Balanced Accuracy of 0.65, Matthews Correlation Coefficient of 0.28, and AUC-ROC of 0.71) compared to models using ErG fingerprints (BA 0.55, MCC 0.09, and AUC-ROC 0.60) and Morgan fingerprints alone (BA 0.54, MCC 0.06, and AUC-ROC 0.56). While using random shuffle splits, the combination of Cell Painting descriptors with ErG and Morgan fingerprints further improved balanced accuracy on average by 8.9% (in 9 out of 12 assays) and 23.4% (in 8 out of 12 assays) compared to using only ErG and Morgan fingerprints, respectively. Regarding feature importance, Cell Painting descriptors related to nuclei texture, granularity of cells, and cytoplasm as well as cell neighbors and radial distributions were identified to be most contributing, which is plausible given the endpoint considered. We conclude that cell morphological descriptors contain complementary information to molecular fingerprints which can be used to improve the performance of predictive cytotoxicity models, in particular in areas of novel structural space.
Collapse
Affiliation(s)
- Srijit Seal
- Centre for Molecular Informatics, Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| | - Hongbin Yang
- Centre for Molecular Informatics, Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| | - Luis Vollmers
- Centre for Molecular Informatics, Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| | - Andreas Bender
- Centre for Molecular Informatics, Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| |
Collapse
|
26
|
Liu A, Walter M, Wright P, Bartosik A, Dolciami D, Elbasir A, Yang H, Bender A. Prediction and mechanistic analysis of drug-induced liver injury (DILI) based on chemical structure. Biol Direct 2021; 16:6. [PMID: 33461600 PMCID: PMC7814730 DOI: 10.1186/s13062-020-00285-0] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2020] [Accepted: 12/01/2020] [Indexed: 12/16/2022] Open
Abstract
BACKGROUND Drug-induced liver injury (DILI) is a major safety concern characterized by a complex and diverse pathogenesis. In order to identify DILI early in drug development, a better understanding of the injury and models with better predictivity are urgently needed. One approach in this regard are in silico models which aim at predicting the risk of DILI based on the compound structure. However, these models do not yet show sufficient predictive performance or interpretability to be useful for decision making by themselves, the former partially stemming from the underlying problem of labeling the in vivo DILI risk of compounds in a meaningful way for generating machine learning models. RESULTS As part of the Critical Assessment of Massive Data Analysis (CAMDA) "CMap Drug Safety Challenge" 2019 ( http://camda2019.bioinf.jku.at ), chemical structure-based models were generated using the binarized DILIrank annotations. Support Vector Machine (SVM) and Random Forest (RF) classifiers showed comparable performance to previously published models with a mean balanced accuracy over models generated using 5-fold LOCO-CV inside a 10-fold training scheme of 0.759 ± 0.027 when predicting an external test set. In the models which used predicted protein targets as compound descriptors, we identified the most information-rich proteins which agreed with the mechanisms of action and toxicity of nonsteroidal anti-inflammatory drugs (NSAIDs), one of the most important drug classes causing DILI, stress response via TP53 and biotransformation. In addition, we identified multiple proteins involved in xenobiotic metabolism which could be novel DILI-related off-targets, such as CLK1 and DYRK2. Moreover, we derived potential structural alerts for DILI with high precision, including furan and hydrazine derivatives; however, all derived alerts were present in approved drugs and were over specific indicating the need to consider quantitative variables such as dose. CONCLUSION Using chemical structure-based descriptors such as structural fingerprints and predicted protein targets, DILI prediction models were built with a predictive performance comparable to previous literature. In addition, we derived insights on proteins and pathways statistically (and potentially causally) linked to DILI from these models and inferred new structural alerts related to this adverse endpoint.
Collapse
Affiliation(s)
- Anika Liu
- Department of Chemistry, Centre for Molecular Informatics, University of Cambridge, Lensfield Road, Cambridge, CB2 1EW, UK.
| | - Moritz Walter
- Department of Chemistry, Centre for Molecular Informatics, University of Cambridge, Lensfield Road, Cambridge, CB2 1EW, UK
| | - Peter Wright
- Department of Chemistry, Centre for Molecular Informatics, University of Cambridge, Lensfield Road, Cambridge, CB2 1EW, UK
| | - Aleksandra Bartosik
- Department of Chemistry, Centre for Molecular Informatics, University of Cambridge, Lensfield Road, Cambridge, CB2 1EW, UK
| | - Daniela Dolciami
- Department of Chemistry, Centre for Molecular Informatics, University of Cambridge, Lensfield Road, Cambridge, CB2 1EW, UK
- Department of Pharmaceutical Sciences, University of Perugia, Via del Liceo 1, 06123, Perugia, Italy
| | - Abdurrahman Elbasir
- Department of Chemistry, Centre for Molecular Informatics, University of Cambridge, Lensfield Road, Cambridge, CB2 1EW, UK
- ICT Department, College of Science and Engineering, Hamad Bin Khalifa University, Doha, Qatar
| | - Hongbin Yang
- Department of Chemistry, Centre for Molecular Informatics, University of Cambridge, Lensfield Road, Cambridge, CB2 1EW, UK
| | - Andreas Bender
- Department of Chemistry, Centre for Molecular Informatics, University of Cambridge, Lensfield Road, Cambridge, CB2 1EW, UK.
| |
Collapse
|
27
|
Béquignon OJ, Pawar G, van de Water B, Cronin MT, van Westen GJ. Computational Approaches for Drug-Induced Liver Injury (DILI) Prediction: State of the Art and Challenges. SYSTEMS MEDICINE 2021. [DOI: 10.1016/b978-0-12-801238-3.11535-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
|
28
|
Wang MWH, Goodman JM, Allen TEH. Machine Learning in Predictive Toxicology: Recent Applications and Future Directions for Classification Models. Chem Res Toxicol 2020; 34:217-239. [PMID: 33356168 DOI: 10.1021/acs.chemrestox.0c00316] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
In recent times, machine learning has become increasingly prominent in predictive toxicology as it has shifted from in vivo studies toward in silico studies. Currently, in vitro methods together with other computational methods such as quantitative structure-activity relationship modeling and absorption, distribution, metabolism, and excretion calculations are being used. An overview of machine learning and its applications in predictive toxicology is presented here, including support vector machines (SVMs), random forest (RF) and decision trees (DTs), neural networks, regression models, naïve Bayes, k-nearest neighbors, and ensemble learning. The recent successes of these machine learning methods in predictive toxicology are summarized, and a comparison of some models used in predictive toxicology is presented. In predictive toxicology, SVMs, RF, and DTs are the dominant machine learning methods due to the characteristics of the data available. Lastly, this review describes the current challenges facing the use of machine learning in predictive toxicology and offers insights into the possible areas of improvement in the field.
Collapse
Affiliation(s)
- Marcus W H Wang
- Centre for Molecular Informatics, Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| | - Jonathan M Goodman
- Centre for Molecular Informatics, Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| | - Timothy E H Allen
- Centre for Molecular Informatics, Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom.,MRC Toxicology Unit, University of Cambridge, Hodgkin Building, Lancaster Road, Leicester LE1 7HB, United Kingdom
| |
Collapse
|
29
|
Dai W, Tang T, Dai Z, Shi D, Mo L, Zhang Y. Probing the Mechanism of Hepatotoxicity of Hexabromocyclododecanes through Toxicological Network Analysis. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2020; 54:15235-15245. [PMID: 33190479 DOI: 10.1021/acs.est.0c03998] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
The prediction and mechanism analysis of hepatotoxicity of contaminants, because of their various phenotypes and complex mechanisms, is still a key problem in environmental research. We applied a toxicological network analysis method to predict the hepatotoxicity of three hexabromocyclododecane (HBCD) diastereoisomers (α-HBCD, β-HBCD, and γ-HBCD) and explore their potential mechanisms. First, we collected the hepatotoxicity related genes and found that those genes were significantly localized in the human interactome. Therefore, these genes form a disease module of hepatotoxicity. We also collected targets of α-, β-, and γ-HBCD and found that their targets overlap with the hepatotoxicity disease module. Then, we trained a model to predict hepatotoxicity of three HBCD diastereoisomers based on the relationship between the hepatotoxicity disease module and targets of compounds. We found that 593 genes were significantly located in the hepatotoxicity disease module (Z = 11.9, p < 0.001) involved in oxidative stress, cellular immunity, and proliferation, and the accuracy of hepatotoxicity prediction of HBCD was 0.7095 ± 0.0193 and the recall score was 0.8355 ± 0.0352. HBCD mainly affects the core disease module genes to mediate the adenosine monophosphate-activated kinase, p38MAPK, PI3K/Akt, and TNFα pathways to regulate the immune reaction and inflammation. HBCD also induces the secretion of IL6 and STAT3 to lead hepatotoxicity by regulating NR3C1. This approach is transferable to other toxicity research studies of environmental pollutants.
Collapse
Affiliation(s)
- Weina Dai
- Chongqing Research Center for Pharmaceutical Engineering, College of Pharmacy, Chongqing Medical University, Chongqing 400016, China
| | - Tiantian Tang
- Chongqing Research Center for Pharmaceutical Engineering, College of Pharmacy, Chongqing Medical University, Chongqing 400016, China
| | - Zhenghua Dai
- Chongqing Research Center for Pharmaceutical Engineering, College of Pharmacy, Chongqing Medical University, Chongqing 400016, China
- Chongqing Academy of Metrology and Quality Inspection, Chongqing 401123, China
| | - Da Shi
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, California 92093, United States
| | - Lingyun Mo
- The Guangxi Key Laboratory of Theory and Technology for Environmental Pollution Control, College of Environmental Science and Engineering, Guilin University of Technology, Guilin 541004, China
- Technical Innovation Center for Mine Geological Environment Restoration Engineering in Shishan Area of South China, Ministry of Natural Resources, Nanning 530028, China
| | - Yonghong Zhang
- Chongqing Research Center for Pharmaceutical Engineering, College of Pharmacy, Chongqing Medical University, Chongqing 400016, China
| |
Collapse
|
30
|
Chavan S, Scherbak N, Engwall M, Repsilber D. Predicting Chemical-Induced Liver Toxicity Using High-Content Imaging Phenotypes and Chemical Descriptors: A Random Forest Approach. Chem Res Toxicol 2020; 33:2261-2275. [DOI: 10.1021/acs.chemrestox.9b00459] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Affiliation(s)
- Swapnil Chavan
- School of Science and Technology, Örebro University, 70112 Örebro, Sweden
| | - Nikolai Scherbak
- School of Science and Technology, Örebro University, 70112 Örebro, Sweden
| | - Magnus Engwall
- School of Science and Technology, Örebro University, 70112 Örebro, Sweden
| | - Dirk Repsilber
- School of Medical Sciences, Örebro University, 70185 Örebro, Sweden
| |
Collapse
|
31
|
Patlewicz G. Navigating the Minefield of Computational Toxicology and Informatics: Looking Back and Charting a New Horizon. FRONTIERS IN TOXICOLOGY 2020; 2:2. [PMID: 35296116 PMCID: PMC8915910 DOI: 10.3389/ftox.2020.00002] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2020] [Accepted: 05/20/2020] [Indexed: 01/07/2023] Open
|
32
|
Sakkiah S, Leggett C, Pan B, Guo W, Valerio LG, Hong H. Development of a Nicotinic Acetylcholine Receptor nAChR α7 Binding Activity Prediction Model. J Chem Inf Model 2020; 60:2396-2404. [PMID: 32159345 DOI: 10.1021/acs.jcim.0c00139] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Despite the well-known adverse health effects associated with tobacco use, addiction to nicotine found in tobacco products causes difficulty in quitting among users. Nicotinic acetylcholine receptors (nAChRs) are the physiological targets of nicotine and facilitate addiction to tobacco products. The nAChR-α7 subtype plays an important role in addiction; therefore, predicting the binding activity of tobacco constituents to nAChR-α7 is an important component for assessing addictive potential of tobacco constituents. We developed an α7 binding activity prediction model based on a large training data set of 843 chemicals with human α7 binding activity data extracted from PubChem and ChEMBL. The model was tested using 1215 chemicals with rat α7 binding activity data from the same databases. Based on the competitive docking results, the docking scores were partitioned to the key residues that play important roles in the receptor-ligand binding. A decision forest was used to train the human α7 binding activity prediction model based on the partition of docking scores. Five-fold cross validations were conducted to estimate the performance of the decision forest models. The developed model was used to predict the potential human α7 binding activity for 5275 tobacco constituents. The human α7 binding activity data for 84 of the 5275 tobacco constituents were experimentally measured to confirm and empirically validate the prediction results. The prediction accuracy, sensitivity, and specificity were 64.3, 40.0, and 81.6%, respectively. The developed prediction model of human α7 may be a useful tool for high-throughput screening of potential addictive tobacco constituents.
Collapse
Affiliation(s)
- Sugunadevi Sakkiah
- Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, U.S. Food and Drug Administration, 3900 NCTR Road, Jefferson, Arkansas 72079, United States
| | - Carmine Leggett
- Division of Nonclinical Science, Office of Science, Center for Tobacco Products, U.S. Food and Drug Administration, 11785 Beltsville Drive, Calverton, Maryland 20705, United States
| | - Bohu Pan
- Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, U.S. Food and Drug Administration, 3900 NCTR Road, Jefferson, Arkansas 72079, United States
| | - Wenjing Guo
- Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, U.S. Food and Drug Administration, 3900 NCTR Road, Jefferson, Arkansas 72079, United States
| | - Luis G Valerio
- Division of Nonclinical Science, Office of Science, Center for Tobacco Products, U.S. Food and Drug Administration, 11785 Beltsville Drive, Calverton, Maryland 20705, United States
| | - Huixiao Hong
- Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, U.S. Food and Drug Administration, 3900 NCTR Road, Jefferson, Arkansas 72079, United States
| |
Collapse
|
33
|
Morger A, Mathea M, Achenbach JH, Wolf A, Buesen R, Schleifer KJ, Landsiedel R, Volkamer A. KnowTox: pipeline and case study for confident prediction of potential toxic effects of compounds in early phases of development. J Cheminform 2020; 12:24. [PMID: 33431007 PMCID: PMC7157991 DOI: 10.1186/s13321-020-00422-x] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2019] [Accepted: 03/09/2020] [Indexed: 02/07/2023] Open
Abstract
Risk assessment of newly synthesised chemicals is a prerequisite for regulatory approval. In this context, in silico methods have great potential to reduce time, cost, and ultimately animal testing as they make use of the ever-growing amount of available toxicity data. Here, KnowTox is presented, a novel pipeline that combines three different in silico toxicology approaches to allow for confident prediction of potentially toxic effects of query compounds, i.e. machine learning models for 88 endpoints, alerts for 919 toxic substructures, and computational support for read-across. It is mainly based on the ToxCast dataset, containing after preprocessing a sparse matrix of 7912 compounds tested against 985 endpoints. When applying machine learning models, applicability and reliability of predictions for new chemicals are of utmost importance. Therefore, first, the conformal prediction technique was deployed, comprising an additional calibration step and per definition creating internally valid predictors at a given significance level. Second, to further improve validity and information efficiency, two adaptations are suggested, exemplified at the androgen receptor antagonism endpoint. An absolute increase in validity of 23% on the in-house dataset of 534 compounds could be achieved by introducing KNNRegressor normalisation. This increase in validity comes at the cost of efficiency, which could again be improved by 20% for the initial ToxCast model by balancing the dataset during model training. Finally, the value of the developed pipeline for risk assessment is discussed using two in-house triazole molecules. Compared to a single toxicity prediction method, complementing the outputs of different approaches can have a higher impact on guiding toxicity testing and de-selecting most likely harmful development-candidate compounds early in the development process.
Collapse
Affiliation(s)
- Andrea Morger
- In Silico Toxicology and Structural Bioinformatics, Institute of Physiology, Charité Universitätsmedizin Berlin, Charitéplatz 1, Berlin, Germany
| | | | | | | | | | | | | | - Andrea Volkamer
- In Silico Toxicology and Structural Bioinformatics, Institute of Physiology, Charité Universitätsmedizin Berlin, Charitéplatz 1, Berlin, Germany.
| |
Collapse
|
34
|
Jeong J, Choi J. Development of AOP relevant to microplastics based on toxicity mechanisms of chemical additives using ToxCast™ and deep learning models combined approach. ENVIRONMENT INTERNATIONAL 2020; 137:105557. [PMID: 32078872 DOI: 10.1016/j.envint.2020.105557] [Citation(s) in RCA: 45] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/27/2019] [Revised: 01/21/2020] [Accepted: 02/05/2020] [Indexed: 06/10/2023]
Abstract
Various additives are used in plastic products to improve the properties and the durability of the plastics. Their possible elution from the plastics when plastics are fragmented into micro- and nano-size in the environment is suspected to one of the major contributors to environmental and human toxicity of microplastics. In this context, to better understand the hazardous effect of microplastics, the toxicity of chemical additives was investigated. Fifty most common chemicals presented in plastics were selected as target additives. Their toxicity was systematically identified using apical and molecular toxicity databases, such as ChemIDplus and ToxCast™. Among the vast ToxCast assays, those having intended gene targets were selected for identification of the mechanism of toxicity of plastic additives. Deep learning artificial neural network models were further developed based on the ToxCast assays for the chemicals not tested in the ToxCast program. Using both the ToxCast database and deep learning models, active chemicals on each ToxCast assays were identified. Through correlation analysis between molecular targets from ToxCast and mammalian toxicity results from ChemIDplus, we identified the fifteen most relevant mechanisms of toxicity for the understanding mechanism of toxicity of plastic additives. They are neurotoxicity, inflammation, lipid metabolism, and cancer pathways. Based on these, along with, previously conducted systemic review on the mechanism of toxicity of microplastics, here we have proposed potential adverse outcome pathways (AOPs) relevant to microplastics pollution. This study also suggests in vivo and in vitro toxicity database and deep learning model combined approach is appropriate to provide insight into the toxicity mechanism of the broad range of environmental chemicals, such as plastic additives.
Collapse
Affiliation(s)
- Jaeseong Jeong
- School of Environmental Engineering, University of Seoul, 163 Seoulsiripdae-ro, Dongdaemun-gu, Seoul 02504, Republic of Korea
| | - Jinhee Choi
- School of Environmental Engineering, University of Seoul, 163 Seoulsiripdae-ro, Dongdaemun-gu, Seoul 02504, Republic of Korea.
| |
Collapse
|
35
|
Xu T, Ngan DK, Ye L, Xia M, Xie HQ, Zhao B, Simeonov A, Huang R. Predictive Models for Human Organ Toxicity Based on In Vitro Bioactivity Data and Chemical Structure. Chem Res Toxicol 2020; 33:731-741. [PMID: 32077278 PMCID: PMC10926239 DOI: 10.1021/acs.chemrestox.9b00305] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Traditional toxicity testing reliant on animal models is costly and low throughput, posing a significant challenge with the increasing numbers of chemicals that humans are exposed to in the environment. The purpose of this investigation was to build optimal prediction models for various human in vivo/organ-level toxicity end points (extracted from ChemIDPlus) using chemical structure and Tox21 in vitro quantitative high-throughput screening (qHTS) bioactivity assay data. Several supervised machine learning algorithms were applied to model 14 human toxicity end points pertaining to vascular, kidney, ureter and bladder, and liver organ systems. Three metrics were used to evaluate model performance: area under the receiver operating characteristic curve (AUC-ROC), balanced accuracy (BA), and Matthews correlation coefficient (MCC). The top four models, with AUC-ROC values >0.8, were derived for endocrine (0.90 ± 0.00), musculoskeletal (0.88 ± 0.02), peripheral nerve and sensation (0.85 ± 0.01), and brain and coverings (0.83 ± 0.02) toxicities, whereas the best model AUC-ROC values were >0.7 for the remaining 10 toxicities. Model performance was found to be dependent on the specific data set, model type, and feature selection method used. In addition, chemical structure and assay data showed different levels of contribution to the prediction of different toxicity end points. Although in vitro assay data, when combined with chemical structure, slightly improved the predictive accuracy for most end points (11 out of 14), a noteworthy finding was the near equal success of the structure-only models, which do not require Tox21 qHTS screening data, and the relatively poor performance of assay-only models. Thus, the top-performing structure-only models from this study could be applied for hazard screening of large sets of chemicals for potential human toxicity, whereas the largest assay contributions to models (i.e., cellular targets) could be used, along with the top-contributing structural features, to provide insight into toxicity mechanisms.
Collapse
Affiliation(s)
- Tuan Xu
- Division of Pre-clinical Innovation, National Center for Advancing Translational Sciences (NCATS), National Institutes of Health (NIH), Rockville, MD 20850, USA
| | - Deborah K. Ngan
- Division of Pre-clinical Innovation, National Center for Advancing Translational Sciences (NCATS), National Institutes of Health (NIH), Rockville, MD 20850, USA
| | - Lin Ye
- Division of Pre-clinical Innovation, National Center for Advancing Translational Sciences (NCATS), National Institutes of Health (NIH), Rockville, MD 20850, USA
| | - Menghang Xia
- Division of Pre-clinical Innovation, National Center for Advancing Translational Sciences (NCATS), National Institutes of Health (NIH), Rockville, MD 20850, USA
| | - Heidi Q. Xie
- State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center of Eco-Environment Sciences, Chinese Academy of Sciences, Beijing 100085, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Bin Zhao
- State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center of Eco-Environment Sciences, Chinese Academy of Sciences, Beijing 100085, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Anton Simeonov
- Division of Pre-clinical Innovation, National Center for Advancing Translational Sciences (NCATS), National Institutes of Health (NIH), Rockville, MD 20850, USA
| | - Ruili Huang
- Division of Pre-clinical Innovation, National Center for Advancing Translational Sciences (NCATS), National Institutes of Health (NIH), Rockville, MD 20850, USA
| |
Collapse
|
36
|
van der Ven LTM, Rorije E, Sprong RC, Zink D, Derr R, Hendriks G, Loo LH, Luijten M. A Case Study with Triazole Fungicides to Explore Practical Application of Next-Generation Hazard Assessment Methods for Human Health. Chem Res Toxicol 2020; 33:834-848. [PMID: 32041405 DOI: 10.1021/acs.chemrestox.9b00484] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
The ongoing developments in chemical risk assessment have led to new concepts building on integration of sophisticated nonanimal models for hazard characterization. Here we explore a pragmatic approach for implementing such concepts, using a case study of three triazole fungicides, namely, flusilazole, propiconazole, and cyproconazole. The strategy applied starts with evaluating the overall level of concern by comparing exposure estimates to toxicological potential, followed by a combination of in silico tools and literature-derived high-throughput screening assays and computational elaborations to obtain insight into potential toxicological mechanisms and targets in the organism. Additionally, some targeted in vitro tests were evaluated for their utility to confirm suspected mechanisms of toxicity and to generate points of departure. Toxicological mechanisms instead of the current "end point-by-end point" approach should guide the selection of methods and assays that constitute a toolbox for next-generation risk assessment. Comparison of the obtained in silico and in vitro results with data from traditional in vivo testing revealed that, overall, nonanimal methods for hazard identification can produce adequate qualitative hazard information for risk assessment. Follow-up studies are needed to further refine the proposed approach, including the composition of the toolbox, toxicokinetics models, and models for exposure assessment.
Collapse
|
37
|
Wittwehr C, Blomstedt P, Gosling JP, Peltola T, Raffael B, Richarz AN, Sienkiewicz M, Whaley P, Worth A, Whelan M. Artificial Intelligence for chemical risk assessment. ACTA ACUST UNITED AC 2020; 13:100114. [PMID: 32140631 PMCID: PMC7043333 DOI: 10.1016/j.comtox.2019.100114] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2019] [Revised: 09/10/2019] [Accepted: 11/25/2019] [Indexed: 02/03/2023]
Abstract
As the basis for managing the risks of chemical exposure, the Chemical Risk Assessment (CRA) process can impact a substantial part of the economy, the health of hundreds of millions of people, and the condition of the environment. However, the number of properly assessed chemicals falls short of societal needs due to a lack of experts for evaluation, interference of third party interests, and the sheer volume of potentially relevant information on the chemicals from disparate sources. In order to explore ways in which computational methods may help overcome this discrepancy between the number of chemical risk assessments required on the one hand and the number and adequateness of assessments actually being conducted on the other, the European Commission's Joint Research Centre organised a workshop on Artificial Intelligence for Chemical Risk Assessment (AI4CRA). The workshop identified a number of areas where Artificial Intelligence could potentially increase the number and quality of regulatory risk management decisions based on CRA, involving process simulation, supporting evaluation, identifying problems, facilitating collaboration, finding experts, evidence gathering, systematic review, knowledge discovery, and building cognitive models. Although these are interconnected, they are organised and discussed under two main themes: scientific-technical process and social aspects and the decision making process.
Collapse
Affiliation(s)
| | | | | | | | - Barbara Raffael
- European Commission, Joint Research Centre (JRC), Ispra, Italy
| | | | | | - Paul Whaley
- Lancaster Environment Centre, University Lancaster, UK.,The Evidence-based Toxicology Collaboration at Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - Andrew Worth
- European Commission, Joint Research Centre (JRC), Ispra, Italy
| | - Maurice Whelan
- European Commission, Joint Research Centre (JRC), Ispra, Italy
| |
Collapse
|
38
|
Vo AH, Van Vleet TR, Gupta RR, Liguori MJ, Rao MS. An Overview of Machine Learning and Big Data for Drug Toxicity Evaluation. Chem Res Toxicol 2019; 33:20-37. [DOI: 10.1021/acs.chemrestox.9b00227] [Citation(s) in RCA: 55] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Affiliation(s)
- Andy H. Vo
- Department of Preclinical Safety, AbbVie, 1 North Waukegan Road, North Chicago, Illinois 60064, United States
| | - Terry R. Van Vleet
- Department of Preclinical Safety, AbbVie, 1 North Waukegan Road, North Chicago, Illinois 60064, United States
| | - Rishi R. Gupta
- Information Research, Research and Development, AbbVie, 1 North Waukegan Road, North Chicago, Illinois 60064, United States
| | - Michael J. Liguori
- Department of Preclinical Safety, AbbVie, 1 North Waukegan Road, North Chicago, Illinois 60064, United States
| | - Mohan S. Rao
- Department of Preclinical Safety, AbbVie, 1 North Waukegan Road, North Chicago, Illinois 60064, United States
| |
Collapse
|
39
|
Mitkov J, Kondeva-Burdina M, Zlatkov A. Synthesis and preliminary hepatotoxicity evaluation of new caffeine-8-(2-thio)-propanoic hydrazid-hydrazone derivatives. PHARMACIA 2019. [DOI: 10.3897/pharmacia.66.e37263] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
New series of caffeine-8-(2-thio)-propanoic hydrazid-hydrazone derivatives were designed and synthesized. The targed compounds were obtained in yields of 51 to 96% and their structures were elucidated by FTIR,1H NMR,13C NMR, MS and microanalyses. All of the compounds were found to be “drug-like” as they fulfill the criteria of drug-likeness, which includes the MDDR-like rule. The tested compounds were subjected toin silicoprediction of substrate/metabolite specificity and Drug Induced Liver Injury (DILI). The prediction for indicated that the evaluated compounds would most probably act as CYP1A2 substrates. The performedin vitrostudies didn’t reveal statistically significant hepatotoxicity of the tested compounds, probably due to the pro-oxidant effects expressed on sub-cellular (isolated rat liver microsomes) level. The obtained experimental results confirmed the predicted low hepatotoxicity for the tested structures. Based on these results the compounds may be considered as promising structures for design of future molecules with low hepatotoxicity.
Collapse
|
40
|
Watford S, Edwards S, Angrish M, Judson RS, Paul Friedman K. Progress in data interoperability to support computational toxicology and chemical safety evaluation. Toxicol Appl Pharmacol 2019; 380:114707. [PMID: 31404555 PMCID: PMC7705611 DOI: 10.1016/j.taap.2019.114707] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2019] [Revised: 07/29/2019] [Accepted: 08/06/2019] [Indexed: 12/20/2022]
Abstract
New approach methodologies (NAMs) in chemical safety evaluation are being explored to address the current public health implications of human environmental exposures to chemicals with limited or no data for assessment. For over a decade since a push toward "Toxicity Testing in the 21st Century," the field has focused on massive data generation efforts to inform computational approaches for preliminary hazard identification, adverse outcome pathways that link molecular initiating events and key events to apical outcomes, and high-throughput approaches to risk-based ratios of bioactivity and exposure to inform relative priority and safety assessment. Projects like the interagency Tox21 program and the US EPA ToxCast program have generated dose-response information on thousands of chemicals, identified and aggregated information from legacy systems, and created tools for access and analysis. The resulting information has been used to develop computational models as viable options for regulatory applications. This progress has introduced challenges in data management that are new, but not unique, to toxicology. Some of the key questions require critical thinking and solutions to promote semantic interoperability, including: (1) identification of bioactivity information from NAMs that might be related to a biological process; (2) identification of legacy hazard information that might be related to a key event or apical outcomes of interest; and, (3) integration of these NAM and traditional data for computational modeling and prediction of complex apical outcomes such as carcinogenesis. This work reviews a number of toxicology-related efforts specifically related to bioactivity and toxicological data interoperability based on the goals established by Findable, Accessible, Interoperable, and Reusable (FAIR) Data Principles. These efforts are essential to enable better integration of NAM and traditional toxicology information to support data-driven toxicology applications.
Collapse
Affiliation(s)
- Sean Watford
- Booz Allen Hamilton, Rockville, MD 20852, USA; National Center for Computational Toxicology, Office of Research and Development, U.S. Environmental Protection Agency, Research Triangle Park, NC 27711, USA
| | - Stephen Edwards
- Research Triangle Institute International, Research Triangle Park, NC 27709, USA
| | - Michelle Angrish
- National Center for Environmental Assessment, Office of Research and Development, U.S. Environmental Protection Agency, Research Triangle Park, NC 27711, USA
| | - Richard S Judson
- National Center for Computational Toxicology, Office of Research and Development, U.S. Environmental Protection Agency, Research Triangle Park, NC 27711, USA
| | - Katie Paul Friedman
- National Center for Computational Toxicology, Office of Research and Development, U.S. Environmental Protection Agency, Research Triangle Park, NC 27711, USA.
| |
Collapse
|
41
|
Watford S, Ly Pham L, Wignall J, Shin R, Martin MT, Friedman KP. ToxRefDB version 2.0: Improved utility for predictive and retrospective toxicology analyses. Reprod Toxicol 2019; 89:145-158. [PMID: 31340180 PMCID: PMC6944327 DOI: 10.1016/j.reprotox.2019.07.012] [Citation(s) in RCA: 51] [Impact Index Per Article: 10.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2019] [Revised: 05/31/2019] [Accepted: 07/12/2019] [Indexed: 02/08/2023]
Abstract
The Toxicity Reference Database (ToxRefDB) structures information from over 5000 in vivo toxicity studies, conducted largely to guidelines or specifications from the US Environmental Protection Agency and the National Toxicology Program, into a public resource for training and validation of predictive models. Herein, ToxRefDB version 2.0 (ToxRefDBv2) development is described. Endpoints were annotated (e.g. required, not required) according to guidelines for subacute, subchronic, chronic, developmental, and multigenerational reproductive designs, distinguishing negative responses from untested. Quantitative data were extracted, and dose-response modeling for nearly 28,000 datasets from nearly 400 endpoints using Benchmark Dose (BMD) Modeling Software were generated and stored. Implementation of controlled vocabulary improved data quality; standardization to guideline requirements and cross-referencing with United Medical Language System (UMLS) connects ToxRefDBv2 observations to vocabularies linked to UMLS, including PubMed medical subject headings. ToxRefDBv2 allows for increased connections to other resources and has greatly enhanced quantitative and qualitative utility for predictive toxicology.
Collapse
Affiliation(s)
- Sean Watford
- ORAU, Contractor to U.S. Environmental Protection Agency through the National Student Services Contract, United States; National Center for Computational Toxicology, Office of Research and Development, US Environmental Protection Agency, United States
| | - Ly Ly Pham
- ORAU, Contractor to U.S. Environmental Protection Agency through the National Student Services Contract, United States; ORISE Postdoctoral Research Participant, United States
| | | | | | - Matthew T Martin
- ORAU, Contractor to U.S. Environmental Protection Agency through the National Student Services Contract, United States; Currently at Drug Safety Research and Development, Global Investigative Toxicology, Pfizer, Groton, CT, United States
| | - Katie Paul Friedman
- National Center for Computational Toxicology, Office of Research and Development, US Environmental Protection Agency, United States.
| |
Collapse
|
42
|
Mahmoud SY, Svensson F, Zoufir A, Módos D, Afzal AM, Bender A. Understanding Conditional Associations between ToxCast in Vitro Readouts and the Hepatotoxicity of Compounds Using Rule-Based Methods. Chem Res Toxicol 2019; 33:137-153. [DOI: 10.1021/acs.chemrestox.8b00382] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Affiliation(s)
- Samar Y. Mahmoud
- Centre for Molecular Informatics, Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge, CB2 1EW, United Kingdom
| | - Fredrik Svensson
- Centre for Molecular Informatics, Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge, CB2 1EW, United Kingdom
| | - Azedine Zoufir
- Centre for Molecular Informatics, Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge, CB2 1EW, United Kingdom
| | - Dezső Módos
- Centre for Molecular Informatics, Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge, CB2 1EW, United Kingdom
| | - Avid M. Afzal
- Centre for Molecular Informatics, Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge, CB2 1EW, United Kingdom
| | - Andreas Bender
- Centre for Molecular Informatics, Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge, CB2 1EW, United Kingdom
| |
Collapse
|
43
|
Banerjee P, Eckert AO, Schrey AK, Preissner R. ProTox-II: a webserver for the prediction of toxicity of chemicals. Nucleic Acids Res 2019; 46:W257-W263. [PMID: 29718510 PMCID: PMC6031011 DOI: 10.1093/nar/gky318] [Citation(s) in RCA: 1031] [Impact Index Per Article: 206.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2018] [Accepted: 04/26/2018] [Indexed: 01/06/2023] Open
Abstract
Advancement in the field of computational research has made it possible for the in silico methods to offer significant benefits to both regulatory needs and requirements for risk assessments, and pharmaceutical industry to assess the safety profile of a chemical. Here, we present ProTox-II that incorporates molecular similarity, pharmacophores, fragment propensities and machine-learning models for the prediction of various toxicity endpoints; such as acute toxicity, hepatotoxicity, cytotoxicity, carcinogenicity, mutagenicity, immunotoxicity, adverse outcomes pathways (Tox21) and toxicity targets. The predictive models are built on data from both in vitro assays (e.g. Tox21 assays, Ames bacterial mutation assays, hepG2 cytotoxicity assays, Immunotoxicity assays) and in vivo cases (e.g. carcinogenicity, hepatotoxicity). The models have been validated on independent external sets and have shown strong performance. ProTox-II provides a freely available webserver for in silico toxicity prediction for toxicologists, regulatory agencies, computational and medicinal chemists, and all users without login at http://tox.charite.de/protox_II. The webserver takes a two-dimensional chemical structure as an input and reports the possible toxicity profile of the chemical for 33 models with confidence scores, and an overall toxicity radar chart along with three most similar compounds with known acute toxicity.
Collapse
Affiliation(s)
- Priyanka Banerjee
- Structural Bioinformatics Group, Institute for Physiology & ECRC, Charité - University Medicine Berlin, 10115 Berlin, Germany
| | - Andreas O Eckert
- Structural Bioinformatics Group, Institute for Physiology & ECRC, Charité - University Medicine Berlin, 10115 Berlin, Germany
| | - Anna K Schrey
- Structural Bioinformatics Group, Institute for Physiology & ECRC, Charité - University Medicine Berlin, 10115 Berlin, Germany
| | - Robert Preissner
- Structural Bioinformatics Group, Institute for Physiology & ECRC, Charité - University Medicine Berlin, 10115 Berlin, Germany.,BB3R - Berlin Brandenburg 3R Graduate School, Freie Universität Berlin, Berlin, Germany
| |
Collapse
|
44
|
Thomas RS, Bahadori T, Buckley TJ, Cowden J, Deisenroth C, Dionisio KL, Frithsen JB, Grulke CM, Gwinn MR, Harrill JA, Higuchi M, Houck KA, Hughes MF, Hunter ES, Isaacs KK, Judson RS, Knudsen TB, Lambert JC, Linnenbrink M, Martin TM, Newton SR, Padilla S, Patlewicz G, Paul-Friedman K, Phillips KA, Richard AM, Sams R, Shafer TJ, Setzer RW, Shah I, Simmons JE, Simmons SO, Singh A, Sobus JR, Strynar M, Swank A, Tornero-Valez R, Ulrich EM, Villeneuve DL, Wambaugh JF, Wetmore BA, Williams AJ. The Next Generation Blueprint of Computational Toxicology at the U.S. Environmental Protection Agency. Toxicol Sci 2019; 169:317-332. [PMID: 30835285 PMCID: PMC6542711 DOI: 10.1093/toxsci/kfz058] [Citation(s) in RCA: 195] [Impact Index Per Article: 39.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
The U.S. Environmental Protection Agency (EPA) is faced with the challenge of efficiently and credibly evaluating chemical safety often with limited or no available toxicity data. The expanding number of chemicals found in commerce and the environment, coupled with time and resource requirements for traditional toxicity testing and exposure characterization, continue to underscore the need for new approaches. In 2005, EPA charted a new course to address this challenge by embracing computational toxicology (CompTox) and investing in the technologies and capabilities to push the field forward. The return on this investment has been demonstrated through results and applications across a range of human and environmental health problems, as well as initial application to regulatory decision-making within programs such as the EPA's Endocrine Disruptor Screening Program. The CompTox initiative at EPA is more than a decade old. This manuscript presents a blueprint to guide the strategic and operational direction over the next 5 years. The primary goal is to obtain broader acceptance of the CompTox approaches for application to higher tier regulatory decisions, such as chemical assessments. To achieve this goal, the blueprint expands and refines the use of high-throughput and computational modeling approaches to transform the components in chemical risk assessment, while systematically addressing key challenges that have hindered progress. In addition, the blueprint outlines additional investments in cross-cutting efforts to characterize uncertainty and variability, develop software and information technology tools, provide outreach and training, and establish scientific confidence for application to different public health and environmental regulatory decisions.
Collapse
Affiliation(s)
- Russell S. Thomas
- National Center for Computational Toxicology, Office of Research and Development, US Environmental Protection Agency
| | - Tina Bahadori
- National Center for Environmental Assessment, Office of Research and Development, US Environmental Protection Agency
| | - Timothy J. Buckley
- National Exposure Research Laboratory, Office of Research and Development, US Environmental Protection Agency
| | - John Cowden
- National Center for Computational Toxicology, Office of Research and Development, US Environmental Protection Agency
| | - Chad Deisenroth
- National Center for Computational Toxicology, Office of Research and Development, US Environmental Protection Agency
| | - Kathie L. Dionisio
- National Exposure Research Laboratory, Office of Research and Development, US Environmental Protection Agency
| | - Jeffrey B. Frithsen
- Chemical Safety for Sustainability National Research Program, Office of Research and Development, US Environmental Protection Agency
| | - Christopher M. Grulke
- National Center for Computational Toxicology, Office of Research and Development, US Environmental Protection Agency
| | - Maureen R. Gwinn
- National Center for Computational Toxicology, Office of Research and Development, US Environmental Protection Agency
| | - Joshua A. Harrill
- National Center for Computational Toxicology, Office of Research and Development, US Environmental Protection Agency
| | - Mark Higuchi
- National Health and Environmental Effects Research Laboratory, Office of Research and Development, US Environmental Protection Agency
| | - Keith A. Houck
- National Center for Computational Toxicology, Office of Research and Development, US Environmental Protection Agency
| | - Michael F. Hughes
- National Health and Environmental Effects Research Laboratory, Office of Research and Development, US Environmental Protection Agency
| | - E. Sidney Hunter
- National Health and Environmental Effects Research Laboratory, Office of Research and Development, US Environmental Protection Agency
| | - Kristin K. Isaacs
- National Exposure Research Laboratory, Office of Research and Development, US Environmental Protection Agency
| | - Richard S. Judson
- National Center for Computational Toxicology, Office of Research and Development, US Environmental Protection Agency
| | - Thomas B. Knudsen
- National Center for Computational Toxicology, Office of Research and Development, US Environmental Protection Agency
| | - Jason C. Lambert
- National Center for Environmental Assessment, Office of Research and Development, US Environmental Protection Agency
| | - Monica Linnenbrink
- National Center for Computational Toxicology, Office of Research and Development, US Environmental Protection Agency
| | - Todd M. Martin
- National Risk Management Research Laboratory, Office of Research and Development, US Environmental Protection Agency
| | - Seth R. Newton
- National Exposure Research Laboratory, Office of Research and Development, US Environmental Protection Agency
| | - Stephanie Padilla
- National Health and Environmental Effects Research Laboratory, Office of Research and Development, US Environmental Protection Agency
| | - Grace Patlewicz
- National Center for Computational Toxicology, Office of Research and Development, US Environmental Protection Agency
| | - Katie Paul-Friedman
- National Center for Computational Toxicology, Office of Research and Development, US Environmental Protection Agency
| | - Katherine A. Phillips
- National Exposure Research Laboratory, Office of Research and Development, US Environmental Protection Agency
| | - Ann M. Richard
- National Center for Computational Toxicology, Office of Research and Development, US Environmental Protection Agency
| | - Reeder Sams
- National Center for Computational Toxicology, Office of Research and Development, US Environmental Protection Agency
| | - Timothy J. Shafer
- National Health and Environmental Effects Research Laboratory, Office of Research and Development, US Environmental Protection Agency
| | - R. Woodrow Setzer
- National Center for Computational Toxicology, Office of Research and Development, US Environmental Protection Agency
| | - Imran Shah
- National Center for Computational Toxicology, Office of Research and Development, US Environmental Protection Agency
| | - Jane E. Simmons
- National Health and Environmental Effects Research Laboratory, Office of Research and Development, US Environmental Protection Agency
| | - Steven O. Simmons
- National Center for Computational Toxicology, Office of Research and Development, US Environmental Protection Agency
| | - Amar Singh
- National Center for Computational Toxicology, Office of Research and Development, US Environmental Protection Agency
| | - Jon R. Sobus
- National Exposure Research Laboratory, Office of Research and Development, US Environmental Protection Agency
| | - Mark Strynar
- National Exposure Research Laboratory, Office of Research and Development, US Environmental Protection Agency
| | - Adam Swank
- National Exposure Research Laboratory, Office of Research and Development, US Environmental Protection Agency
| | - Rogelio Tornero-Valez
- National Exposure Research Laboratory, Office of Research and Development, US Environmental Protection Agency
| | - Elin M. Ulrich
- National Exposure Research Laboratory, Office of Research and Development, US Environmental Protection Agency
| | - Daniel L Villeneuve
- National Health and Environmental Effects Research Laboratory, Office of Research and Development, US Environmental Protection Agency
| | - John F. Wambaugh
- National Center for Computational Toxicology, Office of Research and Development, US Environmental Protection Agency
| | - Barbara A. Wetmore
- National Exposure Research Laboratory, Office of Research and Development, US Environmental Protection Agency
| | - Antony J. Williams
- National Center for Computational Toxicology, Office of Research and Development, US Environmental Protection Agency
| |
Collapse
|
45
|
Honda GS, Pearce RG, Pham LL, Setzer RW, Wetmore BA, Sipes NS, Gilbert J, Franz B, Thomas RS, Wambaugh JF. Using the concordance of in vitro and in vivo data to evaluate extrapolation assumptions. PLoS One 2019; 14:e0217564. [PMID: 31136631 PMCID: PMC6538186 DOI: 10.1371/journal.pone.0217564] [Citation(s) in RCA: 33] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2019] [Accepted: 05/14/2019] [Indexed: 12/16/2022] Open
Abstract
Linking in vitro bioactivity and in vivo toxicity on a dose basis enables the use of high-throughput in vitro assays as an alternative to traditional animal studies. In this study, we evaluated assumptions in the use of a high-throughput, physiologically based toxicokinetic (PBTK) model to relate in vitro bioactivity and rat in vivo toxicity data. The fraction unbound in plasma (fup) and intrinsic hepatic clearance (Clint) were measured for rats (for 67 and 77 chemicals, respectively), combined with fup and Clint literature data for 97 chemicals, and incorporated in the PBTK model. Of these chemicals, 84 had corresponding in vitro ToxCast bioactivity data and in vivo toxicity data. For each possible comparison of in vitro and in vivo endpoint, the concordance between the in vivo and in vitro data was evaluated by a regression analysis. For a base set of assumptions, the PBTK results were more frequently better associated than either the results from a “random” model parameterization or direct comparison of the “untransformed” values of AC50 and dose (performed best in 51%, 28%, and 21% of cases, respectively). We also investigated several assumptions in the application of PBTK for IVIVE, including clearance and internal dose selection. One of the better assumptions sets–restrictive clearance and comparing free in vivo venous plasma concentration with free in vitro concentration–outperformed the random and untransformed results in 71% of the in vitro-in vivo endpoint comparisons. These results demonstrate that applying PBTK improves our ability to observe the association between in vitro bioactivity and in vivo toxicity data in general. This suggests that potency values from in vitro screening should be transformed using in vitro-in vivo extrapolation (IVIVE) to build potentially better machine learning and other statistical models for predicting in vivo toxicity in humans.
Collapse
Affiliation(s)
- Gregory S. Honda
- National Center for Computational Toxicology, U.S. EPA, Research Triangle Park, North Carolina, United States of America
- Oak Ridge Institute for Science and Education, Oak Ridge, Tennessee, United States of America
| | - Robert G. Pearce
- National Center for Computational Toxicology, U.S. EPA, Research Triangle Park, North Carolina, United States of America
- Oak Ridge Institute for Science and Education, Oak Ridge, Tennessee, United States of America
| | - Ly L. Pham
- National Center for Computational Toxicology, U.S. EPA, Research Triangle Park, North Carolina, United States of America
- Oak Ridge Institute for Science and Education, Oak Ridge, Tennessee, United States of America
| | - R. W. Setzer
- National Center for Computational Toxicology, U.S. EPA, Research Triangle Park, North Carolina, United States of America
| | - Barbara A. Wetmore
- National Exposure Research Laboratory, U.S. EPA, Research Triangle Park, North Carolina, United States of America
| | - Nisha S. Sipes
- Division of the National Toxicology Program, NIEHS, Research Triangle Park, North Carolina, United States of America
| | - Jon Gilbert
- Cyprotex, Watertown, MA, United States of America
| | - Briana Franz
- Cyprotex, Watertown, MA, United States of America
| | - Russell S. Thomas
- National Center for Computational Toxicology, U.S. EPA, Research Triangle Park, North Carolina, United States of America
| | - John F. Wambaugh
- National Center for Computational Toxicology, U.S. EPA, Research Triangle Park, North Carolina, United States of America
- * E-mail:
| |
Collapse
|
46
|
|
47
|
Improving the Utility of the Tox21 Dataset by Deep Metadata Annotations and Constructing Reusable Benchmarked Chemical Reference Signatures. Molecules 2019; 24:molecules24081604. [PMID: 31018579 PMCID: PMC6515292 DOI: 10.3390/molecules24081604] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2019] [Revised: 04/16/2019] [Accepted: 04/19/2019] [Indexed: 02/03/2023] Open
Abstract
The Toxicology in the 21st Century (Tox21) project seeks to develop and test methods for high-throughput examination of the effect certain chemical compounds have on biological systems. Although primary and toxicity assay data were readily available for multiple reporter gene modified cell lines, extensive annotation and curation was required to improve these datasets with respect to how FAIR (Findable, Accessible, Interoperable, and Reusable) they are. In this study, we fully annotated the Tox21 published data with relevant and accepted controlled vocabularies. After removing unreliable data points, we aggregated the results and created three sets of signatures reflecting activity in the reporter gene assays, cytotoxicity, and selective reporter gene activity, respectively. We benchmarked these signatures using the chemical structures of the tested compounds and obtained generally high receiver operating characteristic (ROC) scores, suggesting good quality and utility of these signatures and the underlying data. We analyzed the results to identify promiscuous individual compounds and chemotypes for the three signature categories and interpreted the results to illustrate the utility and re-usability of the datasets. With this study, we aimed to demonstrate the importance of data standards in reporting screening results and high-quality annotations to enable re-use and interpretation of these data. To improve the data with respect to all FAIR criteria, all assay annotations, cleaned and aggregate datasets, and signatures were made available as standardized dataset packages (Aggregated Tox21 bioactivity data, 2019).
Collapse
|
48
|
Sun Y, Shi S, Li Y, Wang Q. Development of quantitative structure-activity relationship models to predict potential nephrotoxic ingredients in traditional Chinese medicines. Food Chem Toxicol 2019; 128:163-170. [PMID: 30954639 DOI: 10.1016/j.fct.2019.03.056] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2019] [Revised: 03/26/2019] [Accepted: 03/31/2019] [Indexed: 12/13/2022]
Abstract
The broad use of traditional Chinese medicines (TCMs) and the accompanied incidences of kidney injury have attracted considerable interest in investigating the responsible toxic ingredients. It is challenging to evaluate toxicity of TCMs since they contain complex mixtures of phytochemicals. Quantitative structure-activity relationship (QSAR) is an efficient tool to predict toxicity and QSAR study on TCMs-induced nephrotoxicity remains lacked. We developed QSAR models using three datasets of 609 compounds: natural products, drugs, and mixed (contained both kinds of data) datasets. Each dataset was used for modelling by utilizing artificial neural networks (ANN) and support vector machines (SVM) algorithms separately. Both internal and external validations were performed on each model. Six QSAR models were developed and yielded reliable performance in the internal validation. For external validation, 30 ingredients in the TCMs were predicted well by the natural product models (accuracy: ANN 96.7%, SVM 93.3%). The mixed models (accuracy: ANN 76.7%, SVM 66.7%) showed a better performance than the drug models (accuracy: ANN 50%, SVM 53.3%). Particularly, natural product models produced the most reliable results. It has the application not only on screening the nephrotoxic ingredients in TCMs, but it is also helpful at prioritizing the subsequent toxicity testing of natural products.
Collapse
Affiliation(s)
- Yuqing Sun
- Department of Toxicology, School of Public Health, Peking University, Beijing, 100191, China; Key Laboratory of State Administration of Traditional Chinese Medicine for Compatibility Toxicology, Beijing, 100191, China
| | - Shaoze Shi
- Department of Toxicology, School of Public Health, Peking University, Beijing, 100191, China; Key Laboratory of State Administration of Traditional Chinese Medicine for Compatibility Toxicology, Beijing, 100191, China
| | - Yaqiu Li
- Department of Toxicology, School of Public Health, Peking University, Beijing, 100191, China; Key Laboratory of State Administration of Traditional Chinese Medicine for Compatibility Toxicology, Beijing, 100191, China
| | - Qi Wang
- Department of Toxicology, School of Public Health, Peking University, Beijing, 100191, China; Key Laboratory of State Administration of Traditional Chinese Medicine for Compatibility Toxicology, Beijing, 100191, China; Beijing Key Laboratory of Toxicological Research and Risk Assessment for Food Safety, Beijing, 100191, China.
| |
Collapse
|
49
|
Grenet I, Merlo K, Comet JP, Tertiaux R, Rouquié D, Dayan F. Stacked Generalization with Applicability Domain Outperforms Simple QSAR on in Vitro Toxicological Data. J Chem Inf Model 2019; 59:1486-1496. [PMID: 30735402 DOI: 10.1021/acs.jcim.8b00553] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]
Abstract
The development of in silico tools able to predict bioactivity and toxicity of chemical substances is a powerful solution envisioned to assess toxicity as early as possible. To enable the development of such tools, the ToxCast program has generated and made publicly available in vitro bioactivity data for thousands of compounds. The goal of the present study is to characterize and explore the data from ToxCast in terms of Machine Learning capability. For this, a large scale analysis on the entire database has been performed to build models to predict bioactivities measured in in vitro assays. Simple classical QSAR algorithms (ANN, SVM, LDA, random forest, and Bayesian) were first applied on the data, and the results of these algorithms suggested that they do not seem to be well-suited for data sets with a high proportion of inactive compounds. The study then showed for the first time that the use of an ensemble method named "Stacked generalization" could improve the model performance on this type of data. Indeed, for 61% of 483 models, the Stacked method led to models with higher performance. Moreover, the combination of this ensemble method with an applicability domain filter allows one to assess the reliability of the predictions for further compound prioritization. In particular we showed that for 50% of the models, the ROC score is better if we do not consider the compounds that are not within the applicability domain.
Collapse
Affiliation(s)
- Ingrid Grenet
- University Côte d'Azur, I3S Laboratory , UMR CNRS 7271, CS 40121, 06903 Sophia Antipolis Cedex, France.,Bayer SAS , 06903 Sophia Antipolis Cedex, France
| | - Kevin Merlo
- Dassault Systèmes SE , 06906 Sophia Antipolis, Biot , France
| | - Jean-Paul Comet
- University Côte d'Azur, I3S Laboratory , UMR CNRS 7271, CS 40121, 06903 Sophia Antipolis Cedex, France
| | - Romain Tertiaux
- Dassault Systèmes SE , 06906 Sophia Antipolis, Biot , France
| | | | - Frédéric Dayan
- Dassault Systèmes SE , 06906 Sophia Antipolis, Biot , France
| |
Collapse
|
50
|
Ambe K, Ishihara K, Ochibe T, Ohya K, Tamura S, Inoue K, Yoshida M, Tohkin M. In Silico Prediction of Chemical-Induced Hepatocellular Hypertrophy Using Molecular Descriptors. Toxicol Sci 2019; 162:667-675. [PMID: 29309657 DOI: 10.1093/toxsci/kfx287] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
In silico prediction for toxicity of chemicals is required to reduce cost, time, and animal testing. However, predicting hepatocellular hypertrophy, which often affects the derivation of the No-Observed-Adverse-Effect Level in repeated dose toxicity studies, is difficult because pathological findings are diverse, mechanisms are largely unknown, and a wide variety of chemical structures exists. Therefore, a method for predicting the hepatocellular hypertrophy of diverse chemicals without complete understanding of their mechanisms is necessary. In this study, we developed predictive classification models of hepatocellular hypertrophy using machine learning-specifically, deep learning, random forest, and support vector machine. We extracted hepatocellular hypertrophy data on rats from 2 toxicological databases, our original database developed from risk assessment reports such as pesticides, and the Hazard Evaluation Support System Integrated Platform. Then, we constructed prediction models based on molecular descriptors and evaluated their performance using independent test chemicals datasets, which differed from the training chemicals datasets. Further, we defined the applicability domain (AD), which generally limits the application for chemicals, as structurally similar to the training chemicals dataset. The best model was found to be the support vector machine model using the Hazard Evaluation Support System Integrated Platform dataset, which was trained with 251 chemicals and predicted 214 test chemicals inside the applicability domain. It afforded a prediction accuracy of 0.76, sensitivity of 0.90, and area under the curve of 0.81. These in silico predictive classification models could be reliable tools for hepatocellular hypertrophy assessments and can facilitate the development of in silico models for toxicity prediction.
Collapse
Affiliation(s)
- Kaori Ambe
- Department of Regulatory Science, Graduate School of Pharmaceutical Sciences, Nagoya City University, Nagoya 467-8603, Japan
| | - Kana Ishihara
- Department of Regulatory Science, Graduate School of Pharmaceutical Sciences, Nagoya City University, Nagoya 467-8603, Japan
| | - Tatsuya Ochibe
- Department of Regulatory Science, Graduate School of Pharmaceutical Sciences, Nagoya City University, Nagoya 467-8603, Japan
| | - Kazuyuki Ohya
- Department of Regulatory Science, Graduate School of Pharmaceutical Sciences, Nagoya City University, Nagoya 467-8603, Japan
| | - Sorami Tamura
- Department of Regulatory Science, Graduate School of Pharmaceutical Sciences, Nagoya City University, Nagoya 467-8603, Japan
| | - Kaoru Inoue
- Division of Pathology, National Institute of Health Sciences, Kawasaki 1210-9501, Japan
| | - Midori Yoshida
- Food Safety Commission, Cabinet Office, Tokyo 107-6122, Japan
| | - Masahiro Tohkin
- Department of Regulatory Science, Graduate School of Pharmaceutical Sciences, Nagoya City University, Nagoya 467-8603, Japan
| |
Collapse
|