1
|
Karamertzanis PG, Patlewicz G, Sannicola M, Paul-Friedman K, Shah I. Systematic Approaches for the Encoding of Chemical Groups: A Case Study. Chem Res Toxicol 2024; 37:600-619. [PMID: 38498310 DOI: 10.1021/acs.chemrestox.3c00411] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/20/2024]
Abstract
Regulatory authorities aim to organize substances into groups to facilitate prioritization within hazard and risk assessment processes. Often, such chemical groupings are not explicitly defined by structural rules or physicochemical property information. This is largely due to how these groupings are developed, namely, a manual expert curation process, which in turn makes updating and refining groupings, as new substances are evaluated, a practical challenge. Herein, machine learning methods were leveraged to build models that could preliminarily assign substances to predefined groups. A set of 86 groupings containing 2,184 substances as published on the European Chemicals Agency (ECHA) website were mapped to the U.S. Environmental Protection Agency (EPA) Distributed Toxicity Structure Database (DSSTox) content to extract chemical and structural information. Substances were represented using Morgan fingerprints, and two machine learning approaches were used to classify test substances into 56 groups containing at least 10 substances with a structural representation in the data set: k-nearest neighbor (kNN) and random forest (RF), that led to mean 5-fold cross-validation test accuracies (average F1 scores) of 0.781 and 0.853, respectively. With a 9% improvement, the RF classifier was significantly more accurate than KNN (p-value = 0.001). The approach offers promise as a means of the initial profiling of new substances into predefined groups to facilitate prioritization efforts and streamline the assessment of new substances when earlier groupings are available. The algorithm to fit and use these models has been made available in the accompanying repository, thereby enabling both use of the produced models and refitting of these models, as new groupings become available by regulatory authorities or industry.
Collapse
Affiliation(s)
- Panagiotis G Karamertzanis
- Computational Assessment and Alternative Methods, European Chemicals Agency (ECHA), Telakkakatu 6, Helsinki 00150, Finland
| | - Grace Patlewicz
- Center for Computational Toxicology and Exposure (CCTE), US EPA, 109 TW Alexander Dr, Research Triangle Park, North Carolina 27711, United States
| | - Marta Sannicola
- Computational Assessment and Alternative Methods, European Chemicals Agency (ECHA), Telakkakatu 6, Helsinki 00150, Finland
| | - Katie Paul-Friedman
- Center for Computational Toxicology and Exposure (CCTE), US EPA, 109 TW Alexander Dr, Research Triangle Park, North Carolina 27711, United States
| | - Imran Shah
- Center for Computational Toxicology and Exposure (CCTE), US EPA, 109 TW Alexander Dr, Research Triangle Park, North Carolina 27711, United States
| |
Collapse
|
2
|
Guo W, Liu J, Dong F, Song M, Li Z, Khan MKH, Patterson TA, Hong H. Review of machine learning and deep learning models for toxicity prediction. Exp Biol Med (Maywood) 2023; 248:1952-1973. [PMID: 38057999 PMCID: PMC10798180 DOI: 10.1177/15353702231209421] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/08/2023] Open
Abstract
The ever-increasing number of chemicals has raised public concerns due to their adverse effects on human health and the environment. To protect public health and the environment, it is critical to assess the toxicity of these chemicals. Traditional in vitro and in vivo toxicity assays are complicated, costly, and time-consuming and may face ethical issues. These constraints raise the need for alternative methods for assessing the toxicity of chemicals. Recently, due to the advancement of machine learning algorithms and the increase in computational power, many toxicity prediction models have been developed using various machine learning and deep learning algorithms such as support vector machine, random forest, k-nearest neighbors, ensemble learning, and deep neural network. This review summarizes the machine learning- and deep learning-based toxicity prediction models developed in recent years. Support vector machine and random forest are the most popular machine learning algorithms, and hepatotoxicity, cardiotoxicity, and carcinogenicity are the frequently modeled toxicity endpoints in predictive toxicology. It is known that datasets impact model performance. The quality of datasets used in the development of toxicity prediction models using machine learning and deep learning is vital to the performance of the developed models. The different toxicity assignments for the same chemicals among different datasets of the same type of toxicity have been observed, indicating benchmarking datasets is needed for developing reliable toxicity prediction models using machine learning and deep learning algorithms. This review provides insights into current machine learning models in predictive toxicology, which are expected to promote the development and application of toxicity prediction models in the future.
Collapse
Affiliation(s)
- Wenjing Guo
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA
| | - Jie Liu
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA
| | - Fan Dong
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA
| | - Meng Song
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA
| | - Zoe Li
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA
| | - Md Kamrul Hasan Khan
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA
| | - Tucker A Patterson
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA
| | - Huixiao Hong
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA
| |
Collapse
|
3
|
Ruan T, Li P, Wang H, Li T, Jiang G. Identification and Prioritization of Environmental Organic Pollutants: From an Analytical and Toxicological Perspective. Chem Rev 2023; 123:10584-10640. [PMID: 37531601 DOI: 10.1021/acs.chemrev.3c00056] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/04/2023]
Abstract
Exposure to environmental organic pollutants has triggered significant ecological impacts and adverse health outcomes, which have been received substantial and increasing attention. The contribution of unidentified chemical components is considered as the most significant knowledge gap in understanding the combined effects of pollutant mixtures. To address this issue, remarkable analytical breakthroughs have recently been made. In this review, the basic principles on recognition of environmental organic pollutants are overviewed. Complementary analytical methodologies (i.e., quantitative structure-activity relationship prediction, mass spectrometric nontarget screening, and effect-directed analysis) and experimental platforms are briefly described. The stages of technique development and/or essential parts of the analytical workflow for each of the methodologies are then reviewed. Finally, plausible technique paths and applications of the future nontarget screening methods, interdisciplinary techniques for achieving toxicant identification, and burgeoning strategies on risk assessment of chemical cocktails are discussed.
Collapse
Affiliation(s)
- Ting Ruan
- State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, Beijing 100085, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Pengyang Li
- State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, Beijing 100085, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Haotian Wang
- State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, Beijing 100085, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Tingyu Li
- State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, Beijing 100085, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Guibin Jiang
- State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, Beijing 100085, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| |
Collapse
|
4
|
Hu Y, Ren Q, Liu X, Gao L, Xiao L, Yu W. In Silico Prediction of Human Organ Toxicity via Artificial Intelligence Methods. Chem Res Toxicol 2023. [PMID: 37300507 DOI: 10.1021/acs.chemrestox.2c00411] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Unpredicted human organ level toxicity remains one of the major reasons for drug clinical failure. There is a critical need for cost-efficient strategies in the early stages of drug development for human toxicity assessment. At present, artificial intelligence methods are popularly regarded as a promising solution in chemical toxicology. Thus, we provided comprehensive in silico prediction models for eight significant human organ level toxicity end points using machine learning, deep learning, and transfer learning algorithms. In this work, our results showed that the graph-based deep learning approach was generally better than the conventional machine learning models, and good performances were observed for most of the human organ level toxicity end points in this study. In addition, we found that the transfer learning algorithm could improve model performance for skin sensitization end point using source domain of in vivo acute toxicity data and in vitro data of the Tox21 project. It can be concluded that our models can provide useful guidance for the rapid identification of the compounds with human organ level toxicity for drug discovery.
Collapse
Affiliation(s)
- Yuxuan Hu
- State Key Laboratory of Natural Medicines, China Pharmaceutical University, Nanjing 210009, China
| | - Qiuhan Ren
- School of Science, China Pharmaceutical University, Nanjing 211198, China
| | - Xintong Liu
- State Key Laboratory of Natural Medicines, China Pharmaceutical University, Nanjing 210009, China
| | - Liming Gao
- School of Science, China Pharmaceutical University, Nanjing 211198, China
| | - Lecheng Xiao
- School of Pharmacy, China Pharmaceutical University, Nanjing 211198, China
| | - Wenying Yu
- State Key Laboratory of Natural Medicines, China Pharmaceutical University, Nanjing 210009, China
| |
Collapse
|
5
|
Payton A, Roell KR, Rebuli ME, Valdar W, Jaspers I, Rager JE. Navigating the bridge between wet and dry lab toxicology research to address current challenges with high-dimensional data. FRONTIERS IN TOXICOLOGY 2023; 5:1171175. [PMID: 37304253 PMCID: PMC10250703 DOI: 10.3389/ftox.2023.1171175] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2023] [Accepted: 05/19/2023] [Indexed: 06/13/2023] Open
Abstract
Toxicology research has rapidly evolved, leveraging increasingly advanced technologies in high-throughput approaches to yield important information on toxicological mechanisms and health outcomes. Data produced through toxicology studies are consequently becoming larger, often producing high-dimensional data. These types of data hold promise for imparting new knowledge, yet inherently have complexities causing them to be a rate-limiting element for researchers, particularly those that are housed in "wet lab" settings (i.e., researchers that use liquids to analyze various chemicals and biomarkers as opposed to more computationally focused, "dry lab" researchers). These types of challenges represent topics of ongoing conversation amongst our team and researchers in the field. The aim of this perspective is to i) summarize hurdles in analyzing high-dimensional data in toxicology that require improved training and translation for wet lab researchers, ii) highlight example methods that have aided in translating data analysis techniques to wet lab researchers; and iii) describe challenges that remain to be effectively addressed, to date, in toxicology research. Specific aspects include methodologies that could be introduced to wet lab researchers, including data pre-processing, machine learning, and data reduction. Current challenges discussed include model interpretability, study biases, and data analysis training. Example efforts implemented to translate these data analysis techniques are also mentioned, including online data analysis resources and hands-on workshops. Questions are also posed to continue conversation in the toxicology community. Contents of this perspective represent timely issues broadly occurring in the fields of bioinformatics and toxicology that require ongoing dialogue between wet and dry lab researchers.
Collapse
Affiliation(s)
- Alexis Payton
- Department of Environmental Sciences and Engineering, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
- Center for Environmental Medicine, Asthma and Lung Biology, School of Medicine, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
| | - Kyle R. Roell
- Department of Environmental Sciences and Engineering, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
- The Institute for Environmental Health Solutions, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
| | - Meghan E. Rebuli
- Center for Environmental Medicine, Asthma and Lung Biology, School of Medicine, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
- Curriculum in Toxicology and Environmental Medicine, School of Medicine, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
- Department of Pediatrics, School of Medicine, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
| | - William Valdar
- Department of Genetics, University of North Carolina, Chapel Hill, NC, United States
| | - Ilona Jaspers
- Center for Environmental Medicine, Asthma and Lung Biology, School of Medicine, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
- The Institute for Environmental Health Solutions, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
- Curriculum in Toxicology and Environmental Medicine, School of Medicine, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
- Department of Pediatrics, School of Medicine, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
| | - Julia E. Rager
- Department of Environmental Sciences and Engineering, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
- Center for Environmental Medicine, Asthma and Lung Biology, School of Medicine, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
- The Institute for Environmental Health Solutions, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
- Curriculum in Toxicology and Environmental Medicine, School of Medicine, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
| |
Collapse
|
6
|
Patlewicz G, Paul-Friedman K, Houck K, Zhang L, Huang R, Xia M, Brown J, Simmons SO. Evaluating the utility of a high throughput thiol-containing fluorescent probe to screen for reactivity: A case study with the Tox21 library. COMPUTATIONAL TOXICOLOGY (AMSTERDAM, NETHERLANDS) 2023; 26:10.1016/j.comtox.2023.100271. [PMID: 37388277 PMCID: PMC10304587 DOI: 10.1016/j.comtox.2023.100271] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/01/2023]
Abstract
High-throughput screening (HTS) assays for bioactivity in the Tox21 program aim to evaluate an array of different biological targets and pathways, but a significant barrier to interpretation of these data is the lack of high-throughput screening (HTS) assays intended to identify non-specific reactive chemicals. This is an important aspect for prioritising chemicals to test in specific assays, identifying promiscuous chemicals based on their reactivity, as well as addressing hazards such as skin sensitisation which are not necessarily initiated by a receptor-mediated effect but act through a non-specific mechanism. Herein, a fluorescence-based HTS assay that allows the identification of thiol-reactive compounds was used to screen 7,872 unique chemicals in the Tox21 10K chemical library. Active chemicals were compared with profiling outcomes using structural alerts encoding electrophilic information. Random Forest classification models based on chemical fingerprints were developed to predict assay outcomes and evaluated through 10-fold stratified cross validation (CV). The mean CV Balanced Accuracy of the validation set was 0.648. The model developed shows promise as a tool to screen untested chemicals for their potential electrophilic reactivity based solely on chemical structural features.
Collapse
Affiliation(s)
- Grace Patlewicz
- Center for Computational Toxicology & Exposure (CCTE), U.S. Environmental Protection Agency, Research Triangle Park, Durham, NC, 27709, USA
| | - Katie Paul-Friedman
- Center for Computational Toxicology & Exposure (CCTE), U.S. Environmental Protection Agency, Research Triangle Park, Durham, NC, 27709, USA
| | - Keith Houck
- Center for Computational Toxicology & Exposure (CCTE), U.S. Environmental Protection Agency, Research Triangle Park, Durham, NC, 27709, USA
| | - Li Zhang
- National Center for Advancing Translational Sciences (NCATS), National Institutes of Health, Bethesda, MD 20892, USA
| | - Ruili Huang
- National Center for Advancing Translational Sciences (NCATS), National Institutes of Health, Bethesda, MD 20892, USA
| | - Menghang Xia
- National Center for Advancing Translational Sciences (NCATS), National Institutes of Health, Bethesda, MD 20892, USA
| | - Jason Brown
- Center for Computational Toxicology & Exposure (CCTE), U.S. Environmental Protection Agency, Research Triangle Park, Durham, NC, 27709, USA
| | - Steven O. Simmons
- Center for Computational Toxicology & Exposure (CCTE), U.S. Environmental Protection Agency, Research Triangle Park, Durham, NC, 27709, USA
| |
Collapse
|
7
|
Sharma B, Chenthamarakshan V, Dhurandhar A, Pereira S, Hendler JA, Dordick JS, Das P. Accurate clinical toxicity prediction using multi-task deep neural nets and contrastive molecular explanations. Sci Rep 2023; 13:4908. [PMID: 36966203 PMCID: PMC10039880 DOI: 10.1038/s41598-023-31169-8] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2022] [Accepted: 03/07/2023] [Indexed: 03/27/2023] Open
Abstract
Explainable machine learning for molecular toxicity prediction is a promising approach for efficient drug development and chemical safety. A predictive ML model of toxicity can reduce experimental cost and time while mitigating ethical concerns by significantly reducing animal and clinical testing. Herein, we use a deep learning framework for simultaneously modeling in vitro, in vivo, and clinical toxicity data. Two different molecular input representations are used; Morgan fingerprints and pre-trained SMILES embeddings. A multi-task deep learning model accurately predicts toxicity for all endpoints, including clinical, as indicated by the area under the Receiver Operator Characteristic curve and balanced accuracy. In particular, pre-trained molecular SMILES embeddings as input to the multi-task model improved clinical toxicity predictions compared to existing models in MoleculeNet benchmark. Additionally, our multitask approach is comprehensive in the sense that it is comparable to state-of-the-art approaches for specific endpoints in in vitro, in vivo and clinical platforms. Through both the multi-task model and transfer learning, we were able to indicate the minimal need of in vivo data for clinical toxicity predictions. To provide confidence and explain the model's predictions, we adapt a post-hoc contrastive explanation method that returns pertinent positive and negative features, which correspond well to known mutagenic and reactive toxicophores, such as unsubstituted bonded heteroatoms, aromatic amines, and Michael receptors. Furthermore, toxicophore recovery by pertinent feature analysis captures more of the in vitro (53%) and in vivo (56%), rather than of the clinical (8%), endpoints, and indeed uncovers a preference in known toxicophore data towards in vitro and in vivo experimental data. To our knowledge, this is the first contrastive explanation, using both present and absent substructures, for predictions of clinical and in vivo molecular toxicity.
Collapse
Affiliation(s)
| | | | | | - Shiranee Pereira
- ICARE, International Center for Alternatives in Research and Education, Chennai, India
| | | | | | - Payel Das
- IBM Research, Yorktown Heights, NY, USA.
| |
Collapse
|
8
|
Using chemical and biological data to predict drug toxicity. SLAS DISCOVERY : ADVANCING LIFE SCIENCES R & D 2023; 28:53-64. [PMID: 36639032 DOI: 10.1016/j.slasd.2022.12.003] [Citation(s) in RCA: 12] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/30/2022] [Revised: 12/19/2022] [Accepted: 12/31/2022] [Indexed: 01/12/2023]
Abstract
Various sources of information can be used to better understand and predict compound activity and safety-related endpoints, including biological data such as gene expression and cell morphology. In this review, we first introduce types of chemical, in vitro and in vivo information that can be used to describe compounds and adverse effects. We then explore how compound descriptors based on chemical structure or biological perturbation response can be used to predict safety-related endpoints, and how especially biological data can help us to better understand adverse effects mechanistically. Overall, the described applications demonstrate how large-scale biological information presents new opportunities to anticipate and understand the biological effects of compounds, and how this can support predictive toxicology and drug discovery projects.
Collapse
|
9
|
Yamada T, Kawamura T, Tsujii S, Miura M, Ohata H, Katsutani N, Matsumoto M, Hirose A. Formation and evaluation of mechanism-based chemical categories for regulatory read-across assessment of repeated-dose toxicity: A case of hemolytic anemia. Regul Toxicol Pharmacol 2022; 136:105275. [DOI: 10.1016/j.yrtph.2022.105275] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2021] [Revised: 09/20/2022] [Accepted: 10/07/2022] [Indexed: 11/05/2022]
|
10
|
Pereira M, Macmillan DS, Willett C, Seidle T. REACHing for solutions: Essential revisions to the EU chemicals regulation to modernise safety assessment. Regul Toxicol Pharmacol 2022; 136:105278. [DOI: 10.1016/j.yrtph.2022.105278] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2022] [Revised: 10/05/2022] [Accepted: 10/13/2022] [Indexed: 11/13/2022]
|
11
|
El-Masri H, Paul Friedman K, Isaacs K, Wetmore BA. Advances in computational methods along the exposure to toxicological response paradigm. Toxicol Appl Pharmacol 2022; 450:116141. [PMID: 35777528 PMCID: PMC9619339 DOI: 10.1016/j.taap.2022.116141] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2022] [Revised: 05/27/2022] [Accepted: 06/23/2022] [Indexed: 10/17/2022]
Abstract
Human health risk assessment is a function of chemical toxicity, bioavailability to reach target biological tissues, and potential environmental exposure. These factors are complicated by many physiological, biochemical, physical and lifestyle factors. Furthermore, chemical health risk assessment is challenging in view of the large, and continually increasing, number of chemicals found in the environment. These challenges highlight the need to prioritize resources for the efficient and timely assessment of those environmental chemicals that pose greatest health risks. Computational methods, either predictive or investigative, are designed to assist in this prioritization in view of the lack of cost prohibitive in vivo experimental data. Computational methods provide specific and focused toxicity information using in vitro high throughput screening (HTS) assays. Information from the HTS assays can be converted to in vivo estimates of chemical levels in blood or target tissue, which in turn are converted to in vivo dose estimates that can be compared to exposure levels of the screened chemicals. This manuscript provides a review for the landscape of computational methods developed and used at the U.S. Environmental Protection Agency (EPA) highlighting their potentials and challenges.
Collapse
Affiliation(s)
- Hisham El-Masri
- Center for Computational Toxicology and Exposure, Office of Research and Development, U. S. Environmental Protection Agency, Research Triangle Park, NC, USA.
| | - Katie Paul Friedman
- Center for Computational Toxicology and Exposure, Office of Research and Development, U. S. Environmental Protection Agency, Research Triangle Park, NC, USA
| | - Kristin Isaacs
- Center for Computational Toxicology and Exposure, Office of Research and Development, U. S. Environmental Protection Agency, Research Triangle Park, NC, USA
| | - Barbara A Wetmore
- Center for Computational Toxicology and Exposure, Office of Research and Development, U. S. Environmental Protection Agency, Research Triangle Park, NC, USA
| |
Collapse
|
12
|
Jeong J, Choi J. Artificial Intelligence-Based Toxicity Prediction of Environmental Chemicals: Future Directions for Chemical Management Applications. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2022; 56:7532-7543. [PMID: 35666838 DOI: 10.1021/acs.est.1c07413] [Citation(s) in RCA: 26] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Recently, research on the development of artificial intelligence (AI)-based computational toxicology models that predict toxicity without the use of animal testing has emerged because of the rapid development of computer technology. Various computational toxicology techniques that predict toxicity based on the structure of chemical substances are gaining attention, including the quantitative structure-activity relationship. To understand the recent development of these models, we analyzed the databases, molecular descriptors, fingerprints, and algorithms considered in recent studies. Based on a selection of 96 papers published since 2014, we found that AI models have been developed to predict approximately 30 different toxicity end points using more than 20 toxicity databases. For model development, molecular access system and extended-connectivity fingerprints are the most commonly used molecular descriptors. The most used algorithm among the machine learning techniques is the random forest, while the most used algorithm among the deep learning techniques is a deep neural network. The use of AI technology in the development of toxicity prediction models is a new concept that will aid in achieving a scientific accord and meet regulatory applications. The comprehensive overview provided in this study will provide a useful guide for the further development and application of toxicity prediction models.
Collapse
Affiliation(s)
- Jaeseong Jeong
- School of Environmental Engineering, University of Seoul, 163 Seoulsiripdae-ro, Dongdaemun-gu, Seoul 02504, South Korea
| | - Jinhee Choi
- School of Environmental Engineering, University of Seoul, 163 Seoulsiripdae-ro, Dongdaemun-gu, Seoul 02504, South Korea
| |
Collapse
|
13
|
Liu J, Guo W, Sakkiah S, Ji Z, Yavas G, Zou W, Chen M, Tong W, Patterson TA, Hong H. Machine Learning Models for Predicting Liver Toxicity. Methods Mol Biol 2022; 2425:393-415. [PMID: 35188640 DOI: 10.1007/978-1-0716-1960-5_15] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Liver toxicity is a major adverse drug reaction that accounts for drug failure in clinical trials and withdrawal from the market. Therefore, predicting potential liver toxicity at an early stage in drug discovery is crucial to reduce costs and the potential for drug failure. However, current in vivo animal toxicity testing is very expensive and time consuming. As an alternative approach, various machine learning models have been developed to predict potential liver toxicity in humans. This chapter reviews current advances in the development and application of machine learning models for prediction of potential liver toxicity in humans and discusses possible improvements to liver toxicity prediction.
Collapse
Affiliation(s)
- Jie Liu
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR, USA
| | - Wenjing Guo
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR, USA
| | - Sugunadevi Sakkiah
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR, USA
| | - Zuowei Ji
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR, USA
| | - Gokhan Yavas
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR, USA
| | - Wen Zou
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR, USA
| | - Minjun Chen
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR, USA
| | - Weida Tong
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR, USA
| | - Tucker A Patterson
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR, USA
| | - Huixiao Hong
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR, USA.
| |
Collapse
|
14
|
Bassan A, Alves VM, Amberg A, Anger LT, Auerbach S, Beilke L, Bender A, Cronin MT, Cross KP, Hsieh JH, Greene N, Kemper R, Kim MT, Mumtaz M, Noeske T, Pavan M, Pletz J, Russo DP, Sabnis Y, Schaefer M, Szabo DT, Valentin JP, Wichard J, Williams D, Woolley D, Zwickl C, Myatt GJ. In silico approaches in organ toxicity hazard assessment: current status and future needs in predicting liver toxicity. COMPUTATIONAL TOXICOLOGY (AMSTERDAM, NETHERLANDS) 2021; 20:100187. [PMID: 35340402 PMCID: PMC8955833 DOI: 10.1016/j.comtox.2021.100187] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/15/2023]
Abstract
Hepatotoxicity is one of the most frequently observed adverse effects resulting from exposure to a xenobiotic. For example, in pharmaceutical research and development it is one of the major reasons for drug withdrawals, clinical failures, and discontinuation of drug candidates. The development of faster and cheaper methods to assess hepatotoxicity that are both more sustainable and more informative is critically needed. The biological mechanisms and processes underpinning hepatotoxicity are summarized and experimental approaches to support the prediction of hepatotoxicity are described, including toxicokinetic considerations. The paper describes the increasingly important role of in silico approaches and highlights challenges to the adoption of these methods including the lack of a commonly agreed upon protocol for performing such an assessment and the need for in silico solutions that take dose into consideration. A proposed framework for the integration of in silico and experimental information is provided along with a case study describing how computational methods have been used to successfully respond to a regulatory question concerning non-genotoxic impurities in chemically synthesized pharmaceuticals.
Collapse
Affiliation(s)
- Arianna Bassan
- Innovatune srl, Via Giulio Zanon 130/D, 35129 Padova, Italy
| | - Vinicius M. Alves
- The National Institute of Environmental Health Sciences, Division of the National Toxicology, Program, Research Triangle Park, NC 27709, USA
| | - Alexander Amberg
- Sanofi, R&D Preclinical Safety Frankfurt, Industriepark Hoechst, D-65926 Frankfurt am Main, Germany
| | | | - Scott Auerbach
- The National Institute of Environmental Health Sciences, Division of the National Toxicology, Program, Research Triangle Park, NC 27709, USA
| | - Lisa Beilke
- Toxicology Solutions Inc., San Diego, CA, USA
| | - Andreas Bender
- AI and Data Analytics, Clinical Pharmacology and Safety Sciences, R&D, AstraZeneca, Cambridge, UK
- Centre for Molecular Informatics, Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW
| | - Mark T.D. Cronin
- School of Pharmacy and Biomolecular Sciences, Liverpool John Moores University, Liverpool, L3 3AF, UK
| | | | - Jui-Hua Hsieh
- The National Institute of Environmental Health Sciences, Division of the National Toxicology, Program, Research Triangle Park, NC 27709, USA
| | - Nigel Greene
- Data Science and AI, DSM, IMED Biotech Unit, AstraZeneca, Boston, USA
| | - Raymond Kemper
- Nuvalent, One Broadway, 14th floor, Cambridge, MA, 02142, USA
| | - Marlene T. Kim
- US Food and Drug Administration, Center for Drug Evaluation and Research, Silver Spring, MD, 20993, USA
| | - Moiz Mumtaz
- Office of the Associate Director for Science (OADS), Agency for Toxic Substances and Disease, Registry, US Department of Health and Human Services, Atlanta, GA, USA
| | - Tobias Noeske
- Imaging and Data Analytics, Clinical Pharmacology & Safety Sciences, R&D, AstraZeneca, Gothenburg, Sweden
| | - Manuela Pavan
- Innovatune srl, Via Giulio Zanon 130/D, 35129 Padova, Italy
| | - Julia Pletz
- School of Pharmacy and Biomolecular Sciences, Liverpool John Moores University, Liverpool, L3 3AF, UK
| | - Daniel P. Russo
- Department of Chemistry, Rutgers University, Camden, NJ 08102, USA
- The Rutgers Center for Computational and Integrative Biology, Camden, NJ 08102, USA
| | - Yogesh Sabnis
- UCB Biopharma SRL, Chemin du Foriest – B-1420 Braine-l’Alleud, Belgium
| | - Markus Schaefer
- Sanofi, R&D Preclinical Safety Frankfurt, Industriepark Hoechst, D-65926 Frankfurt am Main, Germany
| | | | | | - Joerg Wichard
- Bayer AG, Genetic Toxicology, Müllerstr. 178, 13353 Berlin, Germany
| | - Dominic Williams
- Functional & Mechanistic Safety, Clinical Pharmacology & Safety Sciences, AstraZeneca, Darwin Building 310, Cambridge Science Park, Milton Rd, Cambridge CB4 0FZ, UK
| | - David Woolley
- ForthTox Limited, PO Box 13550, Linlithgow, EH49 7YU, UK
| | - Craig Zwickl
- Transendix LLC, 1407 Moores Manor, Indianapolis, IN 46229, USA
| | - Glenn J. Myatt
- Instem, 1393 Dublin Road, Columbus, OH 43215. USA
- Corresponding author. (G.J. Myatt)
| |
Collapse
|
15
|
Wang Y, Wang B, Jiang J, Guo J, Lai J, Lian XY, Wu J. Multitask CapsNet: An Imbalanced Data Deep Learning Method for Predicting Toxicants. ACS OMEGA 2021; 6:26545-26555. [PMID: 34661009 PMCID: PMC8515573 DOI: 10.1021/acsomega.1c03842] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/20/2021] [Accepted: 09/14/2021] [Indexed: 05/17/2023]
Abstract
Drug development has a high failure rate, with safety properties constituting a considerable challenge. To reduce risk, in silico tools, including various machine learning methods, have been applied for toxicity prediction. However, these approaches often confront a serious problem: the training data sets are usually biased (imbalanced positive and negative samples), which would result in model training difficulty and unsatisfactory prediction accuracy. Multitask networks obtained significantly better predictive accuracies than single-task methods, and capsule neural networks showed excellent performance in sparse data sets in previous studies. In this study, we developed a new multitask framework based on a capsule neural network (multitask CapsNet) to measure 12 different toxic effects simultaneously. We found that multitask CapsNet excelled in toxicity prediction and outperformed many other computational approaches using the multitask strategy. Only after training on biased data sets did multitask CapsNet achieve significantly improved prediction accuracy on the Tox21 Data Challenge, which gave the largest ratio of highest accuracy (8/12) among compared models. Our model gave a prediction accuracy of 96.6% for the target NR.PPAR.gamma, whose ratio of negative to positive samples was up to 36:1. These results suggested that multitask CapsNet could overcome the bias problems and would provide a novel, accurate, and efficient approach for predicting the toxicities of compounds.
Collapse
Affiliation(s)
- Yiwei Wang
- School
of Preclinical Medicine, Southwest Medical
University, Luzhou 646000, China
| | - Binyou Wang
- School
of Pharmacy, Southwest Medical University, Luzhou 646000, China
| | - Jie Jiang
- School
of Preclinical Medicine, Southwest Medical
University, Luzhou 646000, China
| | - Jianmin Guo
- School
of Preclinical Medicine, Southwest Medical
University, Luzhou 646000, China
| | - Jia Lai
- School
of Pharmacy, Southwest Medical University, Luzhou 646000, China
| | - Xiao-Yuan Lian
- School
of Pharmacy, Zhejiang University, Hangzhou 310011, China
| | - Jianming Wu
- Key
Laboratory of Medical Electrophysiology, Ministry of Education of
China, Medical Key Laboratory for Drug Discovery and Druggability
Evaluation of Sichuan Province, Luzhou Key
Laboratory of Activity Screening and Druggability Evaluation for Chinese
Materia Medica, Luzhou 646000, China
| |
Collapse
|
16
|
Garcia de Lomana M, Morger A, Norinder U, Buesen R, Landsiedel R, Volkamer A, Kirchmair J, Mathea M. ChemBioSim: Enhancing Conformal Prediction of In Vivo Toxicity by Use of Predicted Bioactivities. J Chem Inf Model 2021; 61:3255-3272. [PMID: 34153183 PMCID: PMC8317154 DOI: 10.1021/acs.jcim.1c00451] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2021] [Indexed: 02/07/2023]
Abstract
Computational methods such as machine learning approaches have a strong track record of success in predicting the outcomes of in vitro assays. In contrast, their ability to predict in vivo endpoints is more limited due to the high number of parameters and processes that may influence the outcome. Recent studies have shown that the combination of chemical and biological data can yield better models for in vivo endpoints. The ChemBioSim approach presented in this work aims to enhance the performance of conformal prediction models for in vivo endpoints by combining chemical information with (predicted) bioactivity assay outcomes. Three in vivo toxicological endpoints, capturing genotoxic (MNT), hepatic (DILI), and cardiological (DICC) issues, were selected for this study due to their high relevance for the registration and authorization of new compounds. Since the sparsity of available biological assay data is challenging for predictive modeling, predicted bioactivity descriptors were introduced instead. Thus, a machine learning model for each of the 373 collected biological assays was trained and applied on the compounds of the in vivo toxicity data sets. Besides the chemical descriptors (molecular fingerprints and physicochemical properties), these predicted bioactivities served as descriptors for the models of the three in vivo endpoints. For this study, a workflow based on a conformal prediction framework (a method for confidence estimation) built on random forest models was developed. Furthermore, the most relevant chemical and bioactivity descriptors for each in vivo endpoint were preselected with lasso models. The incorporation of bioactivity descriptors increased the mean F1 scores of the MNT model from 0.61 to 0.70 and for the DICC model from 0.72 to 0.82 while the mean efficiencies increased by roughly 0.10 for both endpoints. In contrast, for the DILI endpoint, no significant improvement in model performance was observed. Besides pure performance improvements, an analysis of the most important bioactivity features allowed detection of novel and less intuitive relationships between the predicted biological assay outcomes used as descriptors and the in vivo endpoints. This study presents how the prediction of in vivo toxicity endpoints can be improved by the incorporation of biological information-which is not necessarily captured by chemical descriptors-in an automated workflow without the need for adding experimental workload for the generation of bioactivity descriptors as predicted outcomes of bioactivity assays were utilized. All bioactivity CP models for deriving the predicted bioactivities, as well as the in vivo toxicity CP models, can be freely downloaded from https://doi.org/10.5281/zenodo.4761225.
Collapse
Affiliation(s)
- Marina Garcia de Lomana
- BASF
SE, Ludwigshafen am Rhein 67063, Germany
- Department
of Pharmaceutical Sciences, Faculty of Life Sciences, University of Vienna, Vienna 1090, Austria
| | - Andrea Morger
- In Silico
Toxicology and Structural Bioinformatics, Institute of Physiology, Charité Universitätsmedizin Berlin, Charitéplatz
1, Berlin 10117, Germany
| | - Ulf Norinder
- MTM
Research Centre, School of Science and Technology, Örebro University, Örebro SE-70182, Sweden
| | | | | | - Andrea Volkamer
- In Silico
Toxicology and Structural Bioinformatics, Institute of Physiology, Charité Universitätsmedizin Berlin, Charitéplatz
1, Berlin 10117, Germany
| | - Johannes Kirchmair
- Department
of Pharmaceutical Sciences, Faculty of Life Sciences, University of Vienna, Vienna 1090, Austria
| | | |
Collapse
|
17
|
Kim H, Kim E, Lee I, Bae B, Park M, Nam H. Artificial Intelligence in Drug Discovery: A Comprehensive Review of Data-driven and Machine Learning Approaches. BIOTECHNOL BIOPROC E 2021; 25:895-930. [PMID: 33437151 PMCID: PMC7790479 DOI: 10.1007/s12257-020-0049-y] [Citation(s) in RCA: 28] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2020] [Revised: 05/27/2020] [Accepted: 06/03/2020] [Indexed: 02/07/2023]
Abstract
As expenditure on drug development increases exponentially, the overall drug discovery process requires a sustainable revolution. Since artificial intelligence (AI) is leading the fourth industrial revolution, AI can be considered as a viable solution for unstable drug research and development. Generally, AI is applied to fields with sufficient data such as computer vision and natural language processing, but there are many efforts to revolutionize the existing drug discovery process by applying AI. This review provides a comprehensive, organized summary of the recent research trends in AI-guided drug discovery process including target identification, hit identification, ADMET prediction, lead optimization, and drug repositioning. The main data sources in each field are also summarized in this review. In addition, an in-depth analysis of the remaining challenges and limitations will be provided, and proposals for promising future directions in each of the aforementioned areas.
Collapse
Affiliation(s)
- Hyunho Kim
- School of Electrical Engineering and Computer Science, Gwangju Institute of Science and Technology (GIST), Gwangju, 61005 Korea
| | - Eunyoung Kim
- School of Electrical Engineering and Computer Science, Gwangju Institute of Science and Technology (GIST), Gwangju, 61005 Korea
| | - Ingoo Lee
- School of Electrical Engineering and Computer Science, Gwangju Institute of Science and Technology (GIST), Gwangju, 61005 Korea
| | - Bongsung Bae
- School of Electrical Engineering and Computer Science, Gwangju Institute of Science and Technology (GIST), Gwangju, 61005 Korea
| | - Minsu Park
- School of Electrical Engineering and Computer Science, Gwangju Institute of Science and Technology (GIST), Gwangju, 61005 Korea
| | - Hojung Nam
- School of Electrical Engineering and Computer Science, Gwangju Institute of Science and Technology (GIST), Gwangju, 61005 Korea
| |
Collapse
|
18
|
Wang MWH, Goodman JM, Allen TEH. Machine Learning in Predictive Toxicology: Recent Applications and Future Directions for Classification Models. Chem Res Toxicol 2020; 34:217-239. [PMID: 33356168 DOI: 10.1021/acs.chemrestox.0c00316] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
In recent times, machine learning has become increasingly prominent in predictive toxicology as it has shifted from in vivo studies toward in silico studies. Currently, in vitro methods together with other computational methods such as quantitative structure-activity relationship modeling and absorption, distribution, metabolism, and excretion calculations are being used. An overview of machine learning and its applications in predictive toxicology is presented here, including support vector machines (SVMs), random forest (RF) and decision trees (DTs), neural networks, regression models, naïve Bayes, k-nearest neighbors, and ensemble learning. The recent successes of these machine learning methods in predictive toxicology are summarized, and a comparison of some models used in predictive toxicology is presented. In predictive toxicology, SVMs, RF, and DTs are the dominant machine learning methods due to the characteristics of the data available. Lastly, this review describes the current challenges facing the use of machine learning in predictive toxicology and offers insights into the possible areas of improvement in the field.
Collapse
Affiliation(s)
- Marcus W H Wang
- Centre for Molecular Informatics, Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| | - Jonathan M Goodman
- Centre for Molecular Informatics, Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| | - Timothy E H Allen
- Centre for Molecular Informatics, Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom.,MRC Toxicology Unit, University of Cambridge, Hodgkin Building, Lancaster Road, Leicester LE1 7HB, United Kingdom
| |
Collapse
|
19
|
Luijten M, Corvi R, Mehta J, Corvaro M, Delrue N, Felter S, Haas B, Hewitt NJ, Hilton G, Holmes T, Jacobs MN, Jacobs A, Lamplmair F, Lewis D, Madia F, Manou I, Melching-Kollmuss S, Schorsch F, Schütte K, Sewell F, Strupp C, van der Laan JW, Wolf DC, Wolterink G, Woutersen R, Zvonar Z, Heusinkveld H, Braakhuis H. A comprehensive view on mechanistic approaches for cancer risk assessment of non-genotoxic agrochemicals. Regul Toxicol Pharmacol 2020; 118:104789. [PMID: 33035627 DOI: 10.1016/j.yrtph.2020.104789] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2020] [Revised: 09/14/2020] [Accepted: 10/04/2020] [Indexed: 11/28/2022]
Abstract
Currently the only methods for non-genotoxic carcinogenic hazard assessment accepted by most regulatory authorities are lifetime carcinogenicity studies. However, these involve the use of large numbers of animals and the relevance of their predictive power and results has been scientifically challenged. With increased availability of innovative test methods and enhanced understanding of carcinogenic processes, it is believed that tumour formation can now be better predicted using mechanistic information. A workshop organised by the European Partnership on Alternative Approaches to Animal Testing brought together experts to discuss an alternative, mechanism-based approach for cancer risk assessment of agrochemicals. Data from a toolbox of test methods for detecting modes of action (MOAs) underlying non-genotoxic carcinogenicity are combined with information from subchronic toxicity studies in a weight-of-evidence approach to identify carcinogenic potential of a test substance. The workshop included interactive sessions to discuss the approach using case studies. These showed that fine-tuning is needed, to build confidence in the proposed approach, to ensure scientific correctness, and to address different regulatory needs. This novel approach was considered realistic, and its regulatory acceptance and implementation can be facilitated in the coming years through continued dialogue between all stakeholders and building confidence in alternative approaches.
Collapse
Affiliation(s)
- Mirjam Luijten
- National Institute for Public Health and the Environment (RIVM), Centre for Health Protection, Bilthoven, the Netherlands.
| | - Raffaella Corvi
- European Commission, Joint Research Centre (JRC), Ispra, Italy
| | | | | | - Nathalie Delrue
- Organisation for Economic Cooperation and Development (OECD), Paris, France
| | | | - Bodo Haas
- Federal Institute for Drugs and Medical Devices, Kurt-Georg-Kiesinger-Allee 3, 53175 Bonn, Germany
| | | | - Gina Hilton
- PETA International Science Consortium Ltd, London, UK
| | | | - Miriam N Jacobs
- Centre for Radiation, Chemical and Environmental Hazards (CRCE), Public Health England, UK
| | | | - Franz Lamplmair
- European Commission, DG Internal Market, Industry, Entrepreneurship and SMEs, Brussels, Belgium
| | | | - Federica Madia
- European Commission, Joint Research Centre (JRC), Ispra, Italy
| | - Irene Manou
- EPAA Industry Secretariat, Brussels, Belgium
| | | | | | | | - Fiona Sewell
- National Centre for the Replacement, Refinement and Reduction of Animals in Research (NC3Rs), London, UK
| | | | | | - Douglas C Wolf
- Syngenta Crop Protection, LLC, Greensboro, North Carolina, USA
| | - Gerrit Wolterink
- National Institute for Public Health and the Environment (RIVM), Centre for Nutrition, Prevention and Health Services, Bilthoven, the Netherlands
| | - Ruud Woutersen
- TNO Innovation for Life, Zeist; Wageningen University and Research, Wageningen, the Netherlands
| | | | - Harm Heusinkveld
- National Institute for Public Health and the Environment (RIVM), Centre for Health Protection, Bilthoven, the Netherlands
| | - Hedwig Braakhuis
- National Institute for Public Health and the Environment (RIVM), Centre for Health Protection, Bilthoven, the Netherlands
| |
Collapse
|
20
|
The EU-ToxRisk method documentation, data processing and chemical testing pipeline for the regulatory use of new approach methods. Arch Toxicol 2020; 94:2435-2461. [PMID: 32632539 PMCID: PMC7367925 DOI: 10.1007/s00204-020-02802-6] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2020] [Accepted: 06/03/2020] [Indexed: 12/17/2022]
Abstract
Hazard assessment, based on new approach methods (NAM), requires the use of batteries of assays, where individual tests may be contributed by different laboratories. A unified strategy for such collaborative testing is presented. It details all procedures required to allow test information to be usable for integrated hazard assessment, strategic project decisions and/or for regulatory purposes. The EU-ToxRisk project developed a strategy to provide regulatorily valid data, and exemplified this using a panel of > 20 assays (with > 50 individual endpoints), each exposed to 19 well-known test compounds (e.g. rotenone, colchicine, mercury, paracetamol, rifampicine, paraquat, taxol). Examples of strategy implementation are provided for all aspects required to ensure data validity: (i) documentation of test methods in a publicly accessible database; (ii) deposition of standard operating procedures (SOP) at the European Union DB-ALM repository; (iii) test readiness scoring accoding to defined criteria; (iv) disclosure of the pipeline for data processing; (v) link of uncertainty measures and metadata to the data; (vi) definition of test chemicals, their handling and their behavior in test media; (vii) specification of the test purpose and overall evaluation plans. Moreover, data generation was exemplified by providing results from 25 reporter assays. A complete evaluation of the entire test battery will be described elsewhere. A major learning from the retrospective analysis of this large testing project was the need for thorough definitions of the above strategy aspects, ideally in form of a study pre-registration, to allow adequate interpretation of the data and to ensure overall scientific/toxicological validity.
Collapse
|
21
|
Morger A, Mathea M, Achenbach JH, Wolf A, Buesen R, Schleifer KJ, Landsiedel R, Volkamer A. KnowTox: pipeline and case study for confident prediction of potential toxic effects of compounds in early phases of development. J Cheminform 2020; 12:24. [PMID: 33431007 PMCID: PMC7157991 DOI: 10.1186/s13321-020-00422-x] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2019] [Accepted: 03/09/2020] [Indexed: 02/07/2023] Open
Abstract
Risk assessment of newly synthesised chemicals is a prerequisite for regulatory approval. In this context, in silico methods have great potential to reduce time, cost, and ultimately animal testing as they make use of the ever-growing amount of available toxicity data. Here, KnowTox is presented, a novel pipeline that combines three different in silico toxicology approaches to allow for confident prediction of potentially toxic effects of query compounds, i.e. machine learning models for 88 endpoints, alerts for 919 toxic substructures, and computational support for read-across. It is mainly based on the ToxCast dataset, containing after preprocessing a sparse matrix of 7912 compounds tested against 985 endpoints. When applying machine learning models, applicability and reliability of predictions for new chemicals are of utmost importance. Therefore, first, the conformal prediction technique was deployed, comprising an additional calibration step and per definition creating internally valid predictors at a given significance level. Second, to further improve validity and information efficiency, two adaptations are suggested, exemplified at the androgen receptor antagonism endpoint. An absolute increase in validity of 23% on the in-house dataset of 534 compounds could be achieved by introducing KNNRegressor normalisation. This increase in validity comes at the cost of efficiency, which could again be improved by 20% for the initial ToxCast model by balancing the dataset during model training. Finally, the value of the developed pipeline for risk assessment is discussed using two in-house triazole molecules. Compared to a single toxicity prediction method, complementing the outputs of different approaches can have a higher impact on guiding toxicity testing and de-selecting most likely harmful development-candidate compounds early in the development process.
Collapse
Affiliation(s)
- Andrea Morger
- In Silico Toxicology and Structural Bioinformatics, Institute of Physiology, Charité Universitätsmedizin Berlin, Charitéplatz 1, Berlin, Germany
| | | | | | | | | | | | | | - Andrea Volkamer
- In Silico Toxicology and Structural Bioinformatics, Institute of Physiology, Charité Universitätsmedizin Berlin, Charitéplatz 1, Berlin, Germany.
| |
Collapse
|
22
|
A map of the amine–carboxylic acid coupling system. Nature 2020; 580:71-75. [DOI: 10.1038/s41586-020-2142-y] [Citation(s) in RCA: 45] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2019] [Accepted: 01/17/2020] [Indexed: 12/18/2022]
|
23
|
Xu T, Ngan DK, Ye L, Xia M, Xie HQ, Zhao B, Simeonov A, Huang R. Predictive Models for Human Organ Toxicity Based on In Vitro Bioactivity Data and Chemical Structure. Chem Res Toxicol 2020; 33:731-741. [PMID: 32077278 PMCID: PMC10926239 DOI: 10.1021/acs.chemrestox.9b00305] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Traditional toxicity testing reliant on animal models is costly and low throughput, posing a significant challenge with the increasing numbers of chemicals that humans are exposed to in the environment. The purpose of this investigation was to build optimal prediction models for various human in vivo/organ-level toxicity end points (extracted from ChemIDPlus) using chemical structure and Tox21 in vitro quantitative high-throughput screening (qHTS) bioactivity assay data. Several supervised machine learning algorithms were applied to model 14 human toxicity end points pertaining to vascular, kidney, ureter and bladder, and liver organ systems. Three metrics were used to evaluate model performance: area under the receiver operating characteristic curve (AUC-ROC), balanced accuracy (BA), and Matthews correlation coefficient (MCC). The top four models, with AUC-ROC values >0.8, were derived for endocrine (0.90 ± 0.00), musculoskeletal (0.88 ± 0.02), peripheral nerve and sensation (0.85 ± 0.01), and brain and coverings (0.83 ± 0.02) toxicities, whereas the best model AUC-ROC values were >0.7 for the remaining 10 toxicities. Model performance was found to be dependent on the specific data set, model type, and feature selection method used. In addition, chemical structure and assay data showed different levels of contribution to the prediction of different toxicity end points. Although in vitro assay data, when combined with chemical structure, slightly improved the predictive accuracy for most end points (11 out of 14), a noteworthy finding was the near equal success of the structure-only models, which do not require Tox21 qHTS screening data, and the relatively poor performance of assay-only models. Thus, the top-performing structure-only models from this study could be applied for hazard screening of large sets of chemicals for potential human toxicity, whereas the largest assay contributions to models (i.e., cellular targets) could be used, along with the top-contributing structural features, to provide insight into toxicity mechanisms.
Collapse
Affiliation(s)
- Tuan Xu
- Division of Pre-clinical Innovation, National Center for Advancing Translational Sciences (NCATS), National Institutes of Health (NIH), Rockville, MD 20850, USA
| | - Deborah K. Ngan
- Division of Pre-clinical Innovation, National Center for Advancing Translational Sciences (NCATS), National Institutes of Health (NIH), Rockville, MD 20850, USA
| | - Lin Ye
- Division of Pre-clinical Innovation, National Center for Advancing Translational Sciences (NCATS), National Institutes of Health (NIH), Rockville, MD 20850, USA
| | - Menghang Xia
- Division of Pre-clinical Innovation, National Center for Advancing Translational Sciences (NCATS), National Institutes of Health (NIH), Rockville, MD 20850, USA
| | - Heidi Q. Xie
- State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center of Eco-Environment Sciences, Chinese Academy of Sciences, Beijing 100085, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Bin Zhao
- State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center of Eco-Environment Sciences, Chinese Academy of Sciences, Beijing 100085, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Anton Simeonov
- Division of Pre-clinical Innovation, National Center for Advancing Translational Sciences (NCATS), National Institutes of Health (NIH), Rockville, MD 20850, USA
| | - Ruili Huang
- Division of Pre-clinical Innovation, National Center for Advancing Translational Sciences (NCATS), National Institutes of Health (NIH), Rockville, MD 20850, USA
| |
Collapse
|
24
|
Tetko IV, Tropsha A. Joint Virtual Special Issue on Computational Toxicology. J Chem Inf Model 2020; 60:1069-1071. [PMID: 32101004 DOI: 10.1021/acs.jcim.0c00140] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
Affiliation(s)
- Igor V Tetko
- Institute of Structural Biology, Helmholtz Zentrum Munchen Deutsches Forschungszentrum fur Umwelt und Gesundheit, Munich 27599, Germany
| | - Alexander Tropsha
- UNC Eshelman School of Pharmacy, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, United States
| |
Collapse
|
25
|
Su R, Wu H, Liu X, Wei L. Predicting drug-induced hepatotoxicity based on biological feature maps and diverse classification strategies. Brief Bioinform 2019; 22:428-437. [PMID: 31838506 DOI: 10.1093/bib/bbz165] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2019] [Revised: 11/20/2019] [Indexed: 12/15/2022] Open
Abstract
Identifying hepatotoxicity as early as possible is significant in drug development. In this study, we developed a drug-induced hepatotoxicity prediction model taking account of both the biological context and the computational efficacy based on toxicogenomics data. Specifically, we proposed a novel gene selection algorithm considering gene's participation, named BioCB, to choose the discriminative genes and make more efficient prediction. Then instead of using the raw gene expression levels to characterize each drug, we developed a two-dimensional biological process feature pattern map to represent each drug. Then we employed two strategies to handle the maps and identify the hepatotoxicity, the direct use of maps, named Two-dim branch, and vectorization of maps, named One-dim branch. The two strategies subsequently used the deep convolutional neural networks and LightGBM as predictors, respectively. Additionally, we here for the first time proposed a stacked vectorized gene matrix, which was more predictive than the raw gene matrix. Results validated on both in vivo and in vitro data from two public data sets, the TG-GATES and DrugMatrix, show that the proposed One-dim branch outperforms the deep framework, the Two-dim branch, and has achieved high accuracy and efficiency. The implementation of the proposed method is available at https://github.com/RanSuLab/Hepatotoxicity.
Collapse
Affiliation(s)
- Ran Su
- School of Computer Software, College of Intelligence and Computing, Tianjin University, China
| | - Huichen Wu
- School of Computer Software, College of Intelligence and Computing, Tianjin University, China
| | - Xinyi Liu
- School of Computer Software, College of Intelligence and Computing, Tianjin University, China
| | - Leyi Wei
- School of Software, Shandong University, China
| |
Collapse
|
26
|
Grenet I, Comet JP, Schorsch F, Ryan N, Wichard J, Rouquié D. Chemical in vitro bioactivity profiles are not informative about the long-term in vivo endocrine mediated toxicity. ACTA ACUST UNITED AC 2019. [DOI: 10.1016/j.comtox.2019.100098] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
|
27
|
Vo AH, Van Vleet TR, Gupta RR, Liguori MJ, Rao MS. An Overview of Machine Learning and Big Data for Drug Toxicity Evaluation. Chem Res Toxicol 2019; 33:20-37. [DOI: 10.1021/acs.chemrestox.9b00227] [Citation(s) in RCA: 55] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Affiliation(s)
- Andy H. Vo
- Department of Preclinical Safety, AbbVie, 1 North Waukegan Road, North Chicago, Illinois 60064, United States
| | - Terry R. Van Vleet
- Department of Preclinical Safety, AbbVie, 1 North Waukegan Road, North Chicago, Illinois 60064, United States
| | - Rishi R. Gupta
- Information Research, Research and Development, AbbVie, 1 North Waukegan Road, North Chicago, Illinois 60064, United States
| | - Michael J. Liguori
- Department of Preclinical Safety, AbbVie, 1 North Waukegan Road, North Chicago, Illinois 60064, United States
| | - Mohan S. Rao
- Department of Preclinical Safety, AbbVie, 1 North Waukegan Road, North Chicago, Illinois 60064, United States
| |
Collapse
|
28
|
Watford S, Ly Pham L, Wignall J, Shin R, Martin MT, Friedman KP. ToxRefDB version 2.0: Improved utility for predictive and retrospective toxicology analyses. Reprod Toxicol 2019; 89:145-158. [PMID: 31340180 PMCID: PMC6944327 DOI: 10.1016/j.reprotox.2019.07.012] [Citation(s) in RCA: 51] [Impact Index Per Article: 10.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2019] [Revised: 05/31/2019] [Accepted: 07/12/2019] [Indexed: 02/08/2023]
Abstract
The Toxicity Reference Database (ToxRefDB) structures information from over 5000 in vivo toxicity studies, conducted largely to guidelines or specifications from the US Environmental Protection Agency and the National Toxicology Program, into a public resource for training and validation of predictive models. Herein, ToxRefDB version 2.0 (ToxRefDBv2) development is described. Endpoints were annotated (e.g. required, not required) according to guidelines for subacute, subchronic, chronic, developmental, and multigenerational reproductive designs, distinguishing negative responses from untested. Quantitative data were extracted, and dose-response modeling for nearly 28,000 datasets from nearly 400 endpoints using Benchmark Dose (BMD) Modeling Software were generated and stored. Implementation of controlled vocabulary improved data quality; standardization to guideline requirements and cross-referencing with United Medical Language System (UMLS) connects ToxRefDBv2 observations to vocabularies linked to UMLS, including PubMed medical subject headings. ToxRefDBv2 allows for increased connections to other resources and has greatly enhanced quantitative and qualitative utility for predictive toxicology.
Collapse
Affiliation(s)
- Sean Watford
- ORAU, Contractor to U.S. Environmental Protection Agency through the National Student Services Contract, United States; National Center for Computational Toxicology, Office of Research and Development, US Environmental Protection Agency, United States
| | - Ly Ly Pham
- ORAU, Contractor to U.S. Environmental Protection Agency through the National Student Services Contract, United States; ORISE Postdoctoral Research Participant, United States
| | | | | | - Matthew T Martin
- ORAU, Contractor to U.S. Environmental Protection Agency through the National Student Services Contract, United States; Currently at Drug Safety Research and Development, Global Investigative Toxicology, Pfizer, Groton, CT, United States
| | - Katie Paul Friedman
- National Center for Computational Toxicology, Office of Research and Development, US Environmental Protection Agency, United States.
| |
Collapse
|
29
|
Incorporating new approach methodologies in toxicity testing and exposure assessment for tiered risk assessment using the RISK21 approach: Case studies on food contact chemicals. Food Chem Toxicol 2019; 134:110819. [PMID: 31545997 PMCID: PMC7429715 DOI: 10.1016/j.fct.2019.110819] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2019] [Revised: 08/30/2019] [Accepted: 09/13/2019] [Indexed: 12/23/2022]
Abstract
Programs including the ToxCast project have generated large amounts of in vitro high‒throughput screening (HTS) data, and best approaches for the interpretation and use of HTS data, including for chemical safety assessment, remain to be evaluated. To fill this gap, we conducted case studies of two indirect food additive chemicals where ToxCast data were compared with in vivo toxicity data using the RISK21 approach. Two food contact substances, sodium (2-pyridylthio)-N-oxide and dibutyltin dichloride, were selected, and available exposure data, toxicity data, and model predictions were compiled and assessed. Oral equivalent doses for the ToxCast bioactivity data were determined by in-vitro in-vivo extrapolation (IVIVE). For sodium (2-pyridylthio)-N-oxide, bioactive concentrations in ToxCast assays corresponded to low-and no-observed adverse effect levels in animal studies. For dibutyltin dichloride, the ToxCast bioactive concentrations were below the dose range that demonstrated toxicity in animals; however, this was confounded by the lack of toxicokinetic data, necessitating the use of conservative toxicokinetic parameter estimates for IVIVE calculations. This study highlights the potential utility of the RISK21 approach for interpretation of the ToxCast HTS data, as well as the challenges involved in integrating in vitro HTS data into safety assessments.
Collapse
|
30
|
Mansouri K, Cariello NF, Korotcov A, Tkachenko V, Grulke CM, Sprankle CS, Allen D, Casey WM, Kleinstreuer NC, Williams AJ. Open-source QSAR models for pKa prediction using multiple machine learning approaches. J Cheminform 2019; 11:60. [PMID: 33430972 PMCID: PMC6749653 DOI: 10.1186/s13321-019-0384-1] [Citation(s) in RCA: 49] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2019] [Accepted: 09/03/2019] [Indexed: 12/13/2022] Open
Abstract
BACKGROUND The logarithmic acid dissociation constant pKa reflects the ionization of a chemical, which affects lipophilicity, solubility, protein binding, and ability to pass through the plasma membrane. Thus, pKa affects chemical absorption, distribution, metabolism, excretion, and toxicity properties. Multiple proprietary software packages exist for the prediction of pKa, but to the best of our knowledge no free and open-source programs exist for this purpose. Using a freely available data set and three machine learning approaches, we developed open-source models for pKa prediction. METHODS The experimental strongest acidic and strongest basic pKa values in water for 7912 chemicals were obtained from DataWarrior, a freely available software package. Chemical structures were curated and standardized for quantitative structure-activity relationship (QSAR) modeling using KNIME, and a subset comprising 79% of the initial set was used for modeling. To evaluate different approaches to modeling, several datasets were constructed based on different processing of chemical structures with acidic and/or basic pKas. Continuous molecular descriptors, binary fingerprints, and fragment counts were generated using PaDEL, and pKa prediction models were created using three machine learning methods, (1) support vector machines (SVM) combined with k-nearest neighbors (kNN), (2) extreme gradient boosting (XGB) and (3) deep neural networks (DNN). RESULTS The three methods delivered comparable performances on the training and test sets with a root-mean-squared error (RMSE) around 1.5 and a coefficient of determination (R2) around 0.80. Two commercial pKa predictors from ACD/Labs and ChemAxon were used to benchmark the three best models developed in this work, and performance of our models compared favorably to the commercial products. CONCLUSIONS This work provides multiple QSAR models to predict the strongest acidic and strongest basic pKas of chemicals, built using publicly available data, and provided as free and open-source software on GitHub.
Collapse
Affiliation(s)
- Kamel Mansouri
- Integrated Laboratory Systems, Inc., P.O. Box 13501, Research Triangle Park, NC 27709 USA
| | - Neal F. Cariello
- Integrated Laboratory Systems, Inc., P.O. Box 13501, Research Triangle Park, NC 27709 USA
| | - Alexandru Korotcov
- Science Data Software LLC, 14914 Bradwill Court, Rockville, MD 20850 USA
| | - Valery Tkachenko
- Science Data Software LLC, 14914 Bradwill Court, Rockville, MD 20850 USA
| | - Chris M. Grulke
- National Center for Computational Toxicology, U.S. Environmental Protection Agency, 109 T.W. Alexander Dr., Mail Code D143-02, Research Triangle Park, NC 27709 USA
| | - Catherine S. Sprankle
- Integrated Laboratory Systems, Inc., P.O. Box 13501, Research Triangle Park, NC 27709 USA
| | - David Allen
- Integrated Laboratory Systems, Inc., P.O. Box 13501, Research Triangle Park, NC 27709 USA
| | - Warren M. Casey
- National Institute of Environmental Health Sciences, P.O. Box 12233, Mail Stop K2-16, Research Triangle Park, NC 27709 USA
| | - Nicole C. Kleinstreuer
- National Institute of Environmental Health Sciences, P.O. Box 12233, Mail Stop K2-16, Research Triangle Park, NC 27709 USA
| | - Antony J. Williams
- National Center for Computational Toxicology, U.S. Environmental Protection Agency, 109 T.W. Alexander Dr., Mail Code D143-02, Research Triangle Park, NC 27709 USA
| |
Collapse
|
31
|
Gellatly N, Sewell F. Regulatory acceptance of in silico approaches for the safety assessment of cosmetic-related substances. ACTA ACUST UNITED AC 2019. [DOI: 10.1016/j.comtox.2019.03.003] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023]
|
32
|
Guth BD, Grobler AF, Frazier KS, Greiter-Wilke A, Herzyk D, Hough TA, Khan AA, Markert M, Smith JD, Svenson KL, Wells S, Pugsley MK. Drug safety Africa: An overview of safety pharmacology & toxicology in South Africa. J Pharmacol Toxicol Methods 2019; 98:106579. [PMID: 31085319 DOI: 10.1016/j.vascn.2019.106579] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2019] [Revised: 04/25/2019] [Accepted: 05/01/2019] [Indexed: 12/22/2022]
Abstract
This meeting report is based on presentations given at the first Drug Safety Africa Meeting in Potchefstroom, South Africa from November 20-22, 2018 at the North-West University campus. There were 134 attendees (including 26 speakers and 34 students) from the pharmaceutical industry, academia, regulatory agencies as well as 6 exhibitors. These meeting proceedings are designed to inform the content that was presented in terms of Safety Pharmacology (SP) and Toxicology methods and models that are used by the pharmaceutical industry to characterize the safety profile of novel small chemical or biological molecules. The first part of this report includes an overview of the core battery studies defined by cardiovascular, central nervous system (CNS) and respiratory studies. Approaches to evaluating drug effects on the renal and gastrointestinal systems and murine phenotyping were also discussed. Subsequently, toxicological approaches were presented including standard strategies and options for early identification and characterization of risks associated with a novel therapeutic, the types of toxicology studies conducted and relevance to risk assessment supporting first-in-human (FIH) clinical trials and target organ toxicity. Biopharmaceutical development and principles of immunotoxicology were discussed as well as emerging technologies. An additional poster session was held that included 18 posters on advanced studies and topics by South African researchers, postgraduate students and postdoctoral fellows.
Collapse
Affiliation(s)
- Brian D Guth
- Boehringer Ingelheim GmbH & Co KG, Biberach an der Riss, Germany; North-West University, Potchefstroom, South Africa.
| | | | | | | | - Danuta Herzyk
- Merck Sharp & Dohme Corp., A subsidiary of Merck & Co., Inc., Kenilworth, NJ, USA
| | - Tertius A Hough
- Mary Lyon Centre and Mammalian Genetics Unit, Medical Research Council, Harwell, UK
| | | | - Michael Markert
- Boehringer Ingelheim GmbH & Co KG, Biberach an der Riss, Germany
| | - James D Smith
- Boehringer-Ingelheim Pharmaceuticals Inc., Ridgefield, CT, USA
| | | | - Sara Wells
- Mary Lyon Centre and Mammalian Genetics Unit, Medical Research Council, Harwell, UK
| | | |
Collapse
|
33
|
Sun Y, Shi S, Li Y, Wang Q. Development of quantitative structure-activity relationship models to predict potential nephrotoxic ingredients in traditional Chinese medicines. Food Chem Toxicol 2019; 128:163-170. [PMID: 30954639 DOI: 10.1016/j.fct.2019.03.056] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2019] [Revised: 03/26/2019] [Accepted: 03/31/2019] [Indexed: 12/13/2022]
Abstract
The broad use of traditional Chinese medicines (TCMs) and the accompanied incidences of kidney injury have attracted considerable interest in investigating the responsible toxic ingredients. It is challenging to evaluate toxicity of TCMs since they contain complex mixtures of phytochemicals. Quantitative structure-activity relationship (QSAR) is an efficient tool to predict toxicity and QSAR study on TCMs-induced nephrotoxicity remains lacked. We developed QSAR models using three datasets of 609 compounds: natural products, drugs, and mixed (contained both kinds of data) datasets. Each dataset was used for modelling by utilizing artificial neural networks (ANN) and support vector machines (SVM) algorithms separately. Both internal and external validations were performed on each model. Six QSAR models were developed and yielded reliable performance in the internal validation. For external validation, 30 ingredients in the TCMs were predicted well by the natural product models (accuracy: ANN 96.7%, SVM 93.3%). The mixed models (accuracy: ANN 76.7%, SVM 66.7%) showed a better performance than the drug models (accuracy: ANN 50%, SVM 53.3%). Particularly, natural product models produced the most reliable results. It has the application not only on screening the nephrotoxic ingredients in TCMs, but it is also helpful at prioritizing the subsequent toxicity testing of natural products.
Collapse
Affiliation(s)
- Yuqing Sun
- Department of Toxicology, School of Public Health, Peking University, Beijing, 100191, China; Key Laboratory of State Administration of Traditional Chinese Medicine for Compatibility Toxicology, Beijing, 100191, China
| | - Shaoze Shi
- Department of Toxicology, School of Public Health, Peking University, Beijing, 100191, China; Key Laboratory of State Administration of Traditional Chinese Medicine for Compatibility Toxicology, Beijing, 100191, China
| | - Yaqiu Li
- Department of Toxicology, School of Public Health, Peking University, Beijing, 100191, China; Key Laboratory of State Administration of Traditional Chinese Medicine for Compatibility Toxicology, Beijing, 100191, China
| | - Qi Wang
- Department of Toxicology, School of Public Health, Peking University, Beijing, 100191, China; Key Laboratory of State Administration of Traditional Chinese Medicine for Compatibility Toxicology, Beijing, 100191, China; Beijing Key Laboratory of Toxicological Research and Risk Assessment for Food Safety, Beijing, 100191, China.
| |
Collapse
|
34
|
Grenet I, Merlo K, Comet JP, Tertiaux R, Rouquié D, Dayan F. Stacked Generalization with Applicability Domain Outperforms Simple QSAR on in Vitro Toxicological Data. J Chem Inf Model 2019; 59:1486-1496. [PMID: 30735402 DOI: 10.1021/acs.jcim.8b00553] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]
Abstract
The development of in silico tools able to predict bioactivity and toxicity of chemical substances is a powerful solution envisioned to assess toxicity as early as possible. To enable the development of such tools, the ToxCast program has generated and made publicly available in vitro bioactivity data for thousands of compounds. The goal of the present study is to characterize and explore the data from ToxCast in terms of Machine Learning capability. For this, a large scale analysis on the entire database has been performed to build models to predict bioactivities measured in in vitro assays. Simple classical QSAR algorithms (ANN, SVM, LDA, random forest, and Bayesian) were first applied on the data, and the results of these algorithms suggested that they do not seem to be well-suited for data sets with a high proportion of inactive compounds. The study then showed for the first time that the use of an ensemble method named "Stacked generalization" could improve the model performance on this type of data. Indeed, for 61% of 483 models, the Stacked method led to models with higher performance. Moreover, the combination of this ensemble method with an applicability domain filter allows one to assess the reliability of the predictions for further compound prioritization. In particular we showed that for 50% of the models, the ROC score is better if we do not consider the compounds that are not within the applicability domain.
Collapse
Affiliation(s)
- Ingrid Grenet
- University Côte d'Azur, I3S Laboratory , UMR CNRS 7271, CS 40121, 06903 Sophia Antipolis Cedex, France.,Bayer SAS , 06903 Sophia Antipolis Cedex, France
| | - Kevin Merlo
- Dassault Systèmes SE , 06906 Sophia Antipolis, Biot , France
| | - Jean-Paul Comet
- University Côte d'Azur, I3S Laboratory , UMR CNRS 7271, CS 40121, 06903 Sophia Antipolis Cedex, France
| | - Romain Tertiaux
- Dassault Systèmes SE , 06906 Sophia Antipolis, Biot , France
| | | | - Frédéric Dayan
- Dassault Systèmes SE , 06906 Sophia Antipolis, Biot , France
| |
Collapse
|
35
|
Dreier DA, Denslow ND, Martyniuk CJ. Computational in Vitro Toxicology Uncovers Chemical Structures Impairing Mitochondrial Membrane Potential. J Chem Inf Model 2019; 59:702-712. [PMID: 30645939 DOI: 10.1021/acs.jcim.8b00433] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
Technological advances in molecular biology have enabled high-throughput screening (HTS) of large chemical libraries. These approaches have provided valuable toxicity data for many physiological responses, including mitochondrial dysfunction. While several quantitative structure-activity relationship (QSAR) models have been developed for mitochondrial dysfunction, there remains a need to identify specific chemical features associated with this response. Thus, the objective of this study was to identify chemical structures associated with altered mitochondrial membrane potential (MMP). To achieve this, we developed computational models to examine the relationship between specific chemotypes (e.g., ToxPrints) and bioactivity in ToxCast/Tox21 HTS assays for altered MMP. The analysis revealed that the "bond:COH_alcohol_aromatic", "bond:COH_alcohol_aromatic_phenol", and "ring:aromatic_benzene" ToxPrints had the highest average correlations (phi coefficient) with ToxCast/Tox21 assay component endpoints for decreased MMP. These structures also constituted a "core" group of ToxPrints for decreased MMP in a force-directed network model and were the most important chemotypes in a random forest (RF) classification model for the "TOX21_MMP_ratio_down" assay component endpoint. Based on multiple lines of evidence, these structures, which are present in numerous chemicals (e.g., aromatic hydrocarbons, pesticides, and industrial chemicals) are likely involved in mitochondrial dysfunction. Because of the hierarchical structure of ToxPrints, these chemotypes were highly convergent and, when excluded from training data, had limited effects on the classification performance as related structures compensated for predictor loss. These results highlight the flexibility of the RF algorithm and ToxPrints for QSAR modeling, which is useful to identify chemicals affecting mitochondrial function.
Collapse
Affiliation(s)
- David A Dreier
- Center for Environmental & Human Toxicology, Department of Physiological Sciences, College of Veterinary Medicine , University of Florida , Gainesville , Florida 32611 , United States
| | - Nancy D Denslow
- Center for Environmental & Human Toxicology, Department of Physiological Sciences, College of Veterinary Medicine , University of Florida , Gainesville , Florida 32611 , United States
| | - Christopher J Martyniuk
- Center for Environmental & Human Toxicology, Department of Physiological Sciences, College of Veterinary Medicine , University of Florida , Gainesville , Florida 32611 , United States
| |
Collapse
|
36
|
Klaren WD, Ring C, Harris MA, Thompson CM, Borghoff S, Sipes NS, Hsieh JH, Auerbach SS, Rager JE. Identifying Attributes That Influence In Vitro-to-In Vivo Concordance by Comparing In Vitro Tox21 Bioactivity Versus In Vivo DrugMatrix Transcriptomic Responses Across 130 Chemicals. Toxicol Sci 2019; 167:157-171. [PMID: 30202884 PMCID: PMC6317427 DOI: 10.1093/toxsci/kfy220] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023] Open
Abstract
Recent efforts aimed at integrating in vitro high-throughput screening (HTS) data into chemical toxicity assessments are necessitating increased understanding of concordance between chemical-induced responses observed in vitro versus in vivo. This investigation set out to (1) measure concordance between in vitro HTS data and transcriptomic responses observed in vivo, focusing on the liver, and (2) identify attributes that can influence concordance. Signal response profiles from 130 substances were compared between in vitro data produced through Tox21 and liver transcriptomic data through DrugMatrix, collected from rats exposed to a chemical for ≤5 days. A global in vitro-to-in vivo comparative analysis based on pathway-level responses resulted in an overall average percent agreement of 79%, ranging on a per-chemical basis between 41% and 100%. Whereas concordance amongst inactive chemicals was high (89%), concordance amongst chemicals showing in vitro activity was only 13%, suggesting that follow-up in vivo and/or orthogonal in vitro assays would improve interpretations of in vitro activity. Attributes identified to influence concordance included experimental design attributes (eg, cell type), target pathways, and physicochemical properties (eg, logP). The attribute that most consistently increased concordance was dose applicability, evaluated by filtering for experimental doses administered to rats that were within 10-fold of those related to likely bioactivity, derived using Tox21 data and high-throughput toxicokinetic modeling. Together, findings suggest that in vitro screening approaches to predict in vivo toxicity are viable particularly when certain attributes are considered, including whether activity versus inactivity is observed, experimental design, chemical properties, and dose applicability.
Collapse
Affiliation(s)
- William D Klaren
- Department of Veterinary Integrative Biosciences, Texas A&M University, College Station, Texas 77840
| | | | | | | | | | - Nisha S Sipes
- National Toxicology Program, National Institutes of Health, Research Triangle Park, North Carolina 27709and
| | - Jui-Hua Hsieh
- Kelly Government Solutions, Durham, North Carolina 27709
| | - Scott S Auerbach
- National Toxicology Program, National Institutes of Health, Research Triangle Park, North Carolina 27709and
| | | |
Collapse
|
37
|
Pastor M, Quintana J, Sanz F. Development of an Infrastructure for the Prediction of Biological Endpoints in Industrial Environments. Lessons Learned at the eTOX Project. Front Pharmacol 2018; 9:1147. [PMID: 30364191 PMCID: PMC6193068 DOI: 10.3389/fphar.2018.01147] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2018] [Accepted: 09/21/2018] [Indexed: 11/13/2022] Open
Abstract
In silico methods are increasingly being used for assessing the chemical safety of substances, as a part of integrated approaches involving in vitro and in vivo experiments. A paradigmatic example of these strategies is the eTOX project http://www.etoxproject.eu, funded by the European Innovative Medicines Initiative (IMI), which aimed at producing high quality predictions of in vivo toxicity of drug candidates and resulted in generating about 200 models for diverse endpoints of toxicological interest. In an industry-oriented project like eTOX, apart from the predictive quality, the models need to meet other quality parameters related to the procedures for their generation and their intended use. For example, when the models are used for predicting the properties of drug candidates, the prediction system must guarantee the complete confidentiality of the compound structures. The interface of the system must be designed to provide non-expert users all the information required to choose the models and appropriately interpret the results. Moreover, procedures like installation, maintenance, documentation, validation and versioning, which are common in software development, must be also implemented for the models and for the prediction platform in which they are implemented. In this article we describe our experience in the eTOX project and the lessons learned after 7 years of close collaboration between industrial and academic partners. We believe that some of the solutions found and the tools developed could be useful for supporting similar initiatives in the future.
Collapse
Affiliation(s)
| | | | - Ferran Sanz
- *Correspondence: Manuel Pastor, Ferran Sanz,
| |
Collapse
|
38
|
Stuard SB, Heinonen T. Relevance and Application of Read-Across - Mini Review of European Consensus Platform for Alternatives and Scandinavian Society for Cell Toxicology 2017 Workshop Session. Basic Clin Pharmacol Toxicol 2018. [PMID: 29524304 DOI: 10.1111/bcpt.13006] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Structure activity relationship (SAR)-based read-across is an effective approach for addressing data gaps in human health risk assessment for 'data-poor' chemicals. In read-across, available data on chemical structural analogues are used to predict the toxicity potential of the data-poor chemical. This approach has long been recognized by regulatory agencies and used by industry to evaluate the hazards of chemicals for which there are limited direct data. Construction of a scientifically robust SAR-based read-across hazard assessment is a complex and iterative process involving multiple considerations in each step. Traditional in vivo data generated using regulatory guideline compliant study designs typically forms the basis for read-across assessments. Recently, however, new data streams have been explored and incorporated to enhance read-across predictivity. These in vitro and omics data streams may be used in different ways involving identification of hazards or to provide insight into modes of action for observed toxicological responses. These 'new approach methods' can also enable comparison of biological responses across analogues or a category of structurally related chemicals in order to establish a pattern of biological similarity in addition chemical similarity and/or to help address potency differences across a category. The purpose of this workshop session was to inform on practical considerations in conducting SAR-based read-across assessments and to review recent activities related to application of new approach methods to the practice of read-across.
Collapse
|