1
|
Sui S, Zhou N, Liu H, Watson P, Yang X. Recognizing high-priority disinfection byproducts based on experimental and predicted endocrine disrupting data: Virtual screening and in vitro study. CHEMOSPHERE 2024; 358:142239. [PMID: 38705414 DOI: 10.1016/j.chemosphere.2024.142239] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/08/2023] [Revised: 04/25/2024] [Accepted: 05/02/2024] [Indexed: 05/07/2024]
Abstract
So far, about 130 disinfection by-products (DBPs) and several DBPs-groups have had their potential endocrine-disrupting effects tested on some endocrine endpoints. However, it is still not clear which specific DBPs, DBPs-groups/subgroups may be the most toxic substances or groups/subgroups for any given endocrine endpoint. In this study, we attempt to address this issue. First, a list of relevant DBPs was updated, and 1187 DBPs belonging to 4 main-groups (aliphatic, aromatic, alicyclic, heterocyclic) and 84 subgroups were described. Then, the high-priority endocrine endpoints, DBPs-groups/subgroups, and specific DBPs were determined from 18 endpoints, 4 main-groups, 84 subgroups, and 1187 specific DBPs by a virtual-screening method. The results demonstrate that most of DBPs could not disturb the endocrine endpoints in question because the proportion of active compounds associated with the endocrine endpoints ranged from 0 (human thyroid receptor beta) to 32% (human transthyretin (hTTR)). All the endpoints with a proportion of active compounds greater than 10% belonged to the thyroid system, highlighting that the potential disrupting effects of DBPs on the thyroid system should be given more attention. The aromatic and alicyclic DBPs may have higher priority than that of aliphatic and heterocyclic DBPs by considering the activity rate and potential for disrupting effects. There were 2 (halophenols and estrogen DBPs), 12, and 24 subgroups that belonged to high, moderate, and low priority classes, respectively. For individual DBPs, there were 23 (2%), 193 (16%), and 971 (82%) DBPs belonging to the high, moderate, and low priority groups, respectively. Lastly, the hTTR binding affinity of 4 DBPs was determined by an in vitro assay and all the tested DBPs exhibited dose-dependent binding potency with hTTR, which was consistent with the predicted result. Thus, more efforts should be performed to reveal the potential endocrine disruption of those high research-priority main-groups, subgroups, and individual DBPs.
Collapse
Affiliation(s)
- Shuxin Sui
- Jiangsu Key Laboratory of Chemical Pollution Control and Resources Reuse, School of Environmental and Biological Engineering, Nanjing University of Science and Technology, Nanjing, 210094, China
| | - Nan Zhou
- Jiangsu Key Laboratory of Chemical Pollution Control and Resources Reuse, School of Environmental and Biological Engineering, Nanjing University of Science and Technology, Nanjing, 210094, China
| | - Huihui Liu
- Jiangsu Key Laboratory of Chemical Pollution Control and Resources Reuse, School of Environmental and Biological Engineering, Nanjing University of Science and Technology, Nanjing, 210094, China
| | - Peter Watson
- Los Alamos National Laboratory, Los Alamos, 87545, New Mexico, United States
| | - Xianhai Yang
- Jiangsu Key Laboratory of Chemical Pollution Control and Resources Reuse, School of Environmental and Biological Engineering, Nanjing University of Science and Technology, Nanjing, 210094, China.
| |
Collapse
|
2
|
Helmke P, Füzi B, Ecker GF. Bioactivity descriptors for in vivo toxicity prediction: now and the future. Expert Opin Drug Metab Toxicol 2024:1-3. [PMID: 38530269 DOI: 10.1080/17425255.2024.2334308] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2024] [Accepted: 03/20/2024] [Indexed: 03/27/2024]
Affiliation(s)
- Palle Helmke
- University of Vienna, Department of Pharmaceutical Sciences, Josef-Holaubek Platz 2, Wien, Austria
| | - Barbara Füzi
- University of Vienna, Department of Pharmaceutical Sciences, Josef-Holaubek Platz 2, Wien, Austria
| | - Gerhard F Ecker
- University of Vienna, Department of Pharmaceutical Sciences, Josef-Holaubek Platz 2, Wien, Austria
| |
Collapse
|
3
|
Ji S. SSC: The novel self-stack ensemble model for thyroid disease prediction. PLoS One 2024; 19:e0295501. [PMID: 38170718 PMCID: PMC10763970 DOI: 10.1371/journal.pone.0295501] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2022] [Accepted: 11/22/2023] [Indexed: 01/05/2024] Open
Abstract
Thyroid disease presents a significant health risk, lowering the quality of life and increasing treatment costs. The diagnosis of thyroid disease can be challenging, especially for inexperienced practitioners. Machine learning has been established as one of the methods for disease diagnosis based on previous studies. This research introduces a novel and more effective technique for predicting thyroid disease by utilizing machine learning methodologies, surpassing the performance of previous studies in this field. This study utilizes the UCI thyroid disease dataset, which consists of 9172 samples and 30 features, and exhibits a highly imbalanced target class distribution. However, machine learning algorithms trained on imbalanced thyroid disease data face challenges in reliably detecting minority data and disease. To address this issue, re-sampling is employed, which modifies the ratio between target classes to balance the data. In this study, the down-sampling approach is utilized to achieve a balanced distribution of target classes. A novel RF-based self-stacking classifier is presented in this research for efficient thyroid disease detection. The proposed approach demonstrates the ability to diagnose primary hypothyroidism, increased binding protein, compensated hypothyroidism, and concurrent non-thyroidal illness with an accuracy of 99.5%. The recommended model exhibits state-of-the-art performance, achieving 100% macro precision, 100% macro recall, and 100% macro F1-score. A thorough comparative assessment is conducted to demonstrate the viability of the proposed approach, including several machine learning classifiers, deep neural networks, and ensemble voting classifiers. The results of K-fold cross-validation provide further support for the efficacy of the proposed self-stacking classifier.
Collapse
Affiliation(s)
- Shengjun Ji
- School of information, Xi’an University of Finance and Economics, Xi’an, China
| |
Collapse
|
4
|
Arturi K, Hollender J. Machine Learning-Based Hazard-Driven Prioritization of Features in Nontarget Screening of Environmental High-Resolution Mass Spectrometry Data. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2023; 57:18067-18079. [PMID: 37279189 PMCID: PMC10666537 DOI: 10.1021/acs.est.3c00304] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/12/2023] [Revised: 05/15/2023] [Accepted: 05/15/2023] [Indexed: 06/08/2023]
Abstract
Nontarget high-resolution mass spectrometry screening (NTS HRMS/MS) can detect thousands of organic substances in environmental samples. However, new strategies are needed to focus time-intensive identification efforts on features with the highest potential to cause adverse effects instead of the most abundant ones. To address this challenge, we developed MLinvitroTox, a machine learning framework that uses molecular fingerprints derived from fragmentation spectra (MS2) for a rapid classification of thousands of unidentified HRMS/MS features as toxic/nontoxic based on nearly 400 target-specific and over 100 cytotoxic endpoints from ToxCast/Tox21. Model development results demonstrated that using customized molecular fingerprints and models, over a quarter of toxic endpoints and the majority of the associated mechanistic targets could be accurately predicted with sensitivities exceeding 0.95. Notably, SIRIUS molecular fingerprints and xboost (Extreme Gradient Boosting) models with SMOTE (Synthetic Minority Oversampling Technique) for handling data imbalance were a universally successful and robust modeling configuration. Validation of MLinvitroTox on MassBank spectra showed that toxicity could be predicted from molecular fingerprints derived from MS2 with an average balanced accuracy of 0.75. By applying MLinvitroTox to environmental HRMS/MS data, we confirmed the experimental results obtained with target analysis and narrowed the analytical focus from tens of thousands of detected signals to 783 features linked to potential toxicity, including 109 spectral matches and 30 compounds with confirmed toxic activity.
Collapse
Affiliation(s)
- Katarzyna Arturi
- Department
of Environmental Chemistry, Swiss Federal
Institute of Aquatic Science and Technology (Eawag), Ueberlandstrasse 133, 8600 Dübendorf, Switzerland
| | - Juliane Hollender
- Department
of Environmental Chemistry, Swiss Federal
Institute of Aquatic Science and Technology (Eawag), Ueberlandstrasse 133, 8600 Dübendorf, Switzerland
- Institute
of Biogeochemistry and Pollution Dynamics, Eidgenössische Technische Hochschule Zürich (ETH Zurich), Rämistrasse 101, 8092 Zürich, Switzerland
| |
Collapse
|
5
|
Sahoo AK, Baskaran SP, Chivukula N, Kumar K, Samal A. Analysis of structure-activity and structure-mechanism relationships among thyroid stimulating hormone receptor binding chemicals by leveraging the ToxCast library. RSC Adv 2023; 13:23461-23471. [PMID: 37546222 PMCID: PMC10401517 DOI: 10.1039/d3ra04452a] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2023] [Accepted: 07/31/2023] [Indexed: 08/08/2023] Open
Abstract
The thyroid stimulating hormone receptor (TSHR) is crucial in thyroid hormone production in humans, and dysregulation in TSHR activation can lead to adverse health effects such as hypothyroidism and Graves' disease. Further, animal studies have shown that binding of endocrine disrupting chemicals (EDCs) with TSHR can lead to developmental toxicity. Hence, several such chemicals have been screened for their adverse physiological effects in human cell lines via high-throughput assays in the ToxCast project. The invaluable data generated by the ToxCast project has enabled the development of toxicity predictors, but they can be limited in their predictive ability due to the heterogeneity in structure-activity relationships among chemicals. Here, we systematically investigated the heterogeneity in structure-activity as well as structure-mechanism relationships among the TSHR binding chemicals from ToxCast. By employing a structure-activity similarity (SAS) map, we identified 79 activity cliffs among 509 chemicals in TSHR agonist dataset and 69 activity cliffs among 650 chemicals in the TSHR antagonist dataset. Further, by using the matched molecular pair (MMP) approach, we find that the resultant activity cliffs (MMP-cliffs) are a subset of activity cliffs identified via the SAS map approach. Subsequently, by leveraging ToxCast mechanism of action (MOA) annotations for chemicals common to both TSHR agonist and TSHR antagonist datasets, we identified 3 chemical pairs as strong MOA-cliffs and 19 chemical pairs as weak MOA-cliffs. In conclusion, the insights from this systematic investigation of the TSHR binding chemicals are likely to inform ongoing efforts towards development of better predictive toxicity models for characterization of the chemical exposome.
Collapse
Affiliation(s)
- Ajaya Kumar Sahoo
- The Institute of Mathematical Sciences (IMSc) Chennai 600113 India
- Homi Bhabha National Institute (HBNI) Mumbai 400094 India
| | - Shanmuga Priya Baskaran
- The Institute of Mathematical Sciences (IMSc) Chennai 600113 India
- Homi Bhabha National Institute (HBNI) Mumbai 400094 India
| | - Nikhil Chivukula
- The Institute of Mathematical Sciences (IMSc) Chennai 600113 India
- Homi Bhabha National Institute (HBNI) Mumbai 400094 India
| | - Kishan Kumar
- The Institute of Mathematical Sciences (IMSc) Chennai 600113 India
| | - Areejit Samal
- The Institute of Mathematical Sciences (IMSc) Chennai 600113 India
- Homi Bhabha National Institute (HBNI) Mumbai 400094 India
| |
Collapse
|
6
|
Melching-Kollmuss S, Bothe K, Charlton A, Gangadharan B, Ghaffari R, Jacobi S, Marty S, Marxfeld HA, McInnes EF, Sauer UG, Sheets LP, Strupp C, Tinwell H, Wiemann C, Botham PA, van Ravenzwaay B. Towards a science-based testing strategy to identify maternal thyroid hormone imbalance and neurodevelopmental effects in the progeny - Part IV: the ECETOC and CLE Proposal for a Thyroid Function-Related Neurodevelopmental Toxicity Testing and Assessment Scheme (Thyroid-NDT-TAS). Crit Rev Toxicol 2023; 53:339-371. [PMID: 37554099 DOI: 10.1080/10408444.2023.2231033] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2023] [Revised: 06/22/2023] [Accepted: 06/22/2023] [Indexed: 08/10/2023]
Abstract
Following the European Commission Endocrine Disruptor Criteria, substances shall be considered as having endocrine disrupting properties if they (a) elicit adverse effects, (b) have endocrine activity, and (c) the two are linked by an endocrine mode-of-action (MoA) unless the MoA is not relevant for humans. A comprehensive, structured approach to assess whether substances meet the Endocrine Disruptor Criteria for the thyroid modality (EDC-T) is currently unavailable. Here, the European Centre for Ecotoxicology and Toxicology of Chemicals Thyroxine Task Force and CropLife Europe propose a Thyroid Function-Related Neurodevelopmental Toxicity Testing and Assessment Scheme (Thyroid-NDT-TAS). In Tier 0, before entering the Thyroid-NDT-TAS, all available in vivo, in vitro and in silico data are submitted to weight-of-evidence (WoE) evaluations to determine whether the substance of interest poses a concern for thyroid disruption. If so, Tier 1 of the Thyroid-NDT-TAS includes an initial MoA and human relevance assessment (structured by the key events of possibly relevant adverse outcome pathways) and the generation of supportive in vitro/in silico data, if relevant. Only if Tier 1 is inconclusive, Tier 2 involves higher-tier testing to generate further thyroid- and/or neurodevelopment-related data. Tier 3 includes the final MoA and human relevance assessment and an overarching WoE evaluation to draw a conclusion on whether, or not, the substance meets the EDC-T. The Thyroid-NDT-TAS is based on the state-of-the-science, and it has been developed to minimise animal testing. To make human safety assessments more accurate, it is recommended to apply the Thyroid-NDT-TAS during future regulatory assessments.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | - Ursula G Sauer
- Scientific Consultancy - Animal Welfare, Neubiberg, Germany
| | | | | | | | | | | | | |
Collapse
|
7
|
Liu W, Wang Z, Chen J, Tang W, Wang H. Machine Learning Model for Screening Thyroid Stimulating Hormone Receptor Agonists Based on Updated Datasets and Improved Applicability Domain Metrics. Chem Res Toxicol 2023. [PMID: 37209109 DOI: 10.1021/acs.chemrestox.3c00074] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/22/2023]
Abstract
Machine learning (ML) models for screening endocrine-disrupting chemicals (EDCs), such as thyroid stimulating hormone receptor (TSHR) agonists, are essential for sound management of chemicals. Previous models for screening TSHR agonists were built on imbalanced datasets and lacked applicability domain (AD) characterization essential for regulatory application. Herein, an updated TSHR agonist dataset was built, for which the ratio of active to inactive compounds greatly increased to 1:2.6, and chemical spaces of structure-activity landscapes (SALs) were enhanced. Resulting models based on 7 molecular representations and 4 ML algorithms were proven to outperform previous ones. Weighted similarity density (ρs) and weighted inconsistency of activities (IA) were proposed to characterize the SALs, and a state-of-the-art AD characterization methodology ADSAL{ρs, IA} was established. An optimal classifier developed with PubChem fingerprints and the random forest algorithm, coupled with ADSAL{ρs ≥ 0.15, IA ≤ 0.65}, exhibited good performance on the validation set with the area under the receiver operating characteristic curve being 0.984 and balanced accuracy being 0.941 and identified 90 TSHR agonist classes that could not be found previously. The classifier together with the ADSAL{ρs, IA} may serve as efficient tools for screening EDCs, and the AD characterization methodology may be applied to other ML models.
Collapse
Affiliation(s)
- Wenjia Liu
- Key Laboratory of Industrial Ecology and Environmental Engineering (Ministry of Education), Dalian Key Laboratory on Chemicals Risk Control and Pollution Prevention Technology, School of Environmental Science and Technology, Dalian University of Technology, Dalian 116024, China
| | - Zhongyu Wang
- Key Laboratory of Industrial Ecology and Environmental Engineering (Ministry of Education), Dalian Key Laboratory on Chemicals Risk Control and Pollution Prevention Technology, School of Environmental Science and Technology, Dalian University of Technology, Dalian 116024, China
| | - Jingwen Chen
- Key Laboratory of Industrial Ecology and Environmental Engineering (Ministry of Education), Dalian Key Laboratory on Chemicals Risk Control and Pollution Prevention Technology, School of Environmental Science and Technology, Dalian University of Technology, Dalian 116024, China
| | - Weihao Tang
- Key Laboratory of Industrial Ecology and Environmental Engineering (Ministry of Education), Dalian Key Laboratory on Chemicals Risk Control and Pollution Prevention Technology, School of Environmental Science and Technology, Dalian University of Technology, Dalian 116024, China
| | - Haobo Wang
- Key Laboratory of Industrial Ecology and Environmental Engineering (Ministry of Education), Dalian Key Laboratory on Chemicals Risk Control and Pollution Prevention Technology, School of Environmental Science and Technology, Dalian University of Technology, Dalian 116024, China
| |
Collapse
|
8
|
Sapounidou M, Norinder U, Andersson PL. Predicting Endocrine Disruption Using Conformal Prediction - A Prioritization Strategy to Identify Hazardous Chemicals with Confidence. Chem Res Toxicol 2022; 36:53-65. [PMID: 36534483 PMCID: PMC9846826 DOI: 10.1021/acs.chemrestox.2c00267] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
Receptor-mediated molecular initiating events (MIEs) and their relevance in endocrine activity (EA) have been highlighted in literature. More than 15 receptors have been associated with neurodevelopmental adversity and metabolic disruption. MIEs describe chemical interactions with defined biological outcomes, a relationship that could be described with quantitative structure-activity relationship (QSAR) models. QSAR uncertainty can be assessed using the conformal prediction (CP) framework, which provides similarity (i.e., nonconformity) scores relative to the defined classes per prediction. CP calibration can indirectly mitigate data imbalance during model development, and the nonconformity scores serve as intrinsic measures of chemical applicability domain assessment during screening. The focus of this work was to propose an in silico predictive strategy for EA. First, 23 QSAR models for MIEs associated with EA were developed using high-throughput data for 14 receptors. To handle the data imbalance, five protocols were compared, and CP provided the most balanced class definition. Second, the developed QSAR models were applied to a large data set (∼55,000 chemicals), comprising chemicals representative of potential risk for human exposure. Using CP, it was possible to assess the uncertainty of the screening results and identify model strengths and out of domain chemicals. Last, two clustering methods, t-distributed stochastic neighbor embedding and Tanimoto similarity, were used to identify compounds with potential EA using known endocrine disruptors as reference. The cluster overlap between methods produced 23 chemicals with suspected or demonstrated EA potential. The presented models could be utilized for first-tier screening and identification of compounds with potential biological activity across the studied MIEs.
Collapse
Affiliation(s)
| | - Ulf Norinder
- Department
of Computer and Systems Sciences, Stockholm
University, Box 7003, 164
07 Kista, Sweden,MTM
Research
Centre, School of Science and Technology, Örebro University, 701 82 Örebro, Sweden,Department
of Pharmaceutical Biosciences, Uppsala University, Box 591, 75 124 Uppsala, Sweden
| | | |
Collapse
|
9
|
Foster MJ, Patlewicz G, Shah I, Haggard DE, Judson RS, Paul Friedman K. Evaluating structure-based activity in a high-throughput assay for steroid biosynthesis. COMPUTATIONAL TOXICOLOGY (AMSTERDAM, NETHERLANDS) 2022; 24:1-23. [PMID: 37841081 PMCID: PMC10569244 DOI: 10.1016/j.comtox.2022.100245] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/17/2023]
Abstract
Data from a high-throughput human adrenocortical carcinoma assay (HT-H295R) for steroid hormone biosynthesis are available for >2000 chemicals in single concentration and 654 chemicals in multi-concentration (mc). Previously, a metric describing the effect size of a chemical on the biosynthesis of 11 hormones was derived using mc data referred to as the maximum mean Mahalanobis distance (maxmMd). However, mc HT-H295R assay data remain unavailable for many chemicals. This work leverages existing HT-H295R assay data by constructing structure-activity relationships to make predictions for data-poor chemicals, including: (1) identification of individual structural descriptors, known as ToxPrint chemotypes, associated with increased odds of affecting estrogen or androgen synthesis; (2) a random forest (RF) classifier using physicochemical property descriptors to predict HT-H295R maxmMd binary (positive or negative) outcomes; and, (3) a local approach to predict maxmMd binary outcomes using nearest neighbors (NNs) based on two types of chemical fingerprints (chemotype or Morgan). Individual chemotypes demonstrated high specificity (85-98%) for modulators of estrogen and androgen synthesis but with low sensitivity. The best RF model for maxmMd classification included 13 predicted physicochemical descriptors, yielding a balanced accuracy (BA) of 71% with only modest improvement when hundreds of structural features were added. The best two NN models for binary maxmMd prediction demonstrated BAs of 85 and 81% using chemotype and Morgan fingerprints, respectively. Using an external test set of 6302 chemicals (lacking HT-H295R data), 1241 were identified as putative estrogen and androgen modulators. Combined results across the three classification models (global RF model and two local NN models) predict that 1033 of the 6302 chemicals would be more likely to affect HT-H295R bioactivity. Together, these in silico approaches can efficiently prioritize thousands of untested chemicals for screening to further evaluate their effects on steroid biosynthesis.
Collapse
Affiliation(s)
- M J Foster
- Center for Computational Toxicology and Exposure, Office of Research and Development, U.S. Environmental Protection Agency, Research Triangle Park, North Carolina, 27711, USA
- National Student Services Contractor, Oak Ridge Associated Universities
| | - G Patlewicz
- Center for Computational Toxicology and Exposure, Office of Research and Development, U.S. Environmental Protection Agency, Research Triangle Park, North Carolina, 27711, USA
| | - I Shah
- Center for Computational Toxicology and Exposure, Office of Research and Development, U.S. Environmental Protection Agency, Research Triangle Park, North Carolina, 27711, USA
| | - D E Haggard
- Center for Computational Toxicology and Exposure, Office of Research and Development, U.S. Environmental Protection Agency, Research Triangle Park, North Carolina, 27711, USA
| | - R S Judson
- Center for Computational Toxicology and Exposure, Office of Research and Development, U.S. Environmental Protection Agency, Research Triangle Park, North Carolina, 27711, USA
| | - K Paul Friedman
- Center for Computational Toxicology and Exposure, Office of Research and Development, U.S. Environmental Protection Agency, Research Triangle Park, North Carolina, 27711, USA
| |
Collapse
|
10
|
Jeong J, Kim D, Choi J. Application of ToxCast/Tox21 data for toxicity mechanism-based evaluation and prioritization of environmental chemicals: Perspective and limitations. Toxicol In Vitro 2022; 84:105451. [PMID: 35921976 DOI: 10.1016/j.tiv.2022.105451] [Citation(s) in RCA: 19] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2022] [Accepted: 07/28/2022] [Indexed: 01/28/2023]
Abstract
In response to the need to minimize the use of experimental animals, new approach methodologies (NAMs) using advanced technology have emerged in the 21st century. ToxCast/Tox21 aims to evaluate the adverse effects of chemicals quickly and efficiently using a high-throughput screening and to transform the paradigm of toxicity assessment into mechanism-based toxicity prediction. The ToxCast/Tox21 database, which contains extensive data from over 1400 assays with numerous biological targets and activity data for over 9000 chemicals, can be used for various purposes in the field of chemical prioritization and toxicity prediction. In this study, an overview of the database was explored to aid mechanism-based chemical prioritization and toxicity prediction. Implications for the utilization of the ToxCast/Tox21 database in chemical prioritization and toxicity prediction were derived. The research trends in ToxCast/Tox21 assay data were reviewed in the context of toxicity mechanism identification, chemical priority, environmental monitoring, assay development, and toxicity prediction. Finally, the potential applications and limitations of using ToxCast/Tox21 assay data in chemical risk assessment were discussed. The analysis of the toxicity mechanism-based assays of ToxCast/Tox21 will help in chemical prioritization and regulatory applications without the use of laboratory animals.
Collapse
Affiliation(s)
- Jaeseong Jeong
- School of Environmental Engineering, University of Seoul, 163 Seoulsiripdae-ro, Dongdaemun-gu, Seoul 02504, Republic of Korea
| | - Donghyeon Kim
- School of Environmental Engineering, University of Seoul, 163 Seoulsiripdae-ro, Dongdaemun-gu, Seoul 02504, Republic of Korea
| | - Jinhee Choi
- School of Environmental Engineering, University of Seoul, 163 Seoulsiripdae-ro, Dongdaemun-gu, Seoul 02504, Republic of Korea.
| |
Collapse
|
11
|
Chaganti R, Rustam F, De La Torre Díez I, Mazón JLV, Rodríguez CL, Ashraf I. Thyroid Disease Prediction Using Selective Features and Machine Learning Techniques. Cancers (Basel) 2022; 14:cancers14163914. [PMID: 36010907 PMCID: PMC9405591 DOI: 10.3390/cancers14163914] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2022] [Revised: 08/09/2022] [Accepted: 08/11/2022] [Indexed: 11/21/2022] Open
Abstract
Simple Summary The study presents a thyroid disease prediction approach which utilizes random forest-based features to obtain high accuracy. The approach can obtain a 0.99 accuracy to predict ten thyroid diseases. Abstract Thyroid disease prediction has emerged as an important task recently. Despite existing approaches for its diagnosis, often the target is binary classification, the used datasets are small-sized and results are not validated either. Predominantly, existing approaches focus on model optimization and the feature engineering part is less investigated. To overcome these limitations, this study presents an approach that investigates feature engineering for machine learning and deep learning models. Forward feature selection, backward feature elimination, bidirectional feature elimination, and machine learning-based feature selection using extra tree classifiers are adopted. The proposed approach can predict Hashimoto’s thyroiditis (primary hypothyroid), binding protein (increased binding protein), autoimmune thyroiditis (compensated hypothyroid), and non-thyroidal syndrome (NTIS) (concurrent non-thyroidal illness). Extensive experiments show that the extra tree classifier-based selected feature yields the best results with 0.99 accuracy and an F1 score when used with the random forest classifier. Results suggest that the machine learning models are a better choice for thyroid disease detection regarding the provided accuracy and the computational complexity. K-fold cross-validation and performance comparison with existing studies corroborate the superior performance of the proposed approach.
Collapse
Affiliation(s)
| | - Furqan Rustam
- Department of Software Engineering, School of System Sciences, University of Management and Technology, Lahore 54770, Pakistan
| | - Isabel De La Torre Díez
- Department of Signal Theory and Communications and Telematic Engineering, University of Valladolid, Paseo de Belén 15, 47011 Valladolid, Spain
- Correspondence: (I.D.L.T.D.); (I.A.)
| | - Juan Luis Vidal Mazón
- Higher Polytechnic School, Universidad Europea del Atlántico, Parque Científico y Tecnológico de Cantabria, Isabel Torres 21, 39011 Santander, Spain
- Project Department, Universidade Internacional do Cuanza, Cuito EN250, Bié, Angola
- Department of Project Management, Universidad Internacional Iberoamericana, Arecibo, PR 00613, USA
| | - Carmen Lili Rodríguez
- Higher Polytechnic School, Universidad Europea del Atlántico, Parque Científico y Tecnológico de Cantabria, Isabel Torres 21, 39011 Santander, Spain
- Department of Project Management, Universidad Internacional Iberoamericana, Campeche 24560, Mexico
| | - Imran Ashraf
- Department of Information and Communication Engineering, Yeungnam University, Gyeongsan 38541, Korea
- Correspondence: (I.D.L.T.D.); (I.A.)
| |
Collapse
|
12
|
Dracheva E, Norinder U, Rydén P, Engelhardt J, Weiss JM, Andersson PL. In Silico Identification of Potential Thyroid Hormone System Disruptors among Chemicals in Human Serum and Chemicals with a High Exposure Index. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2022; 56:8363-8372. [PMID: 35561338 PMCID: PMC9228062 DOI: 10.1021/acs.est.1c07762] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/15/2021] [Revised: 04/27/2022] [Accepted: 04/29/2022] [Indexed: 05/30/2023]
Abstract
Data on toxic effects are at large missing the prevailing understanding of the risks of industrial chemicals. Thyroid hormone (TH) system disruption includes interferences of the life cycle of the thyroid hormones and may occur in various organs. In the current study, high-throughput screening data available for 14 putative molecular initiating events of adverse outcome pathways, related to disruption of the TH system, were used to develop 19 in silico models for identification of potential thyroid hormone system-disrupting chemicals. The conformal prediction framework with the underlying Random Forest was used as a wrapper for the models allowing for setting the desired confidence level and controlling the error rate of predictions. The trained models were then applied to two different databases: (i) an in-house database comprising xenobiotics identified in human blood and ii) currently used chemicals registered in the Swedish Product Register, which have been predicted to have a high exposure index to consumers. The application of these models showed that among currently used chemicals, fewer were overall predicted as active compared to chemicals identified in human blood. Chemicals of specific concern for TH disruption were identified from both databases based on their predicted activity.
Collapse
Affiliation(s)
- Elena Dracheva
- Department
of Chemistry, Umeå University, KB.E6, Linnaeus väg 6, SE-901 87 Umeå, Sweden
| | - Ulf Norinder
- Department
of Computer and Systems Sciences, Stockholm
University, Box 7003, SE-164 07 Kista, Sweden
| | - Patrik Rydén
- Department
of Mathematics and Mathematical Statistics, Umeå University, MIT.E.351, SE-901 87 Umeå, Sweden
| | - Josefin Engelhardt
- Department
of Environmental Science, Stockholm University, SE-11418 Stockholm, Sweden
| | - Jana M. Weiss
- Department
of Environmental Science, Stockholm University, SE-11418 Stockholm, Sweden
| | - Patrik L. Andersson
- Department
of Chemistry, Umeå University, KB.E6, Linnaeus väg 6, SE-901 87 Umeå, Sweden
| |
Collapse
|
13
|
Chen P, Wang R, Chen G, An B, Liu M, Wang Q, Tao Y. Thyroid endocrine disruption and hepatotoxicity induced by bisphenol AF: Integrated zebrafish embryotoxicity test and deep learning. THE SCIENCE OF THE TOTAL ENVIRONMENT 2022; 822:153639. [PMID: 35131240 DOI: 10.1016/j.scitotenv.2022.153639] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/18/2021] [Revised: 01/28/2022] [Accepted: 01/29/2022] [Indexed: 06/14/2023]
Abstract
Bisphenol AF (BPAF) is an emerging contaminant prevalent in the environment as one of main substitutes of bisphenol A (BPA). It was found that BPAF exhibited estrogenic effects in zebrafish larvae in our previous study, while little is known about its effects on the thyroid and liver. A 7 d zebrafish embryotoxicity test was conducted to study the potential thyroid disruption and hepatotoxicity of BPAF. BPAF decreased levels of thyroid hormones and deiodinases but increased expressions of transthyretin at 12.5 and 125 μg/L after 7 d exposure, indicating that both the metabolism and transport of thyroid hormones were perturbed. The thyroid hormone receptor (TR) levels decreased significantly upon exposure to ≥12.5 μg/L BPAF, implying that BPAF acts as a TR antagonist, which coincided well with the prediction from the Direct Message Passing Neural Network. The liver impairment (mainly cell necrosis of hepatocytes) and apoptosis were triggered by 125 μg/L and ≥12.5 μg/L BPAF respectively, accompanied by the increased activities of caspase 3 and caspase 9. Thus BPAF might not be a safe alternative to BPA given the thyroid and liver toxicity. DMPNN appears useful to screen for thyroid disrupting activity from molecular structures.
Collapse
Affiliation(s)
- Pengyu Chen
- College of Oceanography, Hohai University, Nanjing 210024, China
| | - Ruihan Wang
- College of Chemistry, Sichuan University, Chengdu, Sichuan 610064, China
| | - Geng Chen
- School of Pharmaceutical Science and Technology, Hangzhou Institute for Advanced Study, UCAS, Hangzhou 330106, China
| | - Baihui An
- College of Oceanography, Hohai University, Nanjing 210024, China
| | - Ming Liu
- College of Oceanography, Hohai University, Nanjing 210024, China
| | - Qiang Wang
- Agro-Environmental Protection Institute, Ministry of Agriculture and Rural Affairs, Tianjin 300191, China
| | - Yuqiang Tao
- College of Oceanography, Hohai University, Nanjing 210024, China.
| |
Collapse
|
14
|
Gadaleta D, d'Alessandro L, Marzo M, Benfenati E, Roncaglioni A. Quantitative Structure-Activity Relationship Modeling of the Amplex Ultrared Assay to Predict Thyroperoxidase Inhibitory Activity. Front Pharmacol 2021; 12:713037. [PMID: 34456728 PMCID: PMC8387701 DOI: 10.3389/fphar.2021.713037] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2021] [Accepted: 07/30/2021] [Indexed: 11/13/2022] Open
Abstract
The thyroid system plays a major role in the regulation of several physiological processes. The dysregulation of the thyroid system caused by the interference of xenobiotics and contaminants may bring to pathologies like hyper- and hypothyroidism and it has been recently correlated with adverse outcomes leading to cancer, obesity, diabetes and neurodevelopmental disorders. Thyroid disruption can occur at several levels. For example, the inhibition of thyroperoxidase (TPO) enzyme, which catalyses the synthesis of thyroid hormones, may cause dysfunctions related to hypothyroidism. The inhibition of the TPO enzyme can occur as a consequence of prolonged exposure to chemical compounds, for this reason it is of utmost importance to identify alternative methods to evaluate the large amount of pollutants and other chemicals that may pose a potential hazard to the human health. In this work, quantitative structure-activity relationship (QSAR) models to predict the TPO inhibitory potential of chemicals are presented. Models are developed by means of several machine learning and data selection approaches, and are based on data obtained in vitro with the Amplex UltraRed-thyroperoxidase (AUR-TPO) assay. Balancing methods and feature selection are applied during model development. Models are rigorously evaluated through internal and external validation. Based on validation results, two models based on Balanced Random Forest (BRF) and K-Nearest Neighbours (KNN) algorithms were selected for a further validation phase, that leads predictive performance (BA = 0.76-0.78 on external data) that is comparable with the reported experimental variability of the AUR-TPO assay (BA ∼0.70). Finally, a consensus between the two models was proposed (BA = 0.82). Based on the predictive performance, these models can be considered suitable for toxicity screening of environmental chemicals.
Collapse
Affiliation(s)
- Domenico Gadaleta
- Laboratory of Environmental Chemistry and Toxicology, Department of Environmental Health Sciences, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Milan, Italy
| | - Luca d'Alessandro
- Laboratory of Environmental Chemistry and Toxicology, Department of Environmental Health Sciences, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Milan, Italy
| | - Marco Marzo
- Laboratory of Environmental Chemistry and Toxicology, Department of Environmental Health Sciences, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Milan, Italy
| | - Emilio Benfenati
- Laboratory of Environmental Chemistry and Toxicology, Department of Environmental Health Sciences, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Milan, Italy
| | - Alessandra Roncaglioni
- Laboratory of Environmental Chemistry and Toxicology, Department of Environmental Health Sciences, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Milan, Italy
| |
Collapse
|
15
|
Garcia de Lomana M, Morger A, Norinder U, Buesen R, Landsiedel R, Volkamer A, Kirchmair J, Mathea M. ChemBioSim: Enhancing Conformal Prediction of In Vivo Toxicity by Use of Predicted Bioactivities. J Chem Inf Model 2021; 61:3255-3272. [PMID: 34153183 PMCID: PMC8317154 DOI: 10.1021/acs.jcim.1c00451] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2021] [Indexed: 02/07/2023]
Abstract
Computational methods such as machine learning approaches have a strong track record of success in predicting the outcomes of in vitro assays. In contrast, their ability to predict in vivo endpoints is more limited due to the high number of parameters and processes that may influence the outcome. Recent studies have shown that the combination of chemical and biological data can yield better models for in vivo endpoints. The ChemBioSim approach presented in this work aims to enhance the performance of conformal prediction models for in vivo endpoints by combining chemical information with (predicted) bioactivity assay outcomes. Three in vivo toxicological endpoints, capturing genotoxic (MNT), hepatic (DILI), and cardiological (DICC) issues, were selected for this study due to their high relevance for the registration and authorization of new compounds. Since the sparsity of available biological assay data is challenging for predictive modeling, predicted bioactivity descriptors were introduced instead. Thus, a machine learning model for each of the 373 collected biological assays was trained and applied on the compounds of the in vivo toxicity data sets. Besides the chemical descriptors (molecular fingerprints and physicochemical properties), these predicted bioactivities served as descriptors for the models of the three in vivo endpoints. For this study, a workflow based on a conformal prediction framework (a method for confidence estimation) built on random forest models was developed. Furthermore, the most relevant chemical and bioactivity descriptors for each in vivo endpoint were preselected with lasso models. The incorporation of bioactivity descriptors increased the mean F1 scores of the MNT model from 0.61 to 0.70 and for the DICC model from 0.72 to 0.82 while the mean efficiencies increased by roughly 0.10 for both endpoints. In contrast, for the DILI endpoint, no significant improvement in model performance was observed. Besides pure performance improvements, an analysis of the most important bioactivity features allowed detection of novel and less intuitive relationships between the predicted biological assay outcomes used as descriptors and the in vivo endpoints. This study presents how the prediction of in vivo toxicity endpoints can be improved by the incorporation of biological information-which is not necessarily captured by chemical descriptors-in an automated workflow without the need for adding experimental workload for the generation of bioactivity descriptors as predicted outcomes of bioactivity assays were utilized. All bioactivity CP models for deriving the predicted bioactivities, as well as the in vivo toxicity CP models, can be freely downloaded from https://doi.org/10.5281/zenodo.4761225.
Collapse
Affiliation(s)
- Marina Garcia de Lomana
- BASF
SE, Ludwigshafen am Rhein 67063, Germany
- Department
of Pharmaceutical Sciences, Faculty of Life Sciences, University of Vienna, Vienna 1090, Austria
| | - Andrea Morger
- In Silico
Toxicology and Structural Bioinformatics, Institute of Physiology, Charité Universitätsmedizin Berlin, Charitéplatz
1, Berlin 10117, Germany
| | - Ulf Norinder
- MTM
Research Centre, School of Science and Technology, Örebro University, Örebro SE-70182, Sweden
| | | | | | - Andrea Volkamer
- In Silico
Toxicology and Structural Bioinformatics, Institute of Physiology, Charité Universitätsmedizin Berlin, Charitéplatz
1, Berlin 10117, Germany
| | - Johannes Kirchmair
- Department
of Pharmaceutical Sciences, Faculty of Life Sciences, University of Vienna, Vienna 1090, Austria
| | | |
Collapse
|
16
|
Kleinstreuer NC, Tetko IV, Tong W. Introduction to Special Issue: Computational Toxicology. Chem Res Toxicol 2021; 34:171-175. [PMID: 33583184 DOI: 10.1021/acs.chemrestox.1c00032] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
|