1
|
Kruger L, Ngan DK, Xu T, Zhang L, Xia M, Simeonov A, Huang R. Evaluating the Utility of the MSTI Assay in Predicting Compound Promiscuity and Cytotoxicity. Chem Res Toxicol 2024; 37:1691-1697. [PMID: 39255953 DOI: 10.1021/acs.chemrestox.4c00243] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/12/2024]
Abstract
Nonspecific reactive chemicals often interfere with the interpretation of high-throughput assay results because of their promiscuity and/or cytotoxicity. Using a high-throughput assay to identify such compounds is necessary to efficiently rule out potential assay artifacts. The MSTI, (E)-2-(4-mercaptostyryl)-1,3,3-trimethyl-3H-indol-1-ium, assay uses a thiol-containing fluorescent probe to screen for electrophile reactivity and could potentially be used to determine nonspecific reactive compounds. The Tox21 10K compound library was previously screened against a panel of ∼80 cell-based and biochemical assays, including the biochemical MSTI assay. In this study, we compared the MSTI assay activity of the Tox21 10K compounds with their promiscuity and cytotoxicity as reflected by their activities across the Tox21 assay panel to determine: (1) if this assay is predictive of a compound's promiscuity and cytotoxicity and (2) what chemical features create inconsistent results between the MSTI assay activity and promiscuity/cytotoxicity (false negatives and false positives). We found that the MSTI assay can predict a chemical's promiscuity/cytotoxicity with a 0.55 sensitivity and 0.97 specificity. Out of 3,407 unique compounds evaluated, we identified 92 false positive and 227 false negative results. Several structural features such as carboxamides and alkyl halides were found to be apparent in 53% (p = 2.4 × 10-07) and 19% (p = 4.3 × 10-06) of the false positives and negatives, respectively. The results of this analysis will help identify the potential challenges of this high-throughput assay and allow researchers to identify if a compound will be cytotoxic or promiscuous in an efficient manner.
Collapse
Affiliation(s)
- Laken Kruger
- Division of Pre-clinical Innovation, National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, Maryland 20850, United States
| | - Deborah K Ngan
- Division of Pre-clinical Innovation, National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, Maryland 20850, United States
| | - Tuan Xu
- Division of Pre-clinical Innovation, National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, Maryland 20850, United States
| | - Li Zhang
- Division of Pre-clinical Innovation, National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, Maryland 20850, United States
| | - Menghang Xia
- Division of Pre-clinical Innovation, National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, Maryland 20850, United States
| | - Anton Simeonov
- Division of Pre-clinical Innovation, National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, Maryland 20850, United States
| | - Ruili Huang
- Division of Pre-clinical Innovation, National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, Maryland 20850, United States
| |
Collapse
|
2
|
Arturi K, Hollender J. Machine Learning-Based Hazard-Driven Prioritization of Features in Nontarget Screening of Environmental High-Resolution Mass Spectrometry Data. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2023; 57:18067-18079. [PMID: 37279189 PMCID: PMC10666537 DOI: 10.1021/acs.est.3c00304] [Citation(s) in RCA: 21] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/12/2023] [Revised: 05/15/2023] [Accepted: 05/15/2023] [Indexed: 06/08/2023]
Abstract
Nontarget high-resolution mass spectrometry screening (NTS HRMS/MS) can detect thousands of organic substances in environmental samples. However, new strategies are needed to focus time-intensive identification efforts on features with the highest potential to cause adverse effects instead of the most abundant ones. To address this challenge, we developed MLinvitroTox, a machine learning framework that uses molecular fingerprints derived from fragmentation spectra (MS2) for a rapid classification of thousands of unidentified HRMS/MS features as toxic/nontoxic based on nearly 400 target-specific and over 100 cytotoxic endpoints from ToxCast/Tox21. Model development results demonstrated that using customized molecular fingerprints and models, over a quarter of toxic endpoints and the majority of the associated mechanistic targets could be accurately predicted with sensitivities exceeding 0.95. Notably, SIRIUS molecular fingerprints and xboost (Extreme Gradient Boosting) models with SMOTE (Synthetic Minority Oversampling Technique) for handling data imbalance were a universally successful and robust modeling configuration. Validation of MLinvitroTox on MassBank spectra showed that toxicity could be predicted from molecular fingerprints derived from MS2 with an average balanced accuracy of 0.75. By applying MLinvitroTox to environmental HRMS/MS data, we confirmed the experimental results obtained with target analysis and narrowed the analytical focus from tens of thousands of detected signals to 783 features linked to potential toxicity, including 109 spectral matches and 30 compounds with confirmed toxic activity.
Collapse
Affiliation(s)
- Katarzyna Arturi
- Department
of Environmental Chemistry, Swiss Federal
Institute of Aquatic Science and Technology (Eawag), Ueberlandstrasse 133, 8600 Dübendorf, Switzerland
| | - Juliane Hollender
- Department
of Environmental Chemistry, Swiss Federal
Institute of Aquatic Science and Technology (Eawag), Ueberlandstrasse 133, 8600 Dübendorf, Switzerland
- Institute
of Biogeochemistry and Pollution Dynamics, Eidgenössische Technische Hochschule Zürich (ETH Zurich), Rämistrasse 101, 8092 Zürich, Switzerland
| |
Collapse
|
3
|
Patlewicz G, Paul-Friedman K, Houck K, Zhang L, Huang R, Xia M, Brown J, Simmons SO. Evaluating the utility of a high throughput thiol-containing fluorescent probe to screen for reactivity: A case study with the Tox21 library. COMPUTATIONAL TOXICOLOGY (AMSTERDAM, NETHERLANDS) 2023; 26:10.1016/j.comtox.2023.100271. [PMID: 37388277 PMCID: PMC10304587 DOI: 10.1016/j.comtox.2023.100271] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/01/2023]
Abstract
High-throughput screening (HTS) assays for bioactivity in the Tox21 program aim to evaluate an array of different biological targets and pathways, but a significant barrier to interpretation of these data is the lack of high-throughput screening (HTS) assays intended to identify non-specific reactive chemicals. This is an important aspect for prioritising chemicals to test in specific assays, identifying promiscuous chemicals based on their reactivity, as well as addressing hazards such as skin sensitisation which are not necessarily initiated by a receptor-mediated effect but act through a non-specific mechanism. Herein, a fluorescence-based HTS assay that allows the identification of thiol-reactive compounds was used to screen 7,872 unique chemicals in the Tox21 10K chemical library. Active chemicals were compared with profiling outcomes using structural alerts encoding electrophilic information. Random Forest classification models based on chemical fingerprints were developed to predict assay outcomes and evaluated through 10-fold stratified cross validation (CV). The mean CV Balanced Accuracy of the validation set was 0.648. The model developed shows promise as a tool to screen untested chemicals for their potential electrophilic reactivity based solely on chemical structural features.
Collapse
Affiliation(s)
- Grace Patlewicz
- Center for Computational Toxicology & Exposure (CCTE), U.S. Environmental Protection Agency, Research Triangle Park, Durham, NC, 27709, USA
| | - Katie Paul-Friedman
- Center for Computational Toxicology & Exposure (CCTE), U.S. Environmental Protection Agency, Research Triangle Park, Durham, NC, 27709, USA
| | - Keith Houck
- Center for Computational Toxicology & Exposure (CCTE), U.S. Environmental Protection Agency, Research Triangle Park, Durham, NC, 27709, USA
| | - Li Zhang
- National Center for Advancing Translational Sciences (NCATS), National Institutes of Health, Bethesda, MD 20892, USA
| | - Ruili Huang
- National Center for Advancing Translational Sciences (NCATS), National Institutes of Health, Bethesda, MD 20892, USA
| | - Menghang Xia
- National Center for Advancing Translational Sciences (NCATS), National Institutes of Health, Bethesda, MD 20892, USA
| | - Jason Brown
- Center for Computational Toxicology & Exposure (CCTE), U.S. Environmental Protection Agency, Research Triangle Park, Durham, NC, 27709, USA
| | - Steven O. Simmons
- Center for Computational Toxicology & Exposure (CCTE), U.S. Environmental Protection Agency, Research Triangle Park, Durham, NC, 27709, USA
| |
Collapse
|
4
|
Patlewicz G, Shah I. Towards systematic read-across using Generalised Read-Across (GenRA). COMPUTATIONAL TOXICOLOGY (AMSTERDAM, NETHERLANDS) 2023; 25:1-15. [PMID: 37693774 PMCID: PMC10483627 DOI: 10.1016/j.comtox.2022.100258] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/12/2023]
Abstract
Read-across continues to be a popular data gap filling technique within category and analogue approaches. One of the main issues hindering read-across acceptance is the notion of addressing and reducing uncertainties. Frameworks and formats have been created to help facilitate read-across development, evaluation, and residual uncertainties. However, read-across remains an expert-driven approach with each assessment decided on its own merits with no objective means of evaluating performance or quantifying uncertainties. Here, the underlying motivation of creating an algorithmic approach to read-across, namely the Generalised Read-Across (GenRA) approach, is described. The overall objectives of the approach were to quantify performance and uncertainty. Progress made in quantifying the impact of each similarity context commonly relied upon as part of read-across assessment are discussed. The framework underpinning the approach, the software tools developed to date and how GenRA can be used to make and interpret predictions as part of a screening level hazard assessment decision context are illustrated. Future directions and some of the overarching issues still needed in this field and the extent to which GenRA might facilitate those needs are discussed.
Collapse
Affiliation(s)
- Grace Patlewicz
- Center for Computational Toxicology and Exposure (CCTE), Office of Research and Development, US Environmental Protection Agency, 109 TW Alexander Dr, Research Triangle Park, NC 27711, USA
| | - Imran Shah
- Center for Computational Toxicology and Exposure (CCTE), Office of Research and Development, US Environmental Protection Agency, 109 TW Alexander Dr, Research Triangle Park, NC 27711, USA
| |
Collapse
|