1
|
Feshuk M, Kolaczkowski L, Dunham K, Davidson-Fritz SE, Carstens KE, Brown J, Judson RS, Paul Friedman K. The ToxCast pipeline: updates to curve-fitting approaches and database structure. Front Toxicol 2023; 5:1275980. [PMID: 37808181 PMCID: PMC10552852 DOI: 10.3389/ftox.2023.1275980] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2023] [Accepted: 09/08/2023] [Indexed: 10/10/2023] Open
Abstract
Introduction: The US Environmental Protection Agency Toxicity Forecaster (ToxCast) program makes in vitro medium- and high-throughput screening assay data publicly available for prioritization and hazard characterization of thousands of chemicals. The assays employ a variety of technologies to evaluate the effects of chemical exposure on diverse biological targets, from distinct proteins to more complex cellular processes like mitochondrial toxicity, nuclear receptor signaling, immune responses, and developmental toxicity. The ToxCast data pipeline (tcpl) is an open-source R package that stores, manages, curve-fits, and visualizes ToxCast data and populates the linked MySQL Database, invitrodb. Methods: Herein we describe major updates to tcpl and invitrodb to accommodate a new curve-fitting approach. The original tcpl curve-fitting models (constant, Hill, and gain-loss models) have been expanded to include Polynomial 1 (Linear), Polynomial 2 (Quadratic), Power, Exponential 2, Exponential 3, Exponential 4, and Exponential 5 based on BMDExpress and encoded by the R package dependency, tcplfit2. Inclusion of these models impacted invitrodb (beta version v4.0) and tcpl v3 in several ways: (1) long-format storage of generic modeling parameters to permit additional curve-fitting models; (2) updated logic for winning model selection; (3) continuous hit calling logic; and (4) removal of redundant endpoints as a result of bidirectional fitting. Results and discussion: Overall, the hit call and potency estimates were largely consistent between invitrodb v3.5 and 4.0. Tcpl and invitrodb provide a standard for consistent and reproducible curve-fitting and data management for diverse, targeted in vitro assay data with readily available documentation, thus enabling sharing and use of these data in myriad toxicology applications. The software and database updates described herein promote comparability across multiple tiers of data within the US Environmental Protection Agency CompTox Blueprint.
Collapse
Affiliation(s)
- M. Feshuk
- Center for Computational Toxicology and Exposure, Office of Research and Development, U.S. Environmental Protection Agency, Research Triangle Park, Durham, NC, United States
| | - L. Kolaczkowski
- Center for Computational Toxicology and Exposure, Office of Research and Development, U.S. Environmental Protection Agency, Research Triangle Park, Durham, NC, United States
- National Student Services Contractor, Oak Ridge Associated Universities, Oak Ridge, TN, United States
| | - K. Dunham
- Center for Computational Toxicology and Exposure, Office of Research and Development, U.S. Environmental Protection Agency, Research Triangle Park, Durham, NC, United States
- National Student Services Contractor, Oak Ridge Associated Universities, Oak Ridge, TN, United States
| | - S. E. Davidson-Fritz
- Center for Computational Toxicology and Exposure, Office of Research and Development, U.S. Environmental Protection Agency, Research Triangle Park, Durham, NC, United States
| | - K. E. Carstens
- Center for Computational Toxicology and Exposure, Office of Research and Development, U.S. Environmental Protection Agency, Research Triangle Park, Durham, NC, United States
| | - J. Brown
- Center for Computational Toxicology and Exposure, Office of Research and Development, U.S. Environmental Protection Agency, Research Triangle Park, Durham, NC, United States
| | - R. S. Judson
- Center for Computational Toxicology and Exposure, Office of Research and Development, U.S. Environmental Protection Agency, Research Triangle Park, Durham, NC, United States
| | - K. Paul Friedman
- Center for Computational Toxicology and Exposure, Office of Research and Development, U.S. Environmental Protection Agency, Research Triangle Park, Durham, NC, United States
| |
Collapse
|
2
|
Foster MJ, Patlewicz G, Shah I, Haggard DE, Judson RS, Paul Friedman K. Evaluating structure-based activity in a high-throughput assay for steroid biosynthesis. Comput Toxicol 2022; 24:1-23. [PMID: 37841081 PMCID: PMC10569244 DOI: 10.1016/j.comtox.2022.100245] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/17/2023]
Abstract
Data from a high-throughput human adrenocortical carcinoma assay (HT-H295R) for steroid hormone biosynthesis are available for >2000 chemicals in single concentration and 654 chemicals in multi-concentration (mc). Previously, a metric describing the effect size of a chemical on the biosynthesis of 11 hormones was derived using mc data referred to as the maximum mean Mahalanobis distance (maxmMd). However, mc HT-H295R assay data remain unavailable for many chemicals. This work leverages existing HT-H295R assay data by constructing structure-activity relationships to make predictions for data-poor chemicals, including: (1) identification of individual structural descriptors, known as ToxPrint chemotypes, associated with increased odds of affecting estrogen or androgen synthesis; (2) a random forest (RF) classifier using physicochemical property descriptors to predict HT-H295R maxmMd binary (positive or negative) outcomes; and, (3) a local approach to predict maxmMd binary outcomes using nearest neighbors (NNs) based on two types of chemical fingerprints (chemotype or Morgan). Individual chemotypes demonstrated high specificity (85-98%) for modulators of estrogen and androgen synthesis but with low sensitivity. The best RF model for maxmMd classification included 13 predicted physicochemical descriptors, yielding a balanced accuracy (BA) of 71% with only modest improvement when hundreds of structural features were added. The best two NN models for binary maxmMd prediction demonstrated BAs of 85 and 81% using chemotype and Morgan fingerprints, respectively. Using an external test set of 6302 chemicals (lacking HT-H295R data), 1241 were identified as putative estrogen and androgen modulators. Combined results across the three classification models (global RF model and two local NN models) predict that 1033 of the 6302 chemicals would be more likely to affect HT-H295R bioactivity. Together, these in silico approaches can efficiently prioritize thousands of untested chemicals for screening to further evaluate their effects on steroid biosynthesis.
Collapse
Affiliation(s)
- M J Foster
- Center for Computational Toxicology and Exposure, Office of Research and Development, U.S. Environmental Protection Agency, Research Triangle Park, North Carolina, 27711, USA
- National Student Services Contractor, Oak Ridge Associated Universities
| | - G Patlewicz
- Center for Computational Toxicology and Exposure, Office of Research and Development, U.S. Environmental Protection Agency, Research Triangle Park, North Carolina, 27711, USA
| | - I Shah
- Center for Computational Toxicology and Exposure, Office of Research and Development, U.S. Environmental Protection Agency, Research Triangle Park, North Carolina, 27711, USA
| | - D E Haggard
- Center for Computational Toxicology and Exposure, Office of Research and Development, U.S. Environmental Protection Agency, Research Triangle Park, North Carolina, 27711, USA
| | - R S Judson
- Center for Computational Toxicology and Exposure, Office of Research and Development, U.S. Environmental Protection Agency, Research Triangle Park, North Carolina, 27711, USA
| | - K Paul Friedman
- Center for Computational Toxicology and Exposure, Office of Research and Development, U.S. Environmental Protection Agency, Research Triangle Park, North Carolina, 27711, USA
| |
Collapse
|
3
|
Rosenberg S, Watt E, Judson R, Simmons S, Paul Friedman K, Dybdahl M, Nikolov N, Wedebye E. QSAR models for thyroperoxidase inhibition and screening of U.S. and EU chemical inventories. ACTA ACUST UNITED AC 2017. [DOI: 10.1016/j.comtox.2017.07.006] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
|
4
|
Abstract
Details on the 28-year treatment history of a patient with an endocrine-responsive breast cancer are provided. She was originally diagnosed as having a T1N0M0 cancer after a modified radical mastectomy at age 41. Fifteen years later, in 1998, she presented with hemoptysis and pleuritic chest pain: a 10 cm right atrial tumor and estrogen receptor (ER) positive endobronchial and adjacent lung parenchyma adenocarcinoma were documented. Epithelial markers normalized as she manifested a partial response (PR) lasting 3 years with tamoxifen treatment. From 2001 to 2007 she benefitted from exemestane treatment. Upon progression in the previous lung area and left adrenal, exemestane withdrawal led to transient decrease in markers. Six months later (in July 2008), with growth in her adrenal tumor, laparoscopic adrenalectomy was performed: in addition to ER positivity, the tumor showed Her2 overexpression and amplification. She has subsequently had some control of disease with fulvestrant, letrozole + trastuzumab, and subsequently letrozole + lapatinib. In addition to the chronicity of disease, this history illustrates the expanding range of treatments available for endocrine-responsive breast cancer commensurate to our greater understanding of tumor biology.
Collapse
Affiliation(s)
| | | | - F Muggia
- Correspondence to: F Muggia. E-mail:
| |
Collapse
|