1
|
Chen C, Huang Z, Zou X, Li S, Zhang D, Wang SL. Prediction of molecular-specific mutagenic alerts and related mechanisms of chemicals by a convolutional neural network (CNN) model based on SMILES split. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024; 917:170435. [PMID: 38286298 DOI: 10.1016/j.scitotenv.2024.170435] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/11/2023] [Revised: 01/20/2024] [Accepted: 01/23/2024] [Indexed: 01/31/2024]
Abstract
Structural alerts (SAs) are essential to identify chemicals for toxicity evaluation and health risk assessment. We constructed a novel SMILES split-based deep learning model (SSDL) that was trained and verified with 5850 chemicals from the ISSSTY database and 384 external test chemicals from published papers. The training accuracy was above 0.90 and the evaluation metrics (precision, recall and F1-score) all reached 0.78 or above on both internal and external test chemicals. In this model, the molecular-specific fragment importance of chemicals was first quantified independently. Then, the SA identification method based on the importance of these fragments was statistically analyzed and verified with the ISSSTY test and external test chemicals containing one of 28 typical SAs, and most of the performances were better than that of expert rules. Furthermore, a mutagenicity mechanism prediction method was developed using 237 chemicals with four known mutagenic mechanisms based on molecular similarity calibrated by the SSDL method and fragment importance, which significantly improved accuracy in three mechanisms and had comparable accuracy in the other one compared to traditional methods. Overall, the SSDL model quantifying fragment toxicity within molecules would be a novel potentially powerful tool in the determination and visualization of molecular-specific SAs and the prediction of mutagenicity mechanisms for environmental or industrial compounds and drugs.
Collapse
Affiliation(s)
- Chao Chen
- Key Laboratory of Modern Toxicology of Ministry of Education, Center for Global Health, School of Public Health, Nanjing Medical University, 101 Longmian Avenue, Nanjing 211166, PR China
| | - Zhengliang Huang
- Key Laboratory of Modern Toxicology of Ministry of Education, Center for Global Health, School of Public Health, Nanjing Medical University, 101 Longmian Avenue, Nanjing 211166, PR China; School of Public Health, Hubei University of Medicine, Shiyan 442000, PR China
| | - Xuyan Zou
- Key Laboratory of Modern Toxicology of Ministry of Education, Center for Global Health, School of Public Health, Nanjing Medical University, 101 Longmian Avenue, Nanjing 211166, PR China
| | - Sheng Li
- Key Laboratory of Modern Toxicology of Ministry of Education, Center for Global Health, School of Public Health, Nanjing Medical University, 101 Longmian Avenue, Nanjing 211166, PR China
| | - Di Zhang
- Key Laboratory of Modern Toxicology of Ministry of Education, Center for Global Health, School of Public Health, Nanjing Medical University, 101 Longmian Avenue, Nanjing 211166, PR China
| | - Shou-Lin Wang
- Key Laboratory of Modern Toxicology of Ministry of Education, Center for Global Health, School of Public Health, Nanjing Medical University, 101 Longmian Avenue, Nanjing 211166, PR China; State Key Lab of Reproductive Medicine and Offspring Health, Institute of Toxicology, Nanjing Medical University, 101 Longmian Avenue, Nanjing 211166, PR China.
| |
Collapse
|
2
|
Lin MS, Varunjikar MS, Lie KK, Søfteland L, Dellafiora L, Ørnsrud R, Sanden M, Berntssen MHG, Dorne JLCM, Bafna V, Rasinger JD. Multi-tissue proteogenomic analysis for mechanistic toxicology studies in non-model species. ENVIRONMENT INTERNATIONAL 2023; 182:108309. [PMID: 37980879 DOI: 10.1016/j.envint.2023.108309] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/25/2023] [Revised: 08/15/2023] [Accepted: 11/04/2023] [Indexed: 11/21/2023]
Abstract
New approach methodologies (NAM), including omics and in vitro approaches, are contributing to the implementation of 3R (reduction, refinement and replacement) strategies in regulatory science and risk assessment. In this study, we present an integrative transcriptomics and proteomics analysis workflow for the validation and revision of complex fish genomes and demonstrate how proteogenomics expression matrices can be used to support multi-level omics data integration in non-model species in vivo and in vitro. Using Atlantic salmon as an example, we constructed proteogenomic databases from publicly available transcriptomic data and in-house generated RNA-Seq and LC-MS/MS data. Our analysis identified ∼80,000 peptides, providing direct evidence of translation for over 40,000 RefSeq structures. The data also highlighted 183 co-located peptide groups that supported a single transcript each, and in each case, either corrected a previous annotation, supported Ensembl annotations not present in RefSeq, or identified novel previously unannotated genes. Proteogenomics data-derived expression matrices revealed distinct profiles for the different tissue types analyzed. Focusing on proteins involved in defense against xenobiotics, we detected distinct expression patterns across different salmon tissues and observed homology in the expression of chemical defense proteins between in vivo and in vitro liver systems. Our study demonstrates the potential of proteogenomic analyses in extending our understanding of complex fish genomes and provides an advanced bioinformatic toolkit to support the further development of NAMs and their application in regulatory science and (eco)toxicological studies of non-model species.
Collapse
Affiliation(s)
- M S Lin
- Bioinformatics and Systems Biology Program, UC San Diego, San Diego, CA, United States.
| | | | - K K Lie
- Institute of Marine Research, Bergen, Norway.
| | - L Søfteland
- Institute of Marine Research, Bergen, Norway.
| | - L Dellafiora
- Department of Food and Drug, University of Parma, Parco Area delle Scienze 27/A, 43124 Parma, Italy.
| | - R Ørnsrud
- Institute of Marine Research, Bergen, Norway.
| | - M Sanden
- Institute of Marine Research, Bergen, Norway.
| | | | - J L C M Dorne
- European Food Safety Authority, Methodological and Scientific Support Unit, Via Carlo Magno 1A, 43121 Parma, Italy.
| | - V Bafna
- Computer Science & Engineering and HDSI, UC San Diego, San Diego, CA, United States.
| | | |
Collapse
|
3
|
He R, Wu X, Mu H, Chen L, Hu H, Wang J, Ren H, Wu B. Priority control sequence of 34 typical pollutants in effluents of Chinese wastewater treatment plants. WATER RESEARCH 2023; 243:120338. [PMID: 37473511 DOI: 10.1016/j.watres.2023.120338] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/17/2023] [Revised: 06/14/2023] [Accepted: 07/10/2023] [Indexed: 07/22/2023]
Abstract
The identification of the priority control sequence of pollutants in effluents of wastewater treatment plants (WWTPs) has important implications for the management of water quality. This study chose 34 typical pollutants based on their representativeness and detection rates in municipal wastewater. The occurrence frequency and concentration of these pollutants in 168 Chinese WWTP effluents were measured at the national level. The data on in vitro toxicity (67 assays) and in vivo toxicity (216 species) for target pollutants were obtained from the public toxicity database and our experimental data. An environmental health prioritization index (EHPi) method was proposed to integrate the occurrence frequency, concentration, removal rate, and in vitro and in vivo toxicity to determine the priority control sequence of target pollutants. Ethynyl estradiol, 17β-estradiol, estrone, diclofenac, and atrazine were the top 5 pollutants identified by the EHPi score. Several pollutants with high EHPi scores showed spatial differences. Besides the EHPi method which was from the single pollutant perspective, the combined toxicity of pollutants (300 pairs of binary combinations) was also measured based on in vitro toxicity assays to evaluate the key pollutants from the pollutant-pollutant interacting perspective. The pollutants (such as ofloxacin and acetaminophen) that could have significant synergetic effects with many other pollutants are worthy of prior attention. This study shed new light on the identification of the priority control sequence of pollutants in WWTP effluents. The results provide meaningful data for the effective management and control of wastewater water quality.
Collapse
Affiliation(s)
- Ruonan He
- State Key Laboratory of Pollution Control and Resource Reuse, School of Environment, Nanjing University, Nanjing 210023, China
| | - Xingyue Wu
- State Key Laboratory of Pollution Control and Resource Reuse, School of Environment, Nanjing University, Nanjing 210023, China
| | - Hongxin Mu
- State Key Laboratory of Pollution Control and Resource Reuse, School of Environment, Nanjing University, Nanjing 210023, China
| | - Ling Chen
- State Key Laboratory of Pollution Control and Resource Reuse, School of Environment, Nanjing University, Nanjing 210023, China
| | - Haidong Hu
- State Key Laboratory of Pollution Control and Resource Reuse, School of Environment, Nanjing University, Nanjing 210023, China
| | - Jinfeng Wang
- State Key Laboratory of Pollution Control and Resource Reuse, School of Environment, Nanjing University, Nanjing 210023, China
| | - Hongqiang Ren
- State Key Laboratory of Pollution Control and Resource Reuse, School of Environment, Nanjing University, Nanjing 210023, China
| | - Bing Wu
- State Key Laboratory of Pollution Control and Resource Reuse, School of Environment, Nanjing University, Nanjing 210023, China.
| |
Collapse
|
4
|
Merel S. Critical assessment of the Kendrick mass defect analysis as an innovative approach to process high resolution mass spectrometry data for environmental applications. CHEMOSPHERE 2023; 313:137443. [PMID: 36464021 DOI: 10.1016/j.chemosphere.2022.137443] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/23/2022] [Revised: 11/23/2022] [Accepted: 11/28/2022] [Indexed: 06/17/2023]
Abstract
The growing application of high resolution mass spectrometry (HRMS) over the last decades has dramatically improved our knowledge about the occurrence of environmental contaminants. However, most of the compounds detected remain unknown and the large volume of data generated requires specific processing approaches. Therefore, this study presents the concepts of mass defect (MD), Kendrick mass (KM) and Kendrick mass defect (KMD) to the expert and non-expert reader along with relevant examples of applications in environmental HRMS data processing. A preliminary bibliometric overview indicates that the potential benefits of KMD analysis are rather overlooked in environmental science. In practice, a simple calculation allows transforming a mass from the IUPAC system (normalized so that the mass of 12C is exactly 12) to its corresponding KM normalized on a specific moiety such as CH2 (the mass of CH2 is exactly 14). Then, plotting the KMD according to the nominal KM allows revealing groups of compounds that differ only by their number of CH2 moieties. For instance, data processing using KM and KMD was proven particularly useful to characterize natural organic matter in a sample, to reveal the occurrence of polymers as well as poly/perfluorinated alkylated substances (PFASs), and to search for transformation products (TPs) of a given chemical.
Collapse
Affiliation(s)
- Sylvain Merel
- INRAE, UR RiverLy, 5 Rue de la Doua, F-69625, Villeurbanne, France.
| |
Collapse
|
5
|
Bampidis V, Azimonti G, Bastos MDL, Christensen H, Dusemund B, Fašmon Durjava M, Kouba M, López‐Alonso M, López Puente S, Marcon F, Mayo B, Pechová A, Petkova M, Ramos F, Sanz Y, Villa RE, Woutersen R, Finizio A, Teodorovic I, Aquilina G, Bories G, Gropp J, Nebbia C, Tarrés‐Call J, Innocenti M. Safety and efficacy of a feed additive consisting of ethoxyquin (6-ethoxy-1,2-dihydro-2,2,4-trimethylquinoline) for all animal species (FEFANA asbl). EFSA J 2022; 20:e07166. [PMID: 35281649 PMCID: PMC8892239 DOI: 10.2903/j.efsa.2022.7166] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2022] Open
Abstract
Ethoxyquin is synthetised from p-phenetidine, a possible mutagen, which remains in the additive as an impurity at concentrations of < 2.5 mg/kg additive. Ethoxyquin is considered safe for all animal species at the proposed inclusion level of 50 mg/kg complete feed. However, owing the presence of p-phenetidine, no safe level of the additive in feed for long-living and reproductive animals could be identified. The FEEDAP Panel derived a health-based guidance value of 0.006 mg ethoxyquin dimer (EQDM)/kg bw per day and applied it to the sum of ethoxyquin and its transformation products. A maximum total concentration of 50 mg ethoxyquin/kg complete feed for all animal species, except dairy ruminants, would not pose a risk for the consumer. However, in the absence of data on p-phenetidine residues in tissues and products of animal origin, no conclusion on the safety for the consumer could be drawn. The conclusions on consumer safety assume that the maximum total concentration of 50 mg EQ/kg feed is expressed as the sum of EQ, EQDM, EQI and DHEQ. Exposure of the unprotected user to p-phenetidine via inhalation should be minimised. No safety concerns for groundwater are expected. It is not possible to conclude on the safety of EQ for the terrestrial compartment. A risk for the aquatic compartment cannot be excluded when ethoxyquin is used in terrestrial animals. Unacceptable risk is not expected for freshwater sediment-dwelling organisms. A risk of secondary poisoning via the terrestrial food chain is not expected, whereas a risk via the aquatic food chain cannot be excluded. No concerns for aquatic organisms are expected for ethoxyquin used in fish farmed in land-based system, a risk cannot be excluded for marine sediment dwelling organisms when ethoxyquin is used in sea-cages. Ethoxyquin is considered efficacious in the range 25-50 mg/kg complete feed.
Collapse
|