1
|
Canchola A, Tran LN, Woo W, Tian L, Lin YH, Chou WC. Advancing non-target analysis of emerging environmental contaminants with machine learning: Current status and future implications. ENVIRONMENT INTERNATIONAL 2025; 198:109404. [PMID: 40139034 DOI: 10.1016/j.envint.2025.109404] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/20/2024] [Revised: 03/03/2025] [Accepted: 03/18/2025] [Indexed: 03/29/2025]
Abstract
Emerging environmental contaminants (EECs) such as pharmaceuticals, pesticides, and industrial chemicals pose significant challenges for detection and identification due to their structural diversity and lack of analytical standards. Traditional targeted screening methods often fail to detect these compounds, making non-target analysis (NTA) using high-resolution mass spectrometry (HRMS) essential for identifying unknown or suspected contaminants. However, interpreting the vast datasets generated by HRMS is complex and requires advanced data processing techniques. Recent advancements in machine learning (ML) models offer great potential for enhancing NTA applications. As such, we reviewed key developments, including optimizing workflows using computational tools, improved chemical structure identification, advanced quantification methods, and enhanced toxicity prediction capabilities. It also discusses challenges and future perspectives in the field, such as refining ML tools for complex mixtures, improving inter-laboratory validation, and further integrating computational models into environmental risk assessment frameworks. By addressing these challenges, ML-assisted NTA can significantly enhance the detection, quantification, and evaluation of EECs, ultimately contributing to more effective environmental monitoring and public health protection.
Collapse
Affiliation(s)
- Alexa Canchola
- Environmental Toxicology Graduate Program, University of California, Riverside, CA 92521, United States; Department of Environmental Sciences, College of Natural & Agricultural Sciences, University of California, Riverside, CA 92521, United States
| | - Lillian N Tran
- Environmental Toxicology Graduate Program, University of California, Riverside, CA 92521, United States
| | - Wonsik Woo
- Environmental Toxicology Graduate Program, University of California, Riverside, CA 92521, United States
| | - Linhui Tian
- Department of Environmental Sciences, College of Natural & Agricultural Sciences, University of California, Riverside, CA 92521, United States
| | - Ying-Hsuan Lin
- Environmental Toxicology Graduate Program, University of California, Riverside, CA 92521, United States; Department of Environmental Sciences, College of Natural & Agricultural Sciences, University of California, Riverside, CA 92521, United States.
| | - Wei-Chun Chou
- Environmental Toxicology Graduate Program, University of California, Riverside, CA 92521, United States; Department of Environmental Sciences, College of Natural & Agricultural Sciences, University of California, Riverside, CA 92521, United States.
| |
Collapse
|
2
|
Camillo-Andrade AC, Sales LA, Fischer JSG, Duran R, Santos MDM, Carvalho PC. Paired proteomic analysis reveals protein alterations in sun-exposed skin of professional drivers : 1Laboratory for structural and Computational Proteomics, Carlos Chagas Institute, Fiocruz, Paraná, Brazil. Sci Rep 2025; 15:10955. [PMID: 40164647 PMCID: PMC11958692 DOI: 10.1038/s41598-024-82308-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2024] [Accepted: 12/04/2024] [Indexed: 04/02/2025] Open
Abstract
Professional drivers represent an ideal cohort for investigating the effects of solar radiation on skin due to their unique, asymmetric exposure to sun, a consequence of vehicle window orientations. Consequently, one side of the face is naturally subjected to more solar radiation, resulting in uneven sunlight exposure. This scenario supports a paired experimental design for precise within-individual comparisons, crucial for assessing sun exposure's impact on skin health, including signs of aging. Leveraging this approach, our study reveals sun-induced overexpression of proteins linked to photoaging through paired proteomic analysis, providing novel insights into the skin's adaptive responses to chronic solar exposure. Initially, our research focused on a dataset from ten male professional drivers, identifying a set upregulated proteins in sun-exposed skin compared to the less exposed side of the face. To validate these findings, we extended our investigation to a new cohort of seven female bus drivers. Our motivation in switching genders and utilizing different mass spectrometry equipment and sample preparation techniques was for assessing the robustness of our initial findings, encompassing not just sex differences but also methodological variations, and also for understanding the broader implications of our results for photodermatology. To enable this detailed analysis, we developed specialized software that allows precise paired proteomic analysis, significantly enhancing the robustness and clarity of our findings. Our results shortlisted keratins, S100A14, and F-box proteins-by remaining consistently overexpressed in sun-exposed skin-and hemoglobin subunit beta as downregulated across both cohorts. Our findings underscore the potential of proteomic techniques in advancing our understanding of the molecular dynamics of photoaging and highlight the value of selecting cohorts with specific exposure characteristics.
Collapse
Affiliation(s)
- Amanda C Camillo-Andrade
- Laboratory for Structural and Computational Proteomics, Carlos Chagas Institute, Fiocruz, Rua Prof. Algacyr Munhoz Mader 3775, Curitiba, Paraná, Brazil
- Analytical Biochemistry and Proteomics Unit, Instituto de Investigaciones Biológicas Clemente Estable, Institut Pasteur de Montevideo, Montevideo, Uruguay
- Positivo University, Paraná, Brazil
| | - Lucas A Sales
- Laboratory for Structural and Computational Proteomics, Carlos Chagas Institute, Fiocruz, Rua Prof. Algacyr Munhoz Mader 3775, Curitiba, Paraná, Brazil
| | - Juliana S G Fischer
- Laboratory for Structural and Computational Proteomics, Carlos Chagas Institute, Fiocruz, Rua Prof. Algacyr Munhoz Mader 3775, Curitiba, Paraná, Brazil
| | - Rosario Duran
- Analytical Biochemistry and Proteomics Unit, Instituto de Investigaciones Biológicas Clemente Estable, Institut Pasteur de Montevideo, Montevideo, Uruguay
| | - Marlon D M Santos
- Laboratory for Structural and Computational Proteomics, Carlos Chagas Institute, Fiocruz, Rua Prof. Algacyr Munhoz Mader 3775, Curitiba, Paraná, Brazil.
- Analytical Biochemistry and Proteomics Unit, Instituto de Investigaciones Biológicas Clemente Estable, Institut Pasteur de Montevideo, Montevideo, Uruguay.
| | - Paulo C Carvalho
- Laboratory for Structural and Computational Proteomics, Carlos Chagas Institute, Fiocruz, Rua Prof. Algacyr Munhoz Mader 3775, Curitiba, Paraná, Brazil.
- Analytical Biochemistry and Proteomics Unit, Instituto de Investigaciones Biológicas Clemente Estable, Institut Pasteur de Montevideo, Montevideo, Uruguay.
| |
Collapse
|
3
|
Kurt L, Clasen MA, Biembengut ÍV, Ruwolt M, Liu F, Gozzo FC, Lima DB, Carvalho PC. RawVegetable 2.0: Refining XL-MS Data Acquisition through Enhanced Quality Control. J Proteome Res 2024; 23:3141-3148. [PMID: 38301217 PMCID: PMC11301677 DOI: 10.1021/acs.jproteome.3c00791] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2023] [Revised: 01/23/2024] [Accepted: 01/25/2024] [Indexed: 02/03/2024]
Abstract
We present RawVegetable 2.0, a software tailored for assessing mass spectrometry data quality and fine-tuned for cross-linking mass spectrometry (XL-MS) applications. Building upon the capabilities of its predecessor, RawVegetable 2.0 introduces four main modules, each providing distinct and new functionalities: 1) Pair Finder, which identifies ion doublets characteristic of cleavable cross-linking experiments; 2) Diagnostic Peak Finder, which locates potential reporter ions associated with a specific cross-linker; 3) Precursor Signal Ratio, which computes the ratio between precursor intensity and the total signal in an MS/MS scan; and 4) Xrea, which evaluates spectral quality by analyzing the heterogeneity of peak intensities within a spectrum. These modules collectively streamline the process of optimizing mass spectrometry data acquisition for both Proteomics and XL-MS experiments. RawVegetable 2.0, along with a comprehensive tutorial is freely accessible for academic use at: http://patternlabforproteomics.org/rawvegetable2.
Collapse
Affiliation(s)
- Louise
Ulrich Kurt
- Laboratory
for Structural and Computational Proteomics, Carlos Chagas Institute - Fiocruz Parana, Curitiba, Parana 81310-020, Brazil
| | - Milan Avila Clasen
- Laboratory
for Structural and Computational Proteomics, Carlos Chagas Institute - Fiocruz Parana, Curitiba, Parana 81310-020, Brazil
| | - Ísis Venturi Biembengut
- Laboratory
for Structural and Computational Proteomics, Carlos Chagas Institute - Fiocruz Parana, Curitiba, Parana 81310-020, Brazil
| | - Max Ruwolt
- Department
of Chemical Biology, Leibniz - Forschungsinstitut
für Molekulare Pharmakologie (FMP), Berlin 13125, Germany
| | - Fan Liu
- Department
of Chemical Biology, Leibniz - Forschungsinstitut
für Molekulare Pharmakologie (FMP), Berlin 13125, Germany
| | - Fabio César Gozzo
- Dalton
Mass Spectrometry Laboratory, Unicamp, Campinas, Sao Paulo 13083-970, Brazil
| | - Diogo Borges Lima
- Department
of Chemical Biology, Leibniz - Forschungsinstitut
für Molekulare Pharmakologie (FMP), Berlin 13125, Germany
| | - Paulo Costa Carvalho
- Laboratory
for Structural and Computational Proteomics, Carlos Chagas Institute - Fiocruz Parana, Curitiba, Parana 81310-020, Brazil
| |
Collapse
|
4
|
Camillo-Andrade AC, Santos MDM, Nuevo PS, Lajas ABL, Sales LA, Leyva A, Fischer JSG, Duran R, Carvalho PC. Intra-Individual Paired Mass Spectrometry Dataset for Decoding Solar-Induced Proteomic Changes in Facial Skin. Sci Data 2024; 11:441. [PMID: 38702328 PMCID: PMC11068864 DOI: 10.1038/s41597-024-03231-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2023] [Accepted: 04/04/2024] [Indexed: 05/06/2024] Open
Abstract
Photoaging is the premature aging of the skin caused by prolonged exposure to solar radiation. The visual alterations manifest as wrinkles, reduced skin elasticity, uneven skin tone, as well as other signs that surpass the expected outcomes of natural aging. Beyond these surface changes, there is a complex interplay of molecular alterations, encompassing shifts in cellular function, DNA damage, and protein composition disruptions. This data descriptor introduces a unique dataset derived from ten individuals, each with a minimum of 18 years of professional experience as a driver, who are asymmetrically and chronically exposed to solar radiation due to their driving orientation. Skin samples were independently collected from each side of the face using a microdermabrasion-like procedure and analyzed on an Exploris 240 mass spectrometer. Our adapted proteomic statistical framework leverages the sample pairing to provide robust insights. This dataset delves into the molecular differences in exposed skin and serves as a foundational resource for interdisciplinary research in photodermatology, targeted skincare treatments, and computational modelling of skin health.
Collapse
Affiliation(s)
- Amanda C Camillo-Andrade
- Laboratory for Structural and Computational Proteomics, Carlos Chagas Institute, Curitiba, Paraná, Brazil
- Analytical Biochemistry and Proteomics Unit, Instituto de Investigaciones Biológicas Clemente Estable, Institut Pasteur de Montevideo, Montevideo, Uruguay
- Asthetics and Cosmetics, Positivo University, Curitiba, Paraná, Brazil
| | - Marlon D M Santos
- Laboratory for Structural and Computational Proteomics, Carlos Chagas Institute, Curitiba, Paraná, Brazil
- Analytical Biochemistry and Proteomics Unit, Instituto de Investigaciones Biológicas Clemente Estable, Institut Pasteur de Montevideo, Montevideo, Uruguay
| | - Patrícia S Nuevo
- Asthetics and Cosmetics, Positivo University, Curitiba, Paraná, Brazil
| | - Ana B L Lajas
- Laboratory for Structural and Computational Proteomics, Carlos Chagas Institute, Curitiba, Paraná, Brazil
| | - Lucas A Sales
- Laboratory for Structural and Computational Proteomics, Carlos Chagas Institute, Curitiba, Paraná, Brazil
| | - Alejandro Leyva
- Analytical Biochemistry and Proteomics Unit, Instituto de Investigaciones Biológicas Clemente Estable, Institut Pasteur de Montevideo, Montevideo, Uruguay
| | - Juliana S G Fischer
- Laboratory for Structural and Computational Proteomics, Carlos Chagas Institute, Curitiba, Paraná, Brazil
| | - Rosario Duran
- Analytical Biochemistry and Proteomics Unit, Instituto de Investigaciones Biológicas Clemente Estable, Institut Pasteur de Montevideo, Montevideo, Uruguay.
| | - Paulo C Carvalho
- Laboratory for Structural and Computational Proteomics, Carlos Chagas Institute, Curitiba, Paraná, Brazil.
| |
Collapse
|
5
|
Lin AD, Fischer JDSDG, Santos MDM, Camillo-Andrade AC, Kurt LU, Souza TACB, Lajas ABL, Rivera B, Portela M, Duran R, Mira MT, Pillonetto M, Carvalho PC. Beyond the identifiable proteome: Delving into the proteomics of polymyxin-resistant and non-resistant Acinetobacter baumannii from Brazilian hospitals. J Proteomics 2023; 289:105012. [PMID: 37748533 DOI: 10.1016/j.jprot.2023.105012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2023] [Revised: 09/21/2023] [Accepted: 09/21/2023] [Indexed: 09/27/2023]
Abstract
This work discloses a unique, comprehensive proteomic dataset of Acinetobacter baumannii strains, both resistant and non-resistant to polymyxin B, isolated in Brazil generated using Orbitrap Fusion Lumos. From nearly 4 million tandem mass spectra, the software DiagnoMass produced 240,685 quality-filtered mass spectral clusters, of which PatternLab for proteomics identified 44,553 peptides mapping to 3479 proteins. Crucially, DiagnoMass shortlisted 3550 and 1408 unique mass spectral clusters for the resistant and non-resistant strains, respectively, with only about a third with sequences (and PTMs) identified by PatternLab. Further open-search attempts via FragPipe yielded an additional ∼20% identifications, suggesting the remaining unidentified spectra likely arise from complex combinations of post-translational modifications and amino-acid substitutions. This highlights the untapped potential of the dataset for future discoveries, particularly given the importance of PTMs, which remain elusive to nucleotide sequencing approaches but are crucial for understanding biological mechanisms. Our innovative approach extends beyond the identifications that are typically subjected to the bias of a search engine; we discern which spectral clusters are differential and subject them to increased scrutiny, akin to spectral library matching by comparing captured spectra to themselves. Our analysis reveals adaptations in the resistant strain, including enhanced detoxification, altered protein synthesis, and metabolic adjustments. SIGNIFICANCE: We present comprehensive proteomic profiles of non-resistant and resistant Acinetobacter baumannii from Brazilian Hospitals strains, and highlight the presence of discriminative and yet unidentified mass spectral clusters. Our work emphasizes the importance of exploring this overlooked data, as it could hold the key to understanding the complex dynamics of antibiotic resistance. This approach not only informs antimicrobial stewardship efforts but also paves the way for the development of innovative diagnostic tools. Thus, our findings have profound implications for the field, as far as methods for providing a new perspective on diagnosing antibiotic resistance as well as classifying proteomes in general.
Collapse
Affiliation(s)
- Amanda Dal Lin
- Laboratory for Structural and Computational Proteomics, Carlos Chagas Institute, Fiocruz - Paraná, Brazil; Laboratório Experimental Multiuso, Pontifícia Universidade Católica do Paraná, Brazil
| | - Juliana de S da G Fischer
- Laboratory for Structural and Computational Proteomics, Carlos Chagas Institute, Fiocruz - Paraná, Brazil
| | - Marlon D M Santos
- Laboratory for Structural and Computational Proteomics, Carlos Chagas Institute, Fiocruz - Paraná, Brazil; Analytical Biochemistry and Proteomics Unit, Institut Pasteur de Montevideo/IIBCE, Montevideo, Uruguay
| | - Amanda Caroline Camillo-Andrade
- Laboratory for Structural and Computational Proteomics, Carlos Chagas Institute, Fiocruz - Paraná, Brazil; Analytical Biochemistry and Proteomics Unit, Institut Pasteur de Montevideo/IIBCE, Montevideo, Uruguay
| | - Louise Ulrich Kurt
- Laboratory for Structural and Computational Proteomics, Carlos Chagas Institute, Fiocruz - Paraná, Brazil
| | - Tatiana A C B Souza
- Laboratory for Structural and Computational Proteomics, Carlos Chagas Institute, Fiocruz - Paraná, Brazil
| | - Ana Beatriz Lyrio Lajas
- Laboratory for Structural and Computational Proteomics, Carlos Chagas Institute, Fiocruz - Paraná, Brazil
| | - Bernardina Rivera
- Analytical Biochemistry and Proteomics Unit, Institut Pasteur de Montevideo/IIBCE, Montevideo, Uruguay
| | - Magdalena Portela
- Analytical Biochemistry and Proteomics Unit, Institut Pasteur de Montevideo/IIBCE, Montevideo, Uruguay
| | - Rosario Duran
- Analytical Biochemistry and Proteomics Unit, Institut Pasteur de Montevideo/IIBCE, Montevideo, Uruguay
| | - Marcelo Távora Mira
- Laboratório Experimental Multiuso, Pontifícia Universidade Católica do Paraná, Brazil
| | - Marcelo Pillonetto
- Laboratório Experimental Multiuso, Pontifícia Universidade Católica do Paraná, Brazil; Laboratório Central do Estado do Paraná, Brazil.
| | - Paulo Costa Carvalho
- Laboratory for Structural and Computational Proteomics, Carlos Chagas Institute, Fiocruz - Paraná, Brazil.
| |
Collapse
|
6
|
Clasen MA, Santos MDM, Kurt LU, Fischer J, Camillo-Andrade AC, Sales LA, de Arruda Campos Brasil de Souza T, Lima DB, Gozzo FC, Valente RH, Duran R, Barbosa VC, Carvalho PC. PatternLab V Handles Multiplex Spectra in Shotgun Proteomic Searches and Increases Identification. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2023; 34:794-796. [PMID: 36947430 DOI: 10.1021/jasms.3c00063] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/18/2023]
Abstract
Complex protein mixtures typically generate many tandem mass spectra produced by different peptides coisolated in the gas phase. Widely adopted proteomic data analysis environments usually fail to identify most of these spectra, succeeding at best in identifying only one of the multiple cofragmenting peptides. We present PatternLab V (PLV), an updated version of PatternLab that integrates the YADA 3 deconvolution algorithm to handle such cases efficiently. In general, we expect an increase of 10% in spectral identifications when dealing with complex proteomic samples. PLV is freely available at http://patternlabforproteomics.org.
Collapse
Affiliation(s)
- Milan A Clasen
- Laboratory for Structural and Computational Proteomics, Carlos Chagas Institute, Fiocruz-Parana 81310-020, Brazil
| | - Marlon D M Santos
- Laboratory for Structural and Computational Proteomics, Carlos Chagas Institute, Fiocruz-Parana 81310-020, Brazil
| | - Louise Ulrich Kurt
- Laboratory for Structural and Computational Proteomics, Carlos Chagas Institute, Fiocruz-Parana 81310-020, Brazil
| | - Juliana Fischer
- Laboratory for Structural and Computational Proteomics, Carlos Chagas Institute, Fiocruz-Parana 81310-020, Brazil
| | - Amanda C Camillo-Andrade
- Laboratory for Structural and Computational Proteomics, Carlos Chagas Institute, Fiocruz-Parana 81310-020, Brazil
| | - Lucas Albuquerque Sales
- Laboratory for Structural and Computational Proteomics, Carlos Chagas Institute, Fiocruz-Parana 81310-020, Brazil
| | | | - Diogo Borges Lima
- Department of Structural Biology, Leibniz─Forschungsinstitut für Molekulare Pharmakologie (FMP), Berlin 13125, Germany
| | - Fabio C Gozzo
- Dalton Mass Spectrometry Laboratory, Unicamp, Campinas 13083-970, Brazil
| | - Richard Hemmi Valente
- Laboratory of Toxinology, Oswaldo Cruz Institute, Fiocruz, Rio de Janeiro 21040-900, Brazil
| | - Rosario Duran
- Analytical Biochemistry and Proteomics Unit, Institut Pasteur de Montevideo/IIBCE, Montevideo 11400, Uruguay
| | - Valmir C Barbosa
- Systems Engineering and Computer Science Program, Federal University of Rio de Janeiro, Rio de Janeiro 21941-914, Brazil
| | - Paulo C Carvalho
- Laboratory for Structural and Computational Proteomics, Carlos Chagas Institute, Fiocruz-Parana 81310-020, Brazil
| |
Collapse
|