1
|
Knox KE, Abrahamsson D, Trowbridge J, Park JS, Wang M, Carrera E, Hartmayer L, Morello-Frosch R, Rudel RA. Application of a Non-targeted Biomonitoring Method to Characterize Occupational Chemical Exposures of Women Nurses Relative to Office Workers. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2025; 59:9437-9448. [PMID: 40324159 PMCID: PMC12096436 DOI: 10.1021/acs.est.4c14790] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/31/2024] [Revised: 04/11/2025] [Accepted: 04/14/2025] [Indexed: 05/07/2025]
Abstract
We analyzed blood serum samples from two unique female occupational cohorts - 60 nurses and 40 office workers in San Francisco, CA - using liquid chromatography and high-resolution mass spectrometry (quadrupole time-of-flight). Applying a nontargeted analysis (NTA) approach, we sought to isolate occupationally related chemical exposures that were unique to nurses by flagging features that were different from office workers in abundance (mean; 95th percentile) or detection frequency. Of 9828 negative electrospray ionization (ESI-) and 6898 positive electrospray ionization (ESI+) detected chemical features, 1094 and 938, respectively, were higher in nurses, possibly due to workplace exposures. We deciphered the molecular structures of these chemical features by applying data-dependent acquisition (DDA) and targeted MS/MS approaches to pooled samples from each occupational group, and we annotated them using spectral MS/MS databases in MS-DIAL. Nurses had higher concentrations of 14 chemicals that we identified at Schymanski Level 1 (N = 6) or 2 (N = 8), as well as 20 tentatively identified chemicals without spectra. Several chemicals may be occupationally relevant for nurses, including a PFAS (6:2 fluorotelomer sulfonic acid), tridecanedioic acid, salicylic acid, and the medications acetaminophen and theophylline. To our knowledge, this study is the first to apply NTA to elucidate novel chemical exposures in nurses.
Collapse
Affiliation(s)
- Kristin E. Knox
- Silent
Spring Institute, Newton, Massachusetts02460, United States
| | - Dimitri Abrahamsson
- Program
on Reproductive Health and the Environment, Department of Obstetrics,
Gynecology and Reproductive Sciences, University
of California, San Francisco, California94143, United States
| | - Jessica Trowbridge
- Program
on Reproductive Health and the Environment, Department of Obstetrics,
Gynecology and Reproductive Sciences, University
of California, San Francisco, California94143, United States
| | - June-Soo Park
- Environmental
Chemistry Laboratory, Department of Toxic Substances Control, California Environmental Protection Agency, Berkeley, California94710, United States
| | - Miaomiao Wang
- Environmental
Chemistry Laboratory, Department of Toxic Substances Control, California Environmental Protection Agency, Berkeley, California94710, United States
| | - Erin Carrera
- Department
of Nursing, University of San Francisco, San Francisco, California94143, United States
- California
Nurses for Environmental Health & Justice, Bolinas, California94924, United States
| | - Lisa Hartmayer
- Department
of Nursing, University of San Francisco, San Francisco, California94143, United States
- California
Nurses for Environmental Health & Justice, Bolinas, California94924, United States
| | - Rachel Morello-Frosch
- Department
of Environmental Science, Policy and Management, and School of Public
Health, University of California, Berkeley, California94720, United States
| | - R. A. Rudel
- Silent
Spring Institute, Newton, Massachusetts02460, United States
| |
Collapse
|
2
|
Canchola A, Tran LN, Woo W, Tian L, Lin YH, Chou WC. Advancing non-target analysis of emerging environmental contaminants with machine learning: Current status and future implications. ENVIRONMENT INTERNATIONAL 2025; 198:109404. [PMID: 40139034 DOI: 10.1016/j.envint.2025.109404] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/20/2024] [Revised: 03/03/2025] [Accepted: 03/18/2025] [Indexed: 03/29/2025]
Abstract
Emerging environmental contaminants (EECs) such as pharmaceuticals, pesticides, and industrial chemicals pose significant challenges for detection and identification due to their structural diversity and lack of analytical standards. Traditional targeted screening methods often fail to detect these compounds, making non-target analysis (NTA) using high-resolution mass spectrometry (HRMS) essential for identifying unknown or suspected contaminants. However, interpreting the vast datasets generated by HRMS is complex and requires advanced data processing techniques. Recent advancements in machine learning (ML) models offer great potential for enhancing NTA applications. As such, we reviewed key developments, including optimizing workflows using computational tools, improved chemical structure identification, advanced quantification methods, and enhanced toxicity prediction capabilities. It also discusses challenges and future perspectives in the field, such as refining ML tools for complex mixtures, improving inter-laboratory validation, and further integrating computational models into environmental risk assessment frameworks. By addressing these challenges, ML-assisted NTA can significantly enhance the detection, quantification, and evaluation of EECs, ultimately contributing to more effective environmental monitoring and public health protection.
Collapse
Affiliation(s)
- Alexa Canchola
- Environmental Toxicology Graduate Program, University of California, Riverside, CA 92521, United States; Department of Environmental Sciences, College of Natural & Agricultural Sciences, University of California, Riverside, CA 92521, United States
| | - Lillian N Tran
- Environmental Toxicology Graduate Program, University of California, Riverside, CA 92521, United States
| | - Wonsik Woo
- Environmental Toxicology Graduate Program, University of California, Riverside, CA 92521, United States
| | - Linhui Tian
- Department of Environmental Sciences, College of Natural & Agricultural Sciences, University of California, Riverside, CA 92521, United States
| | - Ying-Hsuan Lin
- Environmental Toxicology Graduate Program, University of California, Riverside, CA 92521, United States; Department of Environmental Sciences, College of Natural & Agricultural Sciences, University of California, Riverside, CA 92521, United States.
| | - Wei-Chun Chou
- Environmental Toxicology Graduate Program, University of California, Riverside, CA 92521, United States; Department of Environmental Sciences, College of Natural & Agricultural Sciences, University of California, Riverside, CA 92521, United States.
| |
Collapse
|
3
|
Fisher CM, Miele MM, Knolhoff AM. Community Needs and Proposed Solutions for a Broadly Applicable Standard/QC Mixture for High-Resolution Mass Spectrometry-Based Non-Targeted Analysis. Anal Chem 2025; 97:5424-5433. [PMID: 40042173 DOI: 10.1021/acs.analchem.4c05710] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/19/2025]
Abstract
Non-targeted analysis (NTA) using high-resolution mass spectrometry (HRMS) is a global chemical screening approach that generates information-rich data which can be used to detect and identify unknown chemicals. NTA is a powerful approach which is increasingly being used for a variety of sample types, research fields, and goals. However, there are challenges associated with accurate assessments of data quality and method performance, comparability across laboratories/instruments/methods, and communication of results/confidence. A standard mixture containing a sufficient number and diversity of chemicals would help address these needs, but is not yet commercially available. Thus, we conducted a survey of 146 NTA researchers to examine desired requirements for the broad fields, studies, and goals where NTA can be applied. We also compare this feedback to previously published in-house standard mixtures, which, we argue, are models for a standard that can be adjusted to fit the NTA community's needs and possibly commercialized. Reversed-phase liquid chromatography HRMS is one of the most common methods used for NTA; therefore, this survey is focused on characteristics necessary for these types of methods. We intend this information to communicate the need for an interdisciplinary NTA standard mixture, the importance of implementing standards, and to lower the barriers for chemical vendor standard mixture development and distribution.
Collapse
Affiliation(s)
- Christine M Fisher
- U.S. Food and Drug Administration, Human Foods Program, 5001 Campus Drive, College Park, Maryland 20740, United States
| | - Matthew M Miele
- U.S. Food and Drug Administration, Human Foods Program, 5001 Campus Drive, College Park, Maryland 20740, United States
| | - Ann M Knolhoff
- U.S. Food and Drug Administration, Human Foods Program, 5001 Campus Drive, College Park, Maryland 20740, United States
| |
Collapse
|
4
|
Arturi K, Harris EJ, Gasser L, Escher BI, Braun G, Bosshard R, Hollender J. MLinvitroTox reloaded for high-throughput hazard-based prioritization of high-resolution mass spectrometry data. J Cheminform 2025; 17:14. [PMID: 39891244 PMCID: PMC11786476 DOI: 10.1186/s13321-025-00950-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2024] [Accepted: 01/06/2025] [Indexed: 02/03/2025] Open
Abstract
MLinvitroTox is an automated Python pipeline developed for high-throughput hazard-driven prioritization of toxicologically relevant signals detected in complex environmental samples through high-resolution tandem mass spectrometry (HRMS/MS). MLinvitroTox is a machine learning (ML) framework comprising 490 independent XGBoost classifiers trained on molecular fingerprints from chemical structures and target-specific endpoints from the ToxCast/Tox21 invitroDBv4.1 database. For each analyzed HRMS feature, MLinvitroTox generates a 490-bit bioactivity fingerprint used as a basis for prioritization, focusing the time-consuming molecular identification efforts on features most likely to cause adverse effects. The practical advantages of MLinvitroTox are demonstrated for groundwater HRMS data. Among the 874 features for which molecular fingerprints were derived from spectra, including 630 nontargets, 185 spectral matches, and 59 targets, around 4% of the feature/endpoint relationship pairs were predicted to be active. Cross-checking the predictions for targets and spectral matches with invitroDB data confirmed the bioactivity of 120 active and 6791 nonactive pairs while mislabeling 88 active and 56 non-active relationships. By filtering according to bioactivity probability, endpoint scores, and similarity to the training data, the number of potentially toxic features was reduced by at least one order of magnitude. This refinement makes the analytical confirmation of the toxicologically most relevant features feasible, offering significant benefits for cost-efficient chemical risk assessment.Scientific Contribution:In contrast to the classical ML-based approaches for toxicity prediction, MLinvitroTox predicts bioactivity for HRMS features (i.e., distinct m/z signals) based on MS2 fragmentation spectra rather than the chemical structures from the identified features. While the original proof of concept study was accompanied by the release of a MLinvitroTox v1 KNIME workflow, in this study, we release a Python MLinvitroTox v2 package, which, in addition to automation, expands functionality to include predicting toxicity from structures, cleaning up and generating chemical fingerprints, customizing models, and retraining on custom data. Furthermore, as a result of improvements in bioactivity data processing, realized in the concurrently released pytcpl Python package for the custom processing of invitroDBv4.1 input data used for training MLinvitroTox, the current release introduces enhancements in model accuracy, coverage of biological mechanistic targets, and overall interpretability.
Collapse
Affiliation(s)
- Katarzyna Arturi
- Department of Environmental Chemistry, Swiss Federal Institute of Aquatic Science and Technology (Eawag), Überlandstrasse 133, 8600, Dübendorf, Switzerland.
| | - Eliza J Harris
- Swiss Data Science Center (SDSC), Andreasstrasse 5, 8092, Zürich, Switzerland
- Now at: Climate and Environmental Physics Division, University of Bern, Sidlerstrasse 5, 3012, Bern, Switzerland
| | - Lilian Gasser
- Swiss Data Science Center (SDSC), Andreasstrasse 5, 8092, Zürich, Switzerland
| | - Beate I Escher
- Cell Toxicology, Helmholtz Centre for Environmental Research (UFZ), Permoserstr. 15, 04318, Leipzig, Germany
| | - Georg Braun
- Cell Toxicology, Helmholtz Centre for Environmental Research (UFZ), Permoserstr. 15, 04318, Leipzig, Germany
| | - Robin Bosshard
- Department of Computer Science, Eidgenössische Technische Hochschule Zürich (ETH Zürich), Universitätstrasse 6, 8092, Zürich, Switzerland
| | - Juliane Hollender
- Department of Environmental Chemistry, Swiss Federal Institute of Aquatic Science and Technology (Eawag), Überlandstrasse 133, 8600, Dübendorf, Switzerland.
- Institute of Biogeochemistry and Pollution Dynamics, Eidgenössische Technische Hochschule Zürich (ETH Zürich), Rämistrasse 101, 8092, Zürich, Switzerland.
| |
Collapse
|
5
|
Abrahamsson D, Koronaiou LA, Johnson T, Yang J, Ji X, Lambropoulou DA. Modeling the relative response factor of small molecules in positive electrospray ionization. RSC Adv 2024; 14:37470-37482. [PMID: 39582938 PMCID: PMC11583891 DOI: 10.1039/d4ra06695b] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2024] [Accepted: 11/15/2024] [Indexed: 11/26/2024] Open
Abstract
Technological advancements in liquid chromatography (LC) electrospray ionization (ESI) high-resolution mass spectrometry (HRMS) have made it an increasingly popular analytical technique in non-targeted analysis (NTA) of environmental and biological samples. One critical limitation of current methods in NTA is the lack of available analytical standards for many of the compounds detected in biological and environmental samples. Computational approaches can provide estimates of concentrations by modeling the relative response factor of a compound (RRF) expressed as the peak area of a given peak divided by its concentration. In this paper, we explore the application of molecular dynamics (MD) in the development of a computational workflow for predicting RRF. We obtained measurements of RRF for 48 compounds with LC - quadrupole time-of-flight (QTOF) MS and calculated their RRF. We used the CGenFF force field to generate the topologies and GROMACS to conduct the (MD) simulations. We calculated the Lennard-Jones and Coulomb interactions between the analytes and all other molecules in the ESI droplet, which were then sampled to construct a multilinear regression model for predicting RRF using Monte Carlo simulations. The best performing model showed a coefficient of determination (R 2) of 0.82 and a mean absolute error (MAE) of 0.13 log units. This performance is comparable to other predictive models including machine learning models. While there is a need for further evaluation of diverse chemical structures, our approach showed promise in predictions of RRF.
Collapse
Affiliation(s)
- Dimitri Abrahamsson
- Department of Pediatrics, New York University Grossman School of Medicine New York 10016 USA
- Department of Obstetrics, Gynecology and Reproductive Sciences, School of Medicine, University of California San Francisco California 94158 USA
| | - Lelouda-Athanasia Koronaiou
- Laboratory of Environmental Pollution Control, Department of Chemistry, Aristotle University of Thessaloniki University Campus 54124 Thessaloniki Greece
- Center for Interdisciplinary Research and Innovation (CIRI-AUTH), Balkan Center Thessaloniki 57001 Greece
| | - Trevor Johnson
- Department of Pediatrics, New York University Grossman School of Medicine New York 10016 USA
| | - Junjie Yang
- Department of Obstetrics, Gynecology and Reproductive Sciences, School of Medicine, University of California San Francisco California 94158 USA
| | - Xiaowen Ji
- Department of Pediatrics, New York University Grossman School of Medicine New York 10016 USA
| | - Dimitra A Lambropoulou
- Laboratory of Environmental Pollution Control, Department of Chemistry, Aristotle University of Thessaloniki University Campus 54124 Thessaloniki Greece
- Center for Interdisciplinary Research and Innovation (CIRI-AUTH), Balkan Center Thessaloniki 57001 Greece
| |
Collapse
|
6
|
Ferreira CR, Lima Gomes PCFD, Robison KM, Cooper BR, Shannahan JH. Implementation of multiomic mass spectrometry approaches for the evaluation of human health following environmental exposure. Mol Omics 2024; 20:296-321. [PMID: 38623720 PMCID: PMC11163948 DOI: 10.1039/d3mo00214d] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2023] [Accepted: 03/22/2024] [Indexed: 04/17/2024]
Abstract
Omics analyses collectively refer to the possibility of profiling genetic variants, RNA, epigenetic markers, proteins, lipids, and metabolites. The most common analytical approaches used for detecting molecules present within biofluids related to metabolism are vibrational spectroscopy techniques, represented by infrared, Raman, and nuclear magnetic resonance (NMR) spectroscopies and mass spectrometry (MS). Omics-based assessments utilizing MS are rapidly expanding and being applied to various scientific disciplines and clinical settings. Most of the omics instruments are operated by specialists in dedicated laboratories; however, the development of miniature portable omics has made the technology more available to users for field applications. Variations in molecular information gained from omics approaches are useful for evaluating human health following environmental exposure and the development and progression of numerous diseases. As MS technology develops so do statistical and machine learning methods for the detection of molecular deviations from personalized metabolism, which are correlated to altered health conditions, and they are intended to provide a multi-disciplinary overview for researchers interested in adding multiomic analysis to their current efforts. This includes an introduction to mass spectrometry-based omics technologies, current state-of-the-art capabilities and their respective strengths and limitations for surveying molecular information. Furthermore, we describe how knowledge gained from these assessments can be applied to personalized medicine and diagnostic strategies.
Collapse
Affiliation(s)
- Christina R Ferreira
- Purdue Metabolite Profiling Facility, Purdue University, West Lafayette, IN 47907, USA.
| | | | - Kiley Marie Robison
- School of Health Sciences, College of Health and Human Sciences, Purdue University, West Lafayette, IN 47907, USA
| | - Bruce R Cooper
- Purdue Metabolite Profiling Facility, Purdue University, West Lafayette, IN 47907, USA.
| | - Jonathan H Shannahan
- School of Health Sciences, College of Health and Human Sciences, Purdue University, West Lafayette, IN 47907, USA
| |
Collapse
|
7
|
Johnson TA, Abrahamsson DP. Quantification of chemicals in non-targeted analysis without analytical standards - Understanding the mechanism of electrospray ionization and making predictions. CURRENT OPINION IN ENVIRONMENTAL SCIENCE & HEALTH 2024; 37:100529. [PMID: 38312491 PMCID: PMC10836048 DOI: 10.1016/j.coesh.2023.100529] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/06/2024]
Abstract
The constant creation and release of new chemicals to the environment is forming an ever-widening gap between available analytical standards and known chemicals. Developing non-targeted analysis (NTA) methods that have the ability to detect a broad spectrum of compounds is critical for research and analysis of emerging contaminants. There is a need to develop methods that make it possible to identify compound structures from their MS and MS/MS information and quantify them without analytical standards. Method refinements that utilize machine learning algorithms and chemical descriptors to estimate the instrument response of particular compounds have made progress in recent years. This narrative review seeks to summarize the current state of the field of non-targeted analysis (NTA) toward quantification of unknowns without the use of analytical standards. Despite the limited accumulation of validation studies on real samples, the ongoing enhancement in data processing and refinement of machine learning tools could lead to more comprehensive chemical coverage of NTA and validated quantitative NTA methods, thus boosting confidence in their usage and enhancing the utility of quantitative NTA.
Collapse
Affiliation(s)
- Trevor A Johnson
- Division of Environmental Pediatrics, Department of Pediatrics, Grossman School of Medicine, New York University
| | - Dimitri P Abrahamsson
- Division of Environmental Pediatrics, Department of Pediatrics, Grossman School of Medicine, New York University
| |
Collapse
|
8
|
Abrahamsson D, Brueck CL, Prasse C, Lambropoulou DA, Koronaiou LA, Wang M, Park JS, Woodruff TJ. Extracting Structural Information from Physicochemical Property Measurements Using Machine Learning─A New Approach for Structure Elucidation in Non-targeted Analysis. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2023; 57:14827-14838. [PMID: 37746919 PMCID: PMC10569036 DOI: 10.1021/acs.est.3c03003] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/20/2023] [Revised: 08/29/2023] [Accepted: 08/30/2023] [Indexed: 09/26/2023]
Abstract
Non-targeted analysis (NTA) has made critical contributions in the fields of environmental chemistry and environmental health. One critical bottleneck is the lack of available analytical standards for most chemicals in the environment. Our study aims to explore a novel approach that integrates measurements of equilibrium partition ratios between organic solvents and water (KSW) to predictions of molecular structures. These properties can be used as a fingerprint, which with the help of a machine learning algorithm can be converted into a series of functional groups (RDKit fragments), which can be used to search chemical databases. We conducted partitioning experiments using a chemical mixture containing 185 chemicals in 10 different organic solvents and water. Both a liquid chromatography quadrupole time-of-flight mass spectrometer (LC-QTOF MS) and a LC-Orbitrap MS were used to assess the feasibility of the experimental method and the accuracy of the algorithm at predicting the correct functional groups. The two methods showed differences in log KSW with the QTOF method showing a mean absolute error (MAE) of 0.22 and the Orbitrap method 0.33. The differences also culminated into errors in the predictions of RDKit fragments with the MAE for the QTOF method being 0.23 and for the Orbitrap method being 0.31. Our approach presents a new angle in structure elucidation for NTA and showed promise in assisting with compound identification.
Collapse
Affiliation(s)
- Dimitri Abrahamsson
- Department
of Pediatrics, New York University Grossman
School of Medicine, New York, New York 10016, United States
- Department
of Obstetrics, Gynecology and Reproductive Sciences, Program on Reproductive
Health and the Environment, University of
California, San Francisco, California 94107, United States
| | - Christopher L. Brueck
- Department
of Environmental Health and Engineering, Johns Hopkins University, Baltimore, Maryland 21205, United States
- Exponent, Environmental and Earth Sciences Practice, Bellevue, Washington 98007, United States
| | - Carsten Prasse
- Department
of Environmental Health and Engineering, Johns Hopkins University, Baltimore, Maryland 21205, United States
- Risk
Sciences
and Public Policy Institute, Bloomberg School of Public Health, Johns Hopkins University, Baltimore, Maryland 21205, United States
| | - Dimitra A. Lambropoulou
- Department
of Chemistry, Aristotle University of Thessaloniki, University Campus, 54124 Thessaloniki Greece
- Laboratory
of Environmental Pollution Control, Department of Chemistry, Aristotle University of Thessaloniki, GR-541 24 Thessaloniki, Greece
- Center for
Interdisciplinary Research and Innovation (CIRI-AUTH), Balkan Center, Thessaloniki, GR-57001, Greece
| | - Lelouda-Athanasia Koronaiou
- Department
of Chemistry, Aristotle University of Thessaloniki, University Campus, 54124 Thessaloniki Greece
- Laboratory
of Environmental Pollution Control, Department of Chemistry, Aristotle University of Thessaloniki, GR-541 24 Thessaloniki, Greece
- Center for
Interdisciplinary Research and Innovation (CIRI-AUTH), Balkan Center, Thessaloniki, GR-57001, Greece
| | - Miaomiao Wang
- Department
of Toxic Substances Control, Environmental Chemistry Laboratory, California Environmental Agency, Berkeley, California 94710, United States
| | - June-Soo Park
- Department
of Obstetrics, Gynecology and Reproductive Sciences, Program on Reproductive
Health and the Environment, University of
California, San Francisco, California 94107, United States
- Department
of Toxic Substances Control, Environmental Chemistry Laboratory, California Environmental Agency, Berkeley, California 94710, United States
| | - Tracey J. Woodruff
- Department
of Obstetrics, Gynecology and Reproductive Sciences, Program on Reproductive
Health and the Environment, University of
California, San Francisco, California 94107, United States
| |
Collapse
|
9
|
Yesiltepe Y, Govind N, Metz TO, Renslow RS. An initial investigation of accuracy required for the identification of small molecules in complex samples using quantum chemical calculated NMR chemical shifts. J Cheminform 2022; 14:64. [PMID: 36138446 PMCID: PMC9499888 DOI: 10.1186/s13321-022-00587-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2020] [Accepted: 02/06/2022] [Indexed: 11/24/2022] Open
Abstract
The majority of primary and secondary metabolites in nature have yet to be identified, representing a major challenge for metabolomics studies that currently require reference libraries from analyses of authentic compounds. Using currently available analytical methods, complete chemical characterization of metabolomes is infeasible for both technical and economic reasons. For example, unambiguous identification of metabolites is limited by the availability of authentic chemical standards, which, for the majority of molecules, do not exist. Computationally predicted or calculated data are a viable solution to expand the currently limited metabolite reference libraries, if such methods are shown to be sufficiently accurate. For example, determining nuclear magnetic resonance (NMR) spectroscopy spectra in silico has shown promise in the identification and delineation of metabolite structures. Many researchers have been taking advantage of density functional theory (DFT), a computationally inexpensive yet reputable method for the prediction of carbon and proton NMR spectra of metabolites. However, such methods are expected to have some error in predicted 13C and 1H NMR spectra with respect to experimentally measured values. This leads us to the question-what accuracy is required in predicted 13C and 1H NMR chemical shifts for confident metabolite identification? Using the set of 11,716 small molecules found in the Human Metabolome Database (HMDB), we simulated both experimental and theoretical NMR chemical shift databases. We investigated the level of accuracy required for identification of metabolites in simulated pure and impure samples by matching predicted chemical shifts to experimental data. We found 90% or more of molecules in simulated pure samples can be successfully identified when errors of 1H and 13C chemical shifts in water are below 0.6 and 7.1 ppm, respectively, and below 0.5 and 4.6 ppm in chloroform solvation, respectively. In simulated complex mixtures, as the complexity of the mixture increased, greater accuracy of the calculated chemical shifts was required, as expected. However, if the number of molecules in the mixture is known, e.g., when NMR is combined with MS and sample complexity is low, the likelihood of confident molecular identification increased by 90%.
Collapse
Affiliation(s)
- Yasemin Yesiltepe
- The Gene and Linda Voiland School of Chemical Engineering and Bioengineering, Washington State University, Pullman, WA, USA
- Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory, Richland, WA, USA
| | - Niranjan Govind
- Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory, Richland, WA, USA
| | - Thomas O Metz
- The Gene and Linda Voiland School of Chemical Engineering and Bioengineering, Washington State University, Pullman, WA, USA
| | - Ryan S Renslow
- The Gene and Linda Voiland School of Chemical Engineering and Bioengineering, Washington State University, Pullman, WA, USA.
- Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory, Richland, WA, USA.
| |
Collapse
|
10
|
Approaches for assessing performance of high-resolution mass spectrometry-based non-targeted analysis methods. Anal Bioanal Chem 2022; 414:6455-6471. [PMID: 35796784 PMCID: PMC9411239 DOI: 10.1007/s00216-022-04203-3] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2022] [Revised: 06/17/2022] [Accepted: 06/24/2022] [Indexed: 11/06/2022]
Abstract
Non-targeted analysis (NTA) using high-resolution mass spectrometry has enabled the detection and identification of unknown and unexpected compounds of interest in a wide range of sample matrices. Despite these benefits of NTA methods, standardized procedures do not yet exist for assessing performance, limiting stakeholders’ abilities to suitably interpret and utilize NTA results. Herein, we first summarize existing performance assessment metrics for targeted analyses to provide context and clarify terminology that may be shared between targeted and NTA methods (e.g., terms such as accuracy, precision, sensitivity, and selectivity). We then discuss promising approaches for assessing NTA method performance, listing strengths and key caveats for each approach, and highlighting areas in need of further development. To structure the discussion, we define three types of NTA study objectives: sample classification, chemical identification, and chemical quantitation. Qualitative study performance (i.e., focusing on sample classification and/or chemical identification) can be assessed using the traditional confusion matrix, with some challenges and limitations. Quantitative study performance can be assessed using estimation procedures developed for targeted methods with consideration for additional sources of uncontrolled experimental error. This article is intended to stimulate discussion and further efforts to develop and improve procedures for assessing NTA method performance. Ultimately, improved performance assessments will enable accurate communication and effective utilization of NTA results by stakeholders.
Collapse
|
11
|
Uncertainty estimation strategies for quantitative non-targeted analysis. Anal Bioanal Chem 2022; 414:4919-4933. [PMID: 35699740 DOI: 10.1007/s00216-022-04118-z] [Citation(s) in RCA: 26] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2022] [Revised: 04/13/2022] [Accepted: 05/04/2022] [Indexed: 11/01/2022]
Abstract
Non-targeted analysis (NTA) methods are widely used for chemical discovery but seldom employed for quantitation due to a lack of robust methods to estimate chemical concentrations with confidence limits. Herein, we present and evaluate new statistical methods for quantitative NTA (qNTA) using high-resolution mass spectrometry (HRMS) data from EPA's Non-Targeted Analysis Collaborative Trial (ENTACT). Experimental intensities of ENTACT analytes were observed at multiple concentrations using a semi-automated NTA workflow. Chemical concentrations and corresponding confidence limits were first estimated using traditional calibration curves. Two qNTA estimation methods were then implemented using experimental response factor (RF) data (where RF = intensity/concentration). The bounded response factor method used a non-parametric bootstrap procedure to estimate select quantiles of training set RF distributions. Quantile estimates then were applied to test set HRMS intensities to inversely estimate concentrations with confidence limits. The ionization efficiency estimation method restricted the distribution of likely RFs for each analyte using ionization efficiency predictions. Given the intended future use for chemical risk characterization, predicted upper confidence limits (protective values) were compared to known chemical concentrations. Using traditional calibration curves, 95% of upper confidence limits were within ~tenfold of the true concentrations. The error increased to ~60-fold (ESI+) and ~120-fold (ESI-) for the ionization efficiency estimation method and to ~150-fold (ESI+) and ~130-fold (ESI-) for the bounded response factor method. This work demonstrates successful implementation of confidence limit estimation strategies to support qNTA studies and marks a crucial step towards translating NTA data in a risk-based context.
Collapse
|
12
|
Costalunga R, Tshepelevitsh S, Sepman H, Kull M, Kruve A. Sodium adduct formation with graph-based machine learning can aid structural elucidation in non-targeted LC/ESI/HRMS. Anal Chim Acta 2022; 1204:339402. [PMID: 35397906 DOI: 10.1016/j.aca.2021.339402] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2021] [Revised: 12/07/2021] [Accepted: 12/23/2021] [Indexed: 11/01/2022]
Abstract
Non-targeted screening with LC/ESI/HRMS aims to identify the structure of the detected compounds using their retention time, exact mass, and fragmentation pattern. Challenges remain in differentiating between isomeric compounds. One untapped possibility to facilitate identification of isomers relies on different ionic species formed in electrospray. In positive ESI mode, both protonated molecules and adducts can be formed; however, not all isomeric structures form the same ionic species. The complicated mechanism of adduct formation has hindered the use of this molecular characteristic in the structural elucidation in non-targeted screening. Here, we have studied the adduct formation for 94 small molecules with ion mobility spectra and compared collision cross-sections of the respective ions. Based on the results we developed a fast support vector machine classifier with polynomial kernels for accurately predicting the sodium adduct formation in ESI/HRMS. The model is trained on five independent data sets from different laboratories and uses the graph-based connectivity of functional groups and PubChem fingerprints to predict the sodium adduct formation in ESI/HRMS. The validation of the model showed an accuracy of 74.7% (balanced accuracy 70.0%) on a dataset from an independent laboratory, which was not used in the training of the model. Lastly, we applied the classification algorithm to the SusDat database by NORMAN network to evaluate the proportion of isomeric compounds that could be distinguished based on predicted sodium adduct formation. It was observed that sodium adduct formation probability can provide additional selectivity for about one quarter of the exact masses and, therefore, shows practical utility for structural assignment in non-targeted screening.
Collapse
Affiliation(s)
- Riccardo Costalunga
- Department of Materials and Environmental Chemistry, Stockholm University, Svante Arrhenius väg 16, 106 91, Stockholm, Sweden; Department of Food and Drug, University of Parma, via Università, 12, I 43121, Parma, Italy
| | - Sofja Tshepelevitsh
- Institute of Chemistry, University of Tartu, Ravila 14a, Tartu, 50411, Estonia
| | - Helen Sepman
- Department of Materials and Environmental Chemistry, Stockholm University, Svante Arrhenius väg 16, 106 91, Stockholm, Sweden
| | - Meelis Kull
- Institute of Computer Science, University of Tartu, Narva mnt 18, 51009, Tartu, Estonia
| | - Anneli Kruve
- Department of Materials and Environmental Chemistry, Stockholm University, Svante Arrhenius väg 16, 106 91, Stockholm, Sweden.
| |
Collapse
|
13
|
Colby SM, Chang CH, Bade JL, Nunez JR, Blumer MR, Orton DJ, Bloodsworth KJ, Nakayasu ES, Smith RD, Ibrahim YM, Renslow RS, Metz TO. DEIMoS: An Open-Source Tool for Processing High-Dimensional Mass Spectrometry Data. Anal Chem 2022; 94:6130-6138. [PMID: 35430813 PMCID: PMC9047447 DOI: 10.1021/acs.analchem.1c05017] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2021] [Accepted: 04/05/2022] [Indexed: 01/06/2023]
Abstract
We present DEIMoS: Data Extraction for Integrated Multidimensional Spectrometry, a Python application programming interface (API) and command-line tool for high-dimensional mass spectrometry data analysis workflows that offers ease of development and access to efficient algorithmic implementations. Functionality includes feature detection, feature alignment, collision cross section (CCS) calibration, isotope detection, and MS/MS spectral deconvolution, with the output comprising detected features aligned across study samples and characterized by mass, CCS, tandem mass spectra, and isotopic signature. Notably, DEIMoS operates on N-dimensional data, largely agnostic to acquisition instrumentation; algorithm implementations simultaneously utilize all dimensions to (i) offer greater separation between features, thus improving detection sensitivity, (ii) increase alignment/feature matching confidence among data sets, and (iii) mitigate convolution artifacts in tandem mass spectra. We demonstrate DEIMoS with LC-IMS-MS/MS metabolomics data to illustrate the advantages of a multidimensional approach in each data processing step.
Collapse
Affiliation(s)
- Sean M. Colby
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99352 United States
| | - Christine H. Chang
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99352 United States
| | - Jessica L. Bade
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99352 United States
| | - Jamie R. Nunez
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99352 United States
| | - Madison R. Blumer
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99352 United States
| | - Daniel J. Orton
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99352 United States
| | - Kent J. Bloodsworth
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99352 United States
| | - Ernesto S. Nakayasu
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99352 United States
| | - Richard D. Smith
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99352 United States
| | - Yehia M. Ibrahim
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99352 United States
| | - Ryan S. Renslow
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99352 United States
| | - Thomas O. Metz
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99352 United States
| |
Collapse
|
14
|
Blumer MR, Chang CH, Brayfindley E, Nunez JR, Colby SM, Renslow RS, Metz TO. Mass Spectrometry Adduct Calculator. J Chem Inf Model 2021; 61:5721-5725. [PMID: 34842435 DOI: 10.1021/acs.jcim.1c00579] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
We describe the Mass Spectrometry Adduct Calculator (MSAC), an automated Python tool to calculate the adduct ion masses of a parent molecule. Here, adduct refers to a version of a parent molecule [M] that is charged due to addition or loss of atoms and electrons resulting in a charged ion, for example, [M + H]+. MSAC includes a database of 147 potential adducts and adduct/neutral loss combinations and their mass-to-charge ratios (m/z) as extracted from the NIST/EPA/NIH Mass Spectral Library (NIST17), Global Natural Products Social Molecular Networking Public Spectral Libraries (GNPS), and MassBank of North America (MoNA). The calculator relies on user-selected subsets of the combined database to calculate expected m/z for adducts of molecules supplied as formulas. This tool is intended to help researchers create identification libraries to collect evidence for the presence of molecules in mass spectrometry data. While the included adduct database focuses on adducts typically detected during liquid chromatography-mass spectrometry analyses, users may supply their own lists of adducts and charge states for calculating expected m/z. We also analyzed statistics on adducts from spectra contained in the three selected mass spectral libraries. MSAC is freely available at https://github.com/pnnl/MSAC.
Collapse
Affiliation(s)
- Madison R Blumer
- Pacific Northwest National Laboratory, Richland, Washington 99354, United States
| | - Christine H Chang
- Pacific Northwest National Laboratory, Richland, Washington 99354, United States
| | | | - Jamie R Nunez
- Pacific Northwest National Laboratory, Richland, Washington 99354, United States
| | - Sean M Colby
- Pacific Northwest National Laboratory, Richland, Washington 99354, United States
| | - Ryan S Renslow
- Pacific Northwest National Laboratory, Richland, Washington 99354, United States
| | - Thomas O Metz
- Pacific Northwest National Laboratory, Richland, Washington 99354, United States
| |
Collapse
|
15
|
Celma A, Ahrens L, Gago-Ferrero P, Hernández F, López F, Lundqvist J, Pitarch E, Sancho JV, Wiberg K, Bijlsma L. The relevant role of ion mobility separation in LC-HRMS based screening strategies for contaminants of emerging concern in the aquatic environment. CHEMOSPHERE 2021; 280:130799. [PMID: 34162120 DOI: 10.1016/j.chemosphere.2021.130799] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/25/2021] [Revised: 04/29/2021] [Accepted: 05/01/2021] [Indexed: 05/24/2023]
Abstract
Ion mobility separation (IMS) coupled to high resolution mass spectrometry (IMS-HRMS) is a promising technique for (non-)target/suspect analysis of micropollutants in complex matrices. IMS separates ionized compounds based on their charge, shape and size facilitating the removal of co-eluting isomeric/isobaric species. Additionally, IMS data can be translated into collision cross-section (CCS) values, which can be used to increase the identification reliability. However, IMS-HRMS for the screening of contaminants of emerging concern (CECs) have been scarcely explored. In this study, the role of IMS-HRMS for the identification of CECs in complex matrices is highlighted, with emphasis on when and with which purpose is of use. The utilization of IMS can result in much cleaner mass spectra, which considerably facilitates data interpretation and the obtaining of reliable identifications. Furthermore, the robustness of IMS measurements across matrices permits the use of CCS as an additional relevant parameter during the identification step even when reference standards are not available. Moreover, an effect on the number of true and false identifications could be demonstrated by including IMS restrictions within the identification workflow. Data shown in this work is of special interest for environmental researchers dealing with the detection of CECs with state-of-the-art IMS-HRMS instruments.
Collapse
Affiliation(s)
- Alberto Celma
- Environmental and Public Health Analytical Chemistry, Research Institute for Pesticides and Water, University Jaume I, Castelló, E-12071, Spain
| | - Lutz Ahrens
- Department of Aquatic Sciences and Assessment, Swedish University of Agricultural Sciences (SLU), Box 7050, SE-750 07, Uppsala, Sweden
| | - Pablo Gago-Ferrero
- Institute of Environmental Assessment and Water Research (IDAEA) Severo Ochoa Excellence Center, Spanish Council for Scientific Research (CSIC), Jordi Girona 18-26, E-08034, Barcelona, Spain
| | - Félix Hernández
- Environmental and Public Health Analytical Chemistry, Research Institute for Pesticides and Water, University Jaume I, Castelló, E-12071, Spain
| | - Francisco López
- Environmental and Public Health Analytical Chemistry, Research Institute for Pesticides and Water, University Jaume I, Castelló, E-12071, Spain
| | - Johan Lundqvist
- Department of Biomedicine and Veterinary Public Health, Swedish University of Agricultural Sciences, Box 7028, SE-750 07, Uppsala, Sweden
| | - Elena Pitarch
- Environmental and Public Health Analytical Chemistry, Research Institute for Pesticides and Water, University Jaume I, Castelló, E-12071, Spain
| | - Juan Vicente Sancho
- Environmental and Public Health Analytical Chemistry, Research Institute for Pesticides and Water, University Jaume I, Castelló, E-12071, Spain
| | - Karin Wiberg
- Department of Aquatic Sciences and Assessment, Swedish University of Agricultural Sciences (SLU), Box 7050, SE-750 07, Uppsala, Sweden
| | - Lubertus Bijlsma
- Environmental and Public Health Analytical Chemistry, Research Institute for Pesticides and Water, University Jaume I, Castelló, E-12071, Spain.
| |
Collapse
|
16
|
Nuñez JR, Mcgrady M, Yesiltepe Y, Renslow RS, Metz TO. Chespa: Streamlining Expansive Chemical Space Evaluation of Molecular Sets. J Chem Inf Model 2020; 60:6251-6257. [PMID: 33283505 DOI: 10.1021/acs.jcim.0c00899] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Thousands of chemical properties can be calculated for small molecules, which can be used to place the molecules within the context of a broader "chemical space." These definitions vary based on compounds of interest and the goals for the given chemical space definition. Here, we introduce a customizable Python module, chespa, built to easily assess different chemical space definitions through clustering of compounds in these spaces and visualizing trends of these clusters. To demonstrate this, chespa currently streamlines prediction of various molecular descriptors (predicted chemical properties, molecular substructures, AI-based chemical space, and chemical class ontology) in order to test six different chemical space definitions. Furthermore, we investigated how these varying definitions trend with mass spectrometry (MS)-based observability, that is, the ability of a molecule to be observed with MS (e.g., as a function of the molecule ionizability), using an example data set from the U.S. EPA's nontargeted analysis collaborative trial, where blinded samples had been analyzed previously, providing 1398 data points. Improved understanding of observability would offer many advantages in small-molecule identification, such as (i) a priori selection of experimental conditions based on suspected sample composition, (ii) the ability to reduce the number of candidate structures during compound identification by removing those less likely to ionize, and, in turn, (iii) a reduced false discovery rate and increased confidence in identifications. Factors controlling observability are not fully understood, making prediction of this property nontrivial and a prime candidate for chemical space analysis. Chespa is available at github.com/pnnl/chespa.
Collapse
Affiliation(s)
- Jamie R Nuñez
- Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory, Richland, Washington 99352, United States.,The Gene and Linda Voiland School of Chemical Engineering and Bioengineering, Washington State University, Pullman, Washington 99164, United States
| | - Monee Mcgrady
- Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory, Richland, Washington 99352, United States
| | - Yasemin Yesiltepe
- Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory, Richland, Washington 99352, United States.,The Gene and Linda Voiland School of Chemical Engineering and Bioengineering, Washington State University, Pullman, Washington 99164, United States
| | - Ryan S Renslow
- Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory, Richland, Washington 99352, United States.,The Gene and Linda Voiland School of Chemical Engineering and Bioengineering, Washington State University, Pullman, Washington 99164, United States
| | - Thomas O Metz
- Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory, Richland, Washington 99352, United States
| |
Collapse
|
17
|
Celma A, Sancho JV, Schymanski EL, Fabregat-Safont D, Ibáñez M, Goshawk J, Barknowitz G, Hernández F, Bijlsma L. Improving Target and Suspect Screening High-Resolution Mass Spectrometry Workflows in Environmental Analysis by Ion Mobility Separation. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2020; 54:15120-15131. [PMID: 33207875 DOI: 10.1021/acs.est.0c05713] [Citation(s) in RCA: 76] [Impact Index Per Article: 15.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
Currently, the most powerful approach to monitor organic micropollutants (OMPs) in environmental samples is the combination of target, suspect, and nontarget screening strategies using high-resolution mass spectrometry (HRMS). However, the high complexity of sample matrices and the huge number of OMPs potentially present in samples at low concentrations pose an analytical challenge. Ion mobility separation (IMS) combined with HRMS instruments (IMS-HRMS) introduces an additional analytical dimension, providing extra information, which facilitates the identification of OMPs. The collision cross-section (CCS) value provided by IMS is unaffected by the matrix or chromatographic separation. Consequently, the creation of CCS databases and the inclusion of ion mobility within identification criteria are of high interest for an enhanced and robust screening strategy. In this work, a CCS library for IMS-HRMS, which is online and freely available, was developed for 556 OMPs in both positive and negative ionization modes using electrospray ionization. The inclusion of ion mobility data in widely adopted confidence levels for identification in environmental reporting is discussed. Illustrative examples of OMPs found in environmental samples are presented to highlight the potential of IMS-HRMS and to demonstrate the additional value of CCS data in various screening strategies.
Collapse
Affiliation(s)
- Alberto Celma
- Environmental and Public Health Analytical Chemistry, Research Institute for Pesticides and Water, University Jaume I, Avda. Sos Baynat s/n, E-12071 Castellón, Spain
| | - Juan V Sancho
- Environmental and Public Health Analytical Chemistry, Research Institute for Pesticides and Water, University Jaume I, Avda. Sos Baynat s/n, E-12071 Castellón, Spain
| | - Emma L Schymanski
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, 6, Avenue du Swing, L-4367 Belvaux, Luxembourg
| | - David Fabregat-Safont
- Environmental and Public Health Analytical Chemistry, Research Institute for Pesticides and Water, University Jaume I, Avda. Sos Baynat s/n, E-12071 Castellón, Spain
| | - María Ibáñez
- Environmental and Public Health Analytical Chemistry, Research Institute for Pesticides and Water, University Jaume I, Avda. Sos Baynat s/n, E-12071 Castellón, Spain
| | - Jeff Goshawk
- Waters Corporation, Stamford Avenue, Altrincham Road, Wilmslow, Cheshire SK9 4AX, U.K
| | - Gitte Barknowitz
- Waters Corporation, Stamford Avenue, Altrincham Road, Wilmslow, Cheshire SK9 4AX, U.K
| | - Félix Hernández
- Environmental and Public Health Analytical Chemistry, Research Institute for Pesticides and Water, University Jaume I, Avda. Sos Baynat s/n, E-12071 Castellón, Spain
| | - Lubertus Bijlsma
- Environmental and Public Health Analytical Chemistry, Research Institute for Pesticides and Water, University Jaume I, Avda. Sos Baynat s/n, E-12071 Castellón, Spain
| |
Collapse
|
18
|
Couvillion SP, Agrawal N, Colby SM, Brandvold KR, Metz TO. Who Is Metabolizing What? Discovering Novel Biomolecules in the Microbiome and the Organisms Who Make Them. Front Cell Infect Microbiol 2020; 10:388. [PMID: 32850487 PMCID: PMC7410922 DOI: 10.3389/fcimb.2020.00388] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2020] [Accepted: 06/25/2020] [Indexed: 12/14/2022] Open
Abstract
Even as the field of microbiome research has made huge strides in mapping microbial community composition in a variety of environments and organisms, explaining the phenotypic influences on the host by microbial taxa-both known and unknown-and their specific functions still remain major challenges. A pressing need is the ability to assign specific functions in terms of enzymes and small molecules to specific taxa or groups of taxa in the community. This knowledge will be crucial for advancing personalized therapies based on the targeted modulation of microbes or metabolites that have predictable outcomes to benefit the human host. This perspective article advocates for the combined use of standards-free metabolomics and activity-based protein profiling strategies to address this gap in functional knowledge in microbiome research via the identification of novel biomolecules and the attribution of their production to specific microbial taxa.
Collapse
Affiliation(s)
- Sneha P. Couvillion
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA, United States
| | - Neha Agrawal
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA, United States
| | - Sean M. Colby
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA, United States
| | - Kristoffer R. Brandvold
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA, United States
- Elson S. Floyd College of Medicine, Washington State University, Spokane, WA, United States
| | - Thomas O. Metz
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA, United States
| |
Collapse
|