1
|
Harrison A, Eder JG, Lalli PM, Munoz N, Gao Y, Clendinen CS, Orton DJ, Zheng X, Williams SM, Couvillion SP, Chu RK, Balasubramanian VK, Bhattacharjee A, Anderton CR, Pomraning KR, Burnum-Johnson KE, Liu T, Kyle JE, Bilbao A. PeakQC: A Software Tool for Omics-Agnostic Automated Quality Control of Mass Spectrometry Data. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2024; 35:2680-2689. [PMID: 39013167 PMCID: PMC11932329 DOI: 10.1021/jasms.4c00146] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/18/2024]
Abstract
Mass spectrometry is broadly employed to study complex molecular mechanisms in various biological and environmental fields, enabling 'omics' research such as proteomics, metabolomics, and lipidomics. As study cohorts grow larger and more complex with dozens to hundreds of samples, the need for robust quality control (QC) measures through automated software tools becomes paramount to ensure the integrity, high quality, and validity of scientific conclusions from downstream analyses and minimize the waste of resources. Since existing QC tools are mostly dedicated to proteomics, automated solutions supporting metabolomics are needed. To address this need, we developed the software PeakQC, a tool for automated QC of MS data that is independent of omics molecular types (i.e., omics-agnostic). It allows automated extraction and inspection of peak metrics of precursor ions (e.g., errors in mass, retention time, arrival time) and supports various instrumentations and acquisition types, from infusion experiments or using liquid chromatography and/or ion mobility spectrometry front-end separations and with/without fragmentation spectra from data-dependent or independent acquisition analyses. Diagnostic plots for fragmentation spectra are also generated. Here, we describe and illustrate PeakQC's functionalities using different representative data sets, demonstrating its utility as a valuable tool for enhancing the quality and reliability of omics mass spectrometry analyses.
Collapse
Affiliation(s)
- Andrea Harrison
- Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, Washington 99352, United States
| | - Josie G Eder
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99352, United States
| | - Priscila M Lalli
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99352, United States
| | - Nathalie Munoz
- Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, Washington 99352, United States
- US Department of Energy Agile BioFoundry, Emeryville, California 94608, United States
| | - Yuqian Gao
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99352, United States
- US Department of Energy Agile BioFoundry, Emeryville, California 94608, United States
| | - Chaevien S Clendinen
- Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, Washington 99352, United States
| | - Daniel J Orton
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99352, United States
| | - Xueyun Zheng
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99352, United States
| | - Sarah M Williams
- Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, Washington 99352, United States
| | - Sneha P Couvillion
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99352, United States
| | - Rosalie K Chu
- Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, Washington 99352, United States
| | - Vimal K Balasubramanian
- Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, Washington 99352, United States
| | - Arunima Bhattacharjee
- Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, Washington 99352, United States
| | - Christopher R Anderton
- Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, Washington 99352, United States
| | - Kyle R Pomraning
- Energy Processes & Materials Division, Pacific Northwest National Laboratory, Richland, Washington 99352, United States
- US Department of Energy Agile BioFoundry, Emeryville, California 94608, United States
| | - Kristin E Burnum-Johnson
- Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, Washington 99352, United States
- US Department of Energy Agile BioFoundry, Emeryville, California 94608, United States
| | - Tao Liu
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99352, United States
| | - Jennifer E Kyle
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99352, United States
| | - Aivett Bilbao
- Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, Washington 99352, United States
- US Department of Energy Agile BioFoundry, Emeryville, California 94608, United States
| |
Collapse
|
2
|
Ross DH, Bredeweg EL, Eder JG, Orton DJ, Burnet MC, Kyle JE, Nakayasu ES, Zheng X. A deep learning-guided automated workflow in LipidOz for detailed characterization of fungal fatty acid unsaturation by ozonolysis. JOURNAL OF MASS SPECTROMETRY : JMS 2024; 59:e5078. [PMID: 39132905 DOI: 10.1002/jms.5078] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/11/2024] [Revised: 06/10/2024] [Accepted: 07/02/2024] [Indexed: 08/13/2024]
Abstract
Understanding fungal lipid biology and metabolism is critical for antifungal target discovery as lipids play central roles in cellular processes. Nuances in lipid structural differences can significantly impact their functions, making it necessary to characterize lipids in detail to understand their roles in these complex systems. In particular, lipid double bond (DB) locations are an important component of lipid structure that can only be determined using a few specialized analytical techniques. Ozone-induced dissociation mass spectrometry (OzID-MS) is one such technique that uses ozone to break lipid DBs, producing pairs of characteristic fragments that allow the determination of DB positions. In this work, we apply OzID-MS and LipidOz software to analyze the complex lipids of Saccharomyces cerevisiae yeast strains transformed with different fatty acid desaturases from Histoplasma capsulatum to determine the specific unsaturated lipids produced. The automated data analysis in LipidOz made the determination of DB positions from this large dataset more practical, but manual verification for all targets was still time-consuming. The DL model reduces manual involvement in data analysis, but since it was trained using mammalian lipid extracts, the prediction accuracy on yeast-derived data was reduced. We addressed both shortcomings by retraining the DL model to act as a pre-filter to prioritize targets for automated analysis, providing confident manually verified results but requiring less computational time and manual effort. Our workflow resulted in the determination of detailed DB positions and enzymatic specificity.
Collapse
Affiliation(s)
- Dylan H Ross
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington, USA
| | - Erin L Bredeweg
- Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, Washington, USA
| | - Josie G Eder
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington, USA
| | - Daniel J Orton
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington, USA
| | - Meagan C Burnet
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington, USA
| | - Jennifer E Kyle
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington, USA
| | - Ernesto S Nakayasu
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington, USA
| | - Xueyun Zheng
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington, USA
| |
Collapse
|
3
|
Smith BJ, Guest PC, Martins-de-Souza D. Maximizing Analytical Performance in Biomolecular Discovery with LC-MS: Focus on Psychiatric Disorders. ANNUAL REVIEW OF ANALYTICAL CHEMISTRY (PALO ALTO, CALIF.) 2024; 17:25-46. [PMID: 38424029 DOI: 10.1146/annurev-anchem-061522-041154] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/02/2024]
Abstract
In this review, we discuss the cutting-edge developments in mass spectrometry proteomics and metabolomics that have brought improvements for the identification of new disease-based biomarkers. A special focus is placed on psychiatric disorders, for example, schizophrenia, because they are considered to be not a single disease entity but rather a spectrum of disorders with many overlapping symptoms. This review includes descriptions of various types of commonly used mass spectrometry platforms for biomarker research, as well as complementary techniques to maximize data coverage, reduce sample heterogeneity, and work around potentially confounding factors. Finally, we summarize the different statistical methods that can be used for improving data quality to aid in reliability and interpretation of proteomics findings, as well as to enhance their translatability into clinical use and generalizability to new data sets.
Collapse
Affiliation(s)
- Bradley J Smith
- 1Laboratory of Neuroproteomics, Department of Biochemistry and Tissue Biology, Institute of Biology, University of Campinas, São Paulo, Brazil;
| | - Paul C Guest
- 1Laboratory of Neuroproteomics, Department of Biochemistry and Tissue Biology, Institute of Biology, University of Campinas, São Paulo, Brazil;
- 2Department of Psychiatry, Otto-von-Guericke-University Magdeburg, Magdeburg, Germany
- 3Laboratory of Translational Psychiatry, Otto-von-Guericke-University Magdeburg, Magdeburg, Germany
| | - Daniel Martins-de-Souza
- 1Laboratory of Neuroproteomics, Department of Biochemistry and Tissue Biology, Institute of Biology, University of Campinas, São Paulo, Brazil;
- 4Experimental Medicine Research Cluster, University of Campinas, São Paulo, Brazil
- 5National Institute of Biomarkers in Neuropsychiatry, National Council for Scientific and Technological Development, São Paulo, Brazil
- 6D'Or Institute for Research and Education, São Paulo, Brazil
- 7INCT in Modelling Human Complex Diseases with 3D Platforms (Model3D), São Paulo, Brazil
| |
Collapse
|
4
|
Ross DH, Bhotika H, Zheng X, Smith RD, Burnum-Johnson KE, Bilbao A. Computational tools and algorithms for ion mobility spectrometry-mass spectrometry. Proteomics 2024; 24:e2200436. [PMID: 38438732 PMCID: PMC11632599 DOI: 10.1002/pmic.202200436] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2023] [Revised: 02/12/2024] [Accepted: 02/14/2024] [Indexed: 03/06/2024]
Abstract
Ion mobility spectrometry-mass spectrometry (IMS-MS or IM-MS) is a powerful analytical technique that combines the gas-phase separation capabilities of IM with the identification and quantification capabilities of MS. IM-MS can differentiate molecules with indistinguishable masses but different structures (e.g., isomers, isobars, molecular classes, and contaminant ions). The importance of this analytical technique is reflected by a staged increase in the number of applications for molecular characterization across a variety of fields, from different MS-based omics (proteomics, metabolomics, lipidomics, etc.) to the structural characterization of glycans, organic matter, proteins, and macromolecular complexes. With the increasing application of IM-MS there is a pressing need for effective and accessible computational tools. This article presents an overview of the most recent free and open-source software tools specifically tailored for the analysis and interpretation of data derived from IM-MS instrumentation. This review enumerates these tools and outlines their main algorithmic approaches, while highlighting representative applications across different fields. Finally, a discussion of current limitations and expectable improvements is presented.
Collapse
Affiliation(s)
- Dylan H. Ross
- Biological Sciences Division, Pacific Northwest National
Laboratory, Richland, WA 99354, USA
| | - Harsh Bhotika
- Environmental Molecular Sciences Laboratory, Pacific
Northwest National Laboratory, Richland, WA 99354, USA
| | - Xueyun Zheng
- Biological Sciences Division, Pacific Northwest National
Laboratory, Richland, WA 99354, USA
| | - Richard D. Smith
- Biological Sciences Division, Pacific Northwest National
Laboratory, Richland, WA 99354, USA
| | - Kristin E. Burnum-Johnson
- Environmental Molecular Sciences Laboratory, Pacific
Northwest National Laboratory, Richland, WA 99354, USA
| | - Aivett Bilbao
- Environmental Molecular Sciences Laboratory, Pacific
Northwest National Laboratory, Richland, WA 99354, USA
| |
Collapse
|
5
|
Ross D, Bilbao A, Lee JY, Zheng X. mzapy: An Open-Source Python Library Enabling Efficient Extraction and Processing of Ion Mobility Spectrometry-Mass Spectrometry Data in the MZA File Format. Anal Chem 2023. [PMID: 37307589 DOI: 10.1021/acs.analchem.3c01653] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Analysis of ion mobility spectrometry (IMS) data has been challenging and limited the full utility of these measurements. Unlike liquid chromatography-mass spectrometry, where a plethora of tools with well-established algorithms exist, the incorporation of the additional IMS dimension requires upgrading existing computational pipelines and developing new algorithms to fully exploit the advantages of the technology. We have recently reported MZA, a new and simple mass spectrometry data structure based on the broadly supported HDF5 format and created to facilitate software development. While this format is inherently supportive of application development, the availability of core libraries in popular programming languages with standard mass spectrometry utilities will facilitate fast software development and broader adoption of the format. To this end, we present a Python package, mzapy, for efficient extraction and processing of mass spectrometry data in the MZA format, especially for complex data containing ion mobility spectrometry dimension. In addition to raw data extraction, mzapy contains supporting utilities enabling tasks including calibration, signal processing, peak finding, and generating plots. Being implemented in pure Python and having minimal and largely standardized dependencies makes mzapy uniquely suited to application development in the multiomics domain. The mzapy package is free and open-source, includes comprehensive documentation, and is structured to support future extension to meet the evolving needs of the MS community. The software source code is freely available at https://github.com/PNNL-m-q/mzapy.
Collapse
Affiliation(s)
- Dylan Ross
- Pacific Northwest National Laboratory, Richland, Washington 99354, United States
| | - Aivett Bilbao
- Pacific Northwest National Laboratory, Richland, Washington 99354, United States
| | - Joon-Yong Lee
- Pacific Northwest National Laboratory, Richland, Washington 99354, United States
| | - Xueyun Zheng
- Pacific Northwest National Laboratory, Richland, Washington 99354, United States
| |
Collapse
|