1
|
Ladislavová N, Pojmanová P, Urban Š. DA_2DCHROM - a data alignment tool for applications on real GC × GC-TOF samples. Anal Bioanal Chem 2023; 415:2641-2651. [PMID: 37036485 PMCID: PMC10149467 DOI: 10.1007/s00216-023-04679-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2022] [Revised: 03/24/2023] [Accepted: 03/30/2023] [Indexed: 04/11/2023]
Abstract
Comprehensive two-dimensional gas chromatography coupled with mass spectrometry (GC × GC-MS) has great potential for analyses of complicated mixtures and sample matrices, due to its separation power and possible high resolution. The second component of the measurement results, the mass spectra, is reproducible. However, the reproducibility of two-dimensional chromatography is affected by many factors and makes the evaluation of long-term experiments or cross-laboratory collaborations complicated. This paper presents a new open-source data alignment tool to tackle the problem of retention time shifts - with 5 different algorithms implemented: BiPACE 2D, DISCO, MSort, PAM, and TNT-DA, along with Pearson's correlation and dot product as optional methods for mass spectra comparison. The implemented data alignment algorithms and their variations were tested on real samples to demonstrate the functionality of the presented tool. The suitability of each implemented algorithm for significantly/non-significantly shifted data was discussed on the basis of the results obtained. For the evaluation of the "goodness" of the alignment, Kolmogorov-Smirnov test values were calculated, and comparison graphs were generated. The DA_2DChrom is available online with its documentation, fully open-sourced, and the user can use the tool without the need of uploading their data to external third-party servers.
Collapse
Affiliation(s)
- Nikola Ladislavová
- Department of Analytical Chemistry, University of Chemistry and Technology Prague, Prague, Czech Republic.
| | - Petra Pojmanová
- Department of Analytical Chemistry, University of Chemistry and Technology Prague, Prague, Czech Republic
| | - Štěpán Urban
- Department of Analytical Chemistry, University of Chemistry and Technology Prague, Prague, Czech Republic
| |
Collapse
|
2
|
Feizi N, Hashemi-Nasab FS, Golpelichi F, Saburouh N, Parastar H. Recent trends in application of chemometric methods for GC-MS and GC×GC-MS-based metabolomic studies. Trends Analyt Chem 2021. [DOI: 10.1016/j.trac.2021.116239] [Citation(s) in RCA: 52] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
|
3
|
Stilo F, Bicchi C, Jimenez-Carvelo AM, Cuadros-Rodriguez L, Reichenbach SE, Cordero C. Chromatographic fingerprinting by comprehensive two-dimensional chromatography: Fundamentals and tools. Trends Analyt Chem 2021. [DOI: 10.1016/j.trac.2020.116133] [Citation(s) in RCA: 37] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
|
4
|
Mishra P, Biancolillo A, Roger JM, Marini F, Rutledge DN. New data preprocessing trends based on ensemble of multiple preprocessing techniques. Trends Analyt Chem 2020. [DOI: 10.1016/j.trac.2020.116045] [Citation(s) in RCA: 48] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
|
5
|
Li Z, Kim S, Zhong S, Zhong Z, Kato I, Zhang X. Coherent Point Drift Peak Alignment Algorithms Using Distance and Similarity Measures for Two-Dimensional Gas Chromatography Mass Spectrometry Data. JOURNAL OF CHEMOMETRICS 2020; 34:e3236. [PMID: 33505107 PMCID: PMC7837599 DOI: 10.1002/cem.3236] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/07/2019] [Accepted: 03/18/2020] [Indexed: 06/12/2023]
Abstract
The peak alignment is a vital preprocessing step before downstream analysis, such as biomarker discovery and pathway analysis, for two-dimensional gas chromatography mass spectrometry (2DGCMS)-based metabolomics data. Due to uncontrollable experimental conditions, e.g., the differences in temperature or pressure, matrix effects on samples, and stationary phase degradation, a shift of retention times among samples inevitably occurs during 2DGCMS experiments, making it difficult to align peaks. Various peak alignment algorithms have been developed to correct retention time shifts for homogeneous, heterogeneous or both type of mass spectrometry data. However, almost all existing algorithms have been focused on a local alignment and are suffering from low accuracy especially when aligning dense biological data with many peaks. We have developed four global peak alignment (GPA) algorithms using coherent point drift (CPD) point matching algorithms: retention time-based CPD-GPA (RT), prior CPD-GPA (P), mixture CPD-GPA (M), and prior mixture CPD-GPA (P+M). The method RT performs the peak alignment based only on the retention time distance, while the methods P, M, and P+M carry out the peak alignment using both the retention time distance and mass spectral similarity. The method P incorporates the mass spectral similarity through prior information and the methods M and P+M use the mixture distance measure. Four developed algorithms are applied to homogeneous and heterogeneous spiked-in data as well as two real biological data and compared with three existing algorithms, mSPA, SWPA, and BiPACE-2D. The results show that our CPD-GPA algorithms perform better than all existing algorithms in terms of F1 score.
Collapse
Affiliation(s)
- Zeyu Li
- Department of Computer Sciences, Wayne State University, Detroit, MI 48202
| | - Seongho Kim
- Biostatistics Core, Karmanos Cancer Institute, Wayne State University, Detroit, MI 48201
- Department of Oncology, School of Medicine, Wayne State University, Detroit, MI 48201
| | - Sikai Zhong
- Department of Computer Sciences, Wayne State University, Detroit, MI 48202
| | - Zichun Zhong
- Department of Computer Sciences, Wayne State University, Detroit, MI 48202
| | - Ikuko Kato
- Department of Oncology, School of Medicine, Wayne State University, Detroit, MI 48201
- Department of Pathology, Wayne State University, Detroit, MI 48201
| | - Xiang Zhang
- Department of Chemistry, University of Louisville, Louisville, KY 40209
| |
Collapse
|
6
|
Quiroz-Moreno C, Furlan MF, Belinato JR, Augusto F, Alexandrino GL, Mogollón NGS. RGCxGC toolbox: An R-package for data processing in comprehensive two-dimensional gas chromatography-mass spectrometry. Microchem J 2020. [DOI: 10.1016/j.microc.2020.104830] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
|
7
|
Prebihalo SE, Berrier KL, Freye CE, Bahaghighat HD, Moore NR, Pinkerton DK, Synovec RE. Multidimensional Gas Chromatography: Advances in Instrumentation, Chemometrics, and Applications. Anal Chem 2017; 90:505-532. [DOI: 10.1021/acs.analchem.7b04226] [Citation(s) in RCA: 115] [Impact Index Per Article: 14.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Affiliation(s)
- Sarah E. Prebihalo
- Department of Chemistry, University of Washington, Box 351700, Seattle, Washington 98195, United States
| | - Kelsey L. Berrier
- Department of Chemistry, University of Washington, Box 351700, Seattle, Washington 98195, United States
| | - Chris E. Freye
- Department of Chemistry, University of Washington, Box 351700, Seattle, Washington 98195, United States
| | - H. Daniel Bahaghighat
- Department of Chemistry, University of Washington, Box 351700, Seattle, Washington 98195, United States
- Department of Chemistry and Life Science, United States Military Academy, West Point, New York 10996, United States
| | - Nicholas R. Moore
- Department of Chemistry, University of Washington, Box 351700, Seattle, Washington 98195, United States
| | - David K. Pinkerton
- Department of Chemistry, University of Washington, Box 351700, Seattle, Washington 98195, United States
| | - Robert E. Synovec
- Department of Chemistry, University of Washington, Box 351700, Seattle, Washington 98195, United States
| |
Collapse
|
8
|
Sampat A, Lopatka M, Sjerps M, Vivo-Truyols G, Schoenmakers P, van Asten A. Forensic potential of comprehensive two-dimensional gas chromatography. Trends Analyt Chem 2016. [DOI: 10.1016/j.trac.2015.10.011] [Citation(s) in RCA: 49] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
|
9
|
Misra BB, van der Hooft JJJ. Updates in metabolomics tools and resources: 2014-2015. Electrophoresis 2015; 37:86-110. [DOI: 10.1002/elps.201500417] [Citation(s) in RCA: 84] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2015] [Revised: 10/04/2015] [Accepted: 10/05/2015] [Indexed: 12/12/2022]
Affiliation(s)
- Biswapriya B. Misra
- Department of Biology, Genetics Institute; University of Florida; Gainesville FL USA
| | | |
Collapse
|
10
|
Egert B, Weinert CH, Kulling SE. A peaklet-based generic strategy for the untargeted analysis of comprehensive two-dimensional gas chromatography mass spectrometry data sets. J Chromatogr A 2015; 1405:168-77. [PMID: 26074098 DOI: 10.1016/j.chroma.2015.05.056] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2015] [Revised: 05/26/2015] [Accepted: 05/27/2015] [Indexed: 12/17/2022]
Abstract
Comprehensive two-dimensional gas chromatography mass spectrometry (GC×GC-MS) is a well-established key technology in analytical chemistry and increasingly used in the field of untargeted metabolomics. However, automated processing of large GC×GC-MS data sets is still a major bottleneck in untargeted, large-scale metabolomics. For this reason we introduce a novel peaklet-based alignment strategy. The algorithm is capable of an untargeted deterministic alignment exploiting a density based clustering procedure within a time constrained similarity matrix. Exploiting minimal (1)D and (2)D retention time shifts between peak modulations, the alignment is done without the need for peak merging which also eliminates the need for linear or nonlinear retention time correction procedures. The approach is validated in detail using data of urine samples from a large human metabolomics study. The data was acquired by a Shimadzu GCMS-QP2010 Ultra GC×GC-qMS system and consists of 512 runs, including 312 study samples and 178 quality control sample injections, measured within a time period of 22 days. The final result table consisted of 313 analytes, each of these being detectable in at least 75% of the study samples. In summary, we present an automated, reliable and fully transparent workflow for the analysis of large GC×GC-qMS metabolomics data sets.
Collapse
Affiliation(s)
- Björn Egert
- Max Rubner-Institut, Department of Safety and Quality of Fruit and Vegetables, Haid-und-Neu-Straße 9, 76131, Karlsruhe, Germany.
| | - Christoph H Weinert
- Max Rubner-Institut, Department of Safety and Quality of Fruit and Vegetables, Haid-und-Neu-Straße 9, 76131, Karlsruhe, Germany
| | - Sabine E Kulling
- Max Rubner-Institut, Department of Safety and Quality of Fruit and Vegetables, Haid-und-Neu-Straße 9, 76131, Karlsruhe, Germany
| |
Collapse
|
11
|
Kuich PHJL, Hoffmann N, Kempa S. Maui-VIA: A User-Friendly Software for Visual Identification, Alignment, Correction, and Quantification of Gas Chromatography-Mass Spectrometry Data. Front Bioeng Biotechnol 2015; 2:84. [PMID: 25654076 PMCID: PMC4301187 DOI: 10.3389/fbioe.2014.00084] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2014] [Accepted: 12/17/2014] [Indexed: 02/01/2023] Open
Abstract
A current bottleneck in GC–MS metabolomics is the processing of raw machine data into a final datamatrix that contains the quantities of identified metabolites in each sample. While there are many bioinformatics tools available to aid the initial steps of the process, their use requires both significant technical expertise and a subsequent manual validation of identifications and alignments if high data quality is desired. The manual validation is tedious and time consuming, becoming prohibitively so as sample numbers increase. We have, therefore, developed Maui-VIA, a solution based on a visual interface that allows experts and non-experts to simultaneously and quickly process, inspect, and correct large numbers of GC–MS samples. It allows for the visual inspection of identifications and alignments, facilitating a unique and, due to its visualization and keyboard shortcuts, very fast interaction with the data. Therefore, Maui-Via fills an important niche by (1) providing functionality that optimizes the component of data processing that is currently most labor intensive to save time and (2) lowering the threshold of expertise required to process GC–MS data. Maui-VIA projects are initiated with baseline-corrected raw data, peaklists, and a database of metabolite spectra and retention indices used for identification. It provides functionality for retention index calculation, a targeted library search, the visual annotation, alignment, correction interface, and metabolite quantification, as well as the export of the final datamatrix. The high quality of data produced by Maui-VIA is illustrated by its comparison to data attained manually by an expert using vendor software on a previously published dataset concerning the response of Chlamydomonas reinhardtii to salt stress. In conclusion, Maui-VIA provides the opportunity for fast, confident, and high-quality data processing validation of large numbers of GC–MS samples by non-experts.
Collapse
Affiliation(s)
- P Henning J L Kuich
- Integrative Proteomics and Metabolomics, Berlin Institute of Health , Berlin , Germany
| | - Nils Hoffmann
- Genome Informatics, Faculty of Technology, CeBiTec, Bielefeld University , Bielefeld , Germany
| | - Stefan Kempa
- Integrative Proteomics and Metabolomics, Berlin Institute of Health , Berlin , Germany ; Integrative Proteomics and Metabolomics, Berlin Institute for Medical Systems Biology/Max Delbrück Center for Molecular Medicine , Berlin , Germany
| |
Collapse
|