1
|
Cheng H, Miller D, Southwell N, Porcari P, Fischer JL, Taylor I, Salbaum JM, Kappen C, Hu F, Yang C, Keshari KR, Gross SS, D'Aurelio M, Chen Q. Untargeted pixel-by-pixel metabolite ratio imaging as a novel tool for biomedical discovery in mass spectrometry imaging. eLife 2025; 13:RP96892. [PMID: 40100251 PMCID: PMC11919253 DOI: 10.7554/elife.96892] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/20/2025] Open
Abstract
Mass spectrometry imaging (MSI) is a powerful technology used to define the spatial distribution and relative abundance of metabolites across tissue cryosections. While software packages exist for pixel-by-pixel individual metabolite and limited target pairs of ratio imaging, the research community lacks an easy computing and application tool that images any metabolite abundance ratio pairs. Importantly, recognition of correlated metabolite pairs may contribute to the discovery of unanticipated molecules in shared metabolic pathways. Here, we describe the development and implementation of an untargeted R package workflow for pixel-by-pixel ratio imaging of all metabolites detected in an MSI experiment. Considering untargeted MSI studies of murine brain and embryogenesis, we demonstrate that ratio imaging minimizes systematic data variation introduced by sample handling, markedly enhances spatial image contrast, and reveals previously unrecognized metabotype-distinct tissue regions. Furthermore, ratio imaging facilitates identification of novel regional biomarkers and provides anatomical information regarding spatial distribution of metabolite-linked biochemical pathways. The algorithm described herein is applicable to any MSI dataset containing spatial information for metabolites, peptides or proteins, offering a potent hypothesis generation tool to enhance knowledge obtained from current spatial metabolite profiling technologies.
Collapse
Affiliation(s)
- Huiyong Cheng
- Department of Pharmacology, Weill Cornell MedicineNew YorkUnited States
| | - Dawson Miller
- Department of Pharmacology, Weill Cornell MedicineNew YorkUnited States
| | - Nneka Southwell
- Brain and Mind Research Institute, Weill Cornell MedicineNew York CityUnited States
| | - Paola Porcari
- Memorial Sloan Kettering Cancer CenterNew YorkUnited States
| | | | - Isobel Taylor
- Department of Pharmacology, Weill Cornell MedicineNew YorkUnited States
| | - J Michael Salbaum
- Pennington Biomedical Research Center, Louisiana State UniversityBaton RougeUnited States
| | - Claudia Kappen
- Pennington Biomedical Research Center, Louisiana State UniversityBaton RougeUnited States
| | - Fenghua Hu
- Cornell University, Department of Molecular Biology & GeneticsIthacaUnited States
| | - Cha Yang
- Cornell University, Department of Molecular Biology & GeneticsIthacaUnited States
| | | | - Steven S Gross
- Department of Pharmacology, Weill Cornell MedicineNew YorkUnited States
| | - Marilena D'Aurelio
- Brain and Mind Research Institute, Weill Cornell MedicineNew York CityUnited States
| | - Qiuying Chen
- Department of Pharmacology, Weill Cornell MedicineNew YorkUnited States
| |
Collapse
|
2
|
Cheng H, Miller D, Southwell N, Porcari P, Fischer JL, Taylor I, Michael Salbaum J, Kappen C, Hu F, Yang C, Keshari KR, Gross SS, D'Aurelio M, Chen Q. Untargeted Pixel-by-Pixel Imaging of Metabolite Ratio Pairs as a Novel Tool for Biomedical Discovery in Mass Spectrometry Imaging. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.01.10.575105. [PMID: 38370710 PMCID: PMC10871215 DOI: 10.1101/2024.01.10.575105] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/20/2024]
Abstract
Mass spectrometry imaging (MSI) is a powerful technology used to define the spatial distribution and relative abundance of metabolites across tissue cryosections. While software packages exist for pixel-by-pixel individual metabolite and limited target pairs of ratio imaging, the research community lacks an easy computing and application tool that images any metabolite abundance ratio pairs. Importantly, recognition of correlated metabolite pairs may contribute to the discovery of unanticipated molecules in shared metabolic pathways. Here, we describe the development and implementation of an untargeted R package workflow for pixel-by-pixel ratio imaging of all metabolites detected in an MSI experiment. Considering untargeted MSI studies of murine brain and embryogenesis, we demonstrate that ratio imaging minimizes systematic data variation introduced by sample handling, markedly enhances spatial image contrast, and reveals previously unrecognized metabotype-distinct tissue regions. Furthermore, ratio imaging facilitates identification of novel regional biomarkers and provides anatomical information regarding spatial distribution of metabolite-linked biochemical pathways. The algorithm described herein is applicable to any MSI dataset containing spatial information for metabolites, peptides or proteins, offering a potent hypothesis generation tool to enhance knowledge obtained from current spatial metabolite profiling technologies.
Collapse
|
3
|
Arbeev KG, Bagley O, Ukraintseva SV, Kulminski A, Stallard E, Schwaiger-Haber M, Patti GJ, Gu Y, Yashin AI, Province MA. Methods for joint modelling of longitudinal omics data and time-to-event outcomes: Applications to lysophosphatidylcholines in connection to aging and mortality in the Long Life Family Study. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2024.07.29.24311176. [PMID: 39132492 PMCID: PMC11312646 DOI: 10.1101/2024.07.29.24311176] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 08/13/2024]
Abstract
Studying relationships between longitudinal changes in omics variables and risks of events requires specific methodologies for joint analyses of longitudinal and time-to-event outcomes. We applied two such approaches (joint models [JM], stochastic process models [SPM]) to longitudinal metabolomics data from the Long Life Family Study focusing on understudied associations of longitudinal changes in lysophosphatidylcholines (LPC) with mortality and aging-related outcomes (23 LPC species, 5,790 measurements of each in 4,011 participants, 1,431 of whom died during follow-up). JM analyses found that higher levels of the majority of LPC species were associated with lower mortality risks, with the largest effect size observed for LPC 15:0/0:0 (hazard ratio: 0.715, 95% CI (0.649, 0.788)). SPM applications to LPC 15:0/0:0 revealed how the association found in JM reflects underlying aging-related processes: decline in robustness to deviations from optimal LPC levels, better ability of males' organisms to return to equilibrium LPC levels (which are higher in females), and increasing gaps between the optimum and equilibrium levels leading to increased mortality risks with age. Our results support LPC as a biomarker of aging and related decline in robustness/resilience, and call for further exploration of factors underlying age-dynamics of LPC in relation to mortality and diseases.
Collapse
Affiliation(s)
- Konstantin G. Arbeev
- Biodemography of Aging Research Unit, Social Science Research Institute, Duke University, Durham, North Carolina 27708, USA
| | - Olivia Bagley
- Biodemography of Aging Research Unit, Social Science Research Institute, Duke University, Durham, North Carolina 27708, USA
| | - Svetlana V. Ukraintseva
- Biodemography of Aging Research Unit, Social Science Research Institute, Duke University, Durham, North Carolina 27708, USA
| | - Alexander Kulminski
- Biodemography of Aging Research Unit, Social Science Research Institute, Duke University, Durham, North Carolina 27708, USA
| | - Eric Stallard
- Biodemography of Aging Research Unit, Social Science Research Institute, Duke University, Durham, North Carolina 27708, USA
| | - Michaela Schwaiger-Haber
- Department of Chemistry, Washington University in St. Louis, St. Louis, Missouri 63130, United States
- Department of Medicine, Washington University in St. Louis, St. Louis, Missouri 63130, United States
- Center for Metabolomics and Isotope Tracing at Washington University in St. Louis, St. Louis, Missouri 63130, United States
| | - Gary J. Patti
- Department of Chemistry, Washington University in St. Louis, St. Louis, Missouri 63130, United States
- Department of Medicine, Washington University in St. Louis, St. Louis, Missouri 63130, United States
- Center for Metabolomics and Isotope Tracing at Washington University in St. Louis, St. Louis, Missouri 63130, United States
| | - Yian Gu
- Taub Institute for Research on Alzheimer’s Disease and the Aging Brain, Vagelos College of Physicians and Surgeons, Columbia University, New York, New York 10032, USA
- G.H. Sergievsky Center, Vagelos College of Physicians and Surgeons, Columbia University, New York, New York 10032, USA
- Department of Neurology, Vagelos College of Physicians and Surgeons, Columbia University, and the New York Presbyterian Hospital, New York, New York 10032, USA
- Department of Epidemiology, Mailman School of Public Health, Columbia University, New York, New York 10032, USA
| | - Anatoliy I. Yashin
- Biodemography of Aging Research Unit, Social Science Research Institute, Duke University, Durham, North Carolina 27708, USA
| | - Michael A. Province
- Division of Statistical Genomics, Department of Genetics, Washington University School of Medicine, St. Louis, Missouri 63110, USA
| |
Collapse
|
4
|
Fu Q, Qiu Y, Zhao J, Li J, Xie S, Liao Q, Fu X, Huang Y, Yao Z, Dai Z, Qiu Y, Yang Y, Li F, Chen H. Monotonic trends of soil microbiomes, metagenomic and metabolomic functioning across ecosystems along water gradients in the Altai region, northwestern China. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024; 912:169351. [PMID: 38123079 DOI: 10.1016/j.scitotenv.2023.169351] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/10/2023] [Revised: 11/21/2023] [Accepted: 12/11/2023] [Indexed: 12/23/2023]
Abstract
To investigate microbial communities and their contributions to carbon and nutrient cycling along water gradients can enhance our comprehension of climate change impacts on ecosystem services. Thus, we conducted an assessment of microbial communities, metagenomic functions, and metabolomic profiles within four ecosystems, i.e., desert grassland (DG), shrub-steppe (SS), forest (FO), and marsh (MA) in the Altai region of Xinjiang, China. Our results showed that soil total carbon (TC), total nitrogen, NH4+, and NO3- increased, but pH decreased with soil water gradients. Microbial abundances and richness also increased with soil moisture except the abundances of fungi and protists being lowest in MA. A shift in microbial community composition is evident along the soil moisture gradient, with Proteobacteria, Basidiomycota, and Evosea proliferating but a decline in Actinobacteria and Cercozoa. The β-diversity of microbiomes, metagenomic, and metabolomic functioning were correlated with soil moisture gradients and have significant associations with specific soil factors of TC, NH4+, and pH. Metagenomic functions associated with carbohydrate and DNA metabolisms, as well as phages, prophages, TE, plasmids functions diminished with moisture, whereas the genes involved in nitrogen and potassium metabolism, along with certain biological interactions and environmental information processing functions, demonstrated an augmentation. Additionally, MA harbored the most abundant metabolomics dominated by lipids and lipid-like molecules and organic oxygen compounds, except certain metabolites showing decline trends along water gradients, such as N'-Hydroxymethylnorcotinine and 5-Hydroxyenterolactone. Thus, our study suggests that future ecosystem succession facilitated by changes in rainfall patterns will significantly alter soil microbial taxa, functional potential, and metabolite fractions.
Collapse
Affiliation(s)
- Qi Fu
- State Key Laboratory of Biocontrol, School of Ecology, Shenzhen Campus of Sun Yat-sen University Sun Yat-sen University, Shenzhen, Guangdong 518107, China
| | - Yingbo Qiu
- State Key Laboratory of Biocontrol, School of Ecology, Shenzhen Campus of Sun Yat-sen University Sun Yat-sen University, Shenzhen, Guangdong 518107, China
| | - Jiayi Zhao
- State Key Laboratory of Biocontrol, School of Ecology, Shenzhen Campus of Sun Yat-sen University Sun Yat-sen University, Shenzhen, Guangdong 518107, China
| | - Jiaxin Li
- State Key Laboratory of Biocontrol, School of Ecology, Shenzhen Campus of Sun Yat-sen University Sun Yat-sen University, Shenzhen, Guangdong 518107, China
| | - Siqi Xie
- State Key Laboratory of Biocontrol, School of Ecology, Shenzhen Campus of Sun Yat-sen University Sun Yat-sen University, Shenzhen, Guangdong 518107, China
| | - Qiuchang Liao
- State Key Laboratory of Biocontrol, School of Ecology, Shenzhen Campus of Sun Yat-sen University Sun Yat-sen University, Shenzhen, Guangdong 518107, China
| | - Xianheng Fu
- State Key Laboratory of Biocontrol, School of Ecology, Shenzhen Campus of Sun Yat-sen University Sun Yat-sen University, Shenzhen, Guangdong 518107, China
| | - Yu Huang
- State Key Laboratory of Biocontrol, School of Ecology, Shenzhen Campus of Sun Yat-sen University Sun Yat-sen University, Shenzhen, Guangdong 518107, China
| | - Zhiyuan Yao
- School of Civil and Environmental Engineering, Ningbo University, Ningbo, Zhejiang 315211, China
| | - Zhongmin Dai
- Institute of Soil and Water Resources and Environmental Science, College of Environmental and Resource Sciences, Zhejiang University, Hangzhou, Zhejiang 310058, China
| | - Yunpeng Qiu
- College of Resources and Environmental Sciences, Nanjing Agricultural University, Nanjing, Jiangsu 210095, China
| | - Yuchun Yang
- State Key Laboratory of Biocontrol, School of Ecology, Shenzhen Campus of Sun Yat-sen University Sun Yat-sen University, Shenzhen, Guangdong 518107, China
| | - Furong Li
- State Key Laboratory of Biocontrol, School of Ecology, Shenzhen Campus of Sun Yat-sen University Sun Yat-sen University, Shenzhen, Guangdong 518107, China.
| | - Huaihai Chen
- State Key Laboratory of Biocontrol, School of Ecology, Shenzhen Campus of Sun Yat-sen University Sun Yat-sen University, Shenzhen, Guangdong 518107, China.
| |
Collapse
|
5
|
Gong Y, Ding W, Wang P, Wu Q, Yao X, Yang Q. Evaluating Machine Learning Methods of Analyzing Multiclass Metabolomics. J Chem Inf Model 2023; 63:7628-7641. [PMID: 38079572 DOI: 10.1021/acs.jcim.3c01525] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2023]
Abstract
Multiclass metabolomic studies have become popular for revealing the differences in multiple stages of complex diseases, various lifestyles, or the effects of specific treatments. In multiclass metabolomics, there are multiple data manipulation steps for analyzing raw data, which consist of data filtering, the imputation of missing values, data normalization, marker identification, sample separation, classification, and so on. In each step, several to dozens of machine learning methods can be chosen for the given data set, with potentially hundreds or thousands of method combinations in the whole data processing chain. Therefore, a clear understanding of these machine learning methods is helpful for selecting an appropriate method combination for obtaining stable and reliable analytical results of specific data. However, there has rarely been an overall introduction or evaluation of these methods based on multiclass metabolomic data. Herein, detailed descriptions of these machine learning methods in multiple data manipulation steps are reviewed. Moreover, an assessment of these methods was performed using a benchmark data set for multiclass metabolomics. First, 12 imputation methods for imputing missing values were evaluated based on the PSS (Procrustes statistical shape analysis) and NRMSE (normalized root-mean-square error) values. Second, 17 normalization methods for processing multiclass metabolomic data were evaluated by applying the PMAD (pooled median absolute deviation) value. Third, different methods of identifying markers of multiclass metabolomics were evaluated based on the CWrel (relative weighted consistency) value. Fourth, nine classification methods for constructing multiclass models were assessed using the AUC (area under the curve) value. Performance evaluations of machine learning methods are highly recommended to select the most appropriate method combination before performing the final analysis of the given data. Overall, detailed descriptions and evaluation of various machine learning methods are expected to improve analyses of multiclass metabolomic data.
Collapse
Affiliation(s)
- Yaguo Gong
- State Key Laboratory of Quality Research in Chinese Medicine, School of Pharmacy, Macau University of Science and Technology, Macau 999078, China
| | - Wei Ding
- State Key Laboratory of Quality Research in Chinese Medicine, School of Pharmacy, Macau University of Science and Technology, Macau 999078, China
| | - Panpan Wang
- College of Chemistry and Pharmaceutical Engineering, Huanghuai University, Zhumadian 463000, China
| | - Qibiao Wu
- State Key Laboratory of Quality Research in Chinese Medicine, School of Pharmacy, Macau University of Science and Technology, Macau 999078, China
| | - Xiaojun Yao
- Centre for Artificial Intelligence Driven Drug Discovery, Faculty of Applied Sciences, Macao Polytechnic University, Macao 999078, China
| | - Qingxia Yang
- Zhejiang Provincial Key Laboratory of Precision Diagnosis and Therapy for Major Gynecological Diseases, Women's Hospital, Zhejiang University School of Medicine, Hangzhou 310058, China
- Department of Bioinformatics, School of Geographic and Biologic Information, Nanjing University of Posts and Telecommunications, Nanjing 210023, China
| |
Collapse
|
6
|
Sundararaman N, Bhat A, Venkatraman V, Binek A, Dwight Z, Ariyasinghe NR, Escopete S, Joung SY, Cheng S, Parker SJ, Fert-Bober J, Van Eyk JE. BIRCH: An Automated Workflow for Evaluation, Correction, and Visualization of Batch Effect in Bottom-Up Mass Spectrometry-Based Proteomics Data. J Proteome Res 2023; 22:471-481. [PMID: 36695565 DOI: 10.1021/acs.jproteome.2c00671] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
Abstract
Recent surges in large-scale mass spectrometry (MS)-based proteomics studies demand a concurrent rise in methods to facilitate reliable and reproducible data analysis. Quantification of proteins in MS analysis can be affected by variations in technical factors such as sample preparation and data acquisition conditions leading to batch effects, which adds to noise in the data set. This may in turn affect the effectiveness of any biological conclusions derived from the data. Here we present Batch-effect Identification, Representation, and Correction of Heterogeneous data (BIRCH), a workflow for analysis and correction of batch effect through an automated, versatile, and easy to use web-based tool with the goal of eliminating technical variation. BIRCH also supports diagnosis of the data to check for the presence of batch effects, feasibility of batch correction, and imputation to deal with missing values in the data set. To illustrate the relevance of the tool, we explore two case studies, including an iPSC-derived cell study and a Covid vaccine study to show different context-specific use cases. Ultimately this tool can be used as an extremely powerful approach for eliminating technical bias while retaining biological bias, toward understanding disease mechanisms and potential therapeutics.
Collapse
Affiliation(s)
- Niveda Sundararaman
- Smidt Heart Institute, Cedars-Sinai Medical Center, Los Angeles, California 90048, United States.,Advanced Clinical Biosystems Research Institute, Cedars-Sinai Medical Center, Los Angeles, California 90048, United States
| | - Archana Bhat
- Smidt Heart Institute, Cedars-Sinai Medical Center, Los Angeles, California 90048, United States.,Advanced Clinical Biosystems Research Institute, Cedars-Sinai Medical Center, Los Angeles, California 90048, United States
| | - Vidya Venkatraman
- Smidt Heart Institute, Cedars-Sinai Medical Center, Los Angeles, California 90048, United States.,Advanced Clinical Biosystems Research Institute, Cedars-Sinai Medical Center, Los Angeles, California 90048, United States
| | - Aleksandra Binek
- Smidt Heart Institute, Cedars-Sinai Medical Center, Los Angeles, California 90048, United States.,Advanced Clinical Biosystems Research Institute, Cedars-Sinai Medical Center, Los Angeles, California 90048, United States
| | - Zachary Dwight
- Precision Biomarker Laboratories, Cedars-Sinai Medical Center, Los Angeles, California 90048, United States
| | - Nethika R Ariyasinghe
- Smidt Heart Institute, Cedars-Sinai Medical Center, Los Angeles, California 90048, United States.,Advanced Clinical Biosystems Research Institute, Cedars-Sinai Medical Center, Los Angeles, California 90048, United States
| | - Sean Escopete
- Smidt Heart Institute, Cedars-Sinai Medical Center, Los Angeles, California 90048, United States.,Advanced Clinical Biosystems Research Institute, Cedars-Sinai Medical Center, Los Angeles, California 90048, United States
| | - Sandy Y Joung
- Smidt Heart Institute, Cedars-Sinai Medical Center, Los Angeles, California 90048, United States
| | - Susan Cheng
- Smidt Heart Institute, Cedars-Sinai Medical Center, Los Angeles, California 90048, United States
| | - Sarah J Parker
- Smidt Heart Institute, Cedars-Sinai Medical Center, Los Angeles, California 90048, United States.,Advanced Clinical Biosystems Research Institute, Cedars-Sinai Medical Center, Los Angeles, California 90048, United States
| | - Justyna Fert-Bober
- Smidt Heart Institute, Cedars-Sinai Medical Center, Los Angeles, California 90048, United States.,Advanced Clinical Biosystems Research Institute, Cedars-Sinai Medical Center, Los Angeles, California 90048, United States
| | - Jennifer E Van Eyk
- Smidt Heart Institute, Cedars-Sinai Medical Center, Los Angeles, California 90048, United States.,Advanced Clinical Biosystems Research Institute, Cedars-Sinai Medical Center, Los Angeles, California 90048, United States
| |
Collapse
|
7
|
Kong W, Hui HWH, Peng H, Goh WWB. Dealing with missing values in proteomics data. Proteomics 2022; 22:e2200092. [PMID: 36349819 DOI: 10.1002/pmic.202200092] [Citation(s) in RCA: 37] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2022] [Revised: 09/15/2022] [Accepted: 10/11/2022] [Indexed: 11/10/2022]
Abstract
Proteomics data are often plagued with missingness issues. These missing values (MVs) threaten the integrity of subsequent statistical analyses by reduction of statistical power, introduction of bias, and failure to represent the true sample. Over the years, several categories of missing value imputation (MVI) methods have been developed and adapted for proteomics data. These MVI methods perform their tasks based on different prior assumptions (e.g., data is normally or independently distributed) and operating principles (e.g., the algorithm is built to address random missingness only), resulting in varying levels of performance even when dealing with the same dataset. Thus, to achieve a satisfactory outcome, a suitable MVI method must be selected. To guide decision making on suitable MVI method, we provide a decision chart which facilitates strategic considerations on datasets presenting different characteristics. We also bring attention to other issues that can impact proper MVI such as the presence of confounders (e.g., batch effects) which can influence MVI performance. Thus, these too, should be considered during or before MVI.
Collapse
Affiliation(s)
- Weijia Kong
- Lee Kong Chian School of Medicine, Nanyang Technological University, Singapore, Singapore.,School of Biological Sciences, Nanyang Technological University, Singapore, Singapore
| | - Harvard Wai Hann Hui
- Lee Kong Chian School of Medicine, Nanyang Technological University, Singapore, Singapore.,School of Biological Sciences, Nanyang Technological University, Singapore, Singapore
| | - Hui Peng
- Lee Kong Chian School of Medicine, Nanyang Technological University, Singapore, Singapore.,School of Biological Sciences, Nanyang Technological University, Singapore, Singapore
| | - Wilson Wen Bin Goh
- Lee Kong Chian School of Medicine, Nanyang Technological University, Singapore, Singapore.,School of Biological Sciences, Nanyang Technological University, Singapore, Singapore.,Centre for Biomedical Informatics, Nanyang Technological University, Singapore, Singapore
| |
Collapse
|
8
|
Ampong I, Zimmerman KD, Nathanielsz PW, Cox LA, Olivier M. Optimization of Imputation Strategies for High-Resolution Gas Chromatography-Mass Spectrometry (HR GC-MS) Metabolomics Data. Metabolites 2022; 12:429. [PMID: 35629933 PMCID: PMC9144635 DOI: 10.3390/metabo12050429] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2022] [Revised: 05/07/2022] [Accepted: 05/09/2022] [Indexed: 12/17/2022] Open
Abstract
Gas chromatography-coupled mass spectrometry (GC-MS) has been used in biomedical research to analyze volatile, non-polar, and polar metabolites in a wide array of sample types. Despite advances in technology, missing values are still common in metabolomics datasets and must be properly handled. We evaluated the performance of ten commonly used missing value imputation methods with metabolites analyzed on an HR GC-MS instrument. By introducing missing values into the complete (i.e., data without any missing values) National Institute of Standards and Technology (NIST) plasma dataset, we demonstrate that random forest (RF), glmnet ridge regression (GRR), and Bayesian principal component analysis (BPCA) shared the lowest root mean squared error (RMSE) in technical replicate data. Further examination of these three methods in data from baboon plasma and liver samples demonstrated they all maintained high accuracy. Overall, our analysis suggests that any of the three imputation methods can be applied effectively to untargeted metabolomics datasets with high accuracy. However, it is important to note that imputation will alter the correlation structure of the dataset and bias downstream regression coefficients and p-values.
Collapse
Affiliation(s)
- Isaac Ampong
- Center for Precision Medicine, Department of Internal Medicine, Section on Molecular Medicine, Wake Forest University, Winston-Salem, NC 27157, USA; (I.A.); (K.D.Z.); (L.A.C.)
| | - Kip D. Zimmerman
- Center for Precision Medicine, Department of Internal Medicine, Section on Molecular Medicine, Wake Forest University, Winston-Salem, NC 27157, USA; (I.A.); (K.D.Z.); (L.A.C.)
| | - Peter W. Nathanielsz
- Center for the Study of Fetal Programming, University of Wyoming, Laramie, WY 82071, USA;
- Southwest National Primate Research Center, San Antonio, TX 78227, USA
| | - Laura A. Cox
- Center for Precision Medicine, Department of Internal Medicine, Section on Molecular Medicine, Wake Forest University, Winston-Salem, NC 27157, USA; (I.A.); (K.D.Z.); (L.A.C.)
- Southwest National Primate Research Center, San Antonio, TX 78227, USA
| | - Michael Olivier
- Center for Precision Medicine, Department of Internal Medicine, Section on Molecular Medicine, Wake Forest University, Winston-Salem, NC 27157, USA; (I.A.); (K.D.Z.); (L.A.C.)
| |
Collapse
|
9
|
Ye M, Lin Y, Pan S, Wang ZW, Zhu X. Applications of Multi-omics Approaches for Exploring the Molecular Mechanism of Ovarian Carcinogenesis. Front Oncol 2021; 11:745808. [PMID: 34631583 PMCID: PMC8497990 DOI: 10.3389/fonc.2021.745808] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2021] [Accepted: 09/08/2021] [Indexed: 12/29/2022] Open
Abstract
Ovarian cancer ranks as the fifth most common cause of cancer-related death in females. The molecular mechanisms of ovarian carcinogenesis need to be explored in order to identify effective clinical therapies for ovarian cancer. Recently, multi-omics approaches have been applied to determine the mechanisms of ovarian oncogenesis at genomics (DNA), transcriptomics (RNA), proteomics (proteins), and metabolomics (metabolites) levels. Multi-omics approaches can identify some diagnostic and prognostic biomarkers and therapeutic targets for ovarian cancer, and these molecular signatures are beneficial for clarifying the development and progression of ovarian cancer. Moreover, the discovery of molecular signatures and targeted therapy strategies could noticeably improve the prognosis of ovarian cancer patients.
Collapse
Affiliation(s)
| | | | | | - Zhi-wei Wang
- Center of Uterine Cancer Diagnosis & Therapy Research of Zhejiang Province, Department of Obstetrics and Gynecology, the Second Affiliated Hospital of Wenzhou Medical University, Wenzhou, China
| | - Xueqiong Zhu
- Center of Uterine Cancer Diagnosis & Therapy Research of Zhejiang Province, Department of Obstetrics and Gynecology, the Second Affiliated Hospital of Wenzhou Medical University, Wenzhou, China
| |
Collapse
|
10
|
Chaby LE, Lasseter HC, Contrepois K, Salek RM, Turck CW, Thompson A, Vaughan T, Haas M, Jeromin A. Cross-Platform Evaluation of Commercially Targeted and Untargeted Metabolomics Approaches to Optimize the Investigation of Psychiatric Disease. Metabolites 2021; 11:609. [PMID: 34564425 PMCID: PMC8466258 DOI: 10.3390/metabo11090609] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2021] [Revised: 08/24/2021] [Accepted: 08/26/2021] [Indexed: 11/17/2022] Open
Abstract
Metabolomics methods often encounter trade-offs between quantification accuracy and coverage, with truly comprehensive coverage only attainable through a multitude of complementary assays. Due to the lack of standardization and the variety of metabolomics assays, it is difficult to integrate datasets across studies or assays. To inform metabolomics platform selection, with a focus on posttraumatic stress disorder (PTSD), we review platform use and sample sizes in psychiatric metabolomics studies and then evaluate five prominent metabolomics platforms for coverage and performance, including intra-/inter-assay precision, accuracy, and linearity. We found performance was variable between metabolite classes, but comparable across targeted and untargeted approaches. Within all platforms, precision and accuracy were highly variable across classes, ranging from 0.9-63.2% (coefficient of variation) and 0.6-99.1% for accuracy to reference plasma. Several classes had high inter-assay variance, potentially impeding dissociation of a biological signal, including glycerophospholipids, organooxygen compounds, and fatty acids. Coverage was platform-specific and ranged from 16-70% of PTSD-associated metabolites. Non-overlapping coverage is challenging; however, benefits of applying multiple metabolomics technologies must be weighed against cost, biospecimen availability, platform-specific normative levels, and challenges in merging datasets. Our findings and open-access cross-platform dataset can inform platform selection and dataset integration based on platform-specific coverage breadth/overlap and metabolite-specific performance.
Collapse
Affiliation(s)
- Lauren E. Chaby
- Cohen Veterans Bioscience, New York, NY 10018, USA; (L.E.C.); (H.C.L.); (A.T.); (T.V.); (M.H.)
| | - Heather C. Lasseter
- Cohen Veterans Bioscience, New York, NY 10018, USA; (L.E.C.); (H.C.L.); (A.T.); (T.V.); (M.H.)
| | - Kévin Contrepois
- Department of Genetics, Stanford University School of Medicine, Stanford, CA 94305, USA;
| | - Reza M. Salek
- International Agency for Research on Cancer, Nutrition and Metabolism Branch, World Health Organisation, 150 Cours Albert Thomas, CEDEX 08, 69372 Lyon, France;
| | - Christoph W. Turck
- Max Planck Institute of Psychiatry, Proteomics and Biomarkers, 80804 Munich, Germany;
| | - Andrew Thompson
- Cohen Veterans Bioscience, New York, NY 10018, USA; (L.E.C.); (H.C.L.); (A.T.); (T.V.); (M.H.)
| | - Timothy Vaughan
- Cohen Veterans Bioscience, New York, NY 10018, USA; (L.E.C.); (H.C.L.); (A.T.); (T.V.); (M.H.)
| | - Magali Haas
- Cohen Veterans Bioscience, New York, NY 10018, USA; (L.E.C.); (H.C.L.); (A.T.); (T.V.); (M.H.)
| | - Andreas Jeromin
- Cohen Veterans Bioscience, New York, NY 10018, USA; (L.E.C.); (H.C.L.); (A.T.); (T.V.); (M.H.)
| |
Collapse
|
11
|
Hasan MK, Alam MA, Roy S, Dutta A, Jawad MT, Das S. Missing value imputation affects the performance of machine learning: A review and analysis of the literature (2010–2021). INFORMATICS IN MEDICINE UNLOCKED 2021. [DOI: 10.1016/j.imu.2021.100799] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
|
12
|
|
13
|
Li Q, Fisher K, Meng W, Fang B, Welsh E, Haura EB, Koomen JM, Eschrich SA, Fridley BL, Chen YA. GMSimpute: a generalized two-step Lasso approach to impute missing values in label-free mass spectrum analysis. Bioinformatics 2020; 36:257-263. [PMID: 31199438 PMCID: PMC6956786 DOI: 10.1093/bioinformatics/btz488] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2018] [Revised: 05/06/2019] [Accepted: 06/10/2019] [Indexed: 12/16/2022] Open
Abstract
Motivation Missingness in label-free mass spectrometry is inherent to the technology. A computational approach to recover missing values in metabolomics and proteomics datasets is important. Most existing methods are designed under a particular assumption, either missing at random or under the detection limit. If the missing pattern deviates from the assumption, it may lead to biased results. Hence, we investigate the missing patterns in free mass spectrometry data and develop an omnibus approach GMSimpute, to allow effective imputation accommodating different missing patterns. Results Three proteomics datasets and one metabolomics dataset indicate missing values could be a mixture of abundance-dependent and abundance-independent missingness. We assess the performance of GMSimpute using simulated data (with a wide range of 80 missing patterns) and metabolomics data from the Cancer Genome Atlas breast cancer and clear cell renal cell carcinoma studies. Using Pearson correlation and normalized root mean square errors between the true and imputed abundance, we compare its performance to K-nearest neighbors’ type approaches, Random Forest, GSimp, a model-based method implemented in DanteR and minimum values. The results indicate GMSimpute provides higher accuracy in imputation and exhibits stable performance across different missing patterns. In addition, GMSimpute is able to identify the features in downstream differential expression analysis with high accuracy when applied to the Cancer Genome Atlas datasets. Availability and implementation GMSimpute is on CRAN: https://cran.r-project.org/web/packages/GMSimpute/index.html. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Qian Li
- Health Informatics Institute, University of South Florida, Tampa, FL, USA
| | - Kate Fisher
- Department of Biostatistics and Bioinformatics, Moffitt Cancer Center, Tampa, FL, USA.,Department of Biostatistics, IDDI Inc., Raleigh, NC, USA
| | - Wenjun Meng
- Department of Biostatistics and Bioinformatics, Moffitt Cancer Center, Tampa, FL, USA
| | - Bin Fang
- Proteomics and Metabolomics Core Facility, Moffitt Cancer Center, Tampa, FL, USA
| | - Eric Welsh
- Department of Biostatistics and Bioinformatics, Moffitt Cancer Center, Tampa, FL, USA
| | - Eric B Haura
- Department of Thoracic Oncology, Moffitt Cancer Center, Tampa, FL, USA
| | - John M Koomen
- Department of Molecular Oncology, Moffitt Cancer Center, Tampa, FL, USA
| | - Steven A Eschrich
- Department of Biostatistics and Bioinformatics, Moffitt Cancer Center, Tampa, FL, USA
| | - Brooke L Fridley
- Department of Biostatistics and Bioinformatics, Moffitt Cancer Center, Tampa, FL, USA
| | - Y Ann Chen
- Department of Biostatistics and Bioinformatics, Moffitt Cancer Center, Tampa, FL, USA
| |
Collapse
|
14
|
Long NP, Nghi TD, Kang YP, Anh NH, Kim HM, Park SK, Kwon SW. Toward a Standardized Strategy of Clinical Metabolomics for the Advancement of Precision Medicine. Metabolites 2020; 10:E51. [PMID: 32013105 PMCID: PMC7074059 DOI: 10.3390/metabo10020051] [Citation(s) in RCA: 50] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2019] [Revised: 01/17/2020] [Accepted: 01/21/2020] [Indexed: 12/18/2022] Open
Abstract
Despite the tremendous success, pitfalls have been observed in every step of a clinical metabolomics workflow, which impedes the internal validity of the study. Furthermore, the demand for logistics, instrumentations, and computational resources for metabolic phenotyping studies has far exceeded our expectations. In this conceptual review, we will cover inclusive barriers of a metabolomics-based clinical study and suggest potential solutions in the hope of enhancing study robustness, usability, and transferability. The importance of quality assurance and quality control procedures is discussed, followed by a practical rule containing five phases, including two additional "pre-pre-" and "post-post-" analytical steps. Besides, we will elucidate the potential involvement of machine learning and demonstrate that the need for automated data mining algorithms to improve the quality of future research is undeniable. Consequently, we propose a comprehensive metabolomics framework, along with an appropriate checklist refined from current guidelines and our previously published assessment, in the attempt to accurately translate achievements in metabolomics into clinical and epidemiological research. Furthermore, the integration of multifaceted multi-omics approaches with metabolomics as the pillar member is in urgent need. When combining with other social or nutritional factors, we can gather complete omics profiles for a particular disease. Our discussion reflects the current obstacles and potential solutions toward the progressing trend of utilizing metabolomics in clinical research to create the next-generation healthcare system.
Collapse
Affiliation(s)
- Nguyen Phuoc Long
- College of Pharmacy, Seoul National University, Seoul 08826, Korea; (N.P.L.); (N.H.A.); (H.M.K.)
| | - Tran Diem Nghi
- Department of Life Sciences, Pohang University of Science and Technology, Pohang 790-784, Korea; (T.D.N.); (S.K.P.)
| | - Yun Pyo Kang
- Department of Cancer Physiology, Moffitt Cancer Center and Research Institute, Tampa, FL 33612, USA;
| | - Nguyen Hoang Anh
- College of Pharmacy, Seoul National University, Seoul 08826, Korea; (N.P.L.); (N.H.A.); (H.M.K.)
| | - Hyung Min Kim
- College of Pharmacy, Seoul National University, Seoul 08826, Korea; (N.P.L.); (N.H.A.); (H.M.K.)
| | - Sang Ki Park
- Department of Life Sciences, Pohang University of Science and Technology, Pohang 790-784, Korea; (T.D.N.); (S.K.P.)
| | - Sung Won Kwon
- College of Pharmacy, Seoul National University, Seoul 08826, Korea; (N.P.L.); (N.H.A.); (H.M.K.)
| |
Collapse
|
15
|
Chiou SH, Betensky RA, Balasubramanian R. The missing indicator approach for censored covariates subject to limit of detection in logistic regression models. Ann Epidemiol 2019; 38:57-64. [PMID: 31604610 PMCID: PMC6812630 DOI: 10.1016/j.annepidem.2019.07.014] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2018] [Revised: 07/12/2019] [Accepted: 07/24/2019] [Indexed: 12/14/2022]
Abstract
PURPOSE In several biomedical studies, one or more exposures of interest may be subject to nonrandom missingness because of the failure of the measurement assay at levels below its limit of detection. This issue is commonly encountered in studies of the metabolome using tandem mass spectrometry-based technologies. Owing to a large number of metabolites measured in these studies, preserving statistical power is of utmost interest. In this article, we evaluate the small sample properties of the missing indicator approach in logistic and conditional logistic regression models. METHODS For nested case-control or matched case control study designs, we evaluate the bias, power, and type I error associated with the missing indicator method using simulation. We compare the missing indicator approach to complete case analysis and several imputation approaches. RESULTS We show that under a variety of settings, the missing indicator approach outperforms complete case analysis and other imputation approaches with regard to bias, mean squared error, and power. CONCLUSIONS For nested case-control and matched study designs of modest sample sizes, the missing indicator model minimizes loss of information and thus provides an attractive alternative to the oft-used complete case analysis and other imputation approaches.
Collapse
Affiliation(s)
- Sy Han Chiou
- Department of Biostatistics, Harvard T. H. Chan School of Public Health, Boston, MA
| | - Rebecca A Betensky
- Department of Biostatistics, Harvard T. H. Chan School of Public Health, Boston, MA
| | - Raji Balasubramanian
- Department of Biostatistics and Epidemiology, University of Massachusetts - Amherst, Amherst, MA.
| |
Collapse
|
16
|
Anjos S, Feiteira E, Cerveira F, Melo T, Reboredo A, Colombo S, Dantas R, Costa E, Moreira A, Santos S, Campos A, Ferreira R, Domingues P, Domingues MRM. Lipidomics Reveals Similar Changes in Serum Phospholipid Signatures of Overweight and Obese Pediatric Subjects. J Proteome Res 2019; 18:3174-3183. [DOI: 10.1021/acs.jproteome.9b00249] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Affiliation(s)
- Sara Anjos
- Mass Spectrometry Centre, Department of Chemistry and QOPNA, University of Aveiro, Campus Universitário de Santiago, 3810-193 Aveiro, Portugal
| | - Eva Feiteira
- Mass Spectrometry Centre, Department of Chemistry and QOPNA, University of Aveiro, Campus Universitário de Santiago, 3810-193 Aveiro, Portugal
| | | | - Tânia Melo
- Mass Spectrometry Centre, Department of Chemistry and QOPNA, University of Aveiro, Campus Universitário de Santiago, 3810-193 Aveiro, Portugal
- Department of Chemistry and CESAM and ECOMARE, University of Aveiro, Campus Universitário de Santiago, 3810-193 Aveiro, Portugal
| | - Andrea Reboredo
- Clinical Pathology, Centro Hospitalar do Baixo Vouga, Aveiro, Portugal
| | - Simone Colombo
- Mass Spectrometry Centre, Department of Chemistry and QOPNA, University of Aveiro, Campus Universitário de Santiago, 3810-193 Aveiro, Portugal
| | - Rosa Dantas
- Endocrinology, Diabetes and Nutrition, Centro Hospitalar do Baixo Vouga, Aveiro, Portugal
| | - Elisabete Costa
- Mass Spectrometry Centre, Department of Chemistry and QOPNA, University of Aveiro, Campus Universitário de Santiago, 3810-193 Aveiro, Portugal
| | - Ana Moreira
- Mass Spectrometry Centre, Department of Chemistry and QOPNA, University of Aveiro, Campus Universitário de Santiago, 3810-193 Aveiro, Portugal
| | - Sónia Santos
- Department of Chemistry and CICECO, University of Aveiro, Campus Universitário de Santiago, 3810-193 Aveiro, Portugal
| | - Ana Campos
- Mass Spectrometry Centre, Department of Chemistry and QOPNA, University of Aveiro, Campus Universitário de Santiago, 3810-193 Aveiro, Portugal
| | - Rita Ferreira
- Mass Spectrometry Centre, Department of Chemistry and QOPNA, University of Aveiro, Campus Universitário de Santiago, 3810-193 Aveiro, Portugal
| | - Pedro Domingues
- Mass Spectrometry Centre, Department of Chemistry and QOPNA, University of Aveiro, Campus Universitário de Santiago, 3810-193 Aveiro, Portugal
| | - M. Rosário M. Domingues
- Mass Spectrometry Centre, Department of Chemistry and QOPNA, University of Aveiro, Campus Universitário de Santiago, 3810-193 Aveiro, Portugal
- Department of Chemistry and CESAM and ECOMARE, University of Aveiro, Campus Universitário de Santiago, 3810-193 Aveiro, Portugal
| |
Collapse
|
17
|
Piasecka A, Kachlicki P, Stobiecki M. Analytical Methods for Detection of Plant Metabolomes Changes in Response to Biotic and Abiotic Stresses. Int J Mol Sci 2019; 20:E379. [PMID: 30658398 PMCID: PMC6358739 DOI: 10.3390/ijms20020379] [Citation(s) in RCA: 54] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2018] [Revised: 01/08/2019] [Accepted: 01/15/2019] [Indexed: 11/17/2022] Open
Abstract
Abiotic and biotic stresses are the main reasons of substantial crop yield losses worldwide. Research devoted to reveal mechanisms of plant reactions during their interactions with the environment are conducted on the level of genome, transcriptome, proteome, and metabolome. Data obtained during these studies would permit to define biochemical and physiological mechanisms of plant resistance or susceptibility to affecting factors/stresses. Metabolomics based on mass spectrometric techniques is an important part of research conducted in the direction of breeding new varieties of crop plants tolerant to the affecting stresses and possessing good agronomical features. Studies of this kind are carried out on model, crop and resurrection plants. Metabolites profiling yields large sets of data and due to this fact numerous advanced statistical and bioinformatic methods permitting to obtain qualitative and quantitative evaluation of the results have been developed. Moreover, advanced integration of metabolomics data with these obtained on other omics levels: genome, transcriptome and proteome should be carried out. Such a holistic approach would bring us closer to understanding biochemical and physiological processes of the cell and whole plant interacting with the environment and further apply these observations in successful breeding of stress tolerant or resistant crop plants.
Collapse
Affiliation(s)
- Anna Piasecka
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, Noskowskiego 12/14, 61-704 Poznań, Poland.
- Institute of Plant Genetics, Polish Academy of Sciences, Strzeszynska 34, 60-479 Poznań, Poland.
| | - Piotr Kachlicki
- Institute of Plant Genetics, Polish Academy of Sciences, Strzeszynska 34, 60-479 Poznań, Poland.
| | - Maciej Stobiecki
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, Noskowskiego 12/14, 61-704 Poznań, Poland.
| |
Collapse
|
18
|
Impact of Blood Collection Tubes and Sample Handling Time on Serum and Plasma Metabolome and Lipidome. Metabolites 2018; 8:metabo8040088. [PMID: 30518126 PMCID: PMC6316012 DOI: 10.3390/metabo8040088] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2018] [Revised: 11/26/2018] [Accepted: 11/29/2018] [Indexed: 01/06/2023] Open
Abstract
Background: Metabolomics is emerging as a valuable tool in clinical science. However, one major challenge in clinical metabolomics is the limited use of standardized guidelines for sample collection and handling. In this study, we conducted a pilot analysis of serum and plasma to determine the effects of processing time and collection tube on the metabolome. Methods: Blood was collected in 3 tubes: Vacutainer serum separator tube (SST, serum), EDTA (plasma) and P100 (plasma) and stored at 4 degrees for 0, 0.5, 1, 2, 4 and 24 h prior to centrifugation. Compounds were extracted using liquid-liquid extraction to obtain a hydrophilic and a hydrophobic fraction and analyzed using liquid chromatography mass spectrometry. Differences among the blood collection tubes and sample processing time were evaluated (ANOVA, Bonferroni FWER ≤ 0.05 and ANOVA, Benjamini Hochberg FDR ≤ 0.1, respectively). Results: Among the serum and plasma tubes 93.5% of compounds overlapped, 382 compounds were unique to serum and one compound was unique to plasma. There were 46, 50 and 86 compounds affected by processing time in SST, EDTA and P100 tubes, respectively, including many lipids. In contrast, 496 hydrophilic and 242 hydrophobic compounds differed by collection tube. Forty-five different chemical classes including alcohols, sugars, amino acids and prenol lipids were affected by the choice of blood collection tube. Conclusion: Our results suggest that the choice of blood collection tube has a significant effect on detected metabolites and their overall abundances. Perhaps surprisingly, variation in sample processing time has less of an effect compared to collection tube; however, a larger sample size is needed to confirm this.
Collapse
|