1
|
Momenzadeh A, Meyer JG. Single-Cell Proteomics Using Mass Spectrometry. ARXIV 2025:arXiv:2502.11982v2. [PMID: 40034135 PMCID: PMC11875278] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 03/05/2025]
Abstract
Single-cell proteomics (SCP) is transforming our understanding of biological complexity by shifting from bulk proteomics, where signals are averaged over thousands of cells, to the proteome analysis of individual cells. This granular perspective reveals distinct cell states, population heterogeneity, and the underpinnings of disease pathogenesis that bulk approaches may obscure. However, SCP demands exceptional sensitivity, precise cell handling, and robust data processing to overcome the inherent challenges of analyzing picogram-level protein samples without amplification. Recent innovations in sample preparation, separations, data acquisition strategies, and specialized mass spectrometry instrumentation have substantially improved proteome coverage and throughput. Approaches that integrate complementary omics, streamline multi-step sample processing, and automate workflows through microfluidics and specialized platforms promise to further push SCP boundaries. Advances in computational methods, especially for data normalization and imputation, address the pervasive issue of missing values, enabling more reliable downstream biological interpretations. Despite these strides, higher throughput, reproducibility, and consensus best practices remain pressing needs in the field. This mini review summarizes the latest progress in SCP technology and software solutions, highlighting how closer integration of analytical, computational, and experimental strategies will facilitate deeper and broader coverage of single-cell proteomes.
Collapse
|
2
|
Segers A, Castiglione C, Vanderaa C, De Baere E, Martens L, Risso D, Clement L. omicsGMF: a multi-tool for dimensionality reduction, batch correction and imputation applied to bulk- and single cell proteomics data. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.03.24.644996. [PMID: 40196514 PMCID: PMC11974731 DOI: 10.1101/2025.03.24.644996] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 04/09/2025]
Abstract
The unprecedented speed and sensitivity of mass spectrometry (MS) unlocked large-scale applications of proteomics and even enabled proteome profiling of single cells. However, this fast-evolving field is hindered by a lack of scalable dimensionality reduction tools that can compensate for substantial batch effects and missingness across MS runs. Therefore, we present omicsGMF, a fast, scalable, and interpretable matrix factorization method, tailored for bulk and single-cell proteomics data. Unlike current workflows that sequentially apply imputation, batch correction, and principal component analysis, omicsGMF integrates these steps into a unified framework, dramatically enhancing data processing and dimensionality reduction. Additionally, omicsGMF provides robust imputation of missing values, outperforming bespoke state-of-the-art imputation tools. We further demonstrate how this integrated approach increases statistical power to detect differentially abundant proteins in the downstream data analysis. Hence, omicsGMF is a highly scalable approach to dimensionality reduction in proteomics, that dramatically improves many important steps in proteomics data analysis.
Collapse
Affiliation(s)
- Alexandre Segers
- Department of Applied Mathematics, Computer Science and Statistics, Ghent University. Ghent, Belgium
- Center for Medical Genetics Ghent, Ghent University and Ghent University Hospital. Ghent, Belgium
| | - Cristian Castiglione
- Bocconi Institute for Data Science and Analytics, Bocconi University. Milan, Italy
| | - Christophe Vanderaa
- Department of Applied Mathematics, Computer Science and Statistics, Ghent University. Ghent, Belgium
| | - Elfride De Baere
- Center for Medical Genetics Ghent, Ghent University and Ghent University Hospital. Ghent, Belgium
- Department of Biomolecular Medicine, Ghent University. Ghent, Belgium
| | - Lennart Martens
- Department of Biomolecular Medicine, Ghent University. Ghent, Belgium
- VIB-UGent Center for Medical Biotechnology, VIB. Ghent, Belgium
| | - Davide Risso
- Department of Statistical Sciences, University of Padova. Padova, Italy
| | - Lieven Clement
- Department of Applied Mathematics, Computer Science and Statistics, Ghent University. Ghent, Belgium
| |
Collapse
|
3
|
Gong Y, Dai Y, Wu Q, Guo L, Yao X, Yang Q. Benchmark of Data Integration in Single-Cell Proteomics. Anal Chem 2025; 97:1254-1263. [PMID: 39761355 DOI: 10.1021/acs.analchem.4c04933] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2025]
Abstract
Single-cell proteomics (SCP) detected based on different technologies always involves batch-specific variations because of differences in sample processing and other potential biases. How to integrate SCP data effectively has become a great challenge. Integration of SCP data not only requires the conservation of true biological variances, but also realizes the removal of unwanted batch effects. In this study, benchmarking analysis of popular data integration methods was conducted to determine the most suitable method for SCP data. To comprehensively evaluate the performance of these integration methods, a novel evaluation system was proposed for integrating SCP data. This evaluation system consists of three objective measures from different perspectives: category (a), the efficacy of correcting batch effects; category (b), the power of conserving biological variances; and category (c), the ability to identify consistent markers. For this comprehensive evaluation, five benchmark data sets under different scenarios (containing substantial proteins, substantial cells, multiple batches, multiple cell types, and unbalanced data) were utilized for selecting the most suitable data integration method. As a result, three methods, ComBat, Scanorama, and Seurat version 3 CCA, were identified as the most recommended methods for integrating SCP data. Overall, this systematic evaluation might provide valuable guidance in choosing the appropriate method for data integration in the SCP.
Collapse
Affiliation(s)
- Yaguo Gong
- School of Pharmacy, State Key Laboratory of Quality Research in Chinese Medicine, Macau University of Science and Technology, Macao 999078, China
| | - Yangbo Dai
- State Key Laboratory for Organic Electronics and Information Displays, Institute of Advanced Materials (IAM), Nanjing University of Posts and Telecommunications, Nanjing 210023, China
| | - Qibiao Wu
- School of Pharmacy, State Key Laboratory of Quality Research in Chinese Medicine, Macau University of Science and Technology, Macao 999078, China
| | - Li Guo
- State Key Laboratory for Organic Electronics and Information Displays, Institute of Advanced Materials (IAM), Nanjing University of Posts and Telecommunications, Nanjing 210023, China
| | - Xiaojun Yao
- Centre for Artificial Intelligence Driven Drug Discovery, Faculty of Applied Sciences, Macao Polytechnic University, Macao 999078, China
| | - Qingxia Yang
- Zhejiang Provincial Key Laboratory of Precision Diagnosis and Therapy for Major Gynecological Diseases, Women's Hospital, Zhejiang University School of Medicine, Hangzhou 310058, China
| |
Collapse
|
4
|
Shopsowitz K, Lofroth J, Chan G, Kim J, Rana M, Brinkman R, Weng A, Medvedev N, Wang X. MAGIC-DR: An interpretable machine-learning guided approach for acute myeloid leukemia measurable residual disease analysis. CYTOMETRY. PART B, CLINICAL CYTOMETRY 2024; 106:239-251. [PMID: 38415807 DOI: 10.1002/cyto.b.22168] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/10/2023] [Revised: 02/08/2024] [Accepted: 02/13/2024] [Indexed: 02/29/2024]
Abstract
Multiparameter flow cytometry is widely used for acute myeloid leukemia minimal residual disease testing (AML MRD) but is time consuming and demands substantial expertise. Machine learning offers potential advancements in accuracy and efficiency, but has yet to be widely adopted for this application. To explore this, we trained single cell XGBoost classifiers from 98 diagnostic AML cell populations and 30 MRD negative samples. Performance was assessed by cross-validation. Predictions were integrated with UMAP as a heatmap parameter for an augmented/interactive AML MRD analysis framework, which was benchmarked against traditional MRD analysis for 25 test cases. The results showed that XGBoost achieved a median AUC of 0.97, effectively distinguishing diverse AML cell populations from normal cells. When integrated with UMAP, the classifiers highlighted MRD populations against the background of normal events. Our pipeline, MAGIC-DR, incorporated classifier predictions and UMAP into flow cytometry standard (FCS) files. This enabled a human-in-the-loop machine learning guided MRD workflow. Validation against conventional analysis for 25 MRD samples showed 100% concordance in myeloid blast detection, with MAGIC-DR also identifying several immature monocytic populations not readily found by conventional analysis. In conclusion, Integrating a supervised classifier with unsupervised dimension reduction offers a robust method for AML MRD analysis that can be seamlessly integrated into conventional workflows. Our approach can support and augment human analysis by highlighting abnormal populations that can be gated on for quantification and further assessment. This has the potential to speed up MRD analysis, and potentially improve detection sensitivity for certain AML immunophenotypes.
Collapse
Affiliation(s)
- Kevin Shopsowitz
- Division of Hematopathology, Vancouver General Hospital, Vancouver, British Columbia, Canada
- Department of pathology and laboratory medicine, University of British Columbia, Vancouver, British Columbia, Canada
| | - Jack Lofroth
- Faculty of Medicine, University of British Columbia, Vancouver, British Columbia, Canada
| | - Geoffrey Chan
- Division of Hematopathology, Vancouver General Hospital, Vancouver, British Columbia, Canada
| | - Jubin Kim
- Terry Fox Lab, BC Cancer, Vancouver, British Columbia, Canada
| | - Makhan Rana
- Division of Hematopathology, Vancouver General Hospital, Vancouver, British Columbia, Canada
| | - Ryan Brinkman
- Terry Fox Lab, BC Cancer, Vancouver, British Columbia, Canada
| | - Andrew Weng
- Department of pathology and laboratory medicine, University of British Columbia, Vancouver, British Columbia, Canada
- Terry Fox Lab, BC Cancer, Vancouver, British Columbia, Canada
| | - Nadia Medvedev
- Division of Hematopathology, Vancouver General Hospital, Vancouver, British Columbia, Canada
- Department of pathology and laboratory medicine, University of British Columbia, Vancouver, British Columbia, Canada
| | - Xuehai Wang
- Division of Hematopathology, Vancouver General Hospital, Vancouver, British Columbia, Canada
- Department of pathology and laboratory medicine, University of British Columbia, Vancouver, British Columbia, Canada
| |
Collapse
|
5
|
Yang J, Zhou Y, Wang T, Li N, Chao Y, Gao S, Zhang Q, Wu S, Zhao L, Dong X. A multi-omics study to monitor senescence-associated secretory phenotypes of Alzheimer's disease. Ann Clin Transl Neurol 2024; 11:1310-1324. [PMID: 38605603 DOI: 10.1002/acn3.52047] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2023] [Revised: 03/04/2024] [Accepted: 03/10/2024] [Indexed: 04/13/2024] Open
Abstract
OBJECTIVE Alzheimer's disease (AD) is characterized by the progressive degeneration and damage of neurons in the brain. However, developing an accurate diagnostic assay using blood samples remains a challenge in clinic practice. The aim of this study was to explore senescence-associated secretory phenotypes (SASPs) in peripheral blood using mass spectrometry based multi-omics approach and to establish diagnostic assays for AD. METHODS This retrospective study included 88 participants, consisting of 29 AD patients and 59 cognitively normal (CN) individuals. Plasma and serum samples were examined using high-resolution mass spectrometry to identify proteomic and metabolomic profiles. Receiver operating characteristic (ROC) analysis was employed to screen biomarkers with diagnostic potential. K-nearest neighbors (KNN) algorithm was utilized to construct a multi-dimensional model for distinguishing AD from CN. RESULTS Proteomics analysis revealed upregulation of five plasma proteins in AD, including RNA helicase aquarius (AQR), zinc finger protein 587B (ZNF587B), C-reactive protein (CRP), fibronectin (FN1), and serum amyloid A-1 protein (SAA1), indicating their potential for AD classification. Interestingly, KNN-based three-dimensional model, comprising AQR, ZNF587B, and CRP, demonstrated its high accuracy in AD recognition, with evaluation possibilities of 0.941, 1.000, and 1.000 for the training, testing, and validation datasets, respectively. Besides, metabolomics analysis suggested elevated levels of serum phenylacetylglutamine (PAGIn) in AD. INTERPRETATION The multi-omics outcomes highlighted the significance of the SASPs, specifically AQR, ZNF587B, CRP, and PAGIn, in terms of their potential for diagnosing AD and suggested neuronal aging-associated pathophysiology.
Collapse
Affiliation(s)
- Jingzhi Yang
- Institute of Translational Medicine, Shanghai University, Shanghai, 200444, China
| | - Yinge Zhou
- School of Medicine, Shanghai University, Shanghai, 200444, China
| | - Tianjiao Wang
- School of Medicine, Shanghai University, Shanghai, 200444, China
| | - Na Li
- School of Medicine, Shanghai University, Shanghai, 200444, China
| | - Yufan Chao
- School of Medicine, Shanghai University, Shanghai, 200444, China
| | - Songyan Gao
- Institute of Translational Medicine, Shanghai University, Shanghai, 200444, China
| | - Qun Zhang
- Department of Internal Medicine, Shanghai Baoshan Elderly Nursing Hospital, Shanghai, 200435, China
| | - Shuo Wu
- Neurology Department, Shanghai Baoshan Luodian Hospital, Shanghai, 201908, China
| | - Liang Zhao
- Department of Pharmacy, Shanghai Baoshan Luodian Hospital, Shanghai, 201908, China
| | - Xin Dong
- Institute of Translational Medicine, Shanghai University, Shanghai, 200444, China
- School of Medicine, Shanghai University, Shanghai, 200444, China
- Suzhou Innovation Center of Shanghai University, Suzhou, 215000, Jiangsu, China
| |
Collapse
|