1
|
Khan S, Conover R, Asthagiri AR, Slavov N. Dynamics of Single-Cell Protein Covariation during Epithelial-Mesenchymal Transition. J Proteome Res 2024. [PMID: 38663020 DOI: 10.1021/acs.jproteome.4c00277] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/02/2024]
Abstract
Physiological processes, such as the epithelial-mesenchymal transition (EMT), are mediated by changes in protein interactions. These changes may be better reflected in protein covariation within a cellular cluster than in the temporal dynamics of cluster-average protein abundance. To explore this possibility, we quantified proteins in single human cells undergoing EMT. Covariation analysis of the data revealed that functionally coherent protein clusters dynamically changed their protein-protein correlations without concomitant changes in the cluster-average protein abundance. These dynamics of protein-protein correlations were monotonic in time and delineated protein modules functioning in actin cytoskeleton organization, energy metabolism, and protein transport. These protein modules are defined by protein covariation within the same time point and cluster and, thus, reflect biological regulation masked by the cluster-average protein dynamics. Thus, protein correlation dynamics across single cells offers a window into protein regulation during physiological transitions.
Collapse
Affiliation(s)
- Saad Khan
- Department of Bioengineering, Northeastern University, Boston, Massachusetts 02115, United States
- Department of Biology, Northeastern University, Boston, Massachusetts 02115, United States
| | - Rachel Conover
- Department of Bioengineering, Northeastern University, Boston, Massachusetts 02115, United States
| | - Anand R Asthagiri
- Department of Bioengineering, Northeastern University, Boston, Massachusetts 02115, United States
- Department of Biology, Northeastern University, Boston, Massachusetts 02115, United States
- Department of Chemical Engineering, Northeastern University, Boston, Massachusetts 02115, United States
| | - Nikolai Slavov
- Department of Bioengineering, Northeastern University, Boston, Massachusetts 02115, United States
- Department of Biology, Northeastern University, Boston, Massachusetts 02115, United States
- Parallel Squared Technology Institute, Watertown, Massachusetts 02472, United States
| |
Collapse
|
2
|
Younus S, Rönnstrand L, Kazi JU. Xputer: bridging data gaps with NMF, XGBoost, and a streamlined GUI experience. Front Artif Intell 2024; 7:1345179. [PMID: 38720912 PMCID: PMC11076752 DOI: 10.3389/frai.2024.1345179] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2023] [Accepted: 04/08/2024] [Indexed: 05/12/2024] Open
Abstract
The rapid proliferation of data across diverse fields has accentuated the importance of accurate imputation for missing values. This task is crucial for ensuring data integrity and deriving meaningful insights. In response to this challenge, we present Xputer, a novel imputation tool that adeptly integrates Non-negative Matrix Factorization (NMF) with the predictive strengths of XGBoost. One of Xputer's standout features is its versatility: it supports zero imputation, enables hyperparameter optimization through Optuna, and allows users to define the number of iterations. For enhanced user experience and accessibility, we have equipped Xputer with an intuitive Graphical User Interface (GUI) ensuring ease of handling, even for those less familiar with computational tools. In performance benchmarks, Xputer often outperforms IterativeImputer in terms of imputation accuracy. Furthermore, Xputer autonomously handles a diverse spectrum of data types, including categorical, continuous, and Boolean, eliminating the need for prior preprocessing. Given its blend of performance, flexibility, and user-friendly design, Xputer emerges as a state-of-the-art solution in the realm of data imputation.
Collapse
Affiliation(s)
- Saleena Younus
- Division of Translational Cancer Research, Department of Laboratory Medicine, Lund University, Lund, Sweden
- Lund Stem Cell Center, Lund University, Lund, Sweden
- Lund University Cancer Centre (LUCC), Lund University, Lund, Sweden
| | - Lars Rönnstrand
- Division of Translational Cancer Research, Department of Laboratory Medicine, Lund University, Lund, Sweden
- Lund Stem Cell Center, Lund University, Lund, Sweden
- Lund University Cancer Centre (LUCC), Lund University, Lund, Sweden
- Department of Hematology, Oncology and Radiation Physics, Skåne University Hospital, Lund, Sweden
| | - Julhash U. Kazi
- Division of Translational Cancer Research, Department of Laboratory Medicine, Lund University, Lund, Sweden
- Lund Stem Cell Center, Lund University, Lund, Sweden
- Lund University Cancer Centre (LUCC), Lund University, Lund, Sweden
| |
Collapse
|
3
|
Ziegler AR, Dufour A, Scott NE, Edgington-Mitchell LE. Ion Mobility-Based Enrichment-Free N-Terminomics Analysis Reveals Novel Legumain Substrates in Murine Spleen. Mol Cell Proteomics 2024; 23:100714. [PMID: 38199506 PMCID: PMC10862022 DOI: 10.1016/j.mcpro.2024.100714] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2023] [Revised: 12/19/2023] [Accepted: 01/02/2024] [Indexed: 01/12/2024] Open
Abstract
Aberrant levels of the asparaginyl endopeptidase legumain have been linked to inflammation, neurodegeneration, and cancer, yet our understanding of this protease is incomplete. Systematic attempts to identify legumain substrates have been previously confined to in vitro studies, which fail to mirror physiological conditions and obscure biologically relevant cleavage events. Using high-field asymmetric waveform ion mobility spectrometry (FAIMS), we developed a streamlined approach for proteome and N-terminome analyses without the need for N-termini enrichment. Compared to unfractionated proteomic analysis, we demonstrate FAIMS fractionation improves N-termini identification by >2.5 fold, resulting in the identification of >2882 unique N-termini from limited sample amounts. In murine spleens, this approach identifies 6366 proteins and 2528 unique N-termini, with 235 cleavage events enriched in WT compared to legumain-deficient spleens. Among these, 119 neo-N-termini arose from asparaginyl endopeptidase activities, representing novel putative physiological legumain substrates. The direct cleavage of selected substrates by legumain was confirmed using in vitro assays, providing support for the existence of physiologically relevant extra-lysosomal legumain activity. Combined, these data shed critical light on the functions of legumain and demonstrate the utility of FAIMS as an accessible method to improve depth and quality of N-terminomics studies.
Collapse
Affiliation(s)
- Alexander R Ziegler
- Department of Biochemistry and Pharmacology, Bio21 Molecular Science and Biotechnology Institute, The University of Melbourne, Parkville, Victoria, Australia
| | - Antoine Dufour
- Department of Physiology and Pharmacology, University of Calgary, Calgary, Alberta, Canada; McCaig Institute for Bone and Joint Health, University of Calgary, Calgary, Alberta, Canada
| | - Nichollas E Scott
- Department of Microbiology and Immunology, Peter Doherty Institute, The University of Melbourne, Parkville, Victoria, Australia.
| | - Laura E Edgington-Mitchell
- Department of Biochemistry and Pharmacology, Bio21 Molecular Science and Biotechnology Institute, The University of Melbourne, Parkville, Victoria, Australia.
| |
Collapse
|
4
|
Wang F, Liu C, Li J, Yang F, Song J, Zang T, Yao J, Wang G. SPDB: a comprehensive resource and knowledgebase for proteomic data at the single-cell resolution. Nucleic Acids Res 2024; 52:D562-D571. [PMID: 37953313 PMCID: PMC10767837 DOI: 10.1093/nar/gkad1018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2023] [Revised: 09/28/2023] [Accepted: 10/23/2023] [Indexed: 11/14/2023] Open
Abstract
The single-cell proteomics enables the direct quantification of protein abundance at the single-cell resolution, providing valuable insights into cellular phenotypes beyond what can be inferred from transcriptome analysis alone. However, insufficient large-scale integrated databases hinder researchers from accessing and exploring single-cell proteomics, impeding the advancement of this field. To fill this deficiency, we present a comprehensive database, namely Single-cell Proteomic DataBase (SPDB, https://scproteomicsdb.com/), for general single-cell proteomic data, including antibody-based or mass spectrometry-based single-cell proteomics. Equipped with standardized data process and a user-friendly web interface, SPDB provides unified data formats for convenient interaction with downstream analysis, and offers not only dataset-level but also protein-level data search and exploration capabilities. To enable detailed exhibition of single-cell proteomic data, SPDB also provides a module for visualizing data from the perspectives of cell metadata or protein features. The current version of SPDB encompasses 133 antibody-based single-cell proteomic datasets involving more than 300 million cells and over 800 marker/surface proteins, and 10 mass spectrometry-based single-cell proteomic datasets involving more than 4000 cells and over 7000 proteins. Overall, SPDB is envisioned to be explored as a useful resource that will facilitate the wider research communities by providing detailed insights into proteomics from the single-cell perspective.
Collapse
Affiliation(s)
- Fang Wang
- School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China
- AI Lab, Tencent, Shenzhen 518000, China
| | - Chunpu Liu
- School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China
| | - Jiawei Li
- College of Intelligence and Computing, Tianjin University, Tianjin 300350, China
| | - Fan Yang
- AI Lab, Tencent, Shenzhen 518000, China
| | - Jiangning Song
- Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia
| | - Tianyi Zang
- School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China
| | | | - Guohua Wang
- School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China
| |
Collapse
|