1
|
Lejun G, Like Y, Xinyi W, Shehai Z, Shuhua X. SeqBMC: Single-cell data processing using iterative block matrix completion algorithm based on matrix factorisation. IET Syst Biol 2025; 19:e70003. [PMID: 39943646 PMCID: PMC11821729 DOI: 10.1049/syb2.70003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2024] [Revised: 12/27/2024] [Accepted: 01/16/2025] [Indexed: 02/16/2025] Open
Abstract
With the development of high-throughput sequencing technology, the analysis of single-cell RNA sequencing data has become the focus of current research. Matrix analysis and processing of downstream gene expression after preprocessing is a hot topic for researchers. This paper proposed an iterative block matrix completion algorithm, called SeqBMC, based on matrix factorisation. The algorithm is used to complete the missing value of the gene expression matrix caused by the defect of sequencing technology. The gene frequency of the matrix is used to block the matrix, and then the matrix factorisation algorithm is used to complete the small matrix after the block, and then the biological zeros that may exist in the block matrix are retained. Experimental results show that the matrix completion algorithm can significantly improve the classification performance of the gene expression matrix after completion with 86.81% F1 score, which is conducive to the recognition of cell types in sequencing data. Moreover, this completion method can be completed only by the machine learning method without too much prior knowledge related to biology and has good effects. Compared with ALRA, SeqBMC increased 5.47% accuracy and 5.03% F1 score. It indicates that SeqBMC has significant advantages in the matrix completion of single-cell RNA sequencing data.
Collapse
Affiliation(s)
- Gong Lejun
- Jiangsu Key Lab of Big Data Security & Intelligent ProcessingSchool of Computer ScienceNanjing University of Posts and TelecommunicationsNanjingChina
| | - Yu Like
- Jiangsu Key Lab of Big Data Security & Intelligent ProcessingSchool of Computer ScienceNanjing University of Posts and TelecommunicationsNanjingChina
| | - Wei Xinyi
- Jiangsu Key Lab of Big Data Security & Intelligent ProcessingSchool of Computer ScienceNanjing University of Posts and TelecommunicationsNanjingChina
| | - Zhou Shehai
- Jiangsu Key Lab of Big Data Security & Intelligent ProcessingSchool of Computer ScienceNanjing University of Posts and TelecommunicationsNanjingChina
| | - Xu Shuhua
- School of Data Science and Artificial IntelligenceWenzhou University of TechnologyWenzhouChina
| |
Collapse
|
2
|
Gulati GS, D'Silva JP, Liu Y, Wang L, Newman AM. Profiling cell identity and tissue architecture with single-cell and spatial transcriptomics. Nat Rev Mol Cell Biol 2025; 26:11-31. [PMID: 39169166 DOI: 10.1038/s41580-024-00768-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/16/2024] [Indexed: 08/23/2024]
Abstract
Single-cell transcriptomics has broadened our understanding of cellular diversity and gene expression dynamics in healthy and diseased tissues. Recently, spatial transcriptomics has emerged as a tool to contextualize single cells in multicellular neighbourhoods and to identify spatially recurrent phenotypes, or ecotypes. These technologies have generated vast datasets with targeted-transcriptome and whole-transcriptome profiles of hundreds to millions of cells. Such data have provided new insights into developmental hierarchies, cellular plasticity and diverse tissue microenvironments, and spurred a burst of innovation in computational methods for single-cell analysis. In this Review, we discuss recent advancements, ongoing challenges and prospects in identifying and characterizing cell states and multicellular neighbourhoods. We discuss recent progress in sample processing, data integration, identification of subtle cell states, trajectory modelling, deconvolution and spatial analysis. Furthermore, we discuss the increasing application of deep learning, including foundation models, in analysing single-cell and spatial transcriptomics data. Finally, we discuss recent applications of these tools in the fields of stem cell biology, immunology, and tumour biology, and the future of single-cell and spatial transcriptomics in biological research and its translation to the clinic.
Collapse
Affiliation(s)
- Gunsagar S Gulati
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA
| | | | - Yunhe Liu
- Department of Genomic Medicine, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Linghua Wang
- Department of Genomic Medicine, The University of Texas MD Anderson Cancer Center, Houston, TX, USA
- The University of Texas MD Anderson Cancer Center UTHealth Houston Graduate School of Biomedical Sciences, Houston, TX, USA
| | - Aaron M Newman
- Department of Biomedical Data Science, Stanford University, Stanford, CA, USA.
- Institute for Stem Cell Biology and Regenerative Medicine, Stanford University, Stanford, CA, USA.
- Stanford Cancer Institute, Stanford University, Stanford, CA, USA.
- Chan Zuckerberg Biohub - San Francisco, San Francisco, CA, USA.
| |
Collapse
|
3
|
Ramos Zapatero M, Tong A, Opzoomer JW, O'Sullivan R, Cardoso Rodriguez F, Sufi J, Vlckova P, Nattress C, Qin X, Claus J, Hochhauser D, Krishnaswamy S, Tape CJ. Trellis tree-based analysis reveals stromal regulation of patient-derived organoid drug responses. Cell 2023; 186:5606-5619.e24. [PMID: 38065081 DOI: 10.1016/j.cell.2023.11.005] [Citation(s) in RCA: 27] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2023] [Revised: 07/27/2023] [Accepted: 11/02/2023] [Indexed: 12/18/2023]
Abstract
Patient-derived organoids (PDOs) can model personalized therapy responses; however, current screening technologies cannot reveal drug response mechanisms or how tumor microenvironment cells alter therapeutic performance. To address this, we developed a highly multiplexed mass cytometry platform to measure post-translational modification (PTM) signaling, DNA damage, cell-cycle activity, and apoptosis in >2,500 colorectal cancer (CRC) PDOs and cancer-associated fibroblasts (CAFs) in response to clinical therapies at single-cell resolution. To compare patient- and microenvironment-specific drug responses in thousands of single-cell datasets, we developed "Trellis"-a highly scalable, tree-based treatment effect analysis method. Trellis single-cell screening revealed that on-target cell-cycle blockage and DNA-damage drug effects are common, even in chemorefractory PDOs. However, drug-induced apoptosis is rarer, patient-specific, and aligns with cancer cell PTM signaling. We find that CAFs can regulate PDO plasticity-shifting proliferative colonic stem cells (proCSCs) to slow-cycling revival colonic stem cells (revCSCs) to protect cancer cells from chemotherapy.
Collapse
Affiliation(s)
- María Ramos Zapatero
- Cell Communication Lab, Department of Oncology, University College London Cancer Institute, London WC1E 6DD, UK
| | - Alexander Tong
- Department of Computer Science, Yale University, New Haven, CT, USA; Department of Computer Science and Operations Research, Université de Montréal, Montreal, QC, Canada; Mila - Quebec AI Institute, Montréal, QC, Canada
| | - James W Opzoomer
- Cell Communication Lab, Department of Oncology, University College London Cancer Institute, London WC1E 6DD, UK
| | - Rhianna O'Sullivan
- Cell Communication Lab, Department of Oncology, University College London Cancer Institute, London WC1E 6DD, UK
| | - Ferran Cardoso Rodriguez
- Cell Communication Lab, Department of Oncology, University College London Cancer Institute, London WC1E 6DD, UK
| | - Jahangir Sufi
- Cell Communication Lab, Department of Oncology, University College London Cancer Institute, London WC1E 6DD, UK
| | - Petra Vlckova
- Cell Communication Lab, Department of Oncology, University College London Cancer Institute, London WC1E 6DD, UK
| | - Callum Nattress
- Cell Communication Lab, Department of Oncology, University College London Cancer Institute, London WC1E 6DD, UK
| | - Xiao Qin
- Cell Communication Lab, Department of Oncology, University College London Cancer Institute, London WC1E 6DD, UK
| | - Jeroen Claus
- Phospho Biomedical Animation, The Greenhouse Studio 6, London N17 9QU, UK
| | - Daniel Hochhauser
- Drug-DNA Interactions Group, Department of Oncology, University College London Cancer Institute, London WC1E 6DD, UK
| | - Smita Krishnaswamy
- Department of Computer Science, Yale University, New Haven, CT, USA; Department of Genetics, Yale University, New Haven, CT, USA; Program for Computational Biology & Bioinformatics, Yale University, New Haven, CT, USA; Program for Applied Math, Yale University, New Haven, CT, USA; Wu-Tsai Institute, Yale University, New Haven, CT, USA.
| | - Christopher J Tape
- Cell Communication Lab, Department of Oncology, University College London Cancer Institute, London WC1E 6DD, UK.
| |
Collapse
|
4
|
Senra D, Guisoni N, Diambra L. Cell annotation using scRNA-seq data: A protein-protein interaction network approach. MethodsX 2023; 10:102179. [PMID: 37128282 PMCID: PMC10148184 DOI: 10.1016/j.mex.2023.102179] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2022] [Accepted: 04/09/2023] [Indexed: 05/03/2023] Open
Abstract
Pathway analysis is an important step in the interpretation of single cell transcriptomic data, as it provides powerful information to detect which cellular processes are active in each individual cell. We have recently developed a protein-protein interaction network-based framework to quantify pluripotency associated pathways from scRNA-seq data. On this occasion, we extend this approach to quantify the activity of a pathway associated with any biological process, or even any list of genes. A systems-level characterization of pathway activities across multiple cell types provides a broadly applicable tool for the analysis of pathways in both healthy and disease conditions. Dysregulated cellular functions are a hallmark of a wide spectrum of human disorders, including cancer and autoimmune diseases. Here, we illustrate our method by analyzing various biological processes in healthy and cancer breast samples. Using this approach we found that tumor breast cells, even when they form a single group in the UMAP space, keep diverse biological programs active in a differentiated manner within the cluster.•We implement a protein-protein interaction network-based approach to quantify the activity of different biological processes.•The methodology can be used for cell annotation in scRNA-seq studies and is freely available as R package.
Collapse
|