1
|
White RB, Wild AR, O'Leary TP, Thompson AJ, Flibotte S, Peng A, Rogalski JC, Mair M, Derhami N, Bamji SX. The X-Linked Intellectual Disability Gene, ZDHHC9 , Is Important for Oligodendrocyte Subtype Determination and Myelination. Glia 2025; 73:1452-1466. [PMID: 40105030 PMCID: PMC12121472 DOI: 10.1002/glia.70016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2024] [Revised: 03/04/2025] [Accepted: 03/06/2025] [Indexed: 03/20/2025]
Abstract
Two percent of patients with X-linked intellectual disability (XLID) exhibit loss-of-function mutations in the enzyme, ZDHHC9. One of the main anatomical deficits observed in these patients is a decrease in corpus callosum volume and a concurrent disruption in white matter integrity. In this study, we demonstrate that deletion of Zdhhc9 in mice disrupts the balance of mature oligodendrocyte subtypes within the corpus callosum. While overall mature oligodendrocyte numbers are unchanged, there is a marked increase in MOL5/6 cells that are enriched in genes associated with cell adhesion and synapses, and a concomitant decrease in MOL2/3 cells that are enriched in genes associated with myelination. In line with this, we observed a decrease in the density of myelinated axons and disruptions in myelin compaction in the corpus callosum of Zdhhc9 knockout mice. RNA sequencing and proteomic analysis further revealed a reduction in genes and proteins essential for lipid metabolism, cholesterol synthesis, gene expression, and myelin compaction, offering insights into the underlying mechanisms of the pathology. These findings reveal a previously underappreciated and fundamental role for ZDHHC9 and protein palmitoylation in regulating oligodendrocyte subtype determination and myelinogenesis, offering mechanistic insights into the deficits observed in white matter volume in patients with mutations in ZDHHC9.
Collapse
Affiliation(s)
- Rocio B. White
- Department of Cellular and Physiological Sciences, Life Sciences Institute and Djavad Mowafaghian Centre for Brain HealthUniversity of British ColumbiaVancouverBritish ColumbiaCanada
| | - Angela R. Wild
- Department of Cellular and Physiological Sciences, Life Sciences Institute and Djavad Mowafaghian Centre for Brain HealthUniversity of British ColumbiaVancouverBritish ColumbiaCanada
| | - Timothy P. O'Leary
- Department of Cellular and Physiological Sciences, Life Sciences Institute and Djavad Mowafaghian Centre for Brain HealthUniversity of British ColumbiaVancouverBritish ColumbiaCanada
| | - Andrew J. Thompson
- Department of Cellular and Physiological Sciences, Life Sciences Institute and Djavad Mowafaghian Centre for Brain HealthUniversity of British ColumbiaVancouverBritish ColumbiaCanada
| | - Stephane Flibotte
- Life Sciences Institute Bioinformatics FacilityUniversity of British ColumbiaVancouverBritish ColumbiaCanada
| | - Angie Peng
- Department of Cellular and Physiological Sciences, Life Sciences Institute and Djavad Mowafaghian Centre for Brain HealthUniversity of British ColumbiaVancouverBritish ColumbiaCanada
| | - Jason C. Rogalski
- Department of Biochemistry and Molecular BiologyUniversity of British ColumbiaVancouverBritish ColumbiaCanada
| | - Mila Mair
- Department of Cellular and Physiological Sciences, Life Sciences Institute and Djavad Mowafaghian Centre for Brain HealthUniversity of British ColumbiaVancouverBritish ColumbiaCanada
| | - Neeki Derhami
- Department of Cellular and Physiological Sciences, Life Sciences Institute and Djavad Mowafaghian Centre for Brain HealthUniversity of British ColumbiaVancouverBritish ColumbiaCanada
| | - Shernaz X. Bamji
- Department of Cellular and Physiological Sciences, Life Sciences Institute and Djavad Mowafaghian Centre for Brain HealthUniversity of British ColumbiaVancouverBritish ColumbiaCanada
| |
Collapse
|
2
|
Xu Y, Yu B, Chen X, Peng A, Tao Q, He Y, Wang Y, Li XM. DSCT: a novel deep-learning framework for rapid and accurate spatial transcriptomic cell typing. Natl Sci Rev 2025; 12:nwaf030. [PMID: 40313458 PMCID: PMC12045154 DOI: 10.1093/nsr/nwaf030] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2024] [Revised: 01/05/2025] [Accepted: 01/09/2025] [Indexed: 05/03/2025] Open
Abstract
Unraveling complex cell-type-composition and gene-expression patterns at the cellular spatial resolution is crucial for understanding intricate cell functions in the brain. In this study, we developed Deep Neural Network-based Spatial Cell Typing (DSCT)-an innovative framework for spatial cell typing within spatial transcriptomic data sets. This approach utilizes a synergistic integration of an enhanced gene-selection strategy and a lightweight deep neural network for data training, offering a more rapid and accurate solution for the analysis of spatial transcriptomic data. Based on comprehensive analysis, DSCT achieved exceptional accuracy in cell-type identification across various brain regions, species and spatial transcriptomic platforms. It also performed well in mapping finer cell types, thereby showcasing its versatility and adaptability across diverse data sets. Strikingly, DSCT exhibited high efficiency and remarkable processing speed, with fewer computational resource demands. As such, this novel approach opens new avenues for exploring the spatial organization of cell types and gene-expression patterns, advancing our understanding of biological functions and pathologies within the nervous system.
Collapse
Affiliation(s)
- Yiheng Xu
- Department of Neurology and Department of Psychiatry, the Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou 310058, China
- NHC and CAMS Key Laboratory of Medical Neurobiology, MOE Frontier Center of Brain Science and Brain-machine Integration, School of Brain Science and Brain Medicine, Zhejiang University, Hangzhou 310058, China
- College of Computer Science and Technology, Zhejiang University, Hangzhou 310000, China
| | - Bin Yu
- Key Laboratory of Novel Targets and Drug Study for Neural Repair of Zhejiang Province, Institute of brain and cognitive science, Hangzhou City University School of Medicine, Hangzhou 310015, China
| | - Xuan Chen
- NHC and CAMS Key Laboratory of Medical Neurobiology, MOE Frontier Center of Brain Science and Brain-machine Integration, School of Brain Science and Brain Medicine, Zhejiang University, Hangzhou 310058, China
- Department of Neurobiology, Affiliated Mental Health Center & Hangzhou Seventh People's Hospital, Zhejiang University School of Medicine, Hangzhou 310058, China
| | - Aibing Peng
- NHC and CAMS Key Laboratory of Medical Neurobiology, MOE Frontier Center of Brain Science and Brain-machine Integration, School of Brain Science and Brain Medicine, Zhejiang University, Hangzhou 310058, China
| | | | - Youzhe He
- BGI Research, Hangzhou 310030, China
| | - Yueming Wang
- College of Computer Science and Technology, Zhejiang University, Hangzhou 310000, China
- Qiushi Academy for Advanced Studies, Zhejiang University, Hangzhou 310058, China
- The Nanhu Brain-computer Interface institute, Hangzhou 311100, China
| | - Xiao-Ming Li
- Department of Neurology and Department of Psychiatry, the Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou 310058, China
- NHC and CAMS Key Laboratory of Medical Neurobiology, MOE Frontier Center of Brain Science and Brain-machine Integration, School of Brain Science and Brain Medicine, Zhejiang University, Hangzhou 310058, China
| |
Collapse
|
3
|
Wang Y, Luo J, Jiao S, Xie X, Wang T, Liu J, Shang X, Peng J. STExplore: An Integrated Online Platform for Comprehensive Analysis and Visualization of Spatial Transcriptomics Data. SMALL METHODS 2025; 9:e2401272. [PMID: 40045664 DOI: 10.1002/smtd.202401272] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/14/2024] [Revised: 02/16/2025] [Indexed: 05/26/2025]
Abstract
Spatial transcriptomics revolutionizes the understanding of tissue organization and cellular interactions by combining high-resolution spatial information with gene expression profiles. Existing spatial transcriptomics analysis platforms face challenges in accommodating diverse techniques, integrating multi-omics data, and providing comprehensive analytical workflows. STExplore, an advanced online platform, is developed to address these limitations. STExplore supports a wide range of technologies, including sequencing-based and image-based methods, and offers a complete analysis workflow encompassing preprocessing, integration with single-cell RNA sequencing (scRNA-seq), cluster-level and gene-level analyses, and cell-cell communication studies. The platform features dynamic parameter adjustments and interactive visualizations at each analytical stage, enabling users to gain deeper insights into the spatial transcriptomic landscape. Case studies on neurogenesis in embryonic brain development, Alzheimer's disease, and brain tissue architecture demonstrate STExplore's capabilities in enhancing gene expression analysis, revealing cellular spatial organizations, and uncovering intercellular communication patterns. STExplore provides a comprehensive and user-friendly solution for the expanding demands of spatial transcriptomics research. The platform is accessible at http://120.77.47.2:3000/.
Collapse
Affiliation(s)
- Yongtian Wang
- School of Computer Science, Northwestern Polytechnical University, No.1 Dongxiang Road, Xi'an, Shaanxi, 710129, China
- AI for Science Interdisciplinary Research Center, School of Computer Science, Northwestern Polytechnical University, No.1 Dongxiang Road, Xi'an, Shaanxi, 710129, China
- Key Laboratory of Big Data Storage and Management, Northwestern Polytechnical University, Ministry of Industry and Information Technology, No.1 Dongxiang Road, Xi'an, 710129, China
- Research & Development Institute of Northwestern Polytechnical University in Shenzhen, Sanhang Science & Technology Buliding, No.45th, Gaoxin South 9th Road, Nanshan District, Shenzhen City, 518063, China
| | - Jintian Luo
- School of Software, Northwestern Polytechnical University, No.1 Dongxiang Road, Xi'an, Shaanxi, 710129, China
| | - Shaoqing Jiao
- School of Software, Northwestern Polytechnical University, No.1 Dongxiang Road, Xi'an, Shaanxi, 710129, China
| | - Xiaohan Xie
- School of Computer Science, Northwestern Polytechnical University, No.1 Dongxiang Road, Xi'an, Shaanxi, 710129, China
- AI for Science Interdisciplinary Research Center, School of Computer Science, Northwestern Polytechnical University, No.1 Dongxiang Road, Xi'an, Shaanxi, 710129, China
- Key Laboratory of Big Data Storage and Management, Northwestern Polytechnical University, Ministry of Industry and Information Technology, No.1 Dongxiang Road, Xi'an, 710129, China
| | - Tao Wang
- School of Computer Science, Northwestern Polytechnical University, No.1 Dongxiang Road, Xi'an, Shaanxi, 710129, China
- AI for Science Interdisciplinary Research Center, School of Computer Science, Northwestern Polytechnical University, No.1 Dongxiang Road, Xi'an, Shaanxi, 710129, China
- Key Laboratory of Big Data Storage and Management, Northwestern Polytechnical University, Ministry of Industry and Information Technology, No.1 Dongxiang Road, Xi'an, 710129, China
| | - Jie Liu
- School of Software, Northwestern Polytechnical University, No.1 Dongxiang Road, Xi'an, Shaanxi, 710129, China
| | - Xuequn Shang
- School of Computer Science, Northwestern Polytechnical University, No.1 Dongxiang Road, Xi'an, Shaanxi, 710129, China
- AI for Science Interdisciplinary Research Center, School of Computer Science, Northwestern Polytechnical University, No.1 Dongxiang Road, Xi'an, Shaanxi, 710129, China
- Key Laboratory of Big Data Storage and Management, Northwestern Polytechnical University, Ministry of Industry and Information Technology, No.1 Dongxiang Road, Xi'an, 710129, China
| | - Jiajie Peng
- School of Computer Science, Northwestern Polytechnical University, No.1 Dongxiang Road, Xi'an, Shaanxi, 710129, China
- AI for Science Interdisciplinary Research Center, School of Computer Science, Northwestern Polytechnical University, No.1 Dongxiang Road, Xi'an, Shaanxi, 710129, China
- Key Laboratory of Big Data Storage and Management, Northwestern Polytechnical University, Ministry of Industry and Information Technology, No.1 Dongxiang Road, Xi'an, 710129, China
- Research & Development Institute of Northwestern Polytechnical University in Shenzhen, Sanhang Science & Technology Buliding, No.45th, Gaoxin South 9th Road, Nanshan District, Shenzhen City, 518063, China
| |
Collapse
|
4
|
Jia BB, Sun BK, Lee EY, Ren B. Emerging Techniques in Spatial Multiomics: Fundamental Principles and Applications to Dermatology. J Invest Dermatol 2025; 145:1017-1032. [PMID: 39503694 DOI: 10.1016/j.jid.2024.09.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2024] [Revised: 09/09/2024] [Accepted: 09/09/2024] [Indexed: 04/25/2025]
Abstract
Molecular pathology, such as high-throughput genomic and proteomic profiling, identifies precise disease targets from biopsies but require tissue dissociation, losing valuable histologic and spatial context. Emerging spatial multi-omic technologies now enable multiplexed visualization of genomic, proteomic, and epigenomic targets within a single tissue slice, eliminating the need for labeling multiple adjacent slices. Although early work focused on RNA (spatial transcriptomics), spatial technologies can now concurrently capture DNA, genome accessibility, histone modifications, and proteins with spatially-resolved single-cell resolution. This review outlines the principles, advantages, limitations, and potential for spatial technologies to advance dermatologic research. By jointly profiling multiple molecular channels, spatial multiomics enables novel studies of copy number variations, clonal heterogeneity, and enhancer dysregulation, replete with spatial context, illuminating the skin's complex heterogeneity.
Collapse
Affiliation(s)
- Bojing B Jia
- Bioinformatics and Systems Biology Graduate Program, University of California, San Diego, La Jolla, California, USA; Medical Scientist Training Program, University of California, San Diego, La Jolla, California, USA
| | - Bryan K Sun
- Department of Dermatology, University of California, Irvine, Irvine, California, USA
| | - Ernest Y Lee
- Department of Dermatology, University of California, San Francisco, San Francisco, California, USA
| | - Bing Ren
- Center for Epigenomics, Department of Cellular & Molecular Medicine, University of California, San Diego, La Jolla, California, USA; Institute of Genomic Medicine, Moores Cancer Center, School of Medicine, University of California, San Diego, La Jolla, California, USA.
| |
Collapse
|
5
|
Hui T, Zhou J, Yao M, Xie Y, Zeng H. Advances in Spatial Omics Technologies. SMALL METHODS 2025; 9:e2401171. [PMID: 40099571 DOI: 10.1002/smtd.202401171] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/29/2024] [Revised: 03/03/2025] [Indexed: 03/20/2025]
Abstract
Rapidly developing spatial omics technologies provide us with new approaches to deeply understanding the diversity and functions of cell types within organisms. Unlike traditional approaches, spatial omics technologies enable researchers to dissect the complex relationships between tissue structure and function at the cellular or even subcellular level. The application of spatial omics technologies provides new perspectives on key biological processes such as nervous system development, organ development, and tumor microenvironment. This review focuses on the advancements and strategies of spatial omics technologies, summarizes their applications in biomedical research, and highlights the power of spatial omics technologies in advancing the understanding of life sciences related to development and disease.
Collapse
Affiliation(s)
- Tianxiao Hui
- State Key Laboratory of Gene Function and Modulation Research, College of Future Technology, Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, 100871, China
| | - Jian Zhou
- Peking-Tsinghua Center for Life Sciences, Tsinghua University, Beijing, 100084, China
| | - Muchen Yao
- College of Biological Sciences, China Agricultural University, Beijing, 100193, China
| | - Yige Xie
- School of Nursing, Peking University, Beijing, 100871, China
| | - Hu Zeng
- State Key Laboratory of Gene Function and Modulation Research, College of Future Technology, Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, 100871, China
- Beijing Advanced Center of RNA Biology (BEACON), Peking University, Beijing, 100871, China
| |
Collapse
|
6
|
Wang Z, Dai R, Wang M, Lei L, Zhang Z, Han K, Wang Z, Guo Q. KanCell: dissecting cellular heterogeneity in biological tissues through integrated single-cell and spatial transcriptomics. J Genet Genomics 2025; 52:689-705. [PMID: 39577768 DOI: 10.1016/j.jgg.2024.11.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2024] [Revised: 11/07/2024] [Accepted: 11/10/2024] [Indexed: 11/24/2024]
Abstract
KanCell is a deep learning model based on Kolmogorov-Arnold networks (KAN) designed to enhance cellular heterogeneity analysis by integrating single-cell RNA sequencing and spatial transcriptomics (ST) data. ST technologies provide insights into gene expression within tissue context, revealing cellular interactions and microenvironments. To fully leverage this potential, effective computational models are crucial. We evaluate KanCell on both simulated and real datasets from technologies such as STARmap, Slide-seq, Visium, and Spatial Transcriptomics. Our results demonstrate that KanCell outperforms existing methods across metrics like PCC, SSIM, COSSIM, RMSE, JSD, ARS, and ROC, with robust performance under varying cell numbers and background noise. Real-world applications on human lymph nodes, hearts, melanoma, breast cancer, dorsolateral prefrontal cortex, and mouse embryo brains confirmed its reliability. Compared with traditional approaches, KanCell effectively captures non-linear relationships and optimizes computational efficiency through KAN, providing an accurate and efficient tool for ST. By improving data accuracy and resolving cell type composition, KanCell reveals cellular heterogeneity, clarifies disease microenvironments, and identifies therapeutic targets, addressing complex biological challenges.
Collapse
Affiliation(s)
- Zhenghui Wang
- Academy of Artificial Intelligence, Beijing Institute of Petrochemical Technology, Beijing 102617, China
| | - Ruoyan Dai
- Academy of Artificial Intelligence, Beijing Institute of Petrochemical Technology, Beijing 102617, China
| | - Mengqiu Wang
- Academy of Artificial Intelligence, Beijing Institute of Petrochemical Technology, Beijing 102617, China
| | - Lixin Lei
- Academy of Artificial Intelligence, Beijing Institute of Petrochemical Technology, Beijing 102617, China
| | - Zhiwei Zhang
- Academy of Artificial Intelligence, Beijing Institute of Petrochemical Technology, Beijing 102617, China
| | - Kaitai Han
- Academy of Artificial Intelligence, Beijing Institute of Petrochemical Technology, Beijing 102617, China
| | - Zijun Wang
- Academy of Artificial Intelligence, Beijing Institute of Petrochemical Technology, Beijing 102617, China
| | - Qianjin Guo
- Academy of Artificial Intelligence, Beijing Institute of Petrochemical Technology, Beijing 102617, China.
| |
Collapse
|
7
|
Wang J, Ye F, Chai H, Jiang Y, Wang T, Ran X, Xia Q, Xu Z, Fu Y, Zhang G, Wu H, Guo G, Guo H, Ruan Y, Wang Y, Xing D, Xu X, Zhang Z. Advances and applications in single-cell and spatial genomics. SCIENCE CHINA. LIFE SCIENCES 2025; 68:1226-1282. [PMID: 39792333 DOI: 10.1007/s11427-024-2770-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/20/2024] [Accepted: 10/10/2024] [Indexed: 01/12/2025]
Abstract
The applications of single-cell and spatial technologies in recent times have revolutionized the present understanding of cellular states and the cellular heterogeneity inherent in complex biological systems. These advancements offer unprecedented resolution in the examination of the functional genomics of individual cells and their spatial context within tissues. In this review, we have comprehensively discussed the historical development and recent progress in the field of single-cell and spatial genomics. We have reviewed the breakthroughs in single-cell multi-omics technologies, spatial genomics methods, and the computational strategies employed toward the analyses of single-cell atlas data. Furthermore, we have highlighted the advances made in constructing cellular atlases and their clinical applications, particularly in the context of disease. Finally, we have discussed the emerging trends, challenges, and opportunities in this rapidly evolving field.
Collapse
Affiliation(s)
- Jingjing Wang
- Bone Marrow Transplantation Center of the First Affiliated Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 310058, China
| | - Fang Ye
- Bone Marrow Transplantation Center of the First Affiliated Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 310058, China
| | - Haoxi Chai
- Life Sciences Institute and The Second Affiliated Hospital, Zhejiang University, Hangzhou, 310058, China
| | - Yujia Jiang
- BGI Research, Shenzhen, 518083, China
- BGI Research, Hangzhou, 310030, China
| | - Teng Wang
- Biomedical Pioneering Innovation Center (BIOPIC) and School of Life Sciences, Peking University, Beijing, 100871, China
- Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, 100871, China
| | - Xia Ran
- Bone Marrow Transplantation Center of the First Affiliated Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 310058, China
- Institute of Hematology, Zhejiang University, Hangzhou, 310000, China
| | - Qimin Xia
- Biomedical Pioneering Innovation Center (BIOPIC) and School of Life Sciences, Peking University, Beijing, 100871, China
| | - Ziye Xu
- Department of Laboratory Medicine of The First Affiliated Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 310058, China
| | - Yuting Fu
- Bone Marrow Transplantation Center of the First Affiliated Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 310058, China
- Center for Stem Cell and Regenerative Medicine, Zhejiang University School of Medicine, Hangzhou, 310058, China
| | - Guodong Zhang
- Bone Marrow Transplantation Center of the First Affiliated Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 310058, China
- Center for Stem Cell and Regenerative Medicine, Zhejiang University School of Medicine, Hangzhou, 310058, China
| | - Hanyu Wu
- Bone Marrow Transplantation Center of the First Affiliated Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 310058, China
- Center for Stem Cell and Regenerative Medicine, Zhejiang University School of Medicine, Hangzhou, 310058, China
| | - Guoji Guo
- Bone Marrow Transplantation Center of the First Affiliated Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 310058, China.
- Center for Stem Cell and Regenerative Medicine, Zhejiang University School of Medicine, Hangzhou, 310058, China.
- Zhejiang Provincial Key Lab for Tissue Engineering and Regenerative Medicine, Dr. Li Dak Sum & Yip Yio Chin Center for Stem Cell and Regenerative Medicine, Hangzhou, 310058, China.
- Institute of Hematology, Zhejiang University, Hangzhou, 310000, China.
| | - Hongshan Guo
- Bone Marrow Transplantation Center of the First Affiliated Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 310058, China.
- Institute of Hematology, Zhejiang University, Hangzhou, 310000, China.
| | - Yijun Ruan
- Life Sciences Institute and The Second Affiliated Hospital, Zhejiang University, Hangzhou, 310058, China.
| | - Yongcheng Wang
- Department of Laboratory Medicine of The First Affiliated Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 310058, China.
| | - Dong Xing
- Biomedical Pioneering Innovation Center (BIOPIC) and School of Life Sciences, Peking University, Beijing, 100871, China.
- Beijing Advanced Innovation Center for Genomics (ICG), Peking University, Beijing, 100871, China.
| | - Xun Xu
- BGI Research, Shenzhen, 518083, China.
- BGI Research, Hangzhou, 310030, China.
- Guangdong Provincial Key Laboratory of Genome Read and Write, BGI Research, Shenzhen, 518083, China.
| | - Zemin Zhang
- Biomedical Pioneering Innovation Center (BIOPIC) and School of Life Sciences, Peking University, Beijing, 100871, China.
| |
Collapse
|
8
|
Bao X, Bai X, Liu X, Shi Q, Zhang C. Spatially informed graph transformers for spatially resolved transcriptomics. Commun Biol 2025; 8:574. [PMID: 40188303 PMCID: PMC11972348 DOI: 10.1038/s42003-025-08015-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2024] [Accepted: 03/28/2025] [Indexed: 04/07/2025] Open
Abstract
Spatially resolved transcriptomics (SRT) has emerged as a powerful technique for mapping gene expression landscapes within spatial contexts. However, significant challenges persist in effectively integrating gene expression with spatial information to elucidate the heterogeneity of biological tissues. Here, we present a Spatially informed Graph Transformers framework, SpaGT, which leverages both node and edge channels to model spatially aware graph representation for denoising gene expression and identifying spatial domains. Unlike conventional graph neural networks, which rely on static, localized convolutional aggregation, SpaGT employs a structure-reinforced self-attention mechanism that iteratively evolves topological structural information and transcriptional signal representation. By replacing graph convolution with global self-attention, SpaGT enables the integration of both global and spatially localized information, thereby improving the detection of fine-grained spatial domains. We demonstrate that SpaGT achieves superior performance in identifying spatial domains and denoising gene expression data across diverse platforms and species. Furthermore, SpaGT facilitates the discovery of spatially variable genes with significant prognostic potential in cancer tissues. These findings establish SpaGT as a powerful tool for unraveling the complexities of biological tissues.
Collapse
Affiliation(s)
- Xinyu Bao
- Key Laboratory of Systems Health Science of Zhejiang Province, School of Life Science, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou, China
| | - Xiaosheng Bai
- Key Laboratory of Systems Health Science of Zhejiang Province, School of Life Science, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou, China
| | - Xiaoping Liu
- Key Laboratory of Systems Health Science of Zhejiang Province, School of Life Science, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou, China.
| | - Qianqian Shi
- Hubei Engineering Technology Research Center of Agricultural Big Data, Huazhong Agricultural University, Wuhan, China.
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, China.
| | - Chuanchao Zhang
- Key Laboratory of Systems Health Science of Zhejiang Province, School of Life Science, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou, China.
| |
Collapse
|
9
|
Frazzette N, Jour G. Novel Molecular Methods in Soft Tissue Sarcomas: From Diagnostics to Theragnostics. Cancers (Basel) 2025; 17:1215. [PMID: 40227789 PMCID: PMC11987812 DOI: 10.3390/cancers17071215] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2025] [Revised: 03/27/2025] [Accepted: 03/27/2025] [Indexed: 04/15/2025] Open
Abstract
Soft tissue sarcomas (STSs) are a diverse group of malignant tumors derived from mesenchymal tissues [...].
Collapse
Affiliation(s)
| | - George Jour
- Department of Pathology, NYU Grossman School of Medicine, New York, NY 10016, USA;
| |
Collapse
|
10
|
Min W, Fang D, Chen J, Zhang S. SpaMask: Dual masking graph autoencoder with contrastive learning for spatial transcriptomics. PLoS Comput Biol 2025; 21:e1012881. [PMID: 40179332 PMCID: PMC11968113 DOI: 10.1371/journal.pcbi.1012881] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2024] [Accepted: 02/17/2025] [Indexed: 04/05/2025] Open
Abstract
Understanding the spatial locations of cell within tissues is crucial for unraveling the organization of cellular diversity. Recent advancements in spatial resolved transcriptomics (SRT) have enabled the analysis of gene expression while preserving the spatial context within tissues. Spatial domain characterization is a critical first step in SRT data analysis, providing the foundation for subsequent analyses and insights into biological implications. Graph neural networks (GNNs) have emerged as a common tool for addressing this challenge due to the structural nature of SRT data. However, current graph-based deep learning approaches often overlook the instability caused by the high sparsity of SRT data. Masking mechanisms, as an effective self-supervised learning strategy, can enhance the robustness of these models. To this end, we propose SpaMask, dual masking graph autoencoder with contrastive learning for SRT analysis. Unlike previous GNNs, SpaMask masks a portion of spot nodes and spot-to-spot edges to enhance its performance and robustness. SpaMask combines Masked Graph Autoencoders (MGAE) and Masked Graph Contrastive Learning (MGCL) modules, with MGAE using node masking to leverage spatial neighbors for improved clustering accuracy, while MGCL applies edge masking to create a contrastive loss framework that tightens embeddings of adjacent nodes based on spatial proximity and feature similarity. We conducted a comprehensive evaluation of SpaMask on eight datasets from five different platforms. Compared to existing methods, SpaMask achieves superior clustering accuracy and effective batch correction.
Collapse
Affiliation(s)
- Wenwen Min
- School of Information Science and Engineering, Yunnan University, Kunming, Yunnan, China
| | - Donghai Fang
- School of Information Science and Engineering, Yunnan University, Kunming, Yunnan, China
| | - Jinyu Chen
- School of Mathematics, Statistics and Mechanics, Beijing University of Technology, Beijing, China
| | - Shihua Zhang
- NCMIS, CEMS, RCSDS, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, China
- School of Mathematical Sciences, University of Chinese Academy of Sciences, Beijing, China
- Key Laboratory of Systems Health Science of Zhejiang Province, School of Life Science, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou, Zhejiang, China
| |
Collapse
|
11
|
Marco Salas S, Kuemmerle LB, Mattsson-Langseth C, Tismeyer S, Avenel C, Hu T, Rehman H, Grillo M, Czarnewski P, Helgadottir S, Tiklova K, Andersson A, Rafati N, Chatzinikolaou M, Theis FJ, Luecken MD, Wählby C, Ishaque N, Nilsson M. Optimizing Xenium In Situ data utility by quality assessment and best-practice analysis workflows. Nat Methods 2025; 22:813-823. [PMID: 40082609 PMCID: PMC11978515 DOI: 10.1038/s41592-025-02617-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2023] [Accepted: 02/04/2025] [Indexed: 03/16/2025]
Abstract
The Xenium In Situ platform is a new spatial transcriptomics product commercialized by 10x Genomics, capable of mapping hundreds of genes in situ at subcellular resolution. Given the multitude of commercially available spatial transcriptomics technologies, recommendations in choice of platform and analysis guidelines are increasingly important. Herein, we explore 25 Xenium datasets generated from multiple tissues and species, comparing scalability, resolution, data quality, capacities and limitations with eight other spatially resolved transcriptomics technologies and commercial platforms. In addition, we benchmark the performance of multiple open-source computational tools, when applied to Xenium datasets, in tasks including preprocessing, cell segmentation, selection of spatially variable features and domain identification. This study serves as an independent analysis of the performance of Xenium, and provides best practices and recommendations for analysis of such datasets.
Collapse
Affiliation(s)
- Sergio Marco Salas
- Science for Life Laboratory, Department of Biochemistry and Biophysics, Stockholm University, Stockholm, Sweden.
- Institute of Computational Biology, Computational Health Center, Helmholtz Munich, Munich, Germany.
| | - Louis B Kuemmerle
- Institute of Computational Biology, Computational Health Center, Helmholtz Munich, Munich, Germany
- Institute for Tissue Engineering and Regenerative Medicine (iTERM), Helmholtz Zentrum München, Munich, Germany
| | | | - Sebastian Tismeyer
- Berlin Institute of Health at Charité - Universitätsmedizin Berlin, Digital Health Center, Berlin, Germany
| | - Christophe Avenel
- Department of Information Technology and SciLifeLab BioImage Informatics Facility, Uppsala University, Uppsala, Sweden
| | - Taobo Hu
- Science for Life Laboratory, Department of Biochemistry and Biophysics, Stockholm University, Stockholm, Sweden
| | - Habib Rehman
- Institute of Computational Biology, Computational Health Center, Helmholtz Munich, Munich, Germany
- Department of Bioengineering, University of Illinois at Urbana-Champaign, Champaign, IL, USA
| | - Marco Grillo
- Science for Life Laboratory, Department of Biochemistry and Biophysics, Stockholm University, Stockholm, Sweden
| | - Paulo Czarnewski
- Science for Life Laboratory, Department of Gene Technology, KTH Royal Institute of Technology, Stockholm, Sweden
- Science for Life Laboratory, Department of Biochemistry and Biophysics, National Bioinformatics Infrastructure Sweden, Stockholm University, Stockholm, Sweden
| | - Saga Helgadottir
- Science for Life Laboratory, Department of Biochemistry and Biophysics, Stockholm University, Stockholm, Sweden
| | - Katarina Tiklova
- Science for Life Laboratory, Department of Biochemistry and Biophysics, Stockholm University, Stockholm, Sweden
| | - Axel Andersson
- Department of Information Technology and SciLifeLab BioImage Informatics Facility, Uppsala University, Uppsala, Sweden
| | - Nima Rafati
- National Bioinformatics Infrastructure Sweden, Uppsala University, SciLifeLab, Department of Medical Biochemistry and Microbiology, Uppsala, Sweden
| | - Maria Chatzinikolaou
- Science for Life Laboratory, Department of Biochemistry and Biophysics, Stockholm University, Stockholm, Sweden
| | - Fabian J Theis
- Institute of Computational Biology, Computational Health Center, Helmholtz Munich, Munich, Germany
| | - Malte D Luecken
- Institute of Computational Biology, Computational Health Center, Helmholtz Munich, Munich, Germany
- Institute of Lung Health & Immunity, Helmholtz Munich; Member of the German Center for Lung Research (DZL), Munich, Germany
| | - Carolina Wählby
- Department of Information Technology and SciLifeLab BioImage Informatics Facility, Uppsala University, Uppsala, Sweden
| | - Naveed Ishaque
- Berlin Institute of Health at Charité - Universitätsmedizin Berlin, Digital Health Center, Berlin, Germany
| | - Mats Nilsson
- Science for Life Laboratory, Department of Biochemistry and Biophysics, Stockholm University, Stockholm, Sweden.
| |
Collapse
|
12
|
Liu T, Lin Y, Luo X, Sun Y, Zhao H. VISTA Uncovers Missing Gene Expression and Spatial-induced Information for Spatial Transcriptomic Data Analysis. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2024.08.26.609718. [PMID: 40166134 PMCID: PMC11957009 DOI: 10.1101/2024.08.26.609718] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 04/02/2025]
Abstract
Characterizing cell activities within a spatially resolved context is essential to enhance our understanding of spatially-induced cellular states and features. While single-cell RNA-seq (scRNA-seq) offers comprehensive profiling of cells within a tissue, it fails to capture spatial context. Conversely, subcellular spatial transcriptomics (SST) technologies provide high-resolution spatial profiles of gene expression, yet their utility is constrained by the limited number of genes they can simultaneously profile. To address this limitation, we introduce VISTA, a novel approach designed to predict the expression levels of unobserved genes specifically tailored for SST data. VISTA jointly models scRNA-seq data and SST data based on variational inference and geometric deep learning, and incorporates uncertainty quantification. Using four SST datasets, we demonstrate VISTA's superior performance in imputation and in analyzing large-scale SST datasets with satisfactory time efficiency and memory consumption. The imputation of VISTA enables a multitude of downstream applications, including the detection of new spatially variable genes, the discovery of novel ligand-receptor interactions, the inference of spatial RNA velocity, the generation for spatial transcriptomics with in-silico perturbation, and an improved decomposition of spatial and intrinsic variations.
Collapse
Affiliation(s)
- Tianyu Liu
- Interdepartmental Program in Computational Biology & Bioinformatics, Yale University, New Haven, 06511, CT, USA
| | - Yingxin Lin
- Department of Biostatistics, Yale University, New Haven, 06511, CT, USA
| | - Xiao Luo
- Department of Computer Science, University of California, Los Angeles, Los Angeles, 90095, CA, USA
| | - Yizhou Sun
- Department of Computer Science, University of California, Los Angeles, Los Angeles, 90095, CA, USA
| | - Hongyu Zhao
- Interdepartmental Program in Computational Biology & Bioinformatics, Yale University, New Haven, 06511, CT, USA
- Department of Biostatistics, Yale University, New Haven, 06511, CT, USA
| |
Collapse
|
13
|
Wang Y, Liu Z, Ma X. MuCST: restoring and integrating heterogeneous morphology images and spatial transcriptomics data with contrastive learning. Genome Med 2025; 17:21. [PMID: 40082941 PMCID: PMC11907906 DOI: 10.1186/s13073-025-01449-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2024] [Accepted: 03/07/2025] [Indexed: 03/16/2025] Open
Abstract
Spatially resolved transcriptomics (SRT) simultaneously measure spatial location, histology images, and transcriptional profiles of cells or regions in undissociated tissues. Integrative analysis of multi-modal SRT data holds immense potential for understanding biological mechanisms. Here, we present a flexible multi-modal contrastive learning for the integration of SRT data (MuCST), which joins denoising, heterogeneity elimination, and compatible feature learning. MuCST accurately identifies spatial domains and is applicable to diverse datasets platforms. Overall, MuCST provides an alternative for integrative analysis of multi-modal SRT data ( https://github.com/xkmaxidian/MuCST ).
Collapse
Affiliation(s)
- Yu Wang
- School of Computer Science and Technology, Xidian University, No.2 South Taibai Road, Xi'an, 710071, Shaanxi, China
- Key Laboratory of Smart Human-Computer Interaction and Wearable Technology of Shaanxi Province, Xidian University, No.2 South Taibai Road, Xi'an, 710071, Shaanxi, China
| | - Zaiyi Liu
- Department of Radiology, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, 106 Zhongshan Er Road, Guangzhou, 510080, Guangdong, China
- Guangdong Provincial Key Laboratory of Artificial Intelligence in Medical Image Analysis and Application, 106 Zhongshan Er Road, Guangzhou, 510080, Guangdong, China
| | - Xiaoke Ma
- School of Computer Science and Technology, Xidian University, No.2 South Taibai Road, Xi'an, 710071, Shaanxi, China.
- Key Laboratory of Smart Human-Computer Interaction and Wearable Technology of Shaanxi Province, Xidian University, No.2 South Taibai Road, Xi'an, 710071, Shaanxi, China.
| |
Collapse
|
14
|
Wei X, Chen T, Wang X, Shen W, Liu C, Wu S, Wong HS. COME: contrastive mapping learning for spatial reconstruction of single-cell RNA sequencing data. Bioinformatics 2025; 41:btaf083. [PMID: 39992219 PMCID: PMC11897431 DOI: 10.1093/bioinformatics/btaf083] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2024] [Revised: 02/12/2025] [Accepted: 02/20/2025] [Indexed: 02/25/2025] Open
Abstract
MOTIVATION Single-cell RNA sequencing (scRNA-seq) enables high-throughput transcriptomic profiling at single-cell resolution. The inherent spatial location is crucial for understanding how single cells orchestrate multicellular functions and drive diseases. However, spatial information is often lost during tissue dissociation. Spatial transcriptomic (ST) technologies can provide precise spatial gene expression atlas, while their practicality is constrained by the number of genes they can assay or the associated costs at a larger scale and the fine-grained cell-type annotation. By transferring knowledge between scRNA-seq and ST data through cell correspondence learning, it is possible to recover the spatial properties inherent in scRNA-seq datasets. RESULTS In this study, we introduce COME, a COntrastive Mapping lEarning approach that learns mapping between ST and scRNA-seq data to recover the spatial information of scRNA-seq data. Extensive experiments demonstrate that the proposed COME method effectively captures precise cell-spot relationships and outperforms previous methods in recovering spatial location for scRNA-seq data. More importantly, our method is capable of precisely identifying biologically meaningful information within the data, such as the spatial structure of missing genes, spatial hierarchical patterns, and the cell-type compositions for each spot. These results indicate that the proposed COME method can help to understand the heterogeneity and activities among cells within tissue environments. AVAILABILITY AND IMPLEMENTATION The COME is freely available in GitHub (https://github.com/cindyway/COME).
Collapse
Affiliation(s)
- Xindian Wei
- Department of Computer Science, City University of Hong Kong, Kowloon 999077, Hong Kong
| | - Tianyi Chen
- Department of Computer Science, City University of Hong Kong, Kowloon 999077, Hong Kong
| | - Xibiao Wang
- Department of Computer Science, Shantou University, Shantou 515063, China
| | - Wenjun Shen
- Department of Bioinformatics, Shantou University Medical College, Shantou 515041, China
| | - Cheng Liu
- Department of Computer Science, Shantou University, Shantou 515063, China
| | - Si Wu
- Department of Computer Science and Engineering, South China University of Technology, Guangzhou, Guangdong 510006, China
| | - Hau-San Wong
- Department of Computer Science, City University of Hong Kong, Kowloon 999077, Hong Kong
| |
Collapse
|
15
|
Liu C, Li X, Hu Q, Jia Z, Ye Q, Wang X, Zhao K, Liu L, Wang M. Decoding the blueprints of embryo development with single-cell and spatial omics. Semin Cell Dev Biol 2025; 167:22-39. [PMID: 39889540 DOI: 10.1016/j.semcdb.2025.01.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2023] [Revised: 01/18/2025] [Accepted: 01/18/2025] [Indexed: 02/03/2025]
Abstract
Embryonic development is a complex and intricately regulated process that encompasses precise control over cell differentiation, morphogenesis, and the underlying gene expression changes. Recent years have witnessed a remarkable acceleration in the development of single-cell and spatial omic technologies, enabling high-throughput profiling of transcriptomic and other multi-omic information at the individual cell level. These innovations offer fresh and multifaceted perspectives for investigating the intricate cellular and molecular mechanisms that govern embryonic development. In this review, we provide an in-depth exploration of the latest technical advancements in single-cell and spatial multi-omic methodologies and compile a systematic catalog of their applications in the field of embryonic development. We deconstruct the research strategies employed by recent studies that leverage single-cell sequencing techniques and underscore the unique advantages of spatial transcriptomics. Furthermore, we delve into both the current applications, data analysis algorithms and the untapped potential of these technologies in advancing our understanding of embryonic development. With the continuous evolution of multi-omic technologies, we anticipate their widespread adoption and profound contributions to unraveling the intricate molecular foundations underpinning embryo development in the foreseeable future.
Collapse
Affiliation(s)
- Chang Liu
- BGI Research, Hangzhou 310030, China; BGI Research, Shenzhen 518083, China; Shanxi Medical University-BGI Collaborative Center for Future Medicine, Shanxi Medical University, Taiyuan 030001, China; Shenzhen Proof-of-Concept Center of Digital Cytopathology, BGI Research, Shenzhen 518083, China
| | | | - Qinan Hu
- Shenzhen Key Laboratory of Gene Regulation and Systems Biology, School of Life Sciences, Southern University of Science and Technology, Shenzhen 518005, China; Department of Pharmacology, School of Medicine, Southern University of Science and Technology, Shenzhen 518005, China
| | - Zihan Jia
- BGI Research, Hangzhou 310030, China; College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Qing Ye
- BGI Research, Hangzhou 310030, China; China Jiliang University, Hangzhou 310018, China
| | | | - Kaichen Zhao
- College of Biomedicine and Health, College of Life Science and Technology, Huazhong Agricultural University, Wuhan, Hubei 430070, China
| | - Longqi Liu
- BGI Research, Hangzhou 310030, China; Shanxi Medical University-BGI Collaborative Center for Future Medicine, Shanxi Medical University, Taiyuan 030001, China.
| | - Mingyue Wang
- BGI Research, Hangzhou 310030, China; Key Laboratory of Spatial Omics of Zhejiang Province, BGI Research, Hangzhou 310030, China.
| |
Collapse
|
16
|
Kumaran G, Carroll L, Muirhead N, Bottomley MJ. How Can Spatial Transcriptomic Profiling Advance Our Understanding of Skin Diseases? J Invest Dermatol 2025; 145:522-535. [PMID: 39177547 DOI: 10.1016/j.jid.2024.07.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2024] [Revised: 05/23/2024] [Accepted: 07/04/2024] [Indexed: 08/24/2024]
Abstract
Spatial transcriptomic (ST) profiling is the mapping of gene expression within cell populations with preservation of positional context and represents an exciting new approach to develop our understanding of local and regional influences upon skin biology in health and disease. With the ability to probe from a few hundred transcripts to the entire transcriptome, multiple ST approaches are now widely available. In this paper, we review the ST field and discuss its application to dermatology. Its potential to advance our understanding of skin biology in health and disease is highlighted through the illustrative examples of 3 research areas: cutaneous aging, tumorigenesis, and psoriasis.
Collapse
Affiliation(s)
- Girishkumar Kumaran
- Chinese Academy of Medical Sciences Oxford Institute, Nuffield Department of Medicine, University of Oxford, Oxford, United Kingdom
| | - Liam Carroll
- Chinese Academy of Medical Sciences Oxford Institute, Nuffield Department of Medicine, University of Oxford, Oxford, United Kingdom
| | | | - Matthew J Bottomley
- Chinese Academy of Medical Sciences Oxford Institute, Nuffield Department of Medicine, University of Oxford, Oxford, United Kingdom.
| |
Collapse
|
17
|
Zou G, Shen Q, Li L, Zhang S. stAI: a deep learning-based model for missing gene imputation and cell-type annotation of spatial transcriptomics. Nucleic Acids Res 2025; 53:gkaf158. [PMID: 40057378 PMCID: PMC11890069 DOI: 10.1093/nar/gkaf158] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2024] [Revised: 12/29/2024] [Accepted: 02/18/2025] [Indexed: 05/13/2025] Open
Abstract
Spatial transcriptomics technology has revolutionized our understanding of cellular systems by capturing RNA transcript levels in their original spatial context. Single-cell spatial transcriptomics (scST) offers single-cell resolution expression level and precise spatial information of RNA transcripts, while it has a limited capacity for simultaneously detecting a wide range of RNA transcripts, hindering its broader applications. Characterizing the whole transcriptome level and comprehensively annotating cell types represent two significant challenges in scST applications. Despite several proposed methods for one or both tasks, their performance remains inadequate. In this work, we introduce stAI, a deep learning-based model designed to address both missing gene imputation and cell-type annotation for scST data. stAI leverages a joint embedding for the scST and the reference scRNA-seq data with two separate encoder-decoder modules. Both the imputation and annotation are performed within the latent space in a supervised manner, utilizing scRNA-seq data to guide the processes. Experiments for datasets generated from diverse platforms with varying numbers of measured genes were conducted and compared with the updated methods. The results demonstrate that stAI can predict the unmeasured genes, especially the marker genes, with much higher accuracy, and annotate the cell types, including those of small size, with high precision.
Collapse
Affiliation(s)
- Guangsheng Zou
- School of Mathematical Sciences, Fudan University, Shanghai 200433, China
| | - Qunlun Shen
- School of Mathematical Sciences, Fudan University, Shanghai 200433, China
| | - Limin Li
- School of Mathematics and Statistics, Xi’an Jiaotong University, Xi’an 710049, China
| | - Shuqin Zhang
- School of Mathematical Sciences, Fudan University, Shanghai 200433, China
- Center for Applied Mathematics, Research Institute of Intelligent Complex Systems, and Shanghai Key Laboratory for Contemporary Applied Mathematics, Fudan University, Shanghai 200433, China
| |
Collapse
|
18
|
Millard N, Chen JH, Palshikar MG, Pelka K, Spurrell M, Price C, He J, Hacohen N, Raychaudhuri S, Korsunsky I. Batch correcting single-cell spatial transcriptomics count data with Crescendo improves visualization and detection of spatial gene patterns. Genome Biol 2025; 26:36. [PMID: 40001084 PMCID: PMC11863647 DOI: 10.1186/s13059-025-03479-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2024] [Accepted: 01/21/2025] [Indexed: 02/27/2025] Open
Abstract
Spatial transcriptomics facilitates gene expression analysis of cells in their spatial anatomical context. Batch effects hinder visualization of gene spatial patterns across samples. We present the Crescendo algorithm to correct for batch effects at the gene expression level and enable accurate visualization of gene expression patterns across multiple samples. We show Crescendo's utility and scalability across three datasets ranging from 170,000 to 7 million single cells across spatial and single-cell RNA sequencing technologies. By correcting for batch effects, Crescendo enhances spatial transcriptomics analyses to detect gene colocalization and ligand-receptor interactions and enables cross-technology information transfer.
Collapse
Affiliation(s)
- Nghia Millard
- Division of Rheumatology, Inflammation and Immunity, Brigham and Women's Hospital, Boston, MA, USA
- Division of Genetics, Brigham and Women's Hospital, Boston, MA, USA
- Center for Data Sciences, Brigham and Women's Hospital, Boston, MA, USA
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
- Department of Medicine, Harvard Medical School, Boston, MA, USA
- Harvard Medical School, Boston, MA, USA
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Jonathan H Chen
- Harvard Medical School, Boston, MA, USA
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Massachusetts General Hospital (MGH) Cancer Center, Harvard Medical School, Boston, MA, USA
- Department of Pathology, MGH, Boston, MA, USA
| | - Mukta G Palshikar
- Division of Genetics, Brigham and Women's Hospital, Boston, MA, USA
- Center for Data Sciences, Brigham and Women's Hospital, Boston, MA, USA
- Harvard Medical School, Boston, MA, USA
| | - Karin Pelka
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Massachusetts General Hospital (MGH) Cancer Center, Harvard Medical School, Boston, MA, USA
- UCSF Institute of Genomic Immunology, Gladstone Institutes, San Francisco, CA, USA
| | - Maxwell Spurrell
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Massachusetts General Hospital (MGH) Cancer Center, Harvard Medical School, Boston, MA, USA
- Department of Pathology, MGH, Boston, MA, USA
| | | | | | - Nir Hacohen
- Department of Immunology, Harvard Medical School, Boston, MA, USA
- Harvard Medical School, Boston, MA, USA
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Massachusetts General Hospital (MGH) Cancer Center, Harvard Medical School, Boston, MA, USA
| | - Soumya Raychaudhuri
- Division of Rheumatology, Inflammation and Immunity, Brigham and Women's Hospital, Boston, MA, USA.
- Division of Genetics, Brigham and Women's Hospital, Boston, MA, USA.
- Center for Data Sciences, Brigham and Women's Hospital, Boston, MA, USA.
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.
- Department of Medicine, Harvard Medical School, Boston, MA, USA.
- Harvard Medical School, Boston, MA, USA.
- Broad Institute of MIT and Harvard, Cambridge, MA, USA.
| | - Ilya Korsunsky
- Division of Genetics, Brigham and Women's Hospital, Boston, MA, USA.
- Center for Data Sciences, Brigham and Women's Hospital, Boston, MA, USA.
- Harvard Medical School, Boston, MA, USA.
| |
Collapse
|
19
|
Xu K, Xu Y, Wang Z, Zhou XM, Zhang L. stDyer enables spatial domain clustering with dynamic graph embedding. Genome Biol 2025; 26:34. [PMID: 39980033 PMCID: PMC11843776 DOI: 10.1186/s13059-025-03503-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2024] [Accepted: 02/12/2025] [Indexed: 02/22/2025] Open
Abstract
Spatially resolved transcriptomics (SRT) data provide critical insights into gene expression patterns within tissue contexts, necessitating effective methods for identifying spatial domains. We introduce stDyer, an end-to-end deep learning framework for spatial domain clustering in SRT data. stDyer combines Gaussian Mixture Variational AutoEncoder with graph attention networks to learn embeddings and perform clustering. Its dynamic graphs adaptively link units based on Gaussian Mixture assignments, improving clustering and producing smoother domain boundaries. stDyer's mini-batch strategy and multi-GPU support facilitate scalability to large datasets. Benchmarking against state-of-the-art tools, stDyer demonstrates superior performance in spatial domain clustering, multi-slice analysis, and large-scale dataset handling.
Collapse
Affiliation(s)
- Ke Xu
- Department of Computer Science, Hong Kong Baptist University, Hong Kong, China
| | - Yu Xu
- Department of Computer Science, Hong Kong Baptist University, Hong Kong, China
| | - Zirui Wang
- Department of Computer Science, Hong Kong Baptist University, Hong Kong, China
| | - Xin Maizie Zhou
- Department of Biomedical Engineering, Vanderbilt University, 2301 Vanderbilt Place, 37235, Nashville, Tennessee, USA.
| | - Lu Zhang
- Department of Computer Science, Hong Kong Baptist University, Hong Kong, China.
- Institute of Systems Medicine and Health Sciences, School of Chinese Medicine, Hong Kong Baptist University, Hong Kong, China.
| |
Collapse
|
20
|
Torok J, Maia PD, Anand C, Raj A. Searching for the cellular underpinnings of the selective vulnerability to tauopathic insults in Alzheimer's disease. Commun Biol 2025; 8:195. [PMID: 39920421 PMCID: PMC11806020 DOI: 10.1038/s42003-025-07575-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2024] [Accepted: 01/17/2025] [Indexed: 02/09/2025] Open
Abstract
Neurodegenerative diseases such as Alzheimer's disease exhibit pathological changes in the brain that proceed in a stereotyped and regionally specific fashion. However, the cellular underpinnings of regional vulnerability are poorly understood, in part because whole-brain maps of a comprehensive collection of cell types have been inaccessible. Here, we deployed a recent cell-type mapping pipeline, Matrix Inversion and Subset Selection (MISS), to determine the brain-wide distributions of pan-hippocampal and neocortical cells in the mouse, and then used these maps to identify general principles of cell-type-based selective vulnerability in PS19 mouse models. We found that hippocampal glutamatergic neurons as a whole were significantly positively associated with regional tau deposition, suggesting vulnerability, while cortical glutamatergic and GABAergic neurons were negatively associated. We also identified oligodendrocytes as the single-most strongly negatively associated cell type. Further, cell-type distributions were more predictive of end-time-point tau pathology than AD-risk-gene expression. Using gene ontology analysis, we found that the genes that are directly correlated to tau pathology are functionally distinct from those that constitutively embody the vulnerable cells. In short, we have elucidated cell-type correlates of tau deposition across mouse models of tauopathy, advancing our understanding of selective cellular vulnerability at a whole-brain level.
Collapse
Affiliation(s)
- Justin Torok
- University of CAlifornia, San Francisco, Department of Radiology, San Francisco, CA, 94143, USA
| | - Pedro D Maia
- University of Texas at Arlington, Department of Mathematics, Arlington, TX, 76019, USA
| | - Chaitali Anand
- University of CAlifornia, San Francisco, Institute for Neurodegenerative Diseases, San Francisco, CA, 94143, USA
| | - Ashish Raj
- University of CAlifornia, San Francisco, Department of Radiology, San Francisco, CA, 94143, USA.
| |
Collapse
|
21
|
Hu B, He R, Pang K, Wang G, Wang N, Zhu W, Sui X, Teng H, Liu T, Zhu J, Jiang Z, Zhang J, Zuo Z, Wang W, Ji P, Zhao F. High-resolution spatially resolved proteomics of complex tissues based on microfluidics and transfer learning. Cell 2025; 188:734-748.e22. [PMID: 39855194 DOI: 10.1016/j.cell.2024.12.023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2024] [Revised: 10/12/2024] [Accepted: 12/17/2024] [Indexed: 01/27/2025]
Abstract
Despite recent advances in imaging- and antibody-based methods, achieving in-depth, high-resolution protein mapping across entire tissues remains a significant challenge in spatial proteomics. Here, we present parallel-flow projection and transfer learning across omics data (PLATO), an integrated framework combining microfluidics with deep learning to enable high-resolution mapping of thousands of proteins in whole tissue sections. We validated the PLATO framework by profiling the spatial proteome of the mouse cerebellum, identifying 2,564 protein groups in a single run. We then applied PLATO to rat villus and human breast cancer samples, achieving a spatial resolution of 25 μm and uncovering proteomic dynamics associated with disease states. This approach revealed spatially distinct tumor subtypes, identified key dysregulated proteins, and provided novel insights into the complexity of the tumor microenvironment. We believe that PLATO represents a transformative platform for exploring spatial proteomic regulation and its interplay with genetic and environmental factors.
Collapse
Affiliation(s)
- Beiyu Hu
- Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China; Key Laboratory of Systems Biology, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou, China
| | - Ruiqiao He
- Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Kun Pang
- Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Guibin Wang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Ning Wang
- Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Wenzhuo Zhu
- Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
| | - Xin Sui
- Key Laboratory of Carcinogenesis and Translational Research, Department of Radiation Oncology, Peking University Cancer Hospital and Institute, Beijing 100142, China
| | - Huajing Teng
- Key Laboratory of Carcinogenesis and Translational Research, Department of Radiation Oncology, Peking University Cancer Hospital and Institute, Beijing 100142, China
| | - Tianxin Liu
- Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
| | - Junjie Zhu
- Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China; Key Laboratory of Systems Biology, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou, China
| | - Zewen Jiang
- Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
| | - Jinyang Zhang
- Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
| | - Zhenqiang Zuo
- Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
| | - Weihu Wang
- Key Laboratory of Carcinogenesis and Translational Research, Department of Radiation Oncology, Peking University Cancer Hospital and Institute, Beijing 100142, China
| | - Peifeng Ji
- Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China.
| | - Fangqing Zhao
- Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China; Key Laboratory of Systems Biology, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou, China; University of Chinese Academy of Sciences, Beijing 100049, China.
| |
Collapse
|
22
|
Haviv D, Remšík J, Gatie M, Snopkowski C, Takizawa M, Pereira N, Bashkin J, Jovanovich S, Nawy T, Chaligne R, Boire A, Hadjantonakis AK, Pe'er D. The covariance environment defines cellular niches for spatial inference. Nat Biotechnol 2025; 43:269-280. [PMID: 38565973 PMCID: PMC11445396 DOI: 10.1038/s41587-024-02193-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2023] [Accepted: 02/28/2024] [Indexed: 04/04/2024]
Abstract
A key challenge of analyzing data from high-resolution spatial profiling technologies is to suitably represent the features of cellular neighborhoods or niches. Here we introduce the covariance environment (COVET), a representation that leverages the gene-gene covariate structure across cells in the niche to capture the multivariate nature of cellular interactions within it. We define a principled optimal transport-based distance metric between COVET niches that scales to millions of cells. Using COVET to encode spatial context, we developed environmental variational inference (ENVI), a conditional variational autoencoder that jointly embeds spatial and single-cell RNA sequencing data into a latent space. ENVI includes two decoders: one to impute gene expression across the spatial modality and a second to project spatial information onto single-cell data. ENVI can confer spatial context to genomics data from single dissociated cells and outperforms alternatives for imputing gene expression on diverse spatial datasets.
Collapse
Affiliation(s)
- Doron Haviv
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, NY, USA
- Tri-Institutional Training Program in Computational Biology and Medicine, Weill Cornell Medicine, New York, NY, USA
| | - Ján Remšík
- Human Oncology & Pathogenesis Program, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Mohamed Gatie
- Developmental Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Catherine Snopkowski
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Meril Takizawa
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | | | | | | | - Tal Nawy
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Ronan Chaligne
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Adrienne Boire
- Human Oncology & Pathogenesis Program, Memorial Sloan Kettering Cancer Center, New York, NY, USA
- Department of Neurology, Memorial Sloan Kettering Cancer Center, New York, NY, USA
- Brain Tumor Center, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Anna-Katerina Hadjantonakis
- Developmental Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Dana Pe'er
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, NY, USA.
- Howard Hughes Medical Institute, New York, NY, USA.
| |
Collapse
|
23
|
Zhang M, Parker J, An L, Liu Y, Sun X. Flexible analysis of spatial transcriptomics data (FAST): a deconvolution approach. BMC Bioinformatics 2025; 26:35. [PMID: 39891065 PMCID: PMC11786350 DOI: 10.1186/s12859-025-06054-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2023] [Accepted: 01/16/2025] [Indexed: 02/03/2025] Open
Abstract
MOTIVATION Spatial transcriptomics is a state-of-art technique that allows researchers to study gene expression patterns in tissues over the spatial domain. As a result of technical limitations, the majority of spatial transcriptomics techniques provide bulk data for each sequencing spot. Consequently, in order to obtain high-resolution spatial transcriptomics data, performing deconvolution becomes essential. Most existing deconvolution methods rely on reference data (e.g., single-cell data), which may not be available in real applications. Current reference-free methods encounter limitations due to their dependence on distribution assumptions, reliance on marker genes, or the absence of leveraging histology and spatial information. Consequently, there is a critical need for the development of highly flexible, robust, and user-friendly reference-free deconvolution methods capable of unifying or leveraging case-specific information in the analysis of spatial transcriptomics data. RESULTS We propose a novel reference-free method based on regularized non-negative matrix factorization (NMF), named Flexible Analysis of Spatial Transcriptomics (FAST), that can effectively incorporate gene expression data, spatial, and histology information into a unified deconvolution framework. Compared to existing methods, FAST imposes fewer distribution assumptions, utilizes the spatial structure information of tissues, and encourages interpretable factorization results. These features enable greater flexibility and accuracy, making FAST an effective tool for deciphering the complex cell-type composition of tissues and advancing our understanding of various biological processes and diseases. Extensive simulation studies have shown that FAST outperforms other existing reference-free methods. In real data applications, FAST is able to uncover the underlying tissue structures and identify the corresponding marker genes.
Collapse
Affiliation(s)
- Meng Zhang
- Department of Mathematics, University of Arizona, 617 N. Santa Rita Ave., Tucson, AZ, 85721, USA
| | - Joel Parker
- Department of Epidemiology and Biostatistics, University of Arizona, 1295 N. Martin Ave., Tucson, AZ, 85721, USA
| | - Lingling An
- Department of Agricultural and Biosystems Engineering, University of Arizona, 1177 East Fourth Street, Tucson, AZ, 85721, USA
| | - Yiwen Liu
- Department of Epidemiology and Biostatistics, University of Arizona, 1295 N. Martin Ave., Tucson, AZ, 85721, USA.
| | - Xiaoxiao Sun
- Department of Epidemiology and Biostatistics, University of Arizona, 1295 N. Martin Ave., Tucson, AZ, 85721, USA.
| |
Collapse
|
24
|
Cui T, Li YY, Li BL, Zhang H, Yu TT, Zhang JN, Qian FC, Yin MX, Fang QL, Hu ZH, Yan YX, Wang QY, Li CQ, Shang DS. SpatialRef: a reference of spatial omics with known spot annotation. Nucleic Acids Res 2025; 53:D1215-D1223. [PMID: 39417483 PMCID: PMC11701618 DOI: 10.1093/nar/gkae892] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2024] [Revised: 09/19/2024] [Accepted: 09/26/2024] [Indexed: 10/19/2024] Open
Abstract
Spatial omics technologies have enabled the creation of intricate spatial maps that capture molecular features and tissue morphology, providing valuable insights into the spatial associations and functional organization of tissues. Accurate annotation of spot or domain types is essential for downstream spatial omics analyses, but this remains challenging. Therefore, this study aimed to develop a manually curated spatial omics database (SpatialRef, https://bio.liclab.net/spatialref/), to provide comprehensive and high-quality spatial omics data with known spot labels across multiple species. The current version of SpatialRef aggregates >9 million manually annotated spots across 17 Human, Mouse and Drosophila tissue types through extensive review and strict quality control, covering multiple spatial sequencing technologies and >400 spot/domain types from original studies. Furthermore, SpatialRef supports various spatial omics analyses about known spot types, including differentially expressed genes, spatially variable genes, Gene Ontology (GO)/KEGG annotation, spatial communication and spatial trajectories. With a user-friendly interface, SpatialRef facilitates querying, browsing and visualizing, thereby aiding in elucidating the functional relevance of spatial domains within the tissue and uncovering potential biological effects.
Collapse
Affiliation(s)
- Ting Cui
- The First Affiliated Hospital & National Health Commission Key Laboratory of Birth Defect Research and Prevention, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- Insititute of Biochemistry and Molecular Biology, Hengyang Medical College, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Cardiovascular Lab of Big Data and Imaging Artificial Intelligence, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
| | - Yan-Yu Li
- The First Affiliated Hospital & National Health Commission Key Laboratory of Birth Defect Research and Prevention, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- Insititute of Biochemistry and Molecular Biology, Hengyang Medical College, University of South China, Hengyang, Hunan 421001, China
- Hunan Provincial Key Laboratory of Multi-omics And Artificial Intelligence of Cardiovascular Diseases, University of South China, Hengyang, Hunan 421001, China
- School of Computer, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Cardiovascular Lab of Big Data and Imaging Artificial Intelligence, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Institute of Cardiovascular Disease, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
| | - Bing-Long Li
- The First Affiliated Hospital & National Health Commission Key Laboratory of Birth Defect Research and Prevention, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- Insititute of Biochemistry and Molecular Biology, Hengyang Medical College, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Cardiovascular Lab of Big Data and Imaging Artificial Intelligence, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
| | - Han Zhang
- School of Computer, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Cardiovascular Lab of Big Data and Imaging Artificial Intelligence, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
| | - Ting-Ting Yu
- School of Computer, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Cardiovascular Lab of Big Data and Imaging Artificial Intelligence, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
| | - Jia-Ning Zhang
- The First Affiliated Hospital & National Health Commission Key Laboratory of Birth Defect Research and Prevention, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- Insititute of Biochemistry and Molecular Biology, Hengyang Medical College, University of South China, Hengyang, Hunan 421001, China
- Hunan Provincial Key Laboratory of Multi-omics And Artificial Intelligence of Cardiovascular Diseases, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Cardiovascular Lab of Big Data and Imaging Artificial Intelligence, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
| | - Feng-Cui Qian
- The First Affiliated Hospital & National Health Commission Key Laboratory of Birth Defect Research and Prevention, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- Insititute of Biochemistry and Molecular Biology, Hengyang Medical College, University of South China, Hengyang, Hunan 421001, China
- Hunan Provincial Key Laboratory of Multi-omics And Artificial Intelligence of Cardiovascular Diseases, University of South China, Hengyang, Hunan 421001, China
- School of Computer, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Cardiovascular Lab of Big Data and Imaging Artificial Intelligence, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Institute of Cardiovascular Disease, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
| | - Ming-Xue Yin
- The First Affiliated Hospital & National Health Commission Key Laboratory of Birth Defect Research and Prevention, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- Insititute of Biochemistry and Molecular Biology, Hengyang Medical College, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Cardiovascular Lab of Big Data and Imaging Artificial Intelligence, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
| | - Qiao-Li Fang
- School of Computer, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Cardiovascular Lab of Big Data and Imaging Artificial Intelligence, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
| | - Zi-Hao Hu
- School of Computer, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Cardiovascular Lab of Big Data and Imaging Artificial Intelligence, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
| | - Yu-Xiang Yan
- School of Computer, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Cardiovascular Lab of Big Data and Imaging Artificial Intelligence, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
| | - Qiu-Yu Wang
- The First Affiliated Hospital & National Health Commission Key Laboratory of Birth Defect Research and Prevention, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- Insititute of Biochemistry and Molecular Biology, Hengyang Medical College, University of South China, Hengyang, Hunan 421001, China
- Hunan Provincial Key Laboratory of Multi-omics And Artificial Intelligence of Cardiovascular Diseases, University of South China, Hengyang, Hunan 421001, China
- School of Computer, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Cardiovascular Lab of Big Data and Imaging Artificial Intelligence, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Institute of Cardiovascular Disease, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
| | - Chun-Quan Li
- The First Affiliated Hospital & National Health Commission Key Laboratory of Birth Defect Research and Prevention, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- Insititute of Biochemistry and Molecular Biology, Hengyang Medical College, University of South China, Hengyang, Hunan 421001, China
- Hunan Provincial Key Laboratory of Multi-omics And Artificial Intelligence of Cardiovascular Diseases, University of South China, Hengyang, Hunan 421001, China
- School of Computer, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Cardiovascular Lab of Big Data and Imaging Artificial Intelligence, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- Key Laboratory of Rare Pediatric Diseases, Ministry of Education, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Institute of Cardiovascular Disease, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
| | - De-Si Shang
- The First Affiliated Hospital & National Health Commission Key Laboratory of Birth Defect Research and Prevention, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- Insititute of Biochemistry and Molecular Biology, Hengyang Medical College, University of South China, Hengyang, Hunan 421001, China
- Hunan Provincial Key Laboratory of Multi-omics And Artificial Intelligence of Cardiovascular Diseases, University of South China, Hengyang, Hunan 421001, China
- School of Computer, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Cardiovascular Lab of Big Data and Imaging Artificial Intelligence, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Institute of Cardiovascular Disease, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
| |
Collapse
|
25
|
Sun F, Li H, Sun D, Fu S, Gu L, Shao X, Wang Q, Dong X, Duan B, Xing F, Wu J, Xiao M, Zhao F, Han JDJ, Liu Q, Fan X, Li C, Wang C, Shi T. Single-cell omics: experimental workflow, data analyses and applications. SCIENCE CHINA. LIFE SCIENCES 2025; 68:5-102. [PMID: 39060615 DOI: 10.1007/s11427-023-2561-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/07/2023] [Accepted: 04/18/2024] [Indexed: 07/28/2024]
Abstract
Cells are the fundamental units of biological systems and exhibit unique development trajectories and molecular features. Our exploration of how the genomes orchestrate the formation and maintenance of each cell, and control the cellular phenotypes of various organismsis, is both captivating and intricate. Since the inception of the first single-cell RNA technology, technologies related to single-cell sequencing have experienced rapid advancements in recent years. These technologies have expanded horizontally to include single-cell genome, epigenome, proteome, and metabolome, while vertically, they have progressed to integrate multiple omics data and incorporate additional information such as spatial scRNA-seq and CRISPR screening. Single-cell omics represent a groundbreaking advancement in the biomedical field, offering profound insights into the understanding of complex diseases, including cancers. Here, we comprehensively summarize recent advances in single-cell omics technologies, with a specific focus on the methodology section. This overview aims to guide researchers in selecting appropriate methods for single-cell sequencing and related data analysis.
Collapse
Affiliation(s)
- Fengying Sun
- Department of Clinical Laboratory, the Affiliated Wuhu Hospital of East China Normal University (The Second People's Hospital of Wuhu City), Wuhu, 241000, China
| | - Haoyan Li
- Pharmaceutical Informatics Institute, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China
| | - Dongqing Sun
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration (Tongji University), Ministry of Education, Orthopaedic Department, Tongji Hospital, Bioinformatics Department, School of Life Sciences and Technology, Tongji University, Shanghai, 200082, China
- Frontier Science Center for Stem Cells, School of Life Sciences and Technology, Tongji University, Shanghai, 200092, China
| | - Shaliu Fu
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration (Tongji University), Ministry of Education, Orthopaedic Department, Tongji Hospital, Bioinformatics Department, School of Life Sciences and Technology, Tongji University, Shanghai, 200082, China
- Translational Medical Center for Stem Cell Therapy and Institute for Regenerative Medicine, Shanghai East Hospital, Bioinformatics Department, School of Life Sciences and Technology, Tongji University, Shanghai, 200082, China
- Research Institute of Intelligent Computing, Zhejiang Lab, Hangzhou, 311121, China
- Shanghai Research Institute for Intelligent Autonomous Systems, Shanghai, 201210, China
| | - Lei Gu
- Center for Single-cell Omics, School of Public Health, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, China
| | - Xin Shao
- Pharmaceutical Informatics Institute, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China
- National Key Laboratory of Chinese Medicine Modernization, Innovation Center of Yangtze River Delta, Zhejiang University, Jiaxing, 314103, China
| | - Qinqin Wang
- Center for Single-cell Omics, School of Public Health, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, China
| | - Xin Dong
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration (Tongji University), Ministry of Education, Orthopaedic Department, Tongji Hospital, Bioinformatics Department, School of Life Sciences and Technology, Tongji University, Shanghai, 200082, China
- Frontier Science Center for Stem Cells, School of Life Sciences and Technology, Tongji University, Shanghai, 200092, China
| | - Bin Duan
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration (Tongji University), Ministry of Education, Orthopaedic Department, Tongji Hospital, Bioinformatics Department, School of Life Sciences and Technology, Tongji University, Shanghai, 200082, China
- Translational Medical Center for Stem Cell Therapy and Institute for Regenerative Medicine, Shanghai East Hospital, Bioinformatics Department, School of Life Sciences and Technology, Tongji University, Shanghai, 200082, China
- Research Institute of Intelligent Computing, Zhejiang Lab, Hangzhou, 311121, China
- Shanghai Research Institute for Intelligent Autonomous Systems, Shanghai, 201210, China
| | - Feiyang Xing
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration (Tongji University), Ministry of Education, Orthopaedic Department, Tongji Hospital, Bioinformatics Department, School of Life Sciences and Technology, Tongji University, Shanghai, 200082, China
- Frontier Science Center for Stem Cells, School of Life Sciences and Technology, Tongji University, Shanghai, 200092, China
| | - Jun Wu
- Center for Bioinformatics and Computational Biology, Shanghai Key Laboratory of Regulatory Biology, the Institute of Biomedical Sciences and School of Life Sciences, East China Normal University, Shanghai, 200241, China
| | - Minmin Xiao
- Department of Clinical Laboratory, the Affiliated Wuhu Hospital of East China Normal University (The Second People's Hospital of Wuhu City), Wuhu, 241000, China.
| | - Fangqing Zhao
- Beijing Institutes of Life Science, Chinese Academy of Sciences, Beijing, 100101, China.
| | - Jing-Dong J Han
- Peking-Tsinghua Center for Life Sciences, Academy for Advanced Interdisciplinary Studies, Center for Quantitative Biology (CQB), Peking University, Beijing, 100871, China.
| | - Qi Liu
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration (Tongji University), Ministry of Education, Orthopaedic Department, Tongji Hospital, Bioinformatics Department, School of Life Sciences and Technology, Tongji University, Shanghai, 200082, China.
- Translational Medical Center for Stem Cell Therapy and Institute for Regenerative Medicine, Shanghai East Hospital, Bioinformatics Department, School of Life Sciences and Technology, Tongji University, Shanghai, 200082, China.
- Research Institute of Intelligent Computing, Zhejiang Lab, Hangzhou, 311121, China.
- Shanghai Research Institute for Intelligent Autonomous Systems, Shanghai, 201210, China.
| | - Xiaohui Fan
- Pharmaceutical Informatics Institute, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China.
- National Key Laboratory of Chinese Medicine Modernization, Innovation Center of Yangtze River Delta, Zhejiang University, Jiaxing, 314103, China.
- Zhejiang Key Laboratory of Precision Diagnosis and Therapy for Major Gynecological Diseases, Women's Hospital, Zhejiang University School of Medicine, Hangzhou, 310006, China.
| | - Chen Li
- Center for Single-cell Omics, School of Public Health, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, China.
| | - Chenfei Wang
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration (Tongji University), Ministry of Education, Orthopaedic Department, Tongji Hospital, Bioinformatics Department, School of Life Sciences and Technology, Tongji University, Shanghai, 200082, China.
- Frontier Science Center for Stem Cells, School of Life Sciences and Technology, Tongji University, Shanghai, 200092, China.
| | - Tieliu Shi
- Department of Clinical Laboratory, the Affiliated Wuhu Hospital of East China Normal University (The Second People's Hospital of Wuhu City), Wuhu, 241000, China.
- Center for Bioinformatics and Computational Biology, Shanghai Key Laboratory of Regulatory Biology, the Institute of Biomedical Sciences and School of Life Sciences, East China Normal University, Shanghai, 200241, China.
- Key Laboratory of Advanced Theory and Application in Statistics and Data Science-MOE, School of Statistics, East China Normal University, Shanghai, 200062, China.
| |
Collapse
|
26
|
Yu Z, Yang Y, Chen X, Wong K, Zhang Z, Zhao Y, Li X. Accurate Spatial Heterogeneity Dissection and Gene Regulation Interpretation for Spatial Transcriptomics using Dual Graph Contrastive Learning. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2025; 12:e2410081. [PMID: 39605202 PMCID: PMC11744562 DOI: 10.1002/advs.202410081] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/22/2024] [Revised: 10/31/2024] [Indexed: 11/29/2024]
Abstract
Recent advances in spatial transcriptomics have enabled simultaneous preservation of high-throughput gene expression profiles and the spatial context, enabling high-resolution exploration of distinct regional characterization in tissue. To effectively understand the underlying biological mechanisms within tissue microenvironments, there is a requisite for methods that can accurately capture external spatial heterogeneity and interpret internal gene regulation from spatial transcriptomics data. However, current methods for region identification often lack the simultaneous characterizing of spatial structure and gene regulation, thereby limiting the ability of spatial dissection and gene interpretation. Here, stDCL is developed, a dual graph contrastive learning method to identify spatial domains and interpret gene regulation in spatial transcriptomics data. stDCL adaptively incorporates gene expression data and spatial information via a graph embedding autoencoder, thereby preserving critical information within the latent embedding representations. In addition, dual graph contrastive learning is proposed to train the model, ensuring that the latent embedding representation closely resembles the actual spatial distribution and exhibits cluster similarity. Benchmarking stDCL against other state-of-the-art clustering methods using complex cortex datasets demonstrates its superior accuracy and effectiveness in identifying spatial domains. Our analysis of the imputation matrices generated by stDCL reveals its capability to reconstruct spatial hierarchical structures and refine differential expression assessment. Furthermore, it is demonstrated that the versatility of stDCL in interpretability of gene regulation, spatial heterogeneity at high resolution, and embryonic developmental patterns. In addition, it is also showed that stDCL can successfully annotate disease-associated astrocyte subtypes in Alzheimer's disease and unravel multiple relevant pathways and regulatory mechanisms.
Collapse
Affiliation(s)
- Zhuohan Yu
- School of Artificial IntelligenceJilin UniversityJilin130012China
| | - Yuning Yang
- Terrence Donnelly Centre for Cellular and Biomolecular ResearchUniversity of TorontoTorontoONM5S 3E1Canada
| | - Xingjian Chen
- Cutaneous Biology Research Center, Massachusetts General HospitalHarvard Medical SchoolBostonMA02115USA
| | - Ka‐Chun Wong
- Department of Computer ScienceCity University of Hong KongHong KongSAR999077Hong Kong
| | - Zhaolei Zhang
- Terrence Donnelly Centre for Cellular and Biomolecular ResearchUniversity of TorontoTorontoONM5S 3E1Canada
| | - Yuming Zhao
- College of Computer and Control EngineeringNortheast Forestry UniversityHarbin150040China
| | - Xiangtao Li
- School of Artificial IntelligenceJilin UniversityJilin130012China
- Department of Computer ScienceCity University of Hong KongHong KongSAR999077Hong Kong
| |
Collapse
|
27
|
Defard T, Desrentes A, Fouillade C, Mueller F. Homebuilt Imaging-Based Spatial Transcriptomics: Tertiary Lymphoid Structures as a Case Example. Methods Mol Biol 2025; 2864:77-105. [PMID: 39527218 DOI: 10.1007/978-1-0716-4184-2_5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2024]
Abstract
Spatial transcriptomics methods provide insight into the cellular heterogeneity and spatial architecture of complex, multicellular systems. Combining molecular and spatial information provides important clues to study tissue architecture in development and disease. Here, we present a comprehensive do-it-yourself (DIY) guide to perform such experiments at reduced costs leveraging open-source approaches. This guide spans the entire life cycle of a project, from its initial definition to experimental choices, wet lab approaches, instrumentation, and analysis. As a concrete example, we focus on tertiary lymphoid structures (TLS), which we use to develop typical questions that can be addressed by these approaches.
Collapse
Affiliation(s)
- Thomas Defard
- Institut Pasteur, Université Paris Cité, Photonic Bio-Imaging, Centre de Ressources et Recherches Technologiques (UTechS-PBI, C2RT), Paris, France
- Institut Pasteur, Université Paris Cité, Imaging and Modeling Unit, Paris, France
- Centre for Computational Biology (CBIO), Mines Paris, PSL University, Paris, France
- Institut Curie, PSL University, Paris, France
- INSERM, U900, Paris, France
| | - Auxence Desrentes
- UMRS1135 Sorbonne University, Paris, France
- INSERM U1135, Paris, France
- Team "Immune Microenvironment and Immunotherapy", Centre for Immunology and Microbial Infections (CIMI), Paris, France
| | - Charles Fouillade
- Institut Curie, Inserm U1021-CNRS UMR 3347, University Paris-Saclay, PSL Research University, Centre Universitaire, Orsay, France
| | - Florian Mueller
- Institut Pasteur, Université Paris Cité, Photonic Bio-Imaging, Centre de Ressources et Recherches Technologiques (UTechS-PBI, C2RT), Paris, France.
- Institut Pasteur, Université Paris Cité, Imaging and Modeling Unit, Paris, France.
| |
Collapse
|
28
|
Zhang F, Shen Z, Huang S, Zhu Y, Yi M. SpaInGNN: Enhanced clustering and integration of spatial transcriptomics based on refined graph neural networks. Methods 2025; 233:42-51. [PMID: 39542070 DOI: 10.1016/j.ymeth.2024.11.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2024] [Revised: 10/22/2024] [Accepted: 11/07/2024] [Indexed: 11/17/2024] Open
Abstract
Recent developments in spatial transcriptomics (ST) technology have markedly enhanced the proposed capacity to comprehensively characterize gene expression patterns within tissue microenvironments while crucially preserving spatial context. However, the identification of spatial domains at the single-cell level remains a significant challenge in elucidating biological processes. To address this, SpaInGNN was developed, a sophisticated graph neural network (GNN) framework that accurately delineates spatial domains by integrating spatial location data, histological information, and gene expression profiles into low-dimensional latent embeddings. Additionally, to fully leverage spatial coordinate data, spatial integration using graph neural network (SpaInGNN) refines the graph constructed for spatial locations by incorporating both tissue image distance and Euclidean distance, following a pre-clustering of gene expression profiles. This refined graph is then embedded using a self-supervised GNN, which minimizes self-reconfiguration loss. By applying SpaInGNN to refined graphs across multiple consecutive tissue slices, this study mitigates the impact of batch effects in data analysis. The proposed method demonstrates substantial improvements in the accuracy of spatial domain recognition, providing a more faithful representation of the tissue organization in both mouse olfactory bulb and human lateral prefrontal cortex samples.
Collapse
Affiliation(s)
- Fangqin Zhang
- Shool of Mathematics and Physics, China University of Geosciences, Wuhan 430074, China
| | - Zhan Shen
- Shool of Mathematics and Physics, China University of Geosciences, Wuhan 430074, China
| | - Siyi Huang
- Shool of Mathematics and Physics, China University of Geosciences, Wuhan 430074, China
| | - Yuan Zhu
- Shool of Mathematics and Physics, China University of Geosciences, Wuhan 430074, China
| | - Ming Yi
- Shool of Mathematics and Physics, China University of Geosciences, Wuhan 430074, China.
| |
Collapse
|
29
|
Keller MS, Gold I, McCallum C, Manz T, Kharchenko PV, Gehlenborg N. Vitessce: integrative visualization of multimodal and spatially resolved single-cell data. Nat Methods 2025; 22:63-67. [PMID: 39333268 PMCID: PMC11725496 DOI: 10.1038/s41592-024-02436-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2023] [Accepted: 08/16/2024] [Indexed: 09/29/2024]
Abstract
Multiomics technologies with single-cell and spatial resolution make it possible to measure thousands of features across millions of cells. However, visual analysis of high-dimensional transcriptomic, proteomic, genome-mapped and imaging data types simultaneously remains a challenge. Here we describe Vitessce, an interactive web-based visualization framework for exploration of multimodal and spatially resolved single-cell data. We demonstrate integrative visualization of millions of data points, including cell-type annotations, gene expression quantities, spatially resolved transcripts and cell segmentations, across multiple coordinated views. The open-source software is available at http://vitessce.io .
Collapse
Affiliation(s)
- Mark S Keller
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Ilan Gold
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Chuck McCallum
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Trevor Manz
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Peter V Kharchenko
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Altos Labs, San Diego, CA, USA
| | - Nils Gehlenborg
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.
| |
Collapse
|
30
|
Hong F. Programmable DNA Reactions for Advanced Fluorescence Microscopy in Bioimaging. SMALL METHODS 2024:e2401279. [PMID: 39679773 DOI: 10.1002/smtd.202401279] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/15/2024] [Revised: 11/14/2024] [Indexed: 12/17/2024]
Abstract
Biological organisms are composed of billions of molecules organized across various length scales. Direct visualization of these biomolecules in situ enables the retrieval of vast molecular information, including their location, species, and quantities, which is essential for understanding biological processes. The programmability of DNA interactions has made DNA-based reactions a major driving force in extending the limits of fluorescence microscopy, allowing for the study of biological complexity at different scales. This review article provides an overview of recent technological advancements in DNA-based fluorescence microscopy, highlighting how these innovations have expanded the technique's capabilities in terms of target multiplexity, signal amplification, super-resolution, and mechanical properties. These advanced DNA-based fluorescence microscopy techniques have been widely used to uncover new biological insights at the molecular level.
Collapse
Affiliation(s)
- Fan Hong
- Department of Chemistry, University of Florida, Gainesville, FL, 32611, USA
- J. Crayton Pruitt Family Department of Biomedical Engineering, University of Florida, Gainesville, FL, 32611, USA
| |
Collapse
|
31
|
Liu T, Fang ZY, Zhang Z, Yu Y, Li M, Yin MZ. A comprehensive overview of graph neural network-based approaches to clustering for spatial transcriptomics. Comput Struct Biotechnol J 2024; 23:106-128. [PMID: 38089467 PMCID: PMC10714345 DOI: 10.1016/j.csbj.2023.11.055] [Citation(s) in RCA: 12] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2023] [Revised: 11/24/2023] [Accepted: 11/27/2023] [Indexed: 10/16/2024] Open
Abstract
Spatial transcriptomics technologies enable researchers to accurately quantify and localize messenger ribonucleic acid (mRNA) transcripts at a high resolution while preserving their spatial context. The identification of spatial domains, or the task of spatial clustering, plays a crucial role in investigating data on spatial transcriptomes. One promising approach for classifying spatial domains involves the use of graph neural networks (GNNs) by leveraging gene expressions, spatial locations, and histological images. This study provided a comprehensive overview of the most recent GNN-based methods of spatial clustering methods for the analysis of data on spatial transcriptomics. We extensively evaluated the performance of current methods on prevalent datasets of spatial transcriptomics by considering their accuracy of clustering, robustness, data stabilization, relevant requirements, computational efficiency, and memory use. To this end, we explored 60 clustering scenarios by extending the essential frameworks of spatial clustering for the selection of the GNNs, algorithms of downstream clustering, principal component analysis (PCA)-based reduction, and refined methods of correction. We comparatively analyzed the performance of the methods in terms of spatial clustering to identify their limitations and outline future directions of research in the field. Our survey yielded novel insights, and provided motivation for further investigating spatial transcriptomics.
Collapse
Affiliation(s)
- Teng Liu
- Clinical Research Center (CRC), Clinical Pathology Center (CPC), Cancer Early Detection and Treatment Center (CEDTC) and Translational Medicine Research Center (TMRC), Chongqing University Three Gorges Hospital, Chongqing University, Wanzhou, Chongqing, China
- Chongqing Technical Innovation Center for Quality Evaluation and Identification of Authentic Medicinal Herbs, Chongqing, China
| | - Zhao-Yu Fang
- School of Computer Science and Engineering, Central South University, Changsha 410083, China
| | - Zongbo Zhang
- Clinical Research Center (CRC), Clinical Pathology Center (CPC), Cancer Early Detection and Treatment Center (CEDTC) and Translational Medicine Research Center (TMRC), Chongqing University Three Gorges Hospital, Chongqing University, Wanzhou, Chongqing, China
| | - Yongxiang Yu
- Clinical Research Center (CRC), Clinical Pathology Center (CPC), Cancer Early Detection and Treatment Center (CEDTC) and Translational Medicine Research Center (TMRC), Chongqing University Three Gorges Hospital, Chongqing University, Wanzhou, Chongqing, China
| | - Min Li
- School of Computer Science and Engineering, Central South University, Changsha 410083, China
- Hunan Provincial Engineering Research Center of Intelligent Computing in Biology and Medicine, Central South University, Changsha 410083, China
| | - Ming-Zhu Yin
- Clinical Research Center (CRC), Clinical Pathology Center (CPC), Cancer Early Detection and Treatment Center (CEDTC) and Translational Medicine Research Center (TMRC), Chongqing University Three Gorges Hospital, Chongqing University, Wanzhou, Chongqing, China
- Chongqing Technical Innovation Center for Quality Evaluation and Identification of Authentic Medicinal Herbs, Chongqing, China
| |
Collapse
|
32
|
Nie W, Yu Y, Wang X, Wang R, Li SC. Spatially Informed Graph Structure Learning Extracts Insights from Spatial Transcriptomics. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2024; 11:e2403572. [PMID: 39382177 PMCID: PMC11615819 DOI: 10.1002/advs.202403572] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/05/2024] [Revised: 08/04/2024] [Indexed: 10/10/2024]
Abstract
Embeddings derived from cell graphs hold significant potential for exploring spatial transcriptomics (ST) datasets. Nevertheless, existing methodologies rely on a graph structure defined by spatial proximity, which inadequately represents the diversity inherent in cell-cell interactions (CCIs). This study introduces STAGUE, an innovative framework that concurrently learns a cell graph structure and a low-dimensional embedding from ST data. STAGUE employs graph structure learning to parameterize and refine a cell graph adjacency matrix, enabling the generation of learnable graph views for effective contrastive learning. The derived embeddings and cell graph improve spatial clustering accuracy and facilitate the discovery of novel CCIs. Experimental benchmarks across 86 real and simulated ST datasets show that STAGUE outperforms 15 comparison methods in clustering performance. Additionally, STAGUE delineates the heterogeneity in human breast cancer tissues, revealing the activation of epithelial-to-mesenchymal transition and PI3K/AKT signaling in specific sub-regions. Furthermore, STAGUE identifies CCIs with greater alignment to established biological knowledge than those ascertained by existing graph autoencoder-based methods. STAGUE also reveals the regulatory genes that participate in these CCIs, including those enriched in neuropeptide signaling and receptor tyrosine kinase signaling pathways, thereby providing insights into the underlying biological processes.
Collapse
Affiliation(s)
- Wan Nie
- Department of Computer ScienceCity University of Hong KongHong Kong SARChina
| | - Yingying Yu
- Department of Computer ScienceCity University of Hong KongHong Kong SARChina
| | - Xueying Wang
- Department of Computer ScienceCity University of Hong KongHong Kong SARChina
- City University of Hong Kong (Dongguan)Dongguan523000China
| | - Ruohan Wang
- Department of Computer ScienceCity University of Hong KongHong Kong SARChina
| | - Shuai Cheng Li
- Department of Computer ScienceCity University of Hong KongHong Kong SARChina
| |
Collapse
|
33
|
Kuemmerle LB, Luecken MD, Firsova AB, Barros de Andrade E Sousa L, Straßer L, Mekki II, Campi F, Heumos L, Shulman M, Beliaeva V, Hediyeh-Zadeh S, Schaar AC, Mahbubani KT, Sountoulidis A, Balassa T, Kovacs F, Horvath P, Piraud M, Ertürk A, Samakovlis C, Theis FJ. Probe set selection for targeted spatial transcriptomics. Nat Methods 2024; 21:2260-2270. [PMID: 39558096 PMCID: PMC11621025 DOI: 10.1038/s41592-024-02496-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2022] [Accepted: 09/30/2024] [Indexed: 11/20/2024]
Abstract
Targeted spatial transcriptomic methods capture the topology of cell types and states in tissues at single-cell and subcellular resolution by measuring the expression of a predefined set of genes. The selection of an optimal set of probed genes is crucial for capturing the spatial signals present in a tissue. This requires selecting the most informative, yet minimal, set of genes to profile (gene set selection) for which it is possible to build probes (probe design). However, current selections often rely on marker genes, precluding them from detecting continuous spatial signals or new states. We present Spapros, an end-to-end probe set selection pipeline that optimizes both gene set specificity for cell type identification and within-cell type expression variation to resolve spatially distinct populations while considering prior knowledge as well as probe design and expression constraints. We evaluated Spapros and show that it outperforms other selection approaches in both cell type recovery and recovering expression variation beyond cell types. Furthermore, we used Spapros to design a single-cell resolution in situ hybridization on tissues (SCRINSHOT) experiment of adult lung tissue to demonstrate how probes selected with Spapros identify cell types of interest and detect spatial variation even within cell types.
Collapse
Affiliation(s)
- Louis B Kuemmerle
- Institute of Computational Biology, Helmholtz Zentrum München, Neuherberg, Germany
- Institute for Tissue Engineering and Regenerative Medicine, Helmholtz Zentrum München, Neuherberg, Germany
- School of Life Sciences Weihenstephan, Technical University of Munich, Freising, Germany
| | - Malte D Luecken
- Institute of Computational Biology, Helmholtz Zentrum München, Neuherberg, Germany
- Institute of Lung Health & Immunity, Helmholtz Munich, Member of the German Center for Lung Research (DZL), Munich, Germany
- German Center for Lung Research (DZL), Gießen, Germany
| | - Alexandra B Firsova
- SciLifeLab and Department of Molecular Biosciences, Stockholm University, Stockholm, Sweden
| | | | - Lena Straßer
- Institute of Computational Biology, Helmholtz Zentrum München, Neuherberg, Germany
| | | | - Francesco Campi
- Helmholtz AI, Helmholtz Zentrum München, Neuherberg, Germany
| | - Lukas Heumos
- Institute of Computational Biology, Helmholtz Zentrum München, Neuherberg, Germany
- School of Life Sciences Weihenstephan, Technical University of Munich, Freising, Germany
- Institute of Lung Biology and Disease and Comprehensive Pneumology Center, Helmholtz Zentrum München, German Center for Lung Research (DZL), Munich, Germany
| | - Maiia Shulman
- Institute of Computational Biology, Helmholtz Zentrum München, Neuherberg, Germany
| | - Valentina Beliaeva
- Institute of Computational Biology, Helmholtz Zentrum München, Neuherberg, Germany
| | - Soroor Hediyeh-Zadeh
- Institute of Computational Biology, Helmholtz Zentrum München, Neuherberg, Germany
- School of Life Sciences Weihenstephan, Technical University of Munich, Freising, Germany
| | - Anna C Schaar
- Institute of Computational Biology, Helmholtz Zentrum München, Neuherberg, Germany
- TUM School of Computation, Information and Technology, Technical University of Munich, Munich, Germany
- Munich Center for Machine Learning, Technical University of Munich, Munich, Germany
| | - Krishnaa T Mahbubani
- Department of Surgery, University of Cambridge and Cambridge NIHR Biomedical Research Centre, Cambridge, UK
| | | | - Tamás Balassa
- Synthetic and Systems Biology Unit, Biological Research Centre, Eötvös Loránd Research Network, Szeged, Hungary
| | | | - Peter Horvath
- Synthetic and Systems Biology Unit, Biological Research Centre, Eötvös Loránd Research Network, Szeged, Hungary
- Institute of AI for Health, Helmholtz Zentrum München, Neuherberg, Germany
- Institute for Molecular Medicine Finland (FIMM), University of Helsinki, Helsinki, Finland
| | - Marie Piraud
- Helmholtz AI, Helmholtz Zentrum München, Neuherberg, Germany
| | - Ali Ertürk
- Institute for Tissue Engineering and Regenerative Medicine, Helmholtz Zentrum München, Neuherberg, Germany
- Institute for Stroke and Dementia Research, Klinikum der Universität München, Ludwig-Maximilians University Munich, Munich, Germany
- Munich Cluster for Systems Neurology (SyNergy), Munich, Germany
- School of Medicine, Koç University, İstanbul, Turkey
| | - Christos Samakovlis
- SciLifeLab and Department of Molecular Biosciences, Stockholm University, Stockholm, Sweden
- Cardiopulmonary Institute, Justus Liebig University, Giessen, Germany
| | - Fabian J Theis
- Institute of Computational Biology, Helmholtz Zentrum München, Neuherberg, Germany.
- School of Life Sciences Weihenstephan, Technical University of Munich, Freising, Germany.
- School of Computation, Information and Technology, Technical University of Munich, Munich, Germany.
| |
Collapse
|
34
|
Liu Y, Yang C. Computational methods for alignment and integration of spatially resolved transcriptomics data. Comput Struct Biotechnol J 2024; 23:1094-1105. [PMID: 38495555 PMCID: PMC10940867 DOI: 10.1016/j.csbj.2024.03.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2024] [Revised: 03/02/2024] [Accepted: 03/04/2024] [Indexed: 03/19/2024] Open
Abstract
Most of the complex biological regulatory activities occur in three dimensions (3D). To better analyze biological processes, it is essential not only to decipher the molecular information of numerous cells but also to understand how their spatial contexts influence their behavior. With the development of spatially resolved transcriptomics (SRT) technologies, SRT datasets are being generated to simultaneously characterize gene expression and spatial arrangement information within tissues, organs or organisms. To fully leverage spatial information, the focus extends beyond individual two-dimensional (2D) slices. Two tasks known as slices alignment and data integration have been introduced to establish correlations between multiple slices, enhancing the effectiveness of downstream tasks. Currently, numerous related methods have been developed. In this review, we first elucidate the details and principles behind several representative methods. Then we report the testing results of these methods on various SRT datasets, and assess their performance in representative downstream tasks. Insights into the strengths and weaknesses of each method and the reasons behind their performance are discussed. Finally, we provide an outlook on future developments. The codes and details of experiments are now publicly available at https://github.com/YangLabHKUST/SRT_alignment_and_integration.
Collapse
Affiliation(s)
- Yuyao Liu
- Department of Automation, School of Information Science and Technology, Tsinghua University, Beijing, China
| | - Can Yang
- Department of Mathematics, The Hong Kong University of Science and Technology, Hong Kong, China
| |
Collapse
|
35
|
Zhang D, Yu N, Sun X, Li H, Zhang W, Qiao X, Zhang W, Gao R. Deciphering spatial domains from spatially resolved transcriptomics through spatially regularized deep graph networks. BMC Genomics 2024; 25:1160. [PMID: 39614161 DOI: 10.1186/s12864-024-11072-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2024] [Accepted: 11/21/2024] [Indexed: 12/01/2024] Open
Abstract
BACKGROUND Recent advancements in spatially resolved transcriptomics (SRT) have opened up unprecedented opportunities to explore gene expression patterns within spatial contexts. Deciphering spatial domains is a critical task in spatial transcriptomic data analysis, aiding in the elucidation of tissue structural heterogeneity and biological functions. However, existing spatial domain detection methods ignore the consistency of expression patterns and spatial arrangements between spots, as well as the severe gene dropout phenomenon present in SRT data, resulting in suboptimal performance in identifying tissue spatial heterogeneity. RESULTS In this paper, we introduce a novel framework, spatially regularized deep graph networks (SR-DGN), which integrates gene expression profiles with spatial information to learn spatially-consistent and informative spot representations. Specifically, SR-DGN employs graph attention networks (GAT) to adaptively aggregate gene expression information from neighboring spots, considering local expression patterns between spots. In addition, the spatial regularization constraint ensures the consistency of neighborhood relationships between physical and embedded spaces in an end-to-end manner. SR-DGN also employs cross-entropy (CE) loss to model gene expression states, effectively mitigating the impact of noisy gene dropouts. CONCLUSIONS Experimental results demonstrate that SR-DGN outperforms state-of-the-art methods in spatial domain identification across SRT data from different sequencing platforms. Moreover, SR-DGN is capable of recovering known microanatomical structures, yielding clearer low-dimensional visualizations and more accurate spatial trajectory inferences.
Collapse
Affiliation(s)
- Daoliang Zhang
- Center of Intelligent Medicine, School of Control Science and Engineering, Shandong University, Jinan, Shandong, 250061, China
| | - Na Yu
- Center of Intelligent Medicine, School of Control Science and Engineering, Shandong University, Jinan, Shandong, 250061, China
| | - Xue Sun
- Center of Intelligent Medicine, School of Control Science and Engineering, Shandong University, Jinan, Shandong, 250061, China
| | - Haoyang Li
- Center of Intelligent Medicine, School of Control Science and Engineering, Shandong University, Jinan, Shandong, 250061, China
| | - Wenjing Zhang
- Center of Intelligent Medicine, School of Control Science and Engineering, Shandong University, Jinan, Shandong, 250061, China
| | - Xu Qiao
- Center of Intelligent Medicine, School of Control Science and Engineering, Shandong University, Jinan, Shandong, 250061, China.
| | - Wei Zhang
- Center of Intelligent Medicine, School of Control Science and Engineering, Shandong University, Jinan, Shandong, 250061, China.
| | - Rui Gao
- Center of Intelligent Medicine, School of Control Science and Engineering, Shandong University, Jinan, Shandong, 250061, China.
| |
Collapse
|
36
|
Li J, Xiang S, Wei D. Deciphering progressive lesion areas in breast cancer spatial transcriptomics via TGR-NMF. Brief Bioinform 2024; 26:bbae707. [PMID: 39780487 PMCID: PMC11711100 DOI: 10.1093/bib/bbae707] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2024] [Revised: 11/04/2024] [Accepted: 12/26/2024] [Indexed: 01/11/2025] Open
Abstract
Identifying spatial domains is critical for understanding breast cancer tissue heterogeneity and providing insights into tumor progression. However, dropout events introduces computational challenges and the lack of transparency in methods such as graph neural networks limits their interpretability. This study aimed to decipher disease progression-related spatial domains in breast cancer spatial transcriptomics by developing the three graph regularized non-negative matrix factorization (TGR-NMF). A unitization strategy was proposed to mitigate the impact of dropout events on the computational process, enabling utilization of the complete gene expression count data. By integrating one gene expression neighbor topology and two spatial position neighbor topologies, TGR-NMF was developed for constructing an interpretable low-dimensional representation of spatial transcriptomic data. The progressive lesion area that can reveal the progression of breast cancer was uncovered through heterogeneity analysis. Moreover, several related pathogenic genes and signal pathways on this area were identified by using gene enrichment and cell communication analysis.
Collapse
Affiliation(s)
- Juntao Li
- School of Mathematics and Statistics, Henan Normal University, 46 Jianshe East Road, 453007 Xinxiang, China
| | - Shan Xiang
- School of Mathematics and Statistics, Henan Normal University, 46 Jianshe East Road, 453007 Xinxiang, China
| | - Dongqing Wei
- School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, 800 Dongchuan Road, 200240 Shanghai, China
| |
Collapse
|
37
|
Mo Y, Liu J, Zhang L. Deconvolution of spatial transcriptomics data via graph contrastive learning and partial least square regression. Brief Bioinform 2024; 26:bbaf052. [PMID: 39924717 PMCID: PMC11807730 DOI: 10.1093/bib/bbaf052] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2024] [Revised: 12/19/2024] [Accepted: 01/24/2025] [Indexed: 02/11/2025] Open
Abstract
Deciphering the cellular abundance in spatial transcriptomics (ST) is crucial for revealing the spatial architecture of cellular heterogeneity within tissues. However, some of the current spatial sequencing technologies are in low resolutions, leading to each spot having multiple heterogeneous cells. Additionally, current spatial deconvolution methods lack the ability to utilize multi-modality information such as gene expression and chromatin accessibility from single-cell multi-omics data. In this study, we introduce a graph Contrastive Learning and Partial Least Squares regression-based method, CLPLS, to deconvolute ST data. CLPLS is a flexible method that it can be extended to integrate ST data and single-cell multi-omics data, enabling the exploration of the spatially epigenomic heterogeneity. We applied CLPLS to both simulated and real datasets coming from different platforms. Benchmark analyses with other methods on these datasets show the superior performance of CLPLS in deconvoluting spots in single cell level.
Collapse
Affiliation(s)
- Yuanyuan Mo
- School of Artificial Intelligence, School of Computer Science, Wuhan University, Wuhan 430072, China
| | - Juan Liu
- School of Artificial Intelligence, School of Computer Science, Wuhan University, Wuhan 430072, China
| | - Lihua Zhang
- School of Artificial Intelligence, School of Computer Science, Wuhan University, Wuhan 430072, China
| |
Collapse
|
38
|
Peng W, Zhang Z, Dai W, Ping Z, Fu X, Liu L, Liu L, Yu N. MVCLST: A spatial transcriptome data analysis pipeline for cell type classification based on multi-view comparative learning. Methods 2024; 232:115-128. [PMID: 39542071 DOI: 10.1016/j.ymeth.2024.11.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2024] [Revised: 11/03/2024] [Accepted: 11/04/2024] [Indexed: 11/17/2024] Open
Abstract
Recent advancements in spatial transcriptomics sequencing technologies can not only provide gene expression within individual cells or cell clusters (spots) in a tissue but also pinpoint the exact location of this expression and generate detailed images of stained tissue sections, which offers invaluable insights into cell type identification and cell function exploration. However, effectively integratingthegene expression data, spatial location information, and tissue images from spatial transcriptomics data presents a significant challenge for computational methodsin cell classification. In this work, we propose MVCLST, a multi-view comparative learningmethod to analyze spatial transcriptomicsdata for accurate cell type classification. MVCLSTconstructs two views based on gene expression profiles, cell coordinates and image features. The multi-view method we proposed can significantly enhance the effectiveness of feature extraction while avoiding the impact of erroneous information in organizing image or gene expression data. The model employs four separate encoders to capture shared and unique features within each view. To ensure consistency and facilitate information exchange between the two views, MVCLST incorporates a contrastive learning loss function. The extracted shared and private features from both views are fused using corresponding decoders. Finally, the model utilizes the Leiden algorithm to clusterthe learned featuresfor cell type identification. Additionally, we establish a framework called MVCLST-CCFS for spatial transcriptomicsdata analysis based on MVCLST and consistent clustering. Our method achieves excellent results in clustering on human dorsolateral prefrontal cortex data and the mouse brain tissue data. Italso outperforms state-of-the-art techniques in the subsequent search for highly variable genes across cell types on the mouse olfactory bulbdata.
Collapse
Affiliation(s)
- Wei Peng
- Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650050, PR China; Computer Technology Application Key Lab of Yunnan Province, Kunming University of Science and Technology, Kunming 650050, PR China.
| | - Zhihao Zhang
- Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650050, PR China
| | - Wei Dai
- Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650050, PR China; Computer Technology Application Key Lab of Yunnan Province, Kunming University of Science and Technology, Kunming 650050, PR China.
| | - Zhihao Ping
- Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650050, PR China
| | - Xiaodong Fu
- Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650050, PR China; Computer Technology Application Key Lab of Yunnan Province, Kunming University of Science and Technology, Kunming 650050, PR China.
| | - Li Liu
- Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650050, PR China; Computer Technology Application Key Lab of Yunnan Province, Kunming University of Science and Technology, Kunming 650050, PR China
| | - Lijun Liu
- Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650050, PR China; Computer Technology Application Key Lab of Yunnan Province, Kunming University of Science and Technology, Kunming 650050, PR China
| | - Ning Yu
- State University of New York, The College at Brockport, Department of Computing Sciences, 350 New Campus Drive, Brockport, NY 14422, United States.
| |
Collapse
|
39
|
Yang J, Wang L, Liu L, Zheng X. GraphPCA: a fast and interpretable dimension reduction algorithm for spatial transcriptomics data. Genome Biol 2024; 25:287. [PMID: 39511664 PMCID: PMC11545739 DOI: 10.1186/s13059-024-03429-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2024] [Accepted: 10/29/2024] [Indexed: 11/15/2024] Open
Abstract
The rapid advancement of spatial transcriptomics technologies has revolutionized our understanding of cell heterogeneity and intricate spatial structures within tissues and organs. However, the high dimensionality and noise in spatial transcriptomic data present significant challenges for downstream data analyses. Here, we develop GraphPCA, an interpretable and quasi-linear dimension reduction algorithm that leverages the strengths of graphical regularization and principal component analysis. Comprehensive evaluations on simulated and multi-resolution spatial transcriptomic datasets generated from various platforms demonstrate the capacity of GraphPCA to enhance downstream analysis tasks including spatial domain detection, denoising, and trajectory inference compared to other state-of-the-art methods.
Collapse
Affiliation(s)
- Jiyuan Yang
- Center for Single-Cell Omics, School of Public Health, Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Lu Wang
- Center for Single-Cell Omics, School of Public Health, Shanghai Jiao Tong University School of Medicine, Shanghai, China
- The Guangxi Key Laboratory of Intelligent Precision Medicine, Guangxi Zhuang Autonomous Region, Nanning, China
| | - Lin Liu
- Institute of Natural Sciences, MOE-LSC, School of Mathematical Sciences, CMA-Shanghai, SJTU-Yale Joint Center for Biostatistics and Data Science, Shanghai Jiao Tong University and Shanghai Artificial Intelligence Laboratory, Shanghai, China
| | - Xiaoqi Zheng
- Center for Single-Cell Omics, School of Public Health, Shanghai Jiao Tong University School of Medicine, Shanghai, China.
| |
Collapse
|
40
|
Bartels T, Rowitch DH, Bayraktar OA. Generation of Mammalian Astrocyte Functional Heterogeneity. Cold Spring Harb Perspect Biol 2024; 16:a041351. [PMID: 38692833 PMCID: PMC11529848 DOI: 10.1101/cshperspect.a041351] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/03/2024]
Abstract
Mammalian astrocytes have regional roles within the brain parenchyma. Indeed, the notion that astrocytes are molecularly heterogeneous could help explain how the central nervous system (CNS) retains embryonic positional information through development into specialized regions into adulthood. A growing body of evidence supports the concept of morphological and molecular differences between astrocytes in different brain regions, which might relate to their derivation from regionally patterned radial glia and/or local neuron inductive cues. Here, we review evidence for regionally encoded functions of astrocytes to provide an integrated concept on lineage origins and heterogeneity to understand regional brain organization, as well as emerging technologies to identify and further investigate novel roles for astrocytes.
Collapse
Affiliation(s)
- Theresa Bartels
- Department of Paediatrics and Wellcome-MRC Cambridge Stem Cell Institute, University of Cambridge, Cambridge CB2 0AW, United Kingdom
| | - David H Rowitch
- Department of Paediatrics and Wellcome-MRC Cambridge Stem Cell Institute, University of Cambridge, Cambridge CB2 0AW, United Kingdom
| | - Omer Ali Bayraktar
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SA, United Kingdom
| |
Collapse
|
41
|
Han S, Xu Q, Du Y, Tang C, Cui H, Xia X, Zheng R, Sun Y, Shang H. Single-cell spatial transcriptomics in cardiovascular development, disease, and medicine. Genes Dis 2024; 11:101163. [PMID: 39224111 PMCID: PMC11367031 DOI: 10.1016/j.gendis.2023.101163] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2023] [Revised: 10/17/2023] [Accepted: 10/29/2023] [Indexed: 09/04/2024] Open
Abstract
Cardiovascular diseases (CVDs) impose a significant burden worldwide. Despite the elucidation of the etiology and underlying molecular mechanisms of CVDs by numerous studies and recent discovery of effective drugs, their morbidity, disability, and mortality are still high. Therefore, precise risk stratification and effective targeted therapies for CVDs are warranted. Recent improvements in single-cell RNA sequencing and spatial transcriptomics have improved our understanding of the mechanisms and cells involved in cardiovascular phylogeny and CVDs. Single-cell RNA sequencing can facilitate the study of the human heart at remarkably high resolution and cellular and molecular heterogeneity. However, this technique does not provide spatial information, which is essential for understanding homeostasis and disease. Spatial transcriptomics can elucidate intracellular interactions, transcription factor distribution, cell spatial localization, and molecular profiles of mRNA and identify cell populations causing the disease and their underlying mechanisms, including cell crosstalk. Herein, we introduce the main methods of RNA-seq and spatial transcriptomics analysis and highlight the latest advances in cardiovascular research. We conclude that single-cell RNA sequencing interprets disease progression in multiple dimensions, levels, perspectives, and dynamics by combining spatial and temporal characterization of the clinical phenome with multidisciplinary techniques such as spatial transcriptomics. This aligns with the dynamic evolution of CVDs (e.g., "angina-myocardial infarction-heart failure" in coronary artery disease). The study of pathways for disease onset and mechanisms (e.g., age, sex, comorbidities) in different patient subgroups should improve disease diagnosis and risk stratification. This can facilitate precise individualized treatment of CVDs.
Collapse
Affiliation(s)
- Songjie Han
- Key Laboratory of Chinese Internal Medicine of Ministry of Education, Dongzhimen Hospital, Beijing University of Chinese Medicine, Beijing 100700, China
| | - Qianqian Xu
- Key Laboratory of Chinese Internal Medicine of Ministry of Education, Dongzhimen Hospital, Beijing University of Chinese Medicine, Beijing 100700, China
| | - Yawen Du
- Key Laboratory of Chinese Internal Medicine of Ministry of Education, Dongzhimen Hospital, Beijing University of Chinese Medicine, Beijing 100700, China
| | - Chuwei Tang
- Key Laboratory of Chinese Internal Medicine of Ministry of Education, Dongzhimen Hospital, Beijing University of Chinese Medicine, Beijing 100700, China
| | - Herong Cui
- Key Laboratory of Chinese Internal Medicine of Ministry of Education, Dongzhimen Hospital, Beijing University of Chinese Medicine, Beijing 100700, China
- School of Life Sciences, Beijing University of Chinese Medicine, Beijing 102488, China
| | - Xiaofeng Xia
- Key Laboratory of Chinese Internal Medicine of Ministry of Education, Dongzhimen Hospital, Beijing University of Chinese Medicine, Beijing 100700, China
| | - Rui Zheng
- Key Laboratory of Chinese Internal Medicine of Ministry of Education, Dongzhimen Hospital, Beijing University of Chinese Medicine, Beijing 100700, China
| | - Yang Sun
- Key Laboratory of Chinese Internal Medicine of Ministry of Education, Dongzhimen Hospital, Beijing University of Chinese Medicine, Beijing 100700, China
| | - Hongcai Shang
- Key Laboratory of Chinese Internal Medicine of Ministry of Education, Dongzhimen Hospital, Beijing University of Chinese Medicine, Beijing 100700, China
| |
Collapse
|
42
|
Xiu YH, Sun SL, Zhou BW, Wan Y, Tang H, Long HX. DGSIST: Clustering spatial transcriptome data based on deep graph structure Infomax. Methods 2024; 231:226-236. [PMID: 39413889 DOI: 10.1016/j.ymeth.2024.10.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2024] [Revised: 09/26/2024] [Accepted: 10/04/2024] [Indexed: 10/18/2024] Open
Abstract
Although spatial transcriptomics data provide valuable insights into gene expression profiles and the spatial structure of tissues, most studies rely solely on gene expression information, underutilizing the spatial data. To fully leverage the potential of spatial transcriptomics and graph neural networks, the DGSI (Deep Graph Structure Infomax) model is proposed. This innovative graph data processing model uses graph convolutional neural networks and employs an unsupervised learning approach. It maximizes the mutual information between graph-level and node-level representations, emphasizing flexible sampling and aggregation of nodes and their neighbors. This effectively captures and incorporates local information from nodes into the overall graph structure. Additionally, this paper developed the DGSIST framework, an unsupervised cell clustering method that integrates the DGSI model, SVD dimensionality reduction algorithm, and k-means++ clustering algorithm. This aims to identify cell types accurately. DGSIST fully uses spatial transcriptomics data and outperforms existing methods in accuracy. Demonstrations of DGSIST's capability across various tissue types and technological platforms have shown its effectiveness in accurately identifying spatial domains in multiple tissue sections. Compared to other spatial clustering methods, DGSIST excels in cell clustering and effectively eliminates batch effects without needing batch correction. DGSIST excels in spatial clustering analysis, spatial variation identification, and differential gene expression detection and directly applies to graph analysis tasks, such as node classification, link prediction, or graph clustering. Anticipation lies in the contribution of the DGSIST framework to a deeper understanding of the spatial organizational structures of diseases such as cancer.
Collapse
Affiliation(s)
- Yu-Han Xiu
- College of Information Science Technology, Hainan Normal University, HaiKou City 571158, China; Key Laboratory of Data Science and Smart Education, Ministry of Education, Hainan Normal University, HaiKou City 571158, China
| | - Si-Lin Sun
- College of Information Science Technology, Hainan Normal University, HaiKou City 571158, China; Key Laboratory of Data Science and Smart Education, Ministry of Education, Hainan Normal University, HaiKou City 571158, China
| | - Bing-Wei Zhou
- College of Information Science Technology, Hainan Normal University, HaiKou City 571158, China; Key Laboratory of Data Science and Smart Education, Ministry of Education, Hainan Normal University, HaiKou City 571158, China
| | - Ying Wan
- School of Basic Medical Sciences, Southwest Medical University, Luzhou 646000, China
| | - Hua Tang
- School of Basic Medical Sciences, Southwest Medical University, Luzhou 646000, China; Medical Engineering & Medical Informatics Integration and Transformational Medicine Key Laboratory of Luzhou City, Luzhou 646000, China.
| | - Hai-Xia Long
- College of Information Science Technology, Hainan Normal University, HaiKou City 571158, China; Key Laboratory of Data Science and Smart Education, Ministry of Education, Hainan Normal University, HaiKou City 571158, China.
| |
Collapse
|
43
|
Zeng Y, Song Y, Zhang C, Li H, Zhao Y, Yu W, Zhang S, Zhang H, Dai Z, Yang Y. Imputing spatial transcriptomics through gene network constructed from protein language model. Commun Biol 2024; 7:1271. [PMID: 39369061 PMCID: PMC11455941 DOI: 10.1038/s42003-024-06964-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2024] [Accepted: 09/25/2024] [Indexed: 10/07/2024] Open
Abstract
Image-based spatial transcriptomic sequencing technologies have enabled the measurement of gene expression at single-cell resolution, but with a limited number of genes. Current computational approaches attempt to overcome these limitations by imputing missing genes, but face challenges regarding prediction accuracy and identification of cell populations due to the neglect of gene-gene relationships. In this context, we present stImpute, a method to impute spatial transcriptomics according to reference scRNA-seq data based on the gene network constructed from the protein language model ESM-2. Specifically, stImpute employs an autoencoder to create gene expression embeddings for both spatial transcriptomics and scRNA-seq data, which are used to identify the nearest neighboring cells between scRNA-seq and spatial transcriptomics datasets. According to the neighbored cells, the gene expressions of spatial transcriptomics cells are imputed through a graph neural network, where nodes are genes, and edges are based on cosine similarity between the ESM-2 embeddings of the gene-encoding proteins. The gene prediction uncertainty is further measured through a deep learning model. stImpute was shown to consistently outperform state-of-the-art methods across multiple datasets concerning imputation and clustering. stImpute also demonstrates robustness in producing consistent results that are insensitive to model parameters.
Collapse
Affiliation(s)
- Yuansong Zeng
- School of Big Data and Software Engineering, Chongqing University, Chongqing, China
- School of Computer Science and Engineering, Sun Yat-sen University, Guangzhou, China
- Jinfeng Laboratory, Chongqing, China
| | - Yujie Song
- School of Computer Science and Engineering, Sun Yat-sen University, Guangzhou, China
| | - Chengyang Zhang
- School of Computer Science and Engineering, Sun Yat-sen University, Guangzhou, China
| | - Haoxuan Li
- School of Big Data and Software Engineering, Chongqing University, Chongqing, China
| | - Yongkang Zhao
- School of Big Data and Software Engineering, Chongqing University, Chongqing, China
| | - Weijiang Yu
- School of Computer Science and Engineering, Sun Yat-sen University, Guangzhou, China
| | - Shiqi Zhang
- School of Big Data and Software Engineering, Chongqing University, Chongqing, China
| | - Hongyu Zhang
- School of Big Data and Software Engineering, Chongqing University, Chongqing, China
| | - Zhiming Dai
- School of Computer Science and Engineering, Sun Yat-sen University, Guangzhou, China.
| | - Yuedong Yang
- School of Computer Science and Engineering, Sun Yat-sen University, Guangzhou, China.
| |
Collapse
|
44
|
Wang N, Hong W, Wu Y, Chen Z, Bai M, Wang W, Zhu J. Next-generation spatial transcriptomics: unleashing the power to gear up translational oncology. MedComm (Beijing) 2024; 5:e765. [PMID: 39376738 PMCID: PMC11456678 DOI: 10.1002/mco2.765] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2024] [Revised: 08/30/2024] [Accepted: 09/03/2024] [Indexed: 10/09/2024] Open
Abstract
The growing advances in spatial transcriptomics (ST) stand as the new frontier bringing unprecedented influences in the realm of translational oncology. This has triggered systemic experimental design, analytical scope, and depth alongside with thorough bioinformatics approaches being constantly developed in the last few years. However, harnessing the power of spatial biology and streamlining an array of ST tools to achieve designated research goals are fundamental and require real-world experiences. We present a systemic review by updating the technical scope of ST across different principal basis in a timeline manner hinting on the generally adopted ST techniques used within the community. We also review the current progress of bioinformatic tools and propose in a pipelined workflow with a toolbox available for ST data exploration. With particular interests in tumor microenvironment where ST is being broadly utilized, we summarize the up-to-date progress made via ST-based technologies by narrating studies categorized into either mechanistic elucidation or biomarker profiling (translational oncology) across multiple cancer types and their ways of deploying the research through ST. This updated review offers as a guidance with forward-looking viewpoints endorsed by many high-resolution ST tools being utilized to disentangle biological questions that may lead to clinical significance in the future.
Collapse
Affiliation(s)
- Nan Wang
- Cosmos Wisdom Biotech Co. LtdHangzhouChina
| | - Weifeng Hong
- Department of Radiation OncologyZhejiang Cancer HospitalHangzhouChina
- Hangzhou Institute of Medicine (HIM)Chinese Academy of SciencesHangzhouChina
- Zhejiang Key Laboratory of Radiation OncologyHangzhouChina
| | - Yixing Wu
- Department of Pulmonary and Critical Care MedicineZhongshan HospitalFudan UniversityShanghaiChina
| | - Zhe‐Sheng Chen
- Department of Pharmaceutical SciencesCollege of Pharmacy and Health SciencesInstitute for BiotechnologySt. John's UniversityQueensNew YorkUSA
| | - Minghua Bai
- Department of Radiation OncologyZhejiang Cancer HospitalHangzhouChina
- Hangzhou Institute of Medicine (HIM)Chinese Academy of SciencesHangzhouChina
- Zhejiang Key Laboratory of Radiation OncologyHangzhouChina
| | | | - Ji Zhu
- Department of Radiation OncologyZhejiang Cancer HospitalHangzhouChina
- Hangzhou Institute of Medicine (HIM)Chinese Academy of SciencesHangzhouChina
- Zhejiang Key Laboratory of Radiation OncologyHangzhouChina
| |
Collapse
|
45
|
Shen R, Cheng M, Wang W, Fan Q, Yan H, Wen J, Yuan Z, Yao J, Li Y, Yuan J. Graph domain adaptation-based framework for gene expression enhancement and cell type identification in large-scale spatially resolved transcriptomics. Brief Bioinform 2024; 25:bbae576. [PMID: 39508445 PMCID: PMC11541786 DOI: 10.1093/bib/bbae576] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2024] [Revised: 09/25/2024] [Accepted: 10/29/2024] [Indexed: 11/15/2024] Open
Abstract
Spatially resolved transcriptomics (SRT) technologies facilitate gene expression profiling with spatial resolution in a naïve state. Nevertheless, current SRT technologies exhibit limitations, manifesting as either low transcript detection sensitivity or restricted gene throughput. These constraints result in diminished precision and coverage in gene measurement. In response, we introduce SpaGDA, a sophisticated deep learning-based graph domain adaptation framework for both scenarios of gene expression imputation and cell type identification in spatially resolved transcriptomics data by impartially transferring knowledge from reference scRNA-seq data. Systematic benchmarking analyses across several SRT datasets generated from different technologies have demonstrated SpaGDA's superior effectiveness compared to state-of-the-art methods in both scenarios. Further applied to three SRT datasets of different biological contexts, SpaGDA not only better recovers the well-established knowledge sourced from public atlases and existing scientific literature but also yields a more informative spatial expression pattern of genes. Together, these results demonstrate that SpaGDA can be used to overcome the challenges of current SRT data and provide more accurate insights into biological processes or disease development. The SpaGDA is available in https://github.com/shenrb/SpaGDA.
Collapse
Affiliation(s)
- Rongbo Shen
- GMU-GIBH Joint School of Life Sciences, The Guangdong-Hong Kong-Macau Joint Laboratory for Cell Fate Regulation and Diseases, Guangzhou Medical University, No. 1 Xinzao Road, Xinzao Town, Panyu District, Guangzhou 510005, China
- Guangzhou National Laboratory, No. 9 XingDaoHuanBei Road, Guangzhou International Bio Island, Guangzhou 510005, Guangdong Province, China
- Tencent AI Lab, Shenzhen 518000, China
| | - Meiling Cheng
- Guangzhou National Laboratory, No. 9 XingDaoHuanBei Road, Guangzhou International Bio Island, Guangzhou 510005, Guangdong Province, China
- Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Key Laboratory of Bioinformatics and Molecular Imaging, Center for Artificial Intelligence Biology, Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Luoyu Road 1037, Wuhan 430074, China
| | - Wencang Wang
- Guangzhou National Laboratory, No. 9 XingDaoHuanBei Road, Guangzhou International Bio Island, Guangzhou 510005, Guangdong Province, China
| | - Qi Fan
- Guangzhou National Laboratory, No. 9 XingDaoHuanBei Road, Guangzhou International Bio Island, Guangzhou 510005, Guangdong Province, China
| | - Huan Yan
- Guangzhou National Laboratory, No. 9 XingDaoHuanBei Road, Guangzhou International Bio Island, Guangzhou 510005, Guangdong Province, China
| | - Jiayue Wen
- Guangzhou National Laboratory, No. 9 XingDaoHuanBei Road, Guangzhou International Bio Island, Guangzhou 510005, Guangdong Province, China
| | - Zhiyuan Yuan
- Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Handan Road, Shanghai 200433, China
| | | | - Yixue Li
- GMU-GIBH Joint School of Life Sciences, The Guangdong-Hong Kong-Macau Joint Laboratory for Cell Fate Regulation and Diseases, Guangzhou Medical University, No. 1 Xinzao Road, Xinzao Town, Panyu District, Guangzhou 510005, China
- Guangzhou National Laboratory, No. 9 XingDaoHuanBei Road, Guangzhou International Bio Island, Guangzhou 510005, Guangdong Province, China
| | - Jiao Yuan
- GMU-GIBH Joint School of Life Sciences, The Guangdong-Hong Kong-Macau Joint Laboratory for Cell Fate Regulation and Diseases, Guangzhou Medical University, No. 1 Xinzao Road, Xinzao Town, Panyu District, Guangzhou 510005, China
- Guangzhou National Laboratory, No. 9 XingDaoHuanBei Road, Guangzhou International Bio Island, Guangzhou 510005, Guangdong Province, China
| |
Collapse
|
46
|
Coullomb A, Monsarrat P, Pancaldi V. mosna reveals different types of cellular interactions predictive of response to immunotherapies and survival in cancer. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.03.16.532947. [PMID: 36993595 PMCID: PMC10055099 DOI: 10.1101/2023.03.16.532947] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
Spatially resolved omics enable the discovery of tissue organization of biological or clinical importance. Despite the existence of several methods, performing a rational analysis including multiple algorithms while integrating different conditions such as clinical data is still not trivial. To make such investigations more accessible, we developed mosna, a Python package to analyze spatial omics data with respect to clinical or biological data and to gain insight on cell interaction patterns or tissue architecture of biological relevance. mosna is compatible with all spatial omics methods, it leverages tysserand to build accurate spatial networks, and is compatible with Squidpy. It proposes an analysis pipeline, in which increasingly complex features computed at each step can be explored in integration with clinical data, either with easy-to-use descriptive statistics and data visualization, or by seamlessly training machine learning models and identifying variables with the most predictive power. mosna can take as input any dataset produced by spatial omics methods, including sub-cellular resolved transcriptomics (MERFISH, seqFISH) and proteomics (CODEX, MIBI-TOF, low-plex immuno-fluorescence), as well as spot-based spatial transcriptomics (10x Visium). Integration with experimental metadata or clinical data is adapted to binary conditions, such as biological treatments or response status of patients, and to survival data. We demonstrate the proposed analysis pipeline on two spatially resolved proteomic datasets containing either binary response to immunotherapy or survival data. mosna identifies features describing cellular composition and spatial distribution that can provide biological insight regarding factors that affect response to immunotherapies or survival.
Collapse
Affiliation(s)
- Alexis Coullomb
- CRCT, Université de Toulouse, Inserm, CNRS, Université Toulouse III-Paul Sabatier, Centre de Recherches en Cancérologie de Toulouse, Toulouse, France
- RESTORE Research Center, Université de Toulouse, INSERM 1301, CNRS 5070, EFS, ENVT, Toulouse, France
| | - Paul Monsarrat
- RESTORE Research Center, Université de Toulouse, INSERM 1301, CNRS 5070, EFS, ENVT, Toulouse, France
- Oral Medicine Department and Hospital of Toulouse - Toulouse Institute of Oral Medicine and Science, CHU de Toulouse, Toulouse, France
- Artificial and Natural Intelligence Toulouse Institute ANITI, Toulouse, France
| | - Vera Pancaldi
- CRCT, Université de Toulouse, Inserm, CNRS, Université Toulouse III-Paul Sabatier, Centre de Recherches en Cancérologie de Toulouse, Toulouse, France
| |
Collapse
|
47
|
Liu W, Wang B, Bai Y, Liang X, Xue L, Luo J. SpaGIC: graph-informed clustering in spatial transcriptomics via self-supervised contrastive learning. Brief Bioinform 2024; 25:bbae578. [PMID: 39541189 PMCID: PMC11562840 DOI: 10.1093/bib/bbae578] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2024] [Revised: 09/30/2024] [Accepted: 10/29/2024] [Indexed: 11/16/2024] Open
Abstract
Spatial transcriptomics technologies enable the generation of gene expression profiles while preserving spatial context, providing the potential for in-depth understanding of spatial-specific tissue heterogeneity. Leveraging gene and spatial data effectively is fundamental to accurately identifying spatial domains in spatial transcriptomics analysis. However, many existing methods have not yet fully exploited the local neighborhood details within spatial information. To address this issue, we introduce SpaGIC, a novel graph-based deep learning framework integrating graph convolutional networks and self-supervised contrastive learning techniques. SpaGIC learns meaningful latent embeddings of spots by maximizing both edge-wise and local neighborhood-wise mutual information of graph structures, as well as minimizing the embedding distance between spatially adjacent spots. We evaluated SpaGIC on seven spatial transcriptomics datasets across various technology platforms. The experimental results demonstrated that SpaGIC consistently outperformed existing state-of-the-art methods in several tasks, such as spatial domain identification, data denoising, visualization, and trajectory inference. Additionally, SpaGIC is capable of performing joint analyses of multiple slices, further underscoring its versatility and effectiveness in spatial transcriptomics research.
Collapse
Affiliation(s)
- Wei Liu
- College of Computer Science and Electronic Engineering, Hunan University, Changsha 410083, China
| | - Bo Wang
- College of Computer Science and Electronic Engineering, Hunan University, Changsha 410083, China
| | - Yuting Bai
- College of Computer Science and Electronic Engineering, Hunan University, Changsha 410083, China
| | - Xiao Liang
- College of Computer Science and Electronic Engineering, Hunan University, Changsha 410083, China
| | - Li Xue
- College of Computer Science and Electronic Engineering, Hunan University, Changsha 410083, China
| | - Jiawei Luo
- College of Computer Science and Electronic Engineering, Hunan University, Changsha 410083, China
| |
Collapse
|
48
|
Samaran J, Peyré G, Cantini L. scConfluence: single-cell diagonal integration with regularized Inverse Optimal Transport on weakly connected features. Nat Commun 2024; 15:7762. [PMID: 39237488 PMCID: PMC11377776 DOI: 10.1038/s41467-024-51382-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2024] [Accepted: 08/06/2024] [Indexed: 09/07/2024] Open
Abstract
The abundance of unpaired multimodal single-cell data has motivated a growing body of research into the development of diagonal integration methods. However, the state-of-the-art suffers from the loss of biological information due to feature conversion and struggles with modality-specific populations. To overcome these crucial limitations, we here introduce scConfluence, a method for single-cell diagonal integration. scConfluence combines uncoupled autoencoders on the complete set of features with regularized Inverse Optimal Transport on weakly connected features. We extensively benchmark scConfluence in several single-cell integration scenarios proving that it outperforms the state-of-the-art. We then demonstrate the biological relevance of scConfluence in three applications. We predict spatial patterns for Scgn, Synpr and Olah in scRNA-smFISH integration. We improve the classification of B cells and Monocytes in highly heterogeneous scRNA-scATAC-CyTOF integration. Finally, we reveal the joint contribution of Fezf2 and apical dendrite morphology in Intra Telencephalic neurons, based on morphological images and scRNA.
Collapse
Affiliation(s)
- Jules Samaran
- Institut Pasteur, Université Paris Cité, CNRS UMR 3738, Machine Learning for Integrative Genomics Group, Paris, France
| | - Gabriel Peyré
- CNRS and DMA de l'Ecole Normale Supérieure, CNRS, Ecole Normale Supérieure, Université PSL, Paris, France
| | - Laura Cantini
- Institut Pasteur, Université Paris Cité, CNRS UMR 3738, Machine Learning for Integrative Genomics Group, Paris, France.
| |
Collapse
|
49
|
Duan H, Zhang Q, Cui F, Zou Q, Zhang Z. MVST: Identifying spatial domains of spatial transcriptomes from multiple views using multi-view graph convolutional networks. PLoS Comput Biol 2024; 20:e1012409. [PMID: 39235988 PMCID: PMC11376559 DOI: 10.1371/journal.pcbi.1012409] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2024] [Accepted: 08/09/2024] [Indexed: 09/07/2024] Open
Abstract
Spatial transcriptome technology can parse transcriptomic data at the spatial level to detect high-throughput gene expression and preserve information regarding the spatial structure of tissues. Identifying spatial domains, that is identifying regions with similarities in gene expression and histology, is the most basic and critical aspect of spatial transcriptome data analysis. Most current methods identify spatial domains only through a single view, which may obscure certain important information and thus fail to make full use of the information embedded in spatial transcriptome data. Therefore, we propose an unsupervised clustering framework based on multiview graph convolutional networks (MVST) to achieve accurate spatial domain recognition by the learning graph embedding features of neighborhood graphs constructed from gene expression information, spatial location information, and histopathological image information through multiview graph convolutional networks. By exploring spatial transcriptomes from multiple views, MVST enables data from all parts of the spatial transcriptome to be comprehensively and fully utilized to obtain more accurate spatial expression patterns. We verified the effectiveness of MVST on real spatial transcriptome datasets, the robustness of MVST on some simulated datasets, and the reasonableness of the framework structure of MVST in ablation experiments, and from the experimental results, it is clear that MVST can achieve a more accurate spatial domain identification compared with the current more advanced methods. In conclusion, MVST is a powerful tool for spatial transcriptome research with improved spatial domain recognition.
Collapse
Affiliation(s)
- Hao Duan
- School of Computer Science and Technology, Hainan University, Haikou, China
| | - Qingchen Zhang
- School of Computer Science and Technology, Hainan University, Haikou, China
| | - Feifei Cui
- School of Computer Science and Technology, Hainan University, Haikou, China
| | - Quan Zou
- Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, China
- Yangtze Delta Region Institute (Quzhou), University of Electronic Science and Technology of China, Quzhou, China
| | - Zilong Zhang
- School of Computer Science and Technology, Hainan University, Haikou, China
| |
Collapse
|
50
|
Andersson A, Behanova A, Avenel C, Windhager J, Malmberg F, Wählby C. Points2Regions: Fast, interactive clustering of imaging-based spatial transcriptomics data. Cytometry A 2024; 105:677-687. [PMID: 38958502 DOI: 10.1002/cyto.a.24884] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2024] [Revised: 05/30/2024] [Accepted: 06/13/2024] [Indexed: 07/04/2024]
Abstract
Imaging-based spatial transcriptomics techniques generate data in the form of spatial points belonging to different mRNA classes. A crucial part of analyzing the data involves the identification of regions with similar composition of mRNA classes. These biologically interesting regions can manifest at different spatial scales. For example, the composition of mRNA classes on a cellular scale corresponds to cell types, whereas compositions on a millimeter scale correspond to tissue-level structures. Traditional techniques for identifying such regions often rely on complementary data, such as pre-segmented cells, or lengthy optimization. This limits their applicability to tasks on a particular scale, restricting their capabilities in exploratory analysis. This article introduces "Points2Regions," a computational tool for identifying regions with similar mRNA compositions. The tool's novelty lies in its rapid feature extraction by rasterizing points (representing mRNAs) onto a pyramidal grid and its efficient clustering using a combination of hierarchical and k -means clustering. This enables fast and efficient region discovery across multiple scales without relying on additional data, making it a valuable resource for exploratory analysis. Points2Regions has demonstrated performance similar to state-of-the-art methods on two simulated datasets, without relying on segmented cells, while being several times faster. Experiments on real-world datasets show that regions identified by Points2Regions are similar to those identified in other studies, confirming that Points2Regions can be used to extract biologically relevant regions. The tool is shared as a Python package integrated into TissUUmaps and a Napari plugin, offering interactive clustering and visualization, significantly enhancing user experience in data exploration.
Collapse
Affiliation(s)
- Axel Andersson
- Department of IT and SciLifeLab BioImage Informatics Facility, Uppsala University, Uppsala, Sweden
| | - Andrea Behanova
- Department of IT and SciLifeLab BioImage Informatics Facility, Uppsala University, Uppsala, Sweden
| | - Christophe Avenel
- Department of IT and SciLifeLab BioImage Informatics Facility, Uppsala University, Uppsala, Sweden
| | - Jonas Windhager
- Department of IT and SciLifeLab BioImage Informatics Facility, Uppsala University, Uppsala, Sweden
| | - Filip Malmberg
- Department of IT and SciLifeLab BioImage Informatics Facility, Uppsala University, Uppsala, Sweden
| | - Carolina Wählby
- Department of IT and SciLifeLab BioImage Informatics Facility, Uppsala University, Uppsala, Sweden
| |
Collapse
|