1
|
Luo B, Teng F, Tang G, Cen W, Liu X, Chen J, Qu C, Liu X, Liu X, Jiang W, Huang H, Feng Y, Zhang X, Jian M, Li M, Xi F, Li G, Liao S, Chen A, Yu W, Xu X, Zhang J. StereoMM: a graph fusion model for integrating spatial transcriptomic data and pathological images. Brief Bioinform 2025; 26:bbaf210. [PMID: 40407386 PMCID: PMC12100622 DOI: 10.1093/bib/bbaf210] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2024] [Revised: 03/27/2025] [Accepted: 04/10/2025] [Indexed: 05/26/2025] Open
Abstract
Spatial omics technologies, generating high-throughput and multimodal data, have necessitated the development of advanced data integration methods to facilitate comprehensive biological and clinical treatment discoveries. Based on the cross-attention concept, we developed an AI learning based toolchain called StereoMM, a graph based fusion model that can incorporate omics data such as gene expression, histological images, and spatial location. StereoMM uses an attention module for omics data interaction and a graph autoencoder to integrate spatial positions and omics data in a self-supervised manner. Applying StereoMM across various cancer types and platforms has demonstrated its robust capability. StereoMM outperforms competitors in identifying spatial regions reflecting tumour progression and shows promise in classifying colorectal cancer patients into deficient mismatch repair and proficient mismatch repair groups. The comprehensive inter-modal integration and efficiency of StereoMM enable researchers to construct spatial views of integrated multimodal features efficiently, advancing thorough tissue and patient characterization.
Collapse
Affiliation(s)
- Bingying Luo
- BGI Research, Chongqing, No. 313, Jinyue road, Jiulongpo District, Chongqing 401329, China
- BGI Research, Shenzhen, No. 9, Yunhua Road, Yantian District, Shenzhen 518083, China
| | - Fei Teng
- BGI Research, Chongqing, No. 313, Jinyue road, Jiulongpo District, Chongqing 401329, China
- BGI Research, Shenzhen, No. 9, Yunhua Road, Yantian District, Shenzhen 518083, China
| | - Guo Tang
- BGI Research, Chongqing, No. 313, Jinyue road, Jiulongpo District, Chongqing 401329, China
| | - Weixuan Cen
- BGI Research, Shenzhen, No. 9, Yunhua Road, Yantian District, Shenzhen 518083, China
| | - Xing Liu
- BGI Research, Chongqing, No. 313, Jinyue road, Jiulongpo District, Chongqing 401329, China
- BGI Research, Shenzhen, No. 9, Yunhua Road, Yantian District, Shenzhen 518083, China
| | - Jinmiao Chen
- Center for Computational Biology and Program in Cancer and Stem Cell Biology, Duke-NUS Medical School, 8 College Road, Singapore 169857, Singapore
| | - Chi Qu
- BGI Research, Chongqing, No. 313, Jinyue road, Jiulongpo District, Chongqing 401329, China
| | - Xuanzhu Liu
- BGI Research, Chongqing, No. 313, Jinyue road, Jiulongpo District, Chongqing 401329, China
- BGI Research, Shenzhen, No. 9, Yunhua Road, Yantian District, Shenzhen 518083, China
| | - Xin Liu
- BGI Research, Chongqing, No. 313, Jinyue road, Jiulongpo District, Chongqing 401329, China
| | - Wenyan Jiang
- BGI Research, Chongqing, No. 313, Jinyue road, Jiulongpo District, Chongqing 401329, China
| | - Huaqiang Huang
- BGI Research, Chongqing, No. 313, Jinyue road, Jiulongpo District, Chongqing 401329, China
| | - Yu Feng
- State Key Laboratory of Genome and Multi-omics Technologies, BGI Research, No. 9, Yunhua Road, Yantian District, Shenzhen 518083, China
- BGI Collaborative Center for Future Medicine, Shanxi Medical University, No. 1258, Xinjiannan Road, Yingze District, Taiyuan 030001, China
| | - Xue Zhang
- BGI Research, Chongqing, No. 313, Jinyue road, Jiulongpo District, Chongqing 401329, China
| | - Min Jian
- BGI Research, Shenzhen, No. 9, Yunhua Road, Yantian District, Shenzhen 518083, China
| | - Mei Li
- BGI Research, Shenzhen, No. 9, Yunhua Road, Yantian District, Shenzhen 518083, China
| | - Feng Xi
- BGI Research, Chongqing, No. 313, Jinyue road, Jiulongpo District, Chongqing 401329, China
| | - Guibo Li
- BGI Research, Chongqing, No. 313, Jinyue road, Jiulongpo District, Chongqing 401329, China
- BGI Research, Shenzhen, No. 9, Yunhua Road, Yantian District, Shenzhen 518083, China
| | - Sha Liao
- BGI Research, Shenzhen, No. 9, Yunhua Road, Yantian District, Shenzhen 518083, China
| | - Ao Chen
- BGI Research, Chongqing, No. 313, Jinyue road, Jiulongpo District, Chongqing 401329, China
- BGI Research, Shenzhen, No. 9, Yunhua Road, Yantian District, Shenzhen 518083, China
| | - Weimiao Yu
- School of Biological Science, Nanyang Technological University, 60 Nanyang Drive, Singapore 637551, Singapore
| | - Xun Xu
- BGI Research, Chongqing, No. 313, Jinyue road, Jiulongpo District, Chongqing 401329, China
- BGI Research, Shenzhen, No. 9, Yunhua Road, Yantian District, Shenzhen 518083, China
- BGI Research, Hangzhou, No. 203, Zhenzhong Road, Xihu District, Hangzhou 310030, China
| | - Jiajun Zhang
- BGI Research, Chongqing, No. 313, Jinyue road, Jiulongpo District, Chongqing 401329, China
- BGI Research, Shenzhen, No. 9, Yunhua Road, Yantian District, Shenzhen 518083, China
| |
Collapse
|
2
|
Wang L, Bai X, Zhang C, Shi Q, Chen L. Spatially Aware Domain Adaptation Enables Cell Type Deconvolution from Multi-Modal Spatially Resolved Transcriptomics. SMALL METHODS 2025; 9:e2401163. [PMID: 39623794 DOI: 10.1002/smtd.202401163] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/28/2024] [Revised: 11/16/2024] [Indexed: 05/26/2025]
Abstract
Spatially Resolved Transcriptomics (SRT) offers unprecedented opportunities to elucidate the cellular arrangements within tissues. Nevertheless, the absence of deconvolution methods that simultaneously model multi-modal features has impeded progress in understanding cellular heterogeneity in spatial contexts. To address this issue, SpaDA is developed, a novel spatially aware domain adaptation method that integrates multi-modal data (i.e., transcriptomics, histological images, and spatial locations) from SRT to accurately estimate the spatial distribution of cell types. SpaDA utilizes a self-expressive variational autoencoder, coupled with deep spatial distribution alignment, to learn and align spatial and graph representations from spatial multi-modal SRT data and single-cell RNA sequencing (scRNA-seq) data. This strategy facilitates the transfer of cell type annotation information across these two similarity graphs, thereby enhancing the prediction accuracy of cell type composition. The results demonstrate that SpaDA surpasses existing methods in cell type deconvolution and the identification of cell types and spatial domains across diverse platforms. Moreover, SpaDA excels in identifying spatially colocalized cell types and key marker genes in regions of low-quality measurements, exemplified by high-resolution mouse cerebellum SRT data. In conclusion, SpaDA offers a powerful and flexible framework for the analysis of multi-modal SRT datasets, advancing the understanding of complex biological systems.
Collapse
Affiliation(s)
- Lequn Wang
- Key Laboratory of Systems Biology, Shanghai Institute of Biochemistry and Cell Biology, Center for Excellence in Molecular Cell Science, Chinese Academy of Sciences, Shanghai, 200031, China
- Key Laboratory of Systems Health Science of Zhejiang Province, School of Life Science, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou, 310024, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Xiaosheng Bai
- Key Laboratory of Systems Health Science of Zhejiang Province, School of Life Science, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou, 310024, China
| | - Chuanchao Zhang
- Key Laboratory of Systems Health Science of Zhejiang Province, School of Life Science, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou, 310024, China
| | - Qianqian Shi
- Hubei Engineering Technology Research Center of Agricultural Big Data, Huazhong Agricultural University, Wuhan, 430070, China
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, 430070, China
| | - Luonan Chen
- Key Laboratory of Systems Biology, Shanghai Institute of Biochemistry and Cell Biology, Center for Excellence in Molecular Cell Science, Chinese Academy of Sciences, Shanghai, 200031, China
- Key Laboratory of Systems Health Science of Zhejiang Province, School of Life Science, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou, 310024, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
| |
Collapse
|
3
|
Bao X, Bai X, Liu X, Shi Q, Zhang C. Spatially informed graph transformers for spatially resolved transcriptomics. Commun Biol 2025; 8:574. [PMID: 40188303 PMCID: PMC11972348 DOI: 10.1038/s42003-025-08015-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2024] [Accepted: 03/28/2025] [Indexed: 04/07/2025] Open
Abstract
Spatially resolved transcriptomics (SRT) has emerged as a powerful technique for mapping gene expression landscapes within spatial contexts. However, significant challenges persist in effectively integrating gene expression with spatial information to elucidate the heterogeneity of biological tissues. Here, we present a Spatially informed Graph Transformers framework, SpaGT, which leverages both node and edge channels to model spatially aware graph representation for denoising gene expression and identifying spatial domains. Unlike conventional graph neural networks, which rely on static, localized convolutional aggregation, SpaGT employs a structure-reinforced self-attention mechanism that iteratively evolves topological structural information and transcriptional signal representation. By replacing graph convolution with global self-attention, SpaGT enables the integration of both global and spatially localized information, thereby improving the detection of fine-grained spatial domains. We demonstrate that SpaGT achieves superior performance in identifying spatial domains and denoising gene expression data across diverse platforms and species. Furthermore, SpaGT facilitates the discovery of spatially variable genes with significant prognostic potential in cancer tissues. These findings establish SpaGT as a powerful tool for unraveling the complexities of biological tissues.
Collapse
Affiliation(s)
- Xinyu Bao
- Key Laboratory of Systems Health Science of Zhejiang Province, School of Life Science, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou, China
| | - Xiaosheng Bai
- Key Laboratory of Systems Health Science of Zhejiang Province, School of Life Science, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou, China
| | - Xiaoping Liu
- Key Laboratory of Systems Health Science of Zhejiang Province, School of Life Science, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou, China.
| | - Qianqian Shi
- Hubei Engineering Technology Research Center of Agricultural Big Data, Huazhong Agricultural University, Wuhan, China.
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, China.
| | - Chuanchao Zhang
- Key Laboratory of Systems Health Science of Zhejiang Province, School of Life Science, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou, China.
| |
Collapse
|
4
|
Huang W, Hu Y, Wang L, Wu G, Zhang C, Shi Q. Spatially aligned graph transfer learning for characterizing spatial regulatory heterogeneity. Brief Bioinform 2024; 26:bbaf021. [PMID: 39841593 PMCID: PMC11752617 DOI: 10.1093/bib/bbaf021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2024] [Revised: 12/26/2024] [Accepted: 01/08/2025] [Indexed: 01/24/2025] Open
Abstract
Spatially resolved transcriptomics (SRT) technologies facilitate the exploration of cell fates or states within tissue microenvironments. Despite these advances, the field has not adequately addressed the regulatory heterogeneity influenced by microenvironmental factors. Here, we propose a novel Spatially Aligned Graph Transfer Learning (SpaGTL), pretrained on a large-scale multi-modal SRT data of about 100 million cells/spots to enable inference of context-specific spatial gene regulatory networks across multiple scales in data-limited settings. As a novel cross-dimensional transfer learning architecture, SpaGTL aligns spatial graph representations across gene-level graph transformers and cell/spot-level manifold-dominated variational autoencoder. This alignment facilitates the exploration of microenvironmental variations in cell types and functional domains from a molecular regulatory perspective, all within a self-supervised framework. We verified SpaGTL's precision, robustness, and speed over existing state-of-the-art algorithms and show SpaGTL's potential that facilitates the discovery of novel regulatory programs that exhibit strong associations with tissue functional regions and cell types. Importantly, SpaGTL could be extended to process multi-slice SRT data and map molecular regulatory landscape associated with three-dimensional spatial-temporal changes during development.
Collapse
Affiliation(s)
- Wendong Huang
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan 430070, China
- Hubei Engineering Technology Research Center of Agricultural Big Data, College of Informatics, Huazhong Agricultural University, Wuhan 430070, China
| | - Yaofeng Hu
- Key Laboratory of Systems Health Science of Zhejiang Province, School of Life Science, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou 310024, China
| | - Lequn Wang
- State Key Laboratory of Cell Biology, Shanghai Institute of Biochemistry and Cell Biology, Center for Excellence in Molecular Cell Science, Chinese Academy of Sciences, Shanghai 200031, China
| | - Guangsheng Wu
- School of Mathematics and Computer Science, Xinyu University, Xinyu 338004, Jiangxi, China
| | - Chuanchao Zhang
- Key Laboratory of Systems Health Science of Zhejiang Province, School of Life Science, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou 310024, China
| | - Qianqian Shi
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan 430070, China
- Hubei Engineering Technology Research Center of Agricultural Big Data, College of Informatics, Huazhong Agricultural University, Wuhan 430070, China
| |
Collapse
|