1
|
Banecki KH, Chai H, Ruan Y, Plewczynski D. ChromMovie: A Molecular Dynamics Approach for Simultaneous Modeling of Chromatin Conformation Changes from Multiple Single-Cell Hi-C Maps. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.05.16.654550. [PMID: 40475498 PMCID: PMC12139908 DOI: 10.1101/2025.05.16.654550] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/18/2025]
Abstract
The development of 3C-based techniques for analyzing three-dimensional chromatin structure dynamics has driven significant interest in computational methods for 3D chromatin reconstruction. In particular, models based on Hi-C and its single-cell variants, such as scHi-C, have gained widespread popularity. Current approaches for reconstructing the chromatin structure from scHi-C data typically operate by processing one scHi-C map at a time, generating a corresponding 3D chromatin structure as output. Here, we introduce an alternative approach to the whole genome 3D chromatin structure reconstruction that builds upon existing methods while incorporating the broader context of dynamic cellular processes, such as the cell cycle or cell maturation. Our approach integrates scHi-C contact data with single-cell trajectory information and is based on applying simultaneous modeling of a number of cells ordered along the progression of a given cellular process. The approach is able to successfully recreate known nuclear structures while simultaneously achieving smooth, continuous changes in chromatin structure throughout the cell cycle trajectory. Although both Hi-C-based chromatin reconstruction and cellular trajectory inference are well-developed fields, little effort has been made to bridge the gap between them. To address this, we present ChromMovie, a comprehensive molecular dynamics framework for modeling 3D chromatin structure changes in the context of cellular trajectories. To our knowledge, no existing method effectively leverages both the variability of single-cell Hi-C data and explicit information from estimated cellular trajectories, such as cell cycle progression, to improve chromatin structure reconstruction.
Collapse
Affiliation(s)
- Krzysztof H. Banecki
- Laboratory of Bioinformatics and Computational Genomics, Faculty of Mathematics and Information Science, Warsaw University of Technology, Koszykowa 75, 00-662, Warsaw, Poland
- Laboratory of Functional and Structural Genomics, Centre of New Technologies, University of Warsaw, Stefana Banacha 2c, 02-097, Warsaw, Poland
| | - Haoxi Chai
- Life Sciences Institute and The Second Affiliated Hospital, Zhejiang University, Hangzhou, 310058, Zhejiang, China
| | - Yijun Ruan
- Life Sciences Institute and The Second Affiliated Hospital, Zhejiang University, Hangzhou, 310058, Zhejiang, China
| | - Dariusz Plewczynski
- Laboratory of Bioinformatics and Computational Genomics, Faculty of Mathematics and Information Science, Warsaw University of Technology, Koszykowa 75, 00-662, Warsaw, Poland
- Laboratory of Functional and Structural Genomics, Centre of New Technologies, University of Warsaw, Stefana Banacha 2c, 02-097, Warsaw, Poland
| |
Collapse
|
2
|
Ramirez A, Orcutt-Jahns BT, Pascoe S, Abraham A, Remigio B, Thomas N, Meyer AS. Integrative, high-resolution analysis of single-cell gene expression across experimental conditions with PARAFAC2-RISE. Cell Syst 2025:101294. [PMID: 40378843 DOI: 10.1016/j.cels.2025.101294] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2024] [Revised: 02/20/2025] [Accepted: 04/22/2025] [Indexed: 05/19/2025]
Abstract
Effective exploration and analysis tools are vital for the extraction of insights from single-cell data. However, current techniques for modeling single-cell studies performed across experimental conditions (e.g., samples) require restrictive assumptions or do not adequately deconvolute condition-to-condition variation from cell-to-cell variation. Here, we report that reduction and insight in single-cell exploration (RISE), an adaptation of the tensor decomposition method PARAFAC2, enables the dimensionality reduction and analysis of single-cell data across conditions. We demonstrate the benefits of RISE across distinct examples of single-cell RNA-sequencing experiments of peripheral immune cells: pharmacologic drug perturbations and systemic lupus erythematosus patient samples. RISE enables associations of gene variation patterns with patients or perturbations while connecting each coordinated change to single cells without requiring cell-type annotations. The theoretical grounding of RISE suggests a unified framework for many single-cell data modeling tasks while providing an intuitive dimensionality reduction approach for multi-sample single-cell studies across biological contexts. A record of this paper's transparent peer review process is included in the supplemental information.
Collapse
Affiliation(s)
- Andrew Ramirez
- Department of Bioengineering, University of California, Los Angeles (UCLA), Los Angeles, CA 90095, USA
| | - Brian T Orcutt-Jahns
- Department of Bioengineering, University of California, Los Angeles (UCLA), Los Angeles, CA 90095, USA
| | - Sean Pascoe
- Department of Bioengineering, University of California, Los Angeles (UCLA), Los Angeles, CA 90095, USA; Department of Molecular Biosciences, Northwestern University, Evanston, IL 60208, USA
| | - Armaan Abraham
- Department of Bioengineering, University of California, Los Angeles (UCLA), Los Angeles, CA 90095, USA
| | - Breanna Remigio
- Computational and Systems Biology, UCLA, Los Angeles, CA 90095, USA
| | - Nathaniel Thomas
- Department of Computer Science, UCLA, Los Angeles, CA 90095, USA
| | - Aaron S Meyer
- Department of Bioengineering, University of California, Los Angeles (UCLA), Los Angeles, CA 90095, USA; Jonsson Comprehensive Cancer Center, UCLA, Los Angeles, CA 90095, USA; Eli and Edythe Broad Center of Regenerative Medicine and Stem Cell Research, UCLA, Los Angeles, CA 90095, USA.
| |
Collapse
|
3
|
Lee DI, Roy S. Examining the dynamics of three-dimensional genome organization with multitask matrix factorization. Genome Res 2025; 35:1179-1193. [PMID: 40113262 PMCID: PMC12047540 DOI: 10.1101/gr.279930.124] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2024] [Accepted: 02/20/2025] [Indexed: 03/22/2025]
Abstract
Three-dimensional (3D) genome organization, which determines how the DNA is packaged inside the nucleus, has emerged as a key component of the gene regulation machinery. High-throughput chromosome conformation data sets, such as Hi-C, have become available across multiple conditions and time points, offering a unique opportunity to examine changes in 3D genome organization and link them to phenotypic changes in normal and disease processes. However, systematic detection of higher-order structural changes across multiple Hi-C data sets remains a major challenge. Existing computational methods either do not model higher-order structural units or cannot model dynamics across more than two conditions of interest. We address these limitations with tree-guided integrated factorization (TGIF), a generalizable multitask nonnegative matrix factorization (NMF) approach that can be applied to time series or hierarchically related biological conditions. TGIF can identify large-scale changes at the compartment or subcompartment levels, as well as local changes at boundaries of topologically associated domains (TADs). Based on benchmarking in simulated and real Hi-C data, TGIF boundaries are more accurate and reproducible across differential levels of noise and sources of technical artifacts, and are more enriched in CTCF. Application to three multisample mammalian data sets shows that TGIF can detect differential regions at compartment, subcompartment, and boundary levels that are associated with significant changes in regulatory signals and gene expression enriched in tissue-specific processes. Finally, we leverage TGIF boundaries to prioritize sequence variants for multiple phenotypes from the NHGRI GWAS catalog. Taken together, TGIF is a flexible tool to examine 3D genome organization dynamics across disease and developmental processes.
Collapse
Affiliation(s)
- Da-Inn Lee
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, Wisconsin 53715, USA
| | - Sushmita Roy
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, Wisconsin 53715, USA;
- Wisconsin Institute for Discovery, Madison, Wisconsin 53715, USA
| |
Collapse
|
4
|
Wang F, Lin J, Alinejad-Rokny H, Ma W, Meng L, Huang L, Yu J, Chen N, Wang Y, Yao Z, Xie W, Wong KC, Li X. Unveiling Multi-Scale Architectural Features in Single-Cell Hi-C Data Using scCAFE. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2025:e2416432. [PMID: 40270467 DOI: 10.1002/advs.202416432] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/08/2024] [Revised: 03/12/2025] [Indexed: 04/25/2025]
Abstract
Single-cell Hi-C (scHi-C) has provided unprecedented insights into the heterogeneity of 3D genome organization. However, its sparse and noisy nature poses challenges for computational analyses, such as chromatin architectural feature identification. Here, scCAFE is introduced, which is a deep learning model for the multi-scale detection of architectural features at the single-cell level. scCAFE provides a unified framework for annotating chromatin loops, TAD-like domains (TLDs), and compartments across individual cells. This model outperforms previous scHi-C loop calling methods and delivers accurate predictions of TLDs and compartments that are biologically consistent with previous studies. The resulting single-cell annotations also offer a measure to characterize the heterogeneity of different levels of architectural features across cell types. This heterogeneity is then leveraged to identify a series of marker loop anchors, demontrating the potential of the 3D genome data to annotate cell identities without the aid of simultaneously sequenced omics data. Overall, scCAFE not only serves as a useful tool for analyzing single-cell genomic architecture, but also paves the way for precise cell-type annotations solely based on 3D genome features.
Collapse
Affiliation(s)
- Fuzhou Wang
- Department of Computer Science, City University of Hong Kong, Kowloon Tong, 000000, Hong Kong SAR
| | - Jiecong Lin
- Department of Computer Science, The University of Hong Kong, Pok Fu Lam, 000000, Hong Kong SAR
- Molecular Pathology Unit, Center for Cancer Research, Massachusetts General Hospital, Department of Pathology, Harvard Medical School, Boston, MA, 02129, USA
| | - Hamid Alinejad-Rokny
- BioMedical Machine Learning Lab, Graduate School of Biomedical Engineering, University of New South Wales, Sydney, 2052, Australia
| | - Wenjing Ma
- School of Artificial Intelligence, Jilin University, Changchun, 132000, China
| | - Lingkuan Meng
- Department of Computer Science, City University of Hong Kong, Kowloon Tong, 000000, Hong Kong SAR
| | - Lei Huang
- Department of Computer Science, City University of Hong Kong, Kowloon Tong, 000000, Hong Kong SAR
| | - Jixiang Yu
- Department of Computer Science, City University of Hong Kong, Kowloon Tong, 000000, Hong Kong SAR
| | - Nanjun Chen
- Department of Computer Science, City University of Hong Kong, Kowloon Tong, 000000, Hong Kong SAR
| | - Yuchen Wang
- Department of Computer Science, City University of Hong Kong, Kowloon Tong, 000000, Hong Kong SAR
| | - Zhongyu Yao
- Department of Computer Science, City University of Hong Kong, Kowloon Tong, 000000, Hong Kong SAR
| | - Weidun Xie
- Department of Computer Science, City University of Hong Kong, Kowloon Tong, 000000, Hong Kong SAR
| | - Ka-Chun Wong
- Department of Computer Science, City University of Hong Kong, Kowloon Tong, 000000, Hong Kong SAR
- Shenzhen Research Institute, City University of Hong Kong, Shenzhen, 518057, China
| | - Xiangtao Li
- School of Artificial Intelligence, Jilin University, Changchun, 132000, China
| |
Collapse
|
5
|
Xu H, Chi Y, Yin C, Li C, Chen Y, Liu Z, Liu X, Xie H, Chen ZJ, Zhao H, Wu K, Zhao S, Xing D. Three-dimensional genome structures of single mammalian sperm. Nat Commun 2025; 16:3805. [PMID: 40268951 PMCID: PMC12019598 DOI: 10.1038/s41467-025-59055-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2024] [Accepted: 04/09/2025] [Indexed: 04/25/2025] Open
Abstract
The three-dimensional (3D) organization of chromosomes is crucial for packaging a large mammalian genome into a confined nucleus and ensuring proper nuclear functions in somatic cells. However, the packaging of the much more condensed sperm genome is challenging to study with traditional imaging or sequencing approaches. In this study, we develop an enhanced chromosome conformation capture assay, and resolve the 3D whole-genome structures of single mammalian sperm. The reconstructed genome structures accurately delineate the species-specific nuclear morphologies for both human and mouse sperm. We discover that sperm genomes are divided into chromosomal territories and A/B compartments, similarly to somatic cells. However, neither human nor mouse sperm chromosomes contain topologically associating domains or chromatin loops. These results suggest that the fine-scale chromosomal organization of mammalian sperm fundamentally differs from that of somatic cells. The discoveries and methods established in this work will be valuable for future studies of sperm related infertility.
Collapse
Affiliation(s)
- Heming Xu
- Biomedical Pioneering Innovation Center (BIOPIC), School of Life Sciences, Peking University, Beijing, China
- Beijing Advanced Innovation Center for Genomics (ICG), Peking University, Beijing, China
| | - Yi Chi
- Biomedical Pioneering Innovation Center (BIOPIC), School of Life Sciences, Peking University, Beijing, China
- Beijing Advanced Innovation Center for Genomics (ICG), Peking University, Beijing, China
- Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, China
- Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, China
| | - Changjian Yin
- State Key Laboratory of Reproductive Medicine and Offspring Health, Center for Reproductive Medicine, Institute of Women, Children and Reproductive Health, Shandong University, Jinan, China
- National Research Center for Assisted Reproductive Technology and Reproductive Genetics, Shandong University, Jinan, China
- Key Laboratory of Reproductive Endocrinology (Shandong University), Ministry of Education, Jinan, China
- Shandong Technology Innovation Center for Reproductive Health, Jinan, China
- Shandong Provincial Clinical Research Center for Reproductive Health, Jinan, China
- Shandong Key Laboratory of Reproductive Research and Birth Defect Prevention, Jinan, China
- Research Unit of Gametogenesis and Health of ART-Offspring, Chinese Academy of Medical Sciences (No.2021RU001), Jinan, China
| | - Cheng Li
- State Key Laboratory of Reproductive Medicine and Offspring Health, Center for Reproductive Medicine, Institute of Women, Children and Reproductive Health, Shandong University, Jinan, China
- National Research Center for Assisted Reproductive Technology and Reproductive Genetics, Shandong University, Jinan, China
- Key Laboratory of Reproductive Endocrinology (Shandong University), Ministry of Education, Jinan, China
- Shandong Technology Innovation Center for Reproductive Health, Jinan, China
- Shandong Provincial Clinical Research Center for Reproductive Health, Jinan, China
- Shandong Key Laboratory of Reproductive Research and Birth Defect Prevention, Jinan, China
- Research Unit of Gametogenesis and Health of ART-Offspring, Chinese Academy of Medical Sciences (No.2021RU001), Jinan, China
| | - Yujie Chen
- Biomedical Pioneering Innovation Center (BIOPIC), School of Life Sciences, Peking University, Beijing, China
| | - Zhiyuan Liu
- Biomedical Pioneering Innovation Center (BIOPIC), School of Life Sciences, Peking University, Beijing, China
| | - Xiaowen Liu
- Biomedical Pioneering Innovation Center (BIOPIC), School of Life Sciences, Peking University, Beijing, China
- Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, China
- Peking University-Tsinghua University-National Institute of Biological Sciences Joint Graduate Program, Peking University, Beijing, China
| | - Hao Xie
- Biomedical Pioneering Innovation Center (BIOPIC), School of Life Sciences, Peking University, Beijing, China
| | - Zi-Jiang Chen
- State Key Laboratory of Reproductive Medicine and Offspring Health, Center for Reproductive Medicine, Institute of Women, Children and Reproductive Health, Shandong University, Jinan, China
- National Research Center for Assisted Reproductive Technology and Reproductive Genetics, Shandong University, Jinan, China
- Key Laboratory of Reproductive Endocrinology (Shandong University), Ministry of Education, Jinan, China
- Shandong Technology Innovation Center for Reproductive Health, Jinan, China
- Shandong Provincial Clinical Research Center for Reproductive Health, Jinan, China
- Shandong Key Laboratory of Reproductive Research and Birth Defect Prevention, Jinan, China
- Research Unit of Gametogenesis and Health of ART-Offspring, Chinese Academy of Medical Sciences (No.2021RU001), Jinan, China
- Shanghai Key Laboratory for Assisted Reproduction and Reproductive Genetics, Shanghai, China
- Department of Reproductive Medicine, Ren Ji Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Han Zhao
- State Key Laboratory of Reproductive Medicine and Offspring Health, Center for Reproductive Medicine, Institute of Women, Children and Reproductive Health, Shandong University, Jinan, China
- National Research Center for Assisted Reproductive Technology and Reproductive Genetics, Shandong University, Jinan, China
- Key Laboratory of Reproductive Endocrinology (Shandong University), Ministry of Education, Jinan, China
- Shandong Technology Innovation Center for Reproductive Health, Jinan, China
- Shandong Provincial Clinical Research Center for Reproductive Health, Jinan, China
- Shandong Key Laboratory of Reproductive Research and Birth Defect Prevention, Jinan, China
- Research Unit of Gametogenesis and Health of ART-Offspring, Chinese Academy of Medical Sciences (No.2021RU001), Jinan, China
| | - Keliang Wu
- State Key Laboratory of Reproductive Medicine and Offspring Health, Center for Reproductive Medicine, Institute of Women, Children and Reproductive Health, Shandong University, Jinan, China
- National Research Center for Assisted Reproductive Technology and Reproductive Genetics, Shandong University, Jinan, China
- Key Laboratory of Reproductive Endocrinology (Shandong University), Ministry of Education, Jinan, China
- Shandong Technology Innovation Center for Reproductive Health, Jinan, China
- Shandong Provincial Clinical Research Center for Reproductive Health, Jinan, China
- Shandong Key Laboratory of Reproductive Research and Birth Defect Prevention, Jinan, China
- Research Unit of Gametogenesis and Health of ART-Offspring, Chinese Academy of Medical Sciences (No.2021RU001), Jinan, China
| | - Shigang Zhao
- State Key Laboratory of Reproductive Medicine and Offspring Health, Center for Reproductive Medicine, Institute of Women, Children and Reproductive Health, Shandong University, Jinan, China.
- National Research Center for Assisted Reproductive Technology and Reproductive Genetics, Shandong University, Jinan, China.
- Key Laboratory of Reproductive Endocrinology (Shandong University), Ministry of Education, Jinan, China.
- Shandong Technology Innovation Center for Reproductive Health, Jinan, China.
- Shandong Provincial Clinical Research Center for Reproductive Health, Jinan, China.
- Shandong Key Laboratory of Reproductive Research and Birth Defect Prevention, Jinan, China.
- Research Unit of Gametogenesis and Health of ART-Offspring, Chinese Academy of Medical Sciences (No.2021RU001), Jinan, China.
| | - Dong Xing
- Biomedical Pioneering Innovation Center (BIOPIC), School of Life Sciences, Peking University, Beijing, China.
- Beijing Advanced Innovation Center for Genomics (ICG), Peking University, Beijing, China.
| |
Collapse
|
6
|
Gao R, Ferraro TN, Chen L, Zhang S, Chen Y. Enhancing Single-Cell and Bulk Hi-C Data Using a Generative Transformer Model. BIOLOGY 2025; 14:288. [PMID: 40136544 PMCID: PMC11940666 DOI: 10.3390/biology14030288] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/11/2025] [Revised: 03/01/2025] [Accepted: 03/10/2025] [Indexed: 03/27/2025]
Abstract
The 3D organization of chromatin in the nucleus plays a critical role in regulating gene expression and maintaining cellular functions in eukaryotic cells. High-throughput chromosome conformation capture (Hi-C) and its derivative technologies have been developed to map genome-wide chromatin interactions at the population and single-cell levels. However, insufficient sequencing depth and high noise levels in bulk Hi-C data, particularly in single-cell Hi-C (scHi-C) data, result in low-resolution contact matrices, thereby limiting diverse downstream computational analyses in identifying complex chromosomal organizations. To address these challenges, we developed a transformer-based deep learning model, HiCENT, to impute and enhance both scHi-C and Hi-C contact matrices. Validation experiments on large-scale bulk Hi-C and scHi-C datasets demonstrated that HiCENT achieves superior enhancement effects compared to five popular methods. When applied to real Hi-C data from the GM12878 cell line, HiCENT effectively enhanced 3D structural features at the scales of topologically associated domains and chromosomal loops. Furthermore, when applied to scHi-C data from five human cell lines, it significantly improved clustering performance, outperforming five widely used methods. The adaptability of HiCENT across different datasets and its capacity to improve the quality of chromatin interaction data will facilitate diverse downstream computational analyses in 3D genome research, single-cell studies and other large-scale omics investigations.
Collapse
Affiliation(s)
- Ruoying Gao
- College of Computer and Information Engineering, Tianjin Normal University, Tianjin 300387, China; (R.G.); (L.C.)
| | - Thomas N. Ferraro
- Department of Biomedical Sciences, Cooper Medical School of Rowan University, Camden, NJ 08103, USA;
| | - Liang Chen
- College of Computer and Information Engineering, Tianjin Normal University, Tianjin 300387, China; (R.G.); (L.C.)
| | - Shaoqiang Zhang
- College of Computer and Information Engineering, Tianjin Normal University, Tianjin 300387, China; (R.G.); (L.C.)
| | - Yong Chen
- Department of Biological and Biomedical Sciences, Rowan University, Glassboro, NJ 08028, USA
| |
Collapse
|
7
|
Dautle MA, Chen Y. Single-Cell Hi-C Technologies and Computational Data Analysis. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2025; 12:e2412232. [PMID: 39887949 PMCID: PMC11884588 DOI: 10.1002/advs.202412232] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/01/2024] [Revised: 01/14/2025] [Indexed: 02/01/2025]
Abstract
Single-cell chromatin conformation capture (scHi-C) techniques have evolved to provide significant insights into the structural organization and regulatory mechanisms in individual cells. Although many scHi-C protocols have been developed, they often involve intricate procedures and the resulting data are sparse, leading to computational challenges for systematic data analysis and limited applicability. This review provides a comprehensive overview, quantitative evaluation of thirteen protocols and practical guidance on computational topics. It is first assessed the efficiency of these protocols based on the total number of contacts recovered per cell and the cis/trans ratio. It is then provided systematic considerations for scHi-C quality control and data imputation. Additionally, the capabilities and implementations of various analysis methods, covering cell clustering, A/B compartment calling, topologically associating domain (TAD) calling, loop calling, 3D reconstruction, scHi-C data simulation and differential interaction analysis is summarized. It is further highlighted key computational challenges associated with the specific complexities of scHi-C data and propose potential solutions.
Collapse
Affiliation(s)
- Madison A Dautle
- Department of Biological and Biomedical SciencesRowan UniversityGlassboroNJ08028USA
| | - Yong Chen
- Department of Biological and Biomedical SciencesRowan UniversityGlassboroNJ08028USA
| |
Collapse
|
8
|
Kang B, Lee H, Roh TY. Deciphering single-cell genomic architecture: insights into cellular heterogeneity and regulatory dynamics. Genomics Inform 2025; 23:5. [PMID: 39934929 DOI: 10.1186/s44342-025-00037-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2024] [Accepted: 01/19/2025] [Indexed: 02/13/2025] Open
Abstract
BACKGROUND The genomic architecture of eukaryotes exhibits dynamic spatial and temporal changes, enabling cellular processes critical for maintaining viability and functional diversity. Recent advances in sequencing technologies have facilitated the dissection of genomic architecture and functional activity at single-cell resolution, moving beyond the averaged signals typically derived from bulk cell analyses. MAIN BODY The advent of single-cell genomics and epigenomics has yielded transformative insights into cellular heterogeneity, behavior, and biological complexity with unparalleled genomic resolution and reproducibility. This review summarizes recent progress in the characterization of genomic architecture at the single-cell level, emphasizing the impact of structural variation and chromatin organization on gene regulatory networks and cellular identity. CONCLUSION Future directions in single-cell genomics and high-resolution epigenomic methodologies are explored, focusing on emerging challenges and potential impacts on the understanding of cellular states, regulatory dynamics, and the intricate mechanisms driving cellular function and diversity. Future perspectives on the challenges and potential implications of single-cell genomics, along with high-resolution genomic and epigenomic technologies for understanding cellular states and regulatory dynamics, are also discussed.
Collapse
Affiliation(s)
- Byunghee Kang
- Department of Life Sciences, Pohang University of Science and Technology (POSTECH), Pohang, 37673, Republic of Korea
| | - Hyeonji Lee
- Department of Life Sciences, Pohang University of Science and Technology (POSTECH), Pohang, 37673, Republic of Korea
| | - Tae-Young Roh
- Department of Life Sciences, Ewha Womans University, Seoul, 03760, Republic of Korea.
| |
Collapse
|
9
|
Liu X, Ling X, Tian Q, Huang Z, Ding J. Nuclear remodeling during cell fate transitions. Curr Opin Genet Dev 2025; 90:102287. [PMID: 39631291 DOI: 10.1016/j.gde.2024.102287] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2024] [Revised: 10/08/2024] [Accepted: 11/12/2024] [Indexed: 12/07/2024]
Abstract
Totipotent stem cells, the earliest cells in embryonic development, can differentiate into complete embryos and extra-embryonic tissues, making them essential for understanding both development and regenerative medicine. This review examines recent advances in the dynamic remodeling of nuclear structures during the transition between totipotency and pluripotency, as well as other cell fate transition processes. Additionally, we highlight innovative experimental and computational methods that elucidate the relationship between nuclear architecture and cell fate decisions. By integrating these insights, we aim to enhance our understanding of how nuclear remodeling influences totipotency and other cell fate transitions, paving the way for future research in this critical field.
Collapse
Affiliation(s)
- Xinyi Liu
- Center for Stem Cell Biology and Tissue Engineering, Key Laboratory for Stem Cells and Tissue Engineering, Ministry of Education, Zhongshan School of Medicine, Sun Yat-Sen University, Guangzhou, Guangdong, China
| | - Xiaoru Ling
- Center for Stem Cell Biology and Tissue Engineering, Key Laboratory for Stem Cells and Tissue Engineering, Ministry of Education, Zhongshan School of Medicine, Sun Yat-Sen University, Guangzhou, Guangdong, China
| | - Qi Tian
- Center for Stem Cell Biology and Tissue Engineering, Key Laboratory for Stem Cells and Tissue Engineering, Ministry of Education, Zhongshan School of Medicine, Sun Yat-Sen University, Guangzhou, Guangdong, China
| | - Zibin Huang
- Center for Stem Cell Biology and Tissue Engineering, Key Laboratory for Stem Cells and Tissue Engineering, Ministry of Education, Zhongshan School of Medicine, Sun Yat-Sen University, Guangzhou, Guangdong, China
| | - Junjun Ding
- Center for Stem Cell Biology and Tissue Engineering, Key Laboratory for Stem Cells and Tissue Engineering, Ministry of Education, Zhongshan School of Medicine, Sun Yat-Sen University, Guangzhou, Guangdong, China.
| |
Collapse
|
10
|
Wu H, Wang M, Zheng Y, Xie XS. Droplet-based high-throughput 3D genome structure mapping of single cells with simultaneous transcriptomics. Cell Discov 2025; 11:8. [PMID: 39837831 PMCID: PMC11751028 DOI: 10.1038/s41421-025-00770-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2024] [Accepted: 12/30/2024] [Indexed: 01/23/2025] Open
Abstract
Single-cell three-dimensional (3D) genome techniques have advanced our understanding of cell-type-specific chromatin structures in complex tissues, yet current methodologies are limited in cell throughput. Here we introduce a high-throughput single-cell Hi-C (dscHi-C) approach and its transcriptome co-assay (dscHi-C-multiome) using droplet microfluidics. Using dscHi-C, we investigate chromatin structural changes during mouse brain aging by profiling 32,777 single cells across three developmental stages (3 months, 12 months, and 23 months), yielding a median of 78,220 unique contacts. Our results show that genes with significant structural changes are enriched in pathways related to metabolic process and morphology change in neurons, and innate immune response in glial cells, highlighting the role of 3D genome organization in physiological brain aging. Furthermore, our multi-omics joint assay, dscHi-C-multiome, enables precise cell type identification in the adult mouse brain and uncovers the intricate relationship between genome architecture and gene expression. Collectively, we developed the sensitive, high-throughput dscHi-C and its multi-omics derivative, dscHi-C-multiome, demonstrating their potential for large-scale cell atlas studies in development and disease.
Collapse
Affiliation(s)
- Honggui Wu
- Biomedical Pioneering Innovation Center (BIOPIC), and School of Life Sciences, Peking University, Beijing, China
- Changping Laboratory, Beijing, China
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Maoxu Wang
- Biomedical Pioneering Innovation Center (BIOPIC), and School of Life Sciences, Peking University, Beijing, China
- Changping Laboratory, Beijing, China
| | - Yinghui Zheng
- Biomedical Pioneering Innovation Center (BIOPIC), and School of Life Sciences, Peking University, Beijing, China
- Changping Laboratory, Beijing, China
| | - X Sunney Xie
- Biomedical Pioneering Innovation Center (BIOPIC), and School of Life Sciences, Peking University, Beijing, China.
- Changping Laboratory, Beijing, China.
| |
Collapse
|
11
|
Grant ZL, Kuang S, Zhang S, Horrillo AJ, Rao KS, Kameswaran V, Joubran C, Lau PK, Dong K, Yang B, Bartosik WM, Zemke NR, Ren B, Kathiriya IS, Pollard KS, Bruneau BG. Dose-dependent sensitivity of human 3D chromatin to a heart disease-linked transcription factor. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.01.09.632202. [PMID: 39829922 PMCID: PMC11741296 DOI: 10.1101/2025.01.09.632202] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 01/22/2025]
Abstract
Dosage-sensitive transcription factors (TFs) underlie altered gene regulation in human developmental disorders, and cell-type specific gene regulation is linked to the reorganization of 3D chromatin during cellular differentiation. Here, we show dose-dependent regulation of chromatin organization by the congenital heart disease (CHD)-linked, lineage-restricted TF TBX5 in human cardiomyocyte differentiation. Genome organization, including compartments, topologically associated domains, and chromatin loops, are sensitive to reduced TBX5 dosage in a human model of CHD, with variations in response across individual cells. Regions normally bound by TBX5 are especially sensitive, while co-occupancy with CTCF partially protects TBX5-bound TAD boundaries and loop anchors. These results highlight the importance of lineage-restricted TF dosage in cell-type specific 3D chromatin dynamics, suggesting a new mechanism for TF-dependent disease.
Collapse
Affiliation(s)
| | | | - Shu Zhang
- Gladstone Institutes; San Francisco, CA, USA
- Bioinformatics Graduate Program, University of California, San Francisco; San Francisco, CA, USA
| | - Abraham J. Horrillo
- Gladstone Institutes; San Francisco, CA, USA
- TETRAD Graduate Program, University of California, San Francisco; San Francisco, CA, USA
| | | | | | | | - Pik Ki Lau
- Department of Cellular and Molecular Medicine, University of California, San Diego School of Medicine; La Jolla, CA, USA
- Center for Epigenomics, University of California, San Diego School of Medicine; La Jolla, CA, USA
| | - Keyi Dong
- Department of Cellular and Molecular Medicine, University of California, San Diego School of Medicine; La Jolla, CA, USA
- Center for Epigenomics, University of California, San Diego School of Medicine; La Jolla, CA, USA
| | - Bing Yang
- Department of Cellular and Molecular Medicine, University of California, San Diego School of Medicine; La Jolla, CA, USA
- Center for Epigenomics, University of California, San Diego School of Medicine; La Jolla, CA, USA
| | - Weronika M. Bartosik
- Department of Cellular and Molecular Medicine, University of California, San Diego School of Medicine; La Jolla, CA, USA
- Center for Epigenomics, University of California, San Diego School of Medicine; La Jolla, CA, USA
| | - Nathan R. Zemke
- Department of Cellular and Molecular Medicine, University of California, San Diego School of Medicine; La Jolla, CA, USA
- Center for Epigenomics, University of California, San Diego School of Medicine; La Jolla, CA, USA
| | - Bing Ren
- Department of Cellular and Molecular Medicine, University of California, San Diego School of Medicine; La Jolla, CA, USA
- Center for Epigenomics, University of California, San Diego School of Medicine; La Jolla, CA, USA
| | - Irfan S. Kathiriya
- Gladstone Institutes; San Francisco, CA, USA
- Department of Anesthesia and Perioperative Care, University of California, San Francisco; San Francisco, CA, USA
| | - Katherine S. Pollard
- Gladstone Institutes; San Francisco, CA, USA
- Department of Epidemiology & Biostatistics, University of California, San Francisco; San Francisco, CA, USA
- Chan Zuckerberg Biohub; San Francisco, CA, USA
| | - Benoit G. Bruneau
- Gladstone Institutes; San Francisco, CA, USA
- Roddenberry Center for Stem Cell Biology and Medicine at Gladstone, San Francisco, CA, USA
- Department of Pediatrics, Cardiovascular Research Institute, Institute for Human Genetics, and the Eli and Edythe Broad Center for Regeneration Medicine and Stem Cell Research, University of California, San Francisco; San Francisco, CA, USA
| |
Collapse
|
12
|
Banecki K, Korsak S, Plewczynski D. Advancements and future directions in single-cell Hi-C based 3D chromatin modeling. Comput Struct Biotechnol J 2024; 23:3549-3558. [PMID: 39963420 PMCID: PMC11832020 DOI: 10.1016/j.csbj.2024.09.026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2024] [Revised: 09/27/2024] [Accepted: 09/29/2024] [Indexed: 02/20/2025] Open
Abstract
Single-cell Hi-C data provides valuable insights into the three-dimensional organization of chromatin within individual cells, yet modeling this data poses significant challenges due to its inherent sparsity and variability. This review comprehensively explores the predominant approaches to reconstructing 3D chromatin structures from single-cell Hi-C data, positioning these methods within the broader contexts of single-cell Hi-C research and bulk Hi-C data modeling. We categorize the modeling strategies based on their objective functions, which are framed in terms of force fields, potentials, cost functions, or likelihood probabilities. Despite their diverse methodologies, these approaches exhibit deep underlying similarities. We further dissect the basic components of these models, such as attractive restraint forces and repulsive forces, and discuss additional terms like fluid viscosity and variation penalties. The review also critically evaluates the current state of model validation, highlighting the inconsistencies across various studies and emphasizing the need for a comprehensive validation framework. We detail common validation techniques, including the comparison of distance matrices and the assessment of contact violations. We argue that the future of single-cell Hi-C modeling lies in integrating multiple data modalities and incorporating cell cycle trajectory information. Such integration could significantly advance our understanding of chromatin conformation dynamics during cell cycle progression and cell differentiation. We also foresee the continued growth of optimization-based and molecular dynamics approaches, supported by general molecular dynamics toolkits.
Collapse
Affiliation(s)
- Krzysztof Banecki
- Laboratory of Bioinformatics and Computational Genomics, Faculty of Mathematics and Information Science, Warsaw University of Technology, Warsaw, Poland
- Laboratory of Functional and Structural Genomics, Centre of New Technologies, University of Warsaw, Warsaw, Poland
| | - Sevastianos Korsak
- Laboratory of Bioinformatics and Computational Genomics, Faculty of Mathematics and Information Science, Warsaw University of Technology, Warsaw, Poland
- Laboratory of Functional and Structural Genomics, Centre of New Technologies, University of Warsaw, Warsaw, Poland
| | - Dariusz Plewczynski
- Laboratory of Bioinformatics and Computational Genomics, Faculty of Mathematics and Information Science, Warsaw University of Technology, Warsaw, Poland
- Laboratory of Functional and Structural Genomics, Centre of New Technologies, University of Warsaw, Warsaw, Poland
| |
Collapse
|
13
|
Ma R, Huang J, Jiang T, Ma W. A mini-review of single-cell Hi-C embedding methods. Comput Struct Biotechnol J 2024; 23:4027-4035. [PMID: 39610904 PMCID: PMC11603012 DOI: 10.1016/j.csbj.2024.11.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2024] [Revised: 11/01/2024] [Accepted: 11/01/2024] [Indexed: 11/30/2024] Open
Abstract
Single-cell Hi-C (scHi-C) techniques have significantly advanced our understanding of the 3D genome organization, providing crucial insights into the spatial genome architecture within individual nuclei. Numerous computational and statistical methods have been developed to analyze scHi-C data, with embedding methods playing a key role. Embedding reduces the dimensionality of complex scHi-C contact maps, making it easier to extract biologically meaningful patterns. These methods not only enhance cell clustering based on chromatin structures but also facilitate visualization and other downstream analyses. Most scHi-C embedding methods incorporate strategies such as normalization and imputation to address the inherent sparsity of scHi-C data, thereby further improving data quality and interpretability. In this review, we systematically examine the existing methods designed for scHi-C embedding, outlining their methodologies and discussing their capabilities in handling normalization and imputation. Additionally, we present a comprehensive benchmarking analysis to compare both embedding techniques and their clustering performances. This review serves as a practical guide for researchers seeking to select suitable scHi-C embedding tools, ultimately contributing to the understanding of the 3D organization of the genome.
Collapse
Affiliation(s)
- Rui Ma
- Department of Statistics, University of California Riverside, 900 University Ave., Riverside, 92521, CA, USA
| | - Jingong Huang
- Department of Computer Science and Engineering, University of California Riverside, 900 University Ave., Riverside, 92521, CA, USA
| | - Tao Jiang
- Department of Computer Science and Engineering, University of California Riverside, 900 University Ave., Riverside, 92521, CA, USA
- Institute of Integrative Genome Biology, University of California Riverside, 900 University Ave., Riverside, 92521, CA, USA
| | - Wenxiu Ma
- Department of Statistics, University of California Riverside, 900 University Ave., Riverside, 92521, CA, USA
- Institute of Integrative Genome Biology, University of California Riverside, 900 University Ave., Riverside, 92521, CA, USA
| |
Collapse
|
14
|
Zhou X, Wu H. scHiClassifier: a deep learning framework for cell type prediction by fusing multiple feature sets from single-cell Hi-C data. Brief Bioinform 2024; 26:bbaf009. [PMID: 39831891 PMCID: PMC11744636 DOI: 10.1093/bib/bbaf009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2024] [Revised: 12/01/2024] [Accepted: 01/06/2025] [Indexed: 01/22/2025] Open
Abstract
Single-cell high-throughput chromosome conformation capture (Hi-C) technology enables capturing chromosomal spatial structure information at the cellular level. However, to effectively investigate changes in chromosomal structure across different cell types, there is a requisite for methods that can identify cell types utilizing single-cell Hi-C data. Current frameworks for cell type prediction based on single-cell Hi-C data are limited, often struggling with features interpretability and biological significance, and lacking convincing and robust classification performance validation. In this study, we propose four new feature sets based on the contact matrix with clear interpretability and biological significance. Furthermore, we develop a novel deep learning framework named scHiClassifier based on multi-head self-attention encoder, 1D convolution and feature fusion, which integrates information from these four feature sets to predict cell types accurately. Through comprehensive comparison experiments with benchmark frameworks on six datasets, we demonstrate the superior classification performance and the universality of the scHiClassifier framework. We further assess the robustness of scHiClassifier through data perturbation experiments and data dropout experiments. Moreover, we demonstrate that using all feature sets in the scHiClassifier framework yields optimal performance, supported by comparisons of different feature set combinations. The effectiveness and the superiority of the multiple feature set extraction are proven by comparison with four unsupervised dimensionality reduction methods. Additionally, we analyze the importance of different feature sets and chromosomes using the "SHapley Additive exPlanations" method. Furthermore, the accuracy and reliability of the scHiClassifier framework in cell classification for single-cell Hi-C data are supported through enrichment analysis. The source code of scHiClassifier is freely available at https://github.com/HaoWuLab-Bioinformatics/scHiClassifier.
Collapse
Affiliation(s)
- Xiangfei Zhou
- School of Software, Shandong University, No. 1500, Shunhua Road, Hi-Tech Industrial Development Zone, Jinan 250100, Shandong, China
| | - Hao Wu
- School of Software, Shandong University, No. 1500, Shunhua Road, Hi-Tech Industrial Development Zone, Jinan 250100, Shandong, China
- Shenzhen Research Institute of Shandong University, Shandong University, No. 19, Gaoxin South 4th Road, Nanshan District, Shenzhen 518063, Guangdong, China
| |
Collapse
|
15
|
Zhou T, Zhang R, Jia D, Doty RT, Munday AD, Gao D, Xin L, Abkowitz JL, Duan Z, Ma J. GAGE-seq concurrently profiles multiscale 3D genome organization and gene expression in single cells. Nat Genet 2024; 56:1701-1711. [PMID: 38744973 PMCID: PMC11323187 DOI: 10.1038/s41588-024-01745-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2023] [Accepted: 04/05/2024] [Indexed: 05/16/2024]
Abstract
The organization of mammalian genomes features a complex, multiscale three-dimensional (3D) architecture, whose functional significance remains elusive because of limited single-cell technologies that can concurrently profile genome organization and transcriptional activities. Here, we introduce genome architecture and gene expression by sequencing (GAGE-seq), a scalable, robust single-cell co-assay measuring 3D genome structure and transcriptome simultaneously within the same cell. Applied to mouse brain cortex and human bone marrow CD34+ cells, GAGE-seq characterized the intricate relationships between 3D genome and gene expression, showing that multiscale 3D genome features inform cell-type-specific gene expression and link regulatory elements to target genes. Integration with spatial transcriptomic data revealed in situ 3D genome variations in mouse cortex. Observations in human hematopoiesis unveiled discordant changes between 3D genome organization and gene expression, underscoring a complex, temporal interplay at the single-cell level. GAGE-seq provides a powerful, cost-effective approach for exploring genome structure and gene expression relationships at the single-cell level across diverse biological contexts.
Collapse
Affiliation(s)
- Tianming Zhou
- Ray and Stephanie Lane Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA
| | - Ruochi Zhang
- Ray and Stephanie Lane Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA
- Eric and Wendy Schmidt Center, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Deyong Jia
- Department of Urology, University of Washington, Seattle, WA, USA
| | - Raymond T Doty
- Division of Hematology and Oncology, Department of Medicine/Fred Hutch Cancer Center, University of Washington, Seattle, WA, USA
| | - Adam D Munday
- Division of Hematology and Oncology, Department of Medicine/Fred Hutch Cancer Center, University of Washington, Seattle, WA, USA
| | - Daniel Gao
- Institute for Stem Cell and Regenerative Medicine, University of Washington, Seattle, WA, USA
- Department of Chemistry, Pomona College, Claremont, CA, USA
| | - Li Xin
- Department of Urology, University of Washington, Seattle, WA, USA
- Institute for Stem Cell and Regenerative Medicine, University of Washington, Seattle, WA, USA
| | - Janis L Abkowitz
- Division of Hematology and Oncology, Department of Medicine/Fred Hutch Cancer Center, University of Washington, Seattle, WA, USA
- Institute for Stem Cell and Regenerative Medicine, University of Washington, Seattle, WA, USA
| | - Zhijun Duan
- Division of Hematology and Oncology, Department of Medicine/Fred Hutch Cancer Center, University of Washington, Seattle, WA, USA.
- Institute for Stem Cell and Regenerative Medicine, University of Washington, Seattle, WA, USA.
| | - Jian Ma
- Ray and Stephanie Lane Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA.
| |
Collapse
|
16
|
Liu W, Zhong W, Giusti-Rodríguez P, Jiang Z, Wang GW, Sun H, Hu M, Li Y. SnapHiC-G: identifying long-range enhancer-promoter interactions from single-cell Hi-C data via a global background model. Brief Bioinform 2024; 25:bbae426. [PMID: 39222061 PMCID: PMC11367764 DOI: 10.1093/bib/bbae426] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2024] [Revised: 07/05/2024] [Accepted: 08/13/2024] [Indexed: 09/04/2024] Open
Abstract
Harnessing the power of single-cell genomics technologies, single-cell Hi-C (scHi-C) and its derived technologies provide powerful tools to measure spatial proximity between regulatory elements and their target genes in individual cells. Using a global background model, we propose SnapHiC-G, a computational method, to identify long-range enhancer-promoter interactions from scHi-C data. We applied SnapHiC-G to scHi-C datasets generated from mouse embryonic stem cells and human brain cortical cells. SnapHiC-G achieved high sensitivity in identifying long-range enhancer-promoter interactions. Moreover, SnapHiC-G can identify putative target genes for noncoding genome-wide association study (GWAS) variants, and the genetic heritability of neuropsychiatric diseases is enriched for single-nucleotide polymorphisms (SNPs) within SnapHiC-G-identified interactions in a cell-type-specific manner. In sum, SnapHiC-G is a powerful tool for characterizing cell-type-specific enhancer-promoter interactions from complex tissues and can facilitate the discovery of chromatin interactions important for gene regulation in biologically relevant cell types.
Collapse
Affiliation(s)
- Weifang Liu
- Department of Biostatistics, University of North Carolina at Chapel Hill, 135 Dauer Drive, Chapel Hill, NC 27599, United States
| | - Wujuan Zhong
- Biostatistics and Research Decision Sciences, Merck & Co., Inc., 126 East Lincoln Ave, Rahway, New Jersey 07065, United States
| | - Paola Giusti-Rodríguez
- Department of Psychiatry, University of Florida, 1149 Newel Dr., Gainesville, FL 32611, United States
| | - Zhiyun Jiang
- Department of Genetics, University of North Carolina at Chapel Hill, 120 Mason Farm Road, Chapel Hill, NC 27599, United States
| | - Geoffery W Wang
- Department of Biostatistics, University of North Carolina at Chapel Hill, 135 Dauer Drive, Chapel Hill, NC 27599, United States
| | - Huaigu Sun
- Department of Genetics, University of North Carolina at Chapel Hill, 120 Mason Farm Road, Chapel Hill, NC 27599, United States
| | - Ming Hu
- Department of Quantitative Health Sciences, Lerner Research Institute, Cleveland Clinic Foundation, 9500 Euclid Avenue, Cleveland, OH 44196, United States
| | - Yun Li
- Department of Biostatistics, University of North Carolina at Chapel Hill, 135 Dauer Drive, Chapel Hill, NC 27599, United States
- Department of Genetics, University of North Carolina at Chapel Hill, 120 Mason Farm Road, Chapel Hill, NC 27599, United States
- Department of Computer Science, University of North Carolina at Chapel Hill, 201 S. Columbia St, Chapel Hill, NC 27599, United States
| |
Collapse
|
17
|
Zhang L, Bartosovic M. Single-cell mapping of cell-type specific chromatin architecture in the central nervous system. Curr Opin Struct Biol 2024; 86:102824. [PMID: 38723561 DOI: 10.1016/j.sbi.2024.102824] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2023] [Revised: 03/22/2024] [Accepted: 04/08/2024] [Indexed: 05/19/2024]
Abstract
Determining how chromatin is structured in the nucleus is critical to studying its role in gene regulation. Recent advances in the analysis of single-cell chromatin architecture have considerably improved our understanding of cell-type-specific chromosome conformation and nuclear architecture. In this review, we discuss the methods used for analysis of 3D chromatin conformation, including sequencing-based methods, imaging-based techniques, and computational approaches. We further review the application of these methods in the study of the role of chromatin topology in neural development and disorders.
Collapse
Affiliation(s)
- Letian Zhang
- Department of Biochemistry and Biophysics, Svante Arrhenius väg 16C, 162 53, Stockholm, Sweden. https://twitter.com/LetianZHANG_
| | - Marek Bartosovic
- Department of Biochemistry and Biophysics, Svante Arrhenius väg 16C, 162 53, Stockholm, Sweden.
| |
Collapse
|
18
|
Xiong K, Zhang R, Ma J. scGHOST: identifying single-cell 3D genome subcompartments. Nat Methods 2024; 21:814-822. [PMID: 38589516 PMCID: PMC11127718 DOI: 10.1038/s41592-024-02230-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2023] [Accepted: 03/01/2024] [Indexed: 04/10/2024]
Abstract
Single-cell Hi-C (scHi-C) technologies allow for probing of genome-wide cell-to-cell variability in three-dimensional (3D) genome organization from individual cells. Computational methods have been developed to reveal single-cell 3D genome features based on scHi-C, including A/B compartments, topologically associating domains and chromatin loops. However, no method exists for annotating single-cell subcompartments, which is important for understanding chromosome spatial localization in single cells. Here we present scGHOST, a single-cell subcompartment annotation method using graph embedding with constrained random walk sampling. Applications of scGHOST to scHi-C data and contact maps derived from single-cell 3D genome imaging demonstrate reliable identification of single-cell subcompartments, offering insights into cell-to-cell variability of nuclear subcompartments. Using scHi-C data from complex tissues, scGHOST identifies cell-type-specific or allele-specific subcompartments linked to gene transcription across various cell types and developmental stages, suggesting functional implications of single-cell subcompartments. scGHOST is an effective method for annotating single-cell 3D genome subcompartments in a broad range of biological contexts.
Collapse
Affiliation(s)
- Kyle Xiong
- Ray and Stephanie Lane Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA
| | - Ruochi Zhang
- Ray and Stephanie Lane Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA
- Eric and Wendy Schmidt Center, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Jian Ma
- Ray and Stephanie Lane Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA.
| |
Collapse
|
19
|
Zhang Y, Boninsegna L, Yang M, Misteli T, Alber F, Ma J. Computational methods for analysing multiscale 3D genome organization. Nat Rev Genet 2024; 25:123-141. [PMID: 37673975 PMCID: PMC11127719 DOI: 10.1038/s41576-023-00638-1] [Citation(s) in RCA: 26] [Impact Index Per Article: 26.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/12/2023] [Indexed: 09/08/2023]
Abstract
Recent progress in whole-genome mapping and imaging technologies has enabled the characterization of the spatial organization and folding of the genome in the nucleus. In parallel, advanced computational methods have been developed to leverage these mapping data to reveal multiscale three-dimensional (3D) genome features and to provide a more complete view of genome structure and its connections to genome functions such as transcription. Here, we discuss how recently developed computational tools, including machine-learning-based methods and integrative structure-modelling frameworks, have led to a systematic, multiscale delineation of the connections among different scales of 3D genome organization, genomic and epigenomic features, functional nuclear components and genome function. However, approaches that more comprehensively integrate a wide variety of genomic and imaging datasets are still needed to uncover the functional role of 3D genome structure in defining cellular phenotypes in health and disease.
Collapse
Affiliation(s)
- Yang Zhang
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA
| | - Lorenzo Boninsegna
- Department of Microbiology, Immunology and Molecular Genetics and Institute for Quantitative and Computational Biosciences, University of California Los Angeles, Los Angeles, CA, USA
| | - Muyu Yang
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA
| | - Tom Misteli
- Center for Cancer Research, National Cancer Institute, Bethesda, MD, USA.
| | - Frank Alber
- Department of Microbiology, Immunology and Molecular Genetics and Institute for Quantitative and Computational Biosciences, University of California Los Angeles, Los Angeles, CA, USA.
| | - Jian Ma
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA.
| |
Collapse
|
20
|
Hua D, Gu M, Zhang X, Du Y, Xie H, Qi L, Du X, Bai Z, Zhu X, Tian D. DiffDomain enables identification of structurally reorganized topologically associating domains. Nat Commun 2024; 15:502. [PMID: 38218905 PMCID: PMC10787792 DOI: 10.1038/s41467-024-44782-6] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2022] [Accepted: 01/02/2024] [Indexed: 01/15/2024] Open
Abstract
Topologically associating domains (TADs) are critical structural units in three-dimensional genome organization of mammalian genome. Dynamic reorganizations of TADs between health and disease states are associated with essential genome functions. However, computational methods for identifying reorganized TADs are still in the early stages of development. Here, we present DiffDomain, an algorithm leveraging high-dimensional random matrix theory to identify structurally reorganized TADs using high-throughput chromosome conformation capture (Hi-C) contact maps. Method comparison using multiple real Hi-C datasets reveals that DiffDomain outperforms alternative methods for false positive rates, true positive rates, and identifying a new subtype of reorganized TADs. Applying DiffDomain to Hi-C data from different cell types and disease states demonstrates its biological relevance. Identified reorganized TADs are associated with structural variations and epigenomic changes such as changes in CTCF binding sites. By applying to a single-cell Hi-C data from mouse neuronal development, DiffDomain can identify reorganized TADs between cell types with reasonable reproducibility using pseudo-bulk Hi-C data from as few as 100 cells per condition. Moreover, DiffDomain reveals differential cell-to-population variability and heterogeneous cell-to-cell variability in TADs. Therefore, DiffDomain is a statistically sound method for better comparative analysis of TADs using both Hi-C and single-cell Hi-C data.
Collapse
Affiliation(s)
- Dunming Hua
- Department of Biostatistics and Systems Biology, School of Public Health (Shenzhen), Sun Yat-sen University, Shenzhen, Guangdong, 510275, China
- Department of Biostatistics and Systems Biology, School of Public Health (Shenzhen), Shenzhen Campus of Sun Yat-sen University, Shenzhen, 518107, China
| | - Ming Gu
- Department of Biostatistics and Systems Biology, School of Public Health (Shenzhen), Sun Yat-sen University, Shenzhen, Guangdong, 510275, China
- Department of Biostatistics and Systems Biology, School of Public Health (Shenzhen), Shenzhen Campus of Sun Yat-sen University, Shenzhen, 518107, China
| | - Xiao Zhang
- Department of Biostatistics and Systems Biology, School of Public Health (Shenzhen), Sun Yat-sen University, Shenzhen, Guangdong, 510275, China
- Department of Biostatistics and Systems Biology, School of Public Health (Shenzhen), Shenzhen Campus of Sun Yat-sen University, Shenzhen, 518107, China
| | - Yanyi Du
- Department of Biostatistics and Systems Biology, School of Public Health (Shenzhen), Sun Yat-sen University, Shenzhen, Guangdong, 510275, China
- Department of Biostatistics and Systems Biology, School of Public Health (Shenzhen), Shenzhen Campus of Sun Yat-sen University, Shenzhen, 518107, China
| | - Hangcheng Xie
- Department of Biostatistics and Systems Biology, School of Public Health (Shenzhen), Sun Yat-sen University, Shenzhen, Guangdong, 510275, China
- Department of Biostatistics and Systems Biology, School of Public Health (Shenzhen), Shenzhen Campus of Sun Yat-sen University, Shenzhen, 518107, China
| | - Li Qi
- Chongqing Municipal Center for Disease Control and Prevention, Chongqing, 400042, China
| | - Xiangjun Du
- Department of Biostatistics and Systems Biology, School of Public Health (Shenzhen), Sun Yat-sen University, Shenzhen, Guangdong, 510275, China
- Department of Biostatistics and Systems Biology, School of Public Health (Shenzhen), Shenzhen Campus of Sun Yat-sen University, Shenzhen, 518107, China
| | - Zhidong Bai
- KLASMOE & School of Mathematics and Statistics, Northeast Normal University, Changchun, Jilin, 130024, China
| | - Xiaopeng Zhu
- MyCellome LLC., Allison Park, PA, 15101, USA
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, 15213, USA
| | - Dechao Tian
- Department of Biostatistics and Systems Biology, School of Public Health (Shenzhen), Sun Yat-sen University, Shenzhen, Guangdong, 510275, China.
- Department of Biostatistics and Systems Biology, School of Public Health (Shenzhen), Shenzhen Campus of Sun Yat-sen University, Shenzhen, 518107, China.
| |
Collapse
|
21
|
Gunsalus LM, Keiser MJ, Pollard KS. ChromaFactor: deconvolution of single-molecule chromatin organization with non-negative matrix factorization. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.11.22.568268. [PMID: 38045231 PMCID: PMC10690235 DOI: 10.1101/2023.11.22.568268] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/05/2023]
Abstract
The investigation of chromatin organization in single cells holds great promise for identifying causal relationships between genome structure and function. However, analysis of single-molecule data is hampered by extreme yet inherent heterogeneity, making it challenging to determine the contributions of individual chromatin fibers to bulk trends. To address this challenge, we propose ChromaFactor, a novel computational approach based on non-negative matrix factorization that deconvolves single-molecule chromatin organization datasets into their most salient primary components. ChromaFactor provides the ability to identify trends accounting for the maximum variance in the dataset while simultaneously describing the contribution of individual molecules to each component. Applying our approach to two single-molecule imaging datasets across different genomic scales, we find that these primary components demonstrate significant correlation with key functional phenotypes, including active transcription, enhancer-promoter distance, and genomic compartment. ChromaFactor offers a robust tool for understanding the complex interplay between chromatin structure and function on individual DNA molecules, pinpointing which subpopulations drive functional changes and fostering new insights into cellular heterogeneity and its implications for bulk genomic phenomena.
Collapse
Affiliation(s)
- Laura M. Gunsalus
- Gladstone Institutes, San Francisco, CA
- Bakar Computational Health Sciences Institute, University of California, San Francisco, CA
| | - Michael J. Keiser
- Bakar Computational Health Sciences Institute, University of California, San Francisco, CA
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA
- Kavli Institute for Fundamental Neuroscience, University of California, San Francisco, San Francisco, CA
- Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, CA
- Department of Epidemiology & Biostatistics, University of California, San Francisco, San Francisco, CA
| | - Katherine S. Pollard
- Gladstone Institutes, San Francisco, CA
- Institute for Neurodegenerative Diseases, University of California, San Francisco, San Francisco, CA
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA
- Chan Zuckerberg Biohub, San Francisco, CA
| |
Collapse
|
22
|
龚 海, 麻 付, 张 晓. [Advances in methods and applications of single-cell Hi-C data analysis]. SHENG WU YI XUE GONG CHENG XUE ZA ZHI = JOURNAL OF BIOMEDICAL ENGINEERING = SHENGWU YIXUE GONGCHENGXUE ZAZHI 2023; 40:1033-1039. [PMID: 37879935 PMCID: PMC10600426 DOI: 10.7507/1001-5515.202303046] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Subscribe] [Scholar Register] [Received: 03/20/2023] [Revised: 08/29/2023] [Indexed: 10/27/2023]
Abstract
Chromatin three-dimensional genome structure plays a key role in cell function and gene regulation. Single-cell Hi-C techniques can capture genomic structure information at the cellular level, which provides an opportunity to study changes in genomic structure between different cell types. Recently, some excellent computational methods have been developed for single-cell Hi-C data analysis. In this paper, the available methods for single-cell Hi-C data analysis were first reviewed, including preprocessing of single-cell Hi-C data, multi-scale structure recognition based on single-cell Hi-C data, bulk-like Hi-C contact matrix generation based on single-cell Hi-C data sets, pseudo-time series analysis, and cell classification. Then the application of single-cell Hi-C data in cell differentiation and structural variation was described. Finally, the future development direction of single-cell Hi-C data analysis was also prospected.
Collapse
Affiliation(s)
- 海燕 龚
- 北京科技大学 新材料技术研究院 (北京 100083)Institute for Advanced Materials and Technology, University of Science and Technology Beijing, Beijing 100083, P. R. China
- 北京科技大学 计算机与通信工程学院(北京 100083)School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing 100083, P. R. China
| | - 付强 麻
- 北京科技大学 新材料技术研究院 (北京 100083)Institute for Advanced Materials and Technology, University of Science and Technology Beijing, Beijing 100083, P. R. China
| | - 晓彤 张
- 北京科技大学 新材料技术研究院 (北京 100083)Institute for Advanced Materials and Technology, University of Science and Technology Beijing, Beijing 100083, P. R. China
- 北京科技大学 计算机与通信工程学院(北京 100083)School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing 100083, P. R. China
| |
Collapse
|
23
|
Tian W, Zhou J, Bartlett A, Zeng Q, Liu H, Castanon RG, Kenworthy M, Altshul J, Valadon C, Aldridge A, Nery JR, Chen H, Xu J, Johnson ND, Lucero J, Osteen JK, Emerson N, Rink J, Lee J, Li Y, Siletti K, Liem M, Claffey N, O’Connor C, Yanny AM, Nyhus J, Dee N, Casper T, Shapovalova N, Hirschstein D, Ding SL, Hodge R, Levi BP, Keene CD, Linnarsson S, Lein E, Ren B, Behrens MM, Ecker JR. Single-cell DNA methylation and 3D genome architecture in the human brain. Science 2023; 382:eadf5357. [PMID: 37824674 PMCID: PMC10572106 DOI: 10.1126/science.adf5357] [Citation(s) in RCA: 61] [Impact Index Per Article: 30.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2022] [Accepted: 09/05/2023] [Indexed: 10/14/2023]
Abstract
Delineating the gene-regulatory programs underlying complex cell types is fundamental for understanding brain function in health and disease. Here, we comprehensively examined human brain cell epigenomes by probing DNA methylation and chromatin conformation at single-cell resolution in 517 thousand cells (399 thousand neurons and 118 thousand non-neurons) from 46 regions of three adult male brains. We identified 188 cell types and characterized their molecular signatures. Integrative analyses revealed concordant changes in DNA methylation, chromatin accessibility, chromatin organization, and gene expression across cell types, cortical areas, and basal ganglia structures. We further developed single-cell methylation barcodes that reliably predict brain cell types using the methylation status of select genomic sites. This multimodal epigenomic brain cell atlas provides new insights into the complexity of cell-type-specific gene regulation in adult human brains.
Collapse
Affiliation(s)
- Wei Tian
- Genomic Analysis Laboratory, The Salk Institute for Biological Studies, La Jolla, CA 92037, USA
| | - Jingtian Zhou
- Genomic Analysis Laboratory, The Salk Institute for Biological Studies, La Jolla, CA 92037, USA
- Bioinformatics and Systems Biology Program, University of California, San Diego, La Jolla, CA 92037, USA
| | - Anna Bartlett
- Genomic Analysis Laboratory, The Salk Institute for Biological Studies, La Jolla, CA 92037, USA
| | - Qiurui Zeng
- Genomic Analysis Laboratory, The Salk Institute for Biological Studies, La Jolla, CA 92037, USA
- Division of Biological Sciences, University of California, San Diego, La Jolla, CA 92037, USA
| | - Hanqing Liu
- Genomic Analysis Laboratory, The Salk Institute for Biological Studies, La Jolla, CA 92037, USA
| | - Rosa G. Castanon
- Genomic Analysis Laboratory, The Salk Institute for Biological Studies, La Jolla, CA 92037, USA
| | - Mia Kenworthy
- Genomic Analysis Laboratory, The Salk Institute for Biological Studies, La Jolla, CA 92037, USA
| | - Jordan Altshul
- Genomic Analysis Laboratory, The Salk Institute for Biological Studies, La Jolla, CA 92037, USA
| | - Cynthia Valadon
- Genomic Analysis Laboratory, The Salk Institute for Biological Studies, La Jolla, CA 92037, USA
| | - Andrew Aldridge
- Genomic Analysis Laboratory, The Salk Institute for Biological Studies, La Jolla, CA 92037, USA
| | - Joseph R. Nery
- Genomic Analysis Laboratory, The Salk Institute for Biological Studies, La Jolla, CA 92037, USA
| | - Huaming Chen
- Genomic Analysis Laboratory, The Salk Institute for Biological Studies, La Jolla, CA 92037, USA
| | - Jiaying Xu
- Genomic Analysis Laboratory, The Salk Institute for Biological Studies, La Jolla, CA 92037, USA
| | - Nicholas D. Johnson
- Computational Neurobiology Laboratory, The Salk Institute for Biological Studies, La Jolla, CA 92037, USA
| | - Jacinta Lucero
- Computational Neurobiology Laboratory, The Salk Institute for Biological Studies, La Jolla, CA 92037, USA
| | - Julia K. Osteen
- Computational Neurobiology Laboratory, The Salk Institute for Biological Studies, La Jolla, CA 92037, USA
| | - Nora Emerson
- Computational Neurobiology Laboratory, The Salk Institute for Biological Studies, La Jolla, CA 92037, USA
| | - Jon Rink
- Computational Neurobiology Laboratory, The Salk Institute for Biological Studies, La Jolla, CA 92037, USA
| | - Jasper Lee
- Computational Neurobiology Laboratory, The Salk Institute for Biological Studies, La Jolla, CA 92037, USA
| | - Yang Li
- Ludwig Institute for Cancer Research, La Jolla, CA 92037, USA
| | - Kimberly Siletti
- Department of Medical Biochemistry and Biophysics, Karolinska Institutet; 171 77 Stockholm, Sweden
| | - Michelle Liem
- Flow Cytometry Core Facility, The Salk Institute for Biological Studies, La Jolla, CA 92037, USA
| | - Naomi Claffey
- Flow Cytometry Core Facility, The Salk Institute for Biological Studies, La Jolla, CA 92037, USA
| | - Caz O’Connor
- Flow Cytometry Core Facility, The Salk Institute for Biological Studies, La Jolla, CA 92037, USA
| | | | - Julie Nyhus
- Allen Institute for Brain Science; Seattle, WA 98109, USA
| | - Nick Dee
- Allen Institute for Brain Science; Seattle, WA 98109, USA
| | - Tamara Casper
- Allen Institute for Brain Science; Seattle, WA 98109, USA
| | | | | | - Song-Lin Ding
- Allen Institute for Brain Science; Seattle, WA 98109, USA
| | - Rebecca Hodge
- Allen Institute for Brain Science; Seattle, WA 98109, USA
| | - Boaz P. Levi
- Allen Institute for Brain Science; Seattle, WA 98109, USA
| | - C. Dirk Keene
- Department of Laboratory Medicine and Pathology, University of Washington, Seattle, WA 98195, USA
| | - Sten Linnarsson
- Department of Medical Biochemistry and Biophysics, Karolinska Institutet; 171 77 Stockholm, Sweden
| | - Ed Lein
- Allen Institute for Brain Science; Seattle, WA 98109, USA
| | - Bing Ren
- Ludwig Institute for Cancer Research, La Jolla, CA 92037, USA
- Center for Epigenomics, University of California, San Diego School of Medicine, La Jolla, CA 92037, USA
- Department of Cellular and Molecular Medicine, University of California, San Diego School of Medicine, La Jolla, CA 92037, USA
- Institute of Genomic Medicine, University of California, San Diego School of Medicine, La Jolla, CA 92037, USA
- Moores Cancer Center, University of California, San Diego School of Medicine, La Jolla, CA 92037, USA
| | - M. Margarita Behrens
- Computational Neurobiology Laboratory, The Salk Institute for Biological Studies, La Jolla, CA 92037, USA
| | - Joseph R. Ecker
- Genomic Analysis Laboratory, The Salk Institute for Biological Studies, La Jolla, CA 92037, USA
- Howard Hughes Medical Institute, The Salk Institute for Biological Studies, La Jolla, CA 92037, USA
| |
Collapse
|
24
|
Lee L, Yu M, Li X, Zhu C, Zhang Y, Yu H, Chen Z, Mishra S, Ren B, Li Y, Hu M. SnapHiC-D: a computational pipeline to identify differential chromatin contacts from single-cell Hi-C data. Brief Bioinform 2023; 24:bbad315. [PMID: 37649383 PMCID: PMC10516352 DOI: 10.1093/bib/bbad315] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2023] [Revised: 08/04/2023] [Accepted: 08/07/2023] [Indexed: 09/01/2023] Open
Abstract
Single-cell high-throughput chromatin conformation capture technologies (scHi-C) has been used to map chromatin spatial organization in complex tissues. However, computational tools to detect differential chromatin contacts (DCCs) from scHi-C datasets in development and through disease pathogenesis are still lacking. Here, we present SnapHiC-D, a computational pipeline to identify DCCs between two scHi-C datasets. Compared to methods designed for bulk Hi-C data, SnapHiC-D detects DCCs with high sensitivity and accuracy. We used SnapHiC-D to identify cell-type-specific chromatin contacts at 10 Kb resolution in mouse hippocampal and human prefrontal cortical tissues, demonstrating that DCCs detected in the hippocampal and cortical cell types are generally associated with cell-type-specific gene expression patterns and epigenomic features. SnapHiC-D is freely available at https://github.com/HuMingLab/SnapHiC-D.
Collapse
Affiliation(s)
- Lindsay Lee
- Department of Quantitative Health Sciences, Lerner Research Institute, Cleveland Clinic Foundation, Cleveland, OH, USA
| | - Miao Yu
- Ludwig Institute for Cancer Research, La Jolla, CA, USA
- State Key Laboratory of Genetic Engineering, School of Life Sciences, Fudan University, Shanghai, China
| | - Xiaoqi Li
- Carolina Health Informatics Program, University of North Carolina, Chapel Hill, NC, USA
| | - Chenxu Zhu
- Ludwig Institute for Cancer Research, La Jolla, CA, USA
- New York Genome Center, New York, NY, USA
- Department of Physiology and Biophysics, Institute for Computational Biomedicine, Weill Cornell Medicine, New York, NY, USA
| | - Yanxiao Zhang
- Ludwig Institute for Cancer Research, La Jolla, CA, USA
- Westlake University, Hangzhou, Zhejiang, China
| | - Hongyu Yu
- Department of Statistics, University of Wisconsin Madison, Madison, WI, USA
- Department of Biochemistry, University of Wisconsin Madison, Madison, WI, USA
| | - Ziyin Chen
- State Key Laboratory of Genetic Engineering, School of Life Sciences, Fudan University, Shanghai, China
| | - Shreya Mishra
- Department of Quantitative Health Sciences, Lerner Research Institute, Cleveland Clinic Foundation, Cleveland, OH, USA
| | - Bing Ren
- Ludwig Institute for Cancer Research, La Jolla, CA, USA
- Center for Epigenomics & Department of Cellular and Molecular Medicine, University of California, San Diego, La Jolla, CA, USA
| | - Yun Li
- Department of Biostatistics, University of North Carolina, Chapel Hill, NC, USA
- Department of Genetics, University of North Carolina, Chapel Hill, NC, USA
- Department of Computer Science, University of North Carolina, Chapel Hill, NC, USA
| | - Ming Hu
- Department of Quantitative Health Sciences, Lerner Research Institute, Cleveland Clinic Foundation, Cleveland, OH, USA
| |
Collapse
|
25
|
Tan L, Shi J, Moghadami S, Parasar B, Wright CP, Seo Y, Vallejo K, Cobos I, Duncan L, Chen R, Deisseroth K. Lifelong restructuring of 3D genome architecture in cerebellar granule cells. Science 2023; 381:1112-1119. [PMID: 37676945 PMCID: PMC11059189 DOI: 10.1126/science.adh3253] [Citation(s) in RCA: 30] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2023] [Accepted: 08/03/2023] [Indexed: 09/09/2023]
Abstract
The cerebellum contains most of the neurons in the human brain and exhibits distinctive modes of development and aging. In this work, by developing our single-cell three-dimensional (3D) genome assay-diploid chromosome conformation capture, or Dip-C-into population-scale (Pop-C) and virus-enriched (vDip-C) modes, we resolved the first 3D genome structures of single cerebellar cells, created life-spanning 3D genome atlases for both humans and mice, and jointly measured transcriptome and chromatin accessibility during development. We found that although the transcriptome and chromatin accessibility of cerebellar granule neurons mature in early postnatal life, 3D genome architecture gradually remodels throughout life, establishing ultra-long-range intrachromosomal contacts and specific interchromosomal contacts that are rarely seen in neurons. These results reveal unexpected evolutionarily conserved molecular processes that underlie distinctive features of neural development and aging across the mammalian life span.
Collapse
Affiliation(s)
- Longzhi Tan
- Department of Neurobiology, Stanford University, Stanford, CA, 94305
- Department of Bioengineering, Stanford University, Stanford, CA, 94305
| | - Jenny Shi
- Department of Neurobiology, Stanford University, Stanford, CA, 94305
- Department of Bioengineering, Stanford University, Stanford, CA, 94305
- Department of Chemistry, Stanford University, Stanford, CA, 94305
| | - Siavash Moghadami
- Department of Neurobiology, Stanford University, Stanford, CA, 94305
- Department of Chemical and Systems Biology, Stanford University, Stanford, CA, 94305
| | - Bibudha Parasar
- Department of Neurobiology, Stanford University, Stanford, CA, 94305
| | - Cydney P. Wright
- Department of Neurobiology, Stanford University, Stanford, CA, 94305
- Department of Biology, Stanford University, Stanford, CA, 94305
| | - Yunji Seo
- Department of Neurobiology, Stanford University, Stanford, CA, 94305
| | - Kristen Vallejo
- Department of Pathology, Stanford University, Stanford, CA, 94305
| | - Inma Cobos
- Department of Pathology, Stanford University, Stanford, CA, 94305
| | - Laramie Duncan
- Department of Psychiatry and Behavioral Sciences, Stanford University, Stanford, CA, 94305
| | - Ritchie Chen
- Department of Bioengineering, Stanford University, Stanford, CA, 94305
| | - Karl Deisseroth
- Department of Bioengineering, Stanford University, Stanford, CA, 94305
- Department of Psychiatry and Behavioral Sciences, Stanford University, Stanford, CA, 94305
- Howard Hughes Medical Institute, Stanford, CA, 94305
| |
Collapse
|
26
|
Dekker J, Alber F, Aufmkolk S, Beliveau BJ, Bruneau BG, Belmont AS, Bintu L, Boettiger A, Calandrelli R, Disteche CM, Gilbert DM, Gregor T, Hansen AS, Huang B, Huangfu D, Kalhor R, Leslie CS, Li W, Li Y, Ma J, Noble WS, Park PJ, Phillips-Cremins JE, Pollard KS, Rafelski SM, Ren B, Ruan Y, Shav-Tal Y, Shen Y, Shendure J, Shu X, Strambio-De-Castillia C, Vertii A, Zhang H, Zhong S. Spatial and temporal organization of the genome: Current state and future aims of the 4D nucleome project. Mol Cell 2023; 83:2624-2640. [PMID: 37419111 PMCID: PMC10528254 DOI: 10.1016/j.molcel.2023.06.018] [Citation(s) in RCA: 47] [Impact Index Per Article: 23.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2023] [Revised: 06/10/2023] [Accepted: 06/12/2023] [Indexed: 07/09/2023]
Abstract
The four-dimensional nucleome (4DN) consortium studies the architecture of the genome and the nucleus in space and time. We summarize progress by the consortium and highlight the development of technologies for (1) mapping genome folding and identifying roles of nuclear components and bodies, proteins, and RNA, (2) characterizing nuclear organization with time or single-cell resolution, and (3) imaging of nuclear organization. With these tools, the consortium has provided over 2,000 public datasets. Integrative computational models based on these data are starting to reveal connections between genome structure and function. We then present a forward-looking perspective and outline current aims to (1) delineate dynamics of nuclear architecture at different timescales, from minutes to weeks as cells differentiate, in populations and in single cells, (2) characterize cis-determinants and trans-modulators of genome organization, (3) test functional consequences of changes in cis- and trans-regulators, and (4) develop predictive models of genome structure and function.
Collapse
Affiliation(s)
- Job Dekker
- University of Massachusetts Chan Medical School, Boston, MA, USA; Howard Hughes Medical Institute, Chevy Chase, MD, USA.
| | - Frank Alber
- University of California, Los Angeles, Los Angeles, CA, USA
| | | | | | - Benoit G Bruneau
- Gladstone Institutes, San Francisco, CA, USA; University of California, San Francisco, San Francisco, CA, USA
| | | | | | | | | | | | | | | | | | - Bo Huang
- University of California, San Francisco, San Francisco, CA, USA
| | - Danwei Huangfu
- Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Reza Kalhor
- Johns Hopkins University, Baltimore, MD, USA
| | | | - Wenbo Li
- University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Yun Li
- University of North Carolina, Gillings School of Global Public Health, Chapel Hill, NC, USA
| | - Jian Ma
- Carnegie Mellon University, Pittsburgh, PA, USA
| | | | | | | | - Katherine S Pollard
- Gladstone Institutes, San Francisco, CA, USA; University of California, San Francisco, San Francisco, CA, USA; Chan Zuckerberg Biohub, San Francisco, San Francisco, CA, USA
| | | | - Bing Ren
- University of California, San Diego, La Jolla, CA, USA
| | - Yijun Ruan
- Zhejiang University, Hangzhou, Zhejiang, China
| | | | - Yin Shen
- University of California, San Francisco, San Francisco, CA, USA
| | | | - Xiaokun Shu
- University of California, San Francisco, San Francisco, CA, USA
| | | | | | | | - Sheng Zhong
- University of California, San Diego, La Jolla, CA, USA.
| |
Collapse
|
27
|
Gaulton KJ, Preissl S, Ren B. Interpreting non-coding disease-associated human variants using single-cell epigenomics. Nat Rev Genet 2023; 24:516-534. [PMID: 37161089 PMCID: PMC10629587 DOI: 10.1038/s41576-023-00598-6] [Citation(s) in RCA: 21] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/27/2023] [Indexed: 05/11/2023]
Abstract
Genome-wide association studies (GWAS) have linked hundreds of thousands of sequence variants in the human genome to common traits and diseases. However, translating this knowledge into a mechanistic understanding of disease-relevant biology remains challenging, largely because such variants are predominantly in non-protein-coding sequences that still lack functional annotation at cell-type resolution. Recent advances in single-cell epigenomics assays have enabled the generation of cell type-, subtype- and state-resolved maps of the epigenome in heterogeneous human tissues. These maps have facilitated cell type-specific annotation of candidate cis-regulatory elements and their gene targets in the human genome, enhancing our ability to interpret the genetic basis of common traits and diseases.
Collapse
Affiliation(s)
- Kyle J Gaulton
- Department of Paediatrics, Paediatric Diabetes Research Center, University of California San Diego School of Medicine, La Jolla, CA, USA.
| | - Sebastian Preissl
- Center for Epigenomics, University of California San Diego School of Medicine, La Jolla, CA, USA.
- Institute of Experimental and Clinical Pharmacology and Toxicology, Faculty of Medicine, University of Freiburg, Freiburg, Germany.
| | - Bing Ren
- Center for Epigenomics, University of California San Diego School of Medicine, La Jolla, CA, USA.
- Department of Cellular and Molecular Medicine, University of California San Diego School of Medicine, La Jolla, CA, USA.
- Ludwig Institute for Cancer Research, La Jolla, CA, USA.
| |
Collapse
|
28
|
Zhou T, Zhang R, Jia D, Doty RT, Munday AD, Gao D, Xin L, Abkowitz JL, Duan Z, Ma J. Concurrent profiling of multiscale 3D genome organization and gene expression in single mammalian cells. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.07.20.549578. [PMID: 37546900 PMCID: PMC10401946 DOI: 10.1101/2023.07.20.549578] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/08/2023]
Abstract
The organization of mammalian genomes within the nucleus features a complex, multiscale three-dimensional (3D) architecture. The functional significance of these 3D genome features, however, remains largely elusive due to limited single-cell technologies that can concurrently profile genome organization and transcriptional activities. Here, we report GAGE-seq, a highly scalable, robust single-cell co-assay that simultaneously measures 3D genome structure and transcriptome within the same cell. Employing GAGE-seq on mouse brain cortex and human bone marrow CD34+ cells, we comprehensively characterized the intricate relationships between 3D genome and gene expression. We found that these multiscale 3D genome features collectively inform cell type-specific gene expressions, hence contributing to defining cell identity at the single-cell level. Integration of GAGE-seq data with spatial transcriptomic data revealed in situ variations of the 3D genome in mouse cortex. Moreover, our observations of lineage commitment in normal human hematopoiesis unveiled notable discordant changes between 3D genome organization and gene expression, underscoring a complex, temporal interplay at the single-cell level that is more nuanced than previously appreciated. Together, GAGE-seq provides a powerful, cost-effective approach for interrogating genome structure and gene expression relationships at the single-cell level across diverse biological contexts.
Collapse
Affiliation(s)
- Tianming Zhou
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Ruochi Zhang
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, USA
- Present address: Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Deyong Jia
- Department of Urology, University of Washington, Seattle, WA 98195, USA
| | - Raymond T. Doty
- Division of Hematology, Department of Medicine, University of Washington, Seattle, WA 98195, USA
| | - Adam D. Munday
- Division of Hematology, Department of Medicine, University of Washington, Seattle, WA 98195, USA
| | - Daniel Gao
- Institute for Stem Cell and Regenerative Medicine, University of Washington, Seattle, WA 98109, USA
- Present address: Department of Chemistry, Pomona College, Claremont, CA 91711, USA
| | - Li Xin
- Department of Urology, University of Washington, Seattle, WA 98195, USA
- Institute for Stem Cell and Regenerative Medicine, University of Washington, Seattle, WA 98109, USA
| | - Janis L. Abkowitz
- Division of Hematology, Department of Medicine, University of Washington, Seattle, WA 98195, USA
- Institute for Stem Cell and Regenerative Medicine, University of Washington, Seattle, WA 98109, USA
| | - Zhijun Duan
- Division of Hematology, Department of Medicine, University of Washington, Seattle, WA 98195, USA
- Institute for Stem Cell and Regenerative Medicine, University of Washington, Seattle, WA 98109, USA
| | - Jian Ma
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| |
Collapse
|
29
|
Liu Z, Chen Y, Xia Q, Liu M, Xu H, Chi Y, Deng Y, Xing D. Linking genome structures to functions by simultaneous single-cell Hi-C and RNA-seq. Science 2023; 380:1070-1076. [PMID: 37289875 DOI: 10.1126/science.adg3797] [Citation(s) in RCA: 58] [Impact Index Per Article: 29.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2022] [Accepted: 05/07/2023] [Indexed: 06/10/2023]
Abstract
Much progress has been made recently in single-cell chromosome conformation capture technologies. However, a method that allows simultaneous profiling of chromatin architecture and gene expression has not been reported. Here, we developed an assay named "Hi-C and RNA-seq employed simultaneously" (HiRES) and performed it on thousands of single cells from developing mouse embryos. Single-cell three-dimensional genome structures, despite being heavily determined by the cell cycle and developmental stages, gradually diverged in a cell type-specific manner as development progressed. By comparing the pseudotemporal dynamics of chromatin interactions with gene expression, we found a widespread chromatin rewiring that occurred before transcription activation. Our results demonstrate that the establishment of specific chromatin interactions is tightly related to transcriptional control and cell functions during lineage specification.
Collapse
Affiliation(s)
- Zhiyuan Liu
- Biomedical Pioneering Innovation Center (BIOPIC), School of Life Sciences, Peking University, Beijing, China
- Beijing Advanced Innovation Center for Genomics (ICG), Peking University, Beijing, China
| | - Yujie Chen
- Biomedical Pioneering Innovation Center (BIOPIC), School of Life Sciences, Peking University, Beijing, China
- Beijing Advanced Innovation Center for Genomics (ICG), Peking University, Beijing, China
| | - Qimin Xia
- Biomedical Pioneering Innovation Center (BIOPIC), School of Life Sciences, Peking University, Beijing, China
- Beijing Advanced Innovation Center for Genomics (ICG), Peking University, Beijing, China
| | - Menghan Liu
- Biomedical Pioneering Innovation Center (BIOPIC), School of Life Sciences, Peking University, Beijing, China
- Beijing Advanced Innovation Center for Genomics (ICG), Peking University, Beijing, China
| | - Heming Xu
- Biomedical Pioneering Innovation Center (BIOPIC), School of Life Sciences, Peking University, Beijing, China
- Beijing Advanced Innovation Center for Genomics (ICG), Peking University, Beijing, China
| | - Yi Chi
- Biomedical Pioneering Innovation Center (BIOPIC), School of Life Sciences, Peking University, Beijing, China
- Beijing Advanced Innovation Center for Genomics (ICG), Peking University, Beijing, China
| | - Yujing Deng
- Biomedical Pioneering Innovation Center (BIOPIC), School of Life Sciences, Peking University, Beijing, China
- Beijing Advanced Innovation Center for Genomics (ICG), Peking University, Beijing, China
| | - Dong Xing
- Biomedical Pioneering Innovation Center (BIOPIC), School of Life Sciences, Peking University, Beijing, China
- Beijing Advanced Innovation Center for Genomics (ICG), Peking University, Beijing, China
| |
Collapse
|
30
|
Xiong K, Zhang R, Ma J. scGHOST: Identifying single-cell 3D genome subcompartments. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.24.542032. [PMID: 37292994 PMCID: PMC10245874 DOI: 10.1101/2023.05.24.542032] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
New single-cell Hi-C (scHi-C) technologies enable probing of the genome-wide cell-to-cell variability in 3D genome organization from individual cells. Several computational methods have been developed to reveal single-cell 3D genome features based on scHi-C data, including A/B compartments, topologically-associating domains, and chromatin loops. However, no scHi-C analysis method currently exists for annotating single-cell subcompartments, which are crucial for providing a more refined view of large-scale chromosome spatial localization in single cells. Here, we present scGhost, a single-cell subcompartment annotation method based on graph embedding with constrained random walk sampling. Applications of scGhost to scHi-C data and single-cell 3D genome imaging data demonstrate the reliable identification of single-cell subcompartments and offer new insights into cell-to-cell variability of nuclear subcompartments. Using scHi-C data from the human prefrontal cortex, scGhost identifies cell type-specific subcompartments that are strongly connected to cell type-specific gene expression, suggesting the functional implications of single-cell subcompartments. Overall, scGhost is an effective new method for single-cell 3D genome subcompartment annotation based on scHi-C data for a broad range of biological contexts.
Collapse
Affiliation(s)
- Kyle Xiong
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Ruochi Zhang
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Jian Ma
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| |
Collapse
|