1
|
Liu Y, Pei W, Chen L, Xia Y, Yan H, Hu X. scCorrect: Cross-modality label transfer from scRNA-seq to scATAC-seq using domain adaptation. Anal Biochem 2025; 702:115847. [PMID: 40154828 DOI: 10.1016/j.ab.2025.115847] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2024] [Revised: 03/10/2025] [Accepted: 03/15/2025] [Indexed: 04/01/2025]
Abstract
Cell type annotation in single-cell chromatin accessibility sequencing (scATAC-seq) is crucial for enabling researchers to identify subpopulations of cells associated with specific diseases, elucidate gene regulatory networks, and discover markers indicative of disease states. The prevailing approach for cell type annotation in single-cell research involves transferring well-delineated cell types from single-cell RNA sequencing (scRNA-seq) data to scATAC-seq data using a label propagation algorithm. However, the inherent modal discrepancies (i.e.biological interpretation) between scRNA-seq and scATAC-seq data, coupled with the intrinsic sparsity and high dimensionality of scATAC-seq data, pose significant challenges to the efficacy of this strategy. To address these challenges, we introduce a novel neural network framework, scCorrect, which operates in two distinct phases. In the first phase, scCorrect aligns the scRNA-seq and scATAC-seq datasets, generating initial annotation results. The second phase involves training a corrective network specifically designed to amend any erroneous annotations produced during the first phase. Empirical tests across multiple datasets have demonstrated that scCorrect consistently achieves superior recognition accuracy, underscoring its significant potential to enhance disease-related research in humans.
Collapse
Affiliation(s)
- Yan Liu
- Department of Computer Science, Yangzhou University, Yangzhou, 225100, PR China.
| | - Wenyi Pei
- Geriatric Department, Shanghai Baoshan District Wusong Central Hospital, Tongtai North Road 101, Shanghai, 200940, PR China
| | - Li Chen
- Department of Computer Science, Yangzhou University, Yangzhou, 225100, PR China
| | - Yu Xia
- Department of Computer Science, Yangzhou University, Yangzhou, 225100, PR China
| | - He Yan
- College of Information Science and Technology, Nanjing Forestry University, Nanjing, 210037, PR China
| | - Xiaohua Hu
- Geriatric Department, Shanghai Baoshan District Wusong Central Hospital, Tongtai North Road 101, Shanghai, 200940, PR China; Digital Innovation Laboratory, The First Affiliated Hospital of Naval Medical University, Changhai Road 168, Shanghai, 200433, PR China.
| |
Collapse
|
2
|
Adil M, Kolarova TR, Doebley AL, Chen LA, Tobey CL, Galipeau P, Rosen S, Yang M, Colbert B, Patton RD, Persse TW, Kawelo E, Reichel JB, Pritchard CC, Akilesh S, Lockwood CM, Ha G, Shree R. Preeclampsia risk prediction from prenatal cell-free DNA screening. Nat Med 2025; 31:1312-1318. [PMID: 39939524 PMCID: PMC12003088 DOI: 10.1038/s41591-025-03509-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2023] [Accepted: 01/14/2025] [Indexed: 02/14/2025]
Abstract
Preeclampsia is characterized by placental dysfunction and results in significant morbidity, but reliable early prediction remains challenging. We investigated whether clinically obtained prenatal cell-free DNA (cfDNA) screening (PDNAS) using whole-genome sequencing (WGS) data can be leveraged to predict preeclampsia risk early in pregnancy (≤16 weeks). Using 1,854 routinely collected clinical PDNAS samples (median, 12.1 weeks) with low-coverage (0.5×) WGS data, we developed a framework to quantify maternal and fetal tissue signatures using nucleosome accessibility, revealing early placental and endothelial dysfunction. These signatures informed a prediction model for preeclampsia risk, which achieved a validation performance of 0.85 area under the receiver operating characteristic curve (AUC) (81% sensitivity at 80% specificity) for preterm phenotypes several months prior to disease onset in a separate cohort of 831 consecutively collected samples, and subsequently confirmed in an external cohort of 141 samples (AUC 0.84, 79% sensitivity). We demonstrate that assessment of cfDNA nucleosome accessibility from early-pregnancy cfDNA sequence data enables the detection of early placental and endothelial-tissue aberrations and may aid in the determination of preeclampsia risk.
Collapse
Affiliation(s)
- Mohamed Adil
- Divisions of Public Health Sciences and Human Biology, Fred Hutchinson Cancer Center, Seattle, WA, USA
- Department of Laboratory Medicine and Pathology, University of Washington, Seattle, WA, USA
- Molecular Medicine and Mechanisms of Disease (M3D) Program, Seattle, WA, USA
| | - Teodora R Kolarova
- Department of Obstetrics and Gynecology, Division of Maternal Fetal Medicine, University of Washington, Seattle, WA, USA
| | - Anna-Lisa Doebley
- Divisions of Public Health Sciences and Human Biology, Fred Hutchinson Cancer Center, Seattle, WA, USA
- Medical Scientist Training Program, University of Washington, Seattle, WA, USA
| | - Leah A Chen
- School of Medicine, University of Washington, Seattle, WA, USA
| | - Cara L Tobey
- Department of Obstetrics and Gynecology, Division of Maternal Fetal Medicine, University of Washington, Seattle, WA, USA
| | - Patricia Galipeau
- Divisions of Public Health Sciences and Human Biology, Fred Hutchinson Cancer Center, Seattle, WA, USA
| | - Sam Rosen
- Department of Obstetrics and Gynecology, Division of Maternal Fetal Medicine, University of Washington, Seattle, WA, USA
| | - Michael Yang
- Divisions of Public Health Sciences and Human Biology, Fred Hutchinson Cancer Center, Seattle, WA, USA
| | - Brice Colbert
- Department of Laboratory Medicine and Pathology, University of Washington, Seattle, WA, USA
| | - Robert D Patton
- Divisions of Public Health Sciences and Human Biology, Fred Hutchinson Cancer Center, Seattle, WA, USA
| | - Thomas W Persse
- Divisions of Public Health Sciences and Human Biology, Fred Hutchinson Cancer Center, Seattle, WA, USA
| | - Erin Kawelo
- Divisions of Public Health Sciences and Human Biology, Fred Hutchinson Cancer Center, Seattle, WA, USA
| | - Jonathan B Reichel
- Department of Laboratory Medicine and Pathology, University of Washington, Seattle, WA, USA
| | - Colin C Pritchard
- Department of Laboratory Medicine and Pathology, University of Washington, Seattle, WA, USA
- Brotman Baty Institute for Precision Medicine, Seattle, WA, USA
| | - Shreeram Akilesh
- Department of Laboratory Medicine and Pathology, University of Washington, Seattle, WA, USA
- Brotman Baty Institute for Precision Medicine, Seattle, WA, USA
| | - Christina M Lockwood
- Department of Laboratory Medicine and Pathology, University of Washington, Seattle, WA, USA
- Brotman Baty Institute for Precision Medicine, Seattle, WA, USA
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | - Gavin Ha
- Divisions of Public Health Sciences and Human Biology, Fred Hutchinson Cancer Center, Seattle, WA, USA.
- Brotman Baty Institute for Precision Medicine, Seattle, WA, USA.
- Department of Genome Sciences, University of Washington, Seattle, WA, USA.
| | - Raj Shree
- Department of Obstetrics and Gynecology, Division of Maternal Fetal Medicine, University of Washington, Seattle, WA, USA.
- Brotman Baty Institute for Precision Medicine, Seattle, WA, USA.
| |
Collapse
|
3
|
Houdjedj A, Marouf Y, Myradov M, Doğan SO, Erten BO, Tastan O, Erten C, Kazan H. SCITUNA: single-cell data integration tool using network alignment. BMC Bioinformatics 2025; 26:92. [PMID: 40148808 PMCID: PMC11951583 DOI: 10.1186/s12859-025-06087-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2024] [Accepted: 02/17/2025] [Indexed: 03/29/2025] Open
Abstract
BACKGROUND As single-cell genomics experiments increase in complexity and scale, the need to integrate multiple datasets has grown. Such integration enhances cellular feature identification by leveraging larger data volumes. However, batch effects-technical variations arising from differences in labs, times, or protocols-pose a significant challenge. Despite numerous proposed batch correction methods, many still have limitations, such as outputting only dimension-reduced data, relying on computationally intensive models, or resulting in overcorrection for batches with diverse cell type composition. RESULTS We introduce a novel method for batch effect correction named SCITUNA, a Single-Cell data Integration Tool Using Network Alignment. We perform evaluations on 39 individual batches from four real datasets and a simulated dataset, which include both scRNA-seq and scATAC-seq datasets, spanning multiple organisms and tissues. A thorough comparison of existing batch correction methods using 13 metrics reveals that SCITUNA outperforms current approaches and is successful at preserving biological signals present in the original data. In particular, SCITUNA shows a better performance than the current methods in all the comparisons except for the multiple batch integration of the lung dataset where the difference is 0.004. CONCLUSION SCITUNA effectively removes batch effects while retaining the biological signals present in the data. Our extensive experiments reveal that SCITUNA will be a valuable tool for diverse integration tasks.
Collapse
Affiliation(s)
- Aissa Houdjedj
- Antalya Bilim University, 07190, Antalya, Turkey
- Akdeniz University, 07058, Antalya, Turkey
| | | | | | | | | | | | | | - Hilal Kazan
- Antalya Bilim University, 07190, Antalya, Turkey.
| |
Collapse
|
4
|
Wu W, Wang S, Zhang K, Li H, Qiao S, Zhang Y, Pang S. scMDCL: A Deep Collaborative Contrastive Learning Framework for Matched Single-Cell Multiomics Data Clustering. J Chem Inf Model 2025; 65:3048-3063. [PMID: 40068854 DOI: 10.1021/acs.jcim.4c02114] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/25/2025]
Abstract
Single-cell multiomics clustering integrates multiple omics data to analyze cellular heterogeneity and is crucial for uncovering complex biological processes and disease mechanisms. However, existing matched single-cell multiomics clustering methods often neglect the full utilization of intercellular relationships and the interactions and synergy between features from different omics, leading to suboptimal clustering performance. In this paper, we propose a deep collaborative contrastive learning framework for matched single-cell multiomics data clustering, named scMDCL. This framework fully leverages intercell relationships while enhancing feature interactions among identical cells across different omics data, thereby facilitating efficient clustering of multiomics data. Specifically, to fully utilize the topological information between cells, a graph autoencoder and a feature information enhancement module are designed for different omics, enabling the extraction and augmentation of cell features. Additionally, contrastive learning techniques are employed to strengthen the interactions among the different omics features of the same cell. Ultimately, multiomics deep collaborative clustering modules are utilized to achieve single-cell multiomics clustering. Extensive experiments conducted on nine publicly available single-cell multiomics datasets demonstrate the superior performance of the proposed framework in integrating multiomics data for clustering tasks.
Collapse
Affiliation(s)
- Wenhao Wu
- Qingdao Institute of Software, College of Computer Science and Technology, State Key Laboratory of Chemical Safety, Shandong Key Laboratory of Intelligent Oil & Gas Industrial Software, China University of Petroleum (East China), Qingdao 266580, China
| | - Shudong Wang
- Qingdao Institute of Software, College of Computer Science and Technology, State Key Laboratory of Chemical Safety, Shandong Key Laboratory of Intelligent Oil & Gas Industrial Software, China University of Petroleum (East China), Qingdao 266580, China
| | - Kuijie Zhang
- Qingdao Institute of Software, College of Computer Science and Technology, State Key Laboratory of Chemical Safety, Shandong Key Laboratory of Intelligent Oil & Gas Industrial Software, China University of Petroleum (East China), Qingdao 266580, China
| | - Hengxiao Li
- Qingdao Institute of Software, College of Computer Science and Technology, State Key Laboratory of Chemical Safety, Shandong Key Laboratory of Intelligent Oil & Gas Industrial Software, China University of Petroleum (East China), Qingdao 266580, China
| | - Sibo Qiao
- School of software, Tiangong university, Tianjin 300387, China
| | - Yuanyuan Zhang
- The College of Information and Control Engineering, Qingdao University of Technology, Qingdao, Shandong 266520, China
| | - Shanchen Pang
- Qingdao Institute of Software, College of Computer Science and Technology, State Key Laboratory of Chemical Safety, Shandong Key Laboratory of Intelligent Oil & Gas Industrial Software, China University of Petroleum (East China), Qingdao 266580, China
| |
Collapse
|
5
|
Braccioli L, van den Brand T, Alonso Saiz N, Fountas C, Celie PHN, Kazokaitė-Adomaitienė J, de Wit E. Identifying cross-lineage dependencies of cell-type-specific regulators in mouse gastruloids. Dev Cell 2025:S1534-5807(25)00118-2. [PMID: 40101716 DOI: 10.1016/j.devcel.2025.02.013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2023] [Revised: 09/09/2024] [Accepted: 02/21/2025] [Indexed: 03/20/2025]
Abstract
Correct gene expression levels are crucial for normal development. Advances in genomics enable the inference of gene regulatory programs active during development but cannot capture the complex multicellular interactions occurring during mammalian embryogenesis in utero. In vitro models of mammalian development, like gastruloids, can overcome this limitation. Using time-resolved single-cell chromatin accessibility analysis, we delineated the regulatory profile during mouse gastruloid development, identifying critical drivers of developmental transitions. Gastruloids develop from bipotent progenitor cells driven by the transcription factors (TFs) OCT4, SOX2, and TBXT, differentiating into the mesoderm (characterized by the mesogenin 1 [MSGN1]) and spinal cord (characterized by CDX2). ΔCDX gastruloids fail to form spinal cord, while Msgn1 ablation inhibits paraxial mesoderm and spinal cord development. Chimeric gastruloids with ΔMSGN1 and wild-type cells formed both tissues, indicating that inter-tissue communication is necessary for spinal cord formation. Our work has important implications for studying inter-tissue communication and gene regulatory programs in development.
Collapse
Affiliation(s)
- Luca Braccioli
- Division of Gene Regulation, Netherlands Cancer Institute, 1066 CX Amsterdam, the Netherlands.
| | - Teun van den Brand
- Division of Gene Regulation, Netherlands Cancer Institute, 1066 CX Amsterdam, the Netherlands
| | - Noemi Alonso Saiz
- Division of Gene Regulation, Netherlands Cancer Institute, 1066 CX Amsterdam, the Netherlands
| | - Charis Fountas
- Division of Gene Regulation, Netherlands Cancer Institute, 1066 CX Amsterdam, the Netherlands
| | - Patrick H N Celie
- Protein Facility, Division of Biochemistry, Netherlands Cancer Institute, 1066 CX Amsterdam, the Netherlands; Oncode Institute, 3521 AL Utrecht, the Netherlands
| | - Justina Kazokaitė-Adomaitienė
- Protein Facility, Division of Biochemistry, Netherlands Cancer Institute, 1066 CX Amsterdam, the Netherlands; Oncode Institute, 3521 AL Utrecht, the Netherlands
| | - Elzo de Wit
- Division of Gene Regulation, Netherlands Cancer Institute, 1066 CX Amsterdam, the Netherlands.
| |
Collapse
|
6
|
Malekpour SA, Kalirad A, Majidian S. Inferring the Selective History of CNVs Using a Maximum Likelihood Model. Genome Biol Evol 2025; 17:evaf050. [PMID: 40100752 PMCID: PMC11950529 DOI: 10.1093/gbe/evaf050] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2024] [Revised: 02/27/2025] [Accepted: 03/13/2025] [Indexed: 03/20/2025] Open
Abstract
Copy number variations (CNVs)-structural variations generated by deletion and/or duplication that result in a change in DNA dosage-are prevalent in nature. CNVs can drastically affect the phenotype of an organism and have been shown to be both involved in genetic disorders and be used as raw material in adaptive evolution. Unlike single-nucleotide variations, the often large and varied effects of CNVs on phenotype hinders our ability to infer their selective advantage based on the population genetics data. Here, we present a likelihood-based approach, dubbed PoMoCNV (POlymorphism-aware phylogenetic MOdel for CNVs), that estimates the evolutionary parameters such as mutation rates among different copy numbers and relative fitness loss per copy deletion at a genomic locus based on population genetics data. As a case study, we analyze the genomics data of 40 strains of Caenorhabditis elegans, representing four different populations. We take advantage of the data on chromatin accessibility to interpret the mutation rate and fitness of copy numbers, as inferred by PoMoCNV, specifically in open or closed chromatin loci. We further test the reliability of PoMoCNV by estimating the evolutionary parameters of CNVs for mutation-accumulation experiments in C. elegans with varying levels of genetic drift.
Collapse
Affiliation(s)
- Seyed Amir Malekpour
- School of Biological Sciences, Institute for Research in Fundamental Sciences (IPM), Tehran 19395-5746, Iran
| | - Ata Kalirad
- Department for Integrative Evolutionary Biology, Max Planck Institute for Biology Tübingen, Tübingen 72076, Germany
| | - Sina Majidian
- SIB Swiss Institute of Bioinformatics, Lausanne 1015, Switzerland
- Department of Computational Biology, University of Lausanne, Lausanne 1015, Switzerland
| |
Collapse
|
7
|
Clarence T, Bendl J, Cao X, Wang X, Zheng S, Hoffman GE, Kozlenkov A, Hong A, Iskhakova M, Jaiswal MK, Murphy S, Yu A, Haroutunian V, Dracheva S, Akbarian S, Fullard JF, Yuan GC, Lee D, Roussos P. Multiomic single-cell profiling identifies critical regulators of postnatal brain. Nat Genet 2025; 57:591-603. [PMID: 39962241 DOI: 10.1038/s41588-025-02083-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2024] [Accepted: 01/08/2025] [Indexed: 03/15/2025]
Abstract
Human brain development spans from embryogenesis to adulthood, with dynamic gene expression controlled by cell-type-specific cis-regulatory element activity and three-dimensional genome organization. To advance our understanding of postnatal brain development, we simultaneously profiled gene expression and chromatin accessibility in 101,924 single nuclei from four brain regions across ten donors, covering five key postnatal stages from infancy to late adulthood. Using this dataset and chromosome conformation capture data, we constructed enhancer-based gene regulatory networks to identify cell-type-specific regulators of brain development and interpret genome-wide association study loci for ten main brain disorders. Our analysis connected 2,318 cell-specific loci to 1,149 unique genes, representing 41% of loci linked to the investigated traits, and highlighted 55 genes influencing several disease phenotypes. Pseudotime analysis revealed distinct stages of postnatal oligodendrogenesis and their regulatory programs. These findings provide a comprehensive dataset of cell-type-specific gene regulation at critical timepoints in postnatal brain development.
Collapse
Affiliation(s)
- Tereza Clarence
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
| | - Jaroslav Bendl
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Xuan Cao
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Xinyi Wang
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Shiwei Zheng
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Gabriel E Hoffman
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Alexey Kozlenkov
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Aram Hong
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Marina Iskhakova
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Neuroscience, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Manoj K Jaiswal
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Mental Illness Research Education and Clinical Center, James J. Peters VA Medical Center, Bronx, NY, USA
| | - Sarah Murphy
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Alexander Yu
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Vahram Haroutunian
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Neuroscience, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Mental Illness Research Education and Clinical Center, James J. Peters VA Medical Center, Bronx, NY, USA
| | - Stella Dracheva
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Mental Illness Research Education and Clinical Center, James J. Peters VA Medical Center, Bronx, NY, USA
| | - Schahram Akbarian
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Neuroscience, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - John F Fullard
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Guo-Cheng Yuan
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Donghoon Lee
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Panos Roussos
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
- Mental Illness Research Education and Clinical Center, James J. Peters VA Medical Center, Bronx, NY, USA.
- Center for Precision Medicine and Translational Therapeutics, James J. Peters VA Medical Center, Bronx, NY, USA.
| |
Collapse
|
8
|
Conrad DN, Phong KT, Korotkevich E, McGinnis CS, Zhu Q, Chow ED, Gartner ZJ. Reducing batch effects in single cell chromatin accessibility measurements by pooled transposition with MULTI-ATAC. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.02.14.638353. [PMID: 40027737 PMCID: PMC11870453 DOI: 10.1101/2025.02.14.638353] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 03/05/2025]
Abstract
Large-scale scATAC-seq experiments are challenging because of their costs, lengthy protocols, and confounding batch effects. Several sample multiplexing technologies aim to address these challenges, but do not remove batch effects introduced when performing transposition reactions in parallel. We demonstrate that sample-to-sample variability in nuclei-to-Tn5 ratios is a major cause of batch effects and develop MULTI-ATAC, a multiplexing method that pools samples prior to transposition, as a solution. MULTI-ATAC provides high accuracy in sample classification and doublet detection while eliminating batch effects associated with variable nucleus-to-Tn5 ratio. We illustrate the power of MULTI-ATAC by performing a 96-plex multiomic drug assay targeting epigenetic remodelers in a model of primary immune cell activation, uncovering tens of thousands of drug-responsive chromatin regions, cell-type specific effects, and potent differences between matched inhibitors and degraders. MULTI-ATAC therefore enables batch-free and scalable scATAC-seq workflows, providing deeper insights into complex biological processes and potential therapeutic targets.
Collapse
Affiliation(s)
- Daniel N. Conrad
- Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, CA 94158, USA
| | - Kiet T. Phong
- Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, CA 94158, USA
| | - Ekaterina Korotkevich
- Department of Biochemistry and Biophysics, University of California, San Francisco, San Francisco, CA 94158, USA
| | - Christopher S. McGinnis
- Department of Pathology, Stanford University, Stanford, CA 94305, USA
- Gladstone-UCSF Institute of Genomic Immunology, San Francisco, CA 94158, USA
- Parker Institute for Cancer Immunotherapy, San Francisco, CA 94129, USA
| | - Qin Zhu
- Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, CA 94158, USA
| | - Eric D. Chow
- Department of Biochemistry and Biophysics, University of California, San Francisco, San Francisco, CA 94158, USA
- Laboratory for Genomics Research, San Francisco, CA, 94158, USA
| | - Zev J. Gartner
- Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, CA 94158, USA
- Chan Zuckerberg Biohub, San Francisco, CA 94158, USA
- Center for Cellular Construction, University of California, San Francisco, CA 94158, USA
| |
Collapse
|
9
|
Wu J, Wan C, Ji Z, Zhou Y, Hou W. EpiFoundation: A Foundation Model for Single-Cell ATAC-seq via Peak-to-Gene Alignment. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.02.05.636688. [PMID: 39975086 PMCID: PMC11839112 DOI: 10.1101/2025.02.05.636688] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/21/2025]
Abstract
Foundation models exhibit strong capabilities for downstream tasks by learning generalized representations through self-supervised pre-training on large datasets. While several foundation models have been developed for single-cell RNA-seq (scRNA-seq) data, there is still a lack of models specifically tailored for single-cell ATAC-seq (scATAC-seq), which measures epigenetic information in individual cells. The principal challenge in developing such a model lies in the vast number of scATAC peaks and the significant sparsity of the data, which complicates the formulation of peak-to-peak correlations. To address this challenge, we introduce EpiFoundation, a foundation model for learning cell representations from the high-dimensional and sparse space of peaks. EpiFoundation relies on an innovative cross-modality pre-training procedure with two key technical innovations. First, EpiFoundation exclusively processes the non-zero peak set, thereby enhancing the density of cell-specific information within the input data. Second, EpiFoundation utilizes dense gene expression information to supervise the pre-training process, aligning peak-to-gene correlations. EpiFoundation can handle various types of downstream tasks, including cell-type annotation, batch correction, and gene expression prediction. To train and validate EpiFoundation, we curated MiniAtlas, a dataset of 100,000+ single cells with paired scRNA-seq and scATAC-seq data, along with diverse test sets spanning various tissues and cell types for robust evaluation. EpiFoundation demonstrates state-of-the-art performance across multiple tissues and diverse downstream tasks.
Collapse
Affiliation(s)
- Juncheng Wu
- Department of Computer Science and Engineering, UC Santa Cruz
| | - Changxin Wan
- Department of Biostatistics and Bioinformatics, Duke University
| | - Zhicheng Ji
- Department of Biostatistics and Bioinformatics, Duke University
| | - Yuyin Zhou
- Department of Computer Science and Engineering, UC Santa Cruz
| | - Wenpin Hou
- Department of Biostatistics, Mailman School of Public Health, Columbia University
| |
Collapse
|
10
|
Stanley JS, Yang J, Li R, Lindenbaum O, Kobak D, Landa B, Kluger Y. Principled PCA separates signal from noise in omics count data. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.02.03.636129. [PMID: 39975320 PMCID: PMC11838471 DOI: 10.1101/2025.02.03.636129] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 02/21/2025]
Abstract
Principal component analysis (PCA) is indispensable for processing high-throughput omics datasets, as it can extract meaningful biological variability while minimizing the influence of noise. However, the suitability of PCA is contingent on appropriate normalization and transformation of count data, and accurate selection of the number of principal components; improper choices can result in the loss of biological information or corruption of the signal due to excessive noise. Typical approaches to these challenges rely on heuristics that lack theoretical foundations. In this work, we present Biwhitened PCA (BiPCA), a theoretically grounded framework for rank estimation and data denoising across a wide range of omics modalities. BiPCA overcomes a fundamental difficulty with handling count noise in omics data by adaptively rescaling the rows and columns - a rigorous procedure that standardizes the noise variances across both dimensions. Through simulations and analysis of over 100 datasets spanning seven omics modalities, we demonstrate that BiPCA reliably recovers the data rank and enhances the biological interpretability of count data. In particular, BiPCA enhances marker gene expression, preserves cell neighborhoods, and mitigates batch effects. Our results establish BiPCA as a robust and versatile framework for high-throughput count data analysis.
Collapse
Affiliation(s)
- Jay S. Stanley
- Program in Applied Mathematics, Yale University, New Haven, CT, USA
| | - Junchen Yang
- Interdepartmental Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA
| | - Ruiqi Li
- Interdepartmental Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA
| | - Ofir Lindenbaum
- Faculty of Engineering, Bar Ilan University, Ramat-Gan, Israel
| | - Dmitry Kobak
- Hertie Institute for AI in Brain Health, University of Tübingen, Germany
| | - Boris Landa
- Program in Applied Mathematics, Yale University, New Haven, CT, USA
- Department of Electrical and Computer Engineering, Yale University, New Haven, CT, USA
| | - Yuval Kluger
- Program in Applied Mathematics, Yale University, New Haven, CT, USA
- Interdepartmental Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, USA
- Department of Pathology, Yale University, New Haven, CT, USA
| |
Collapse
|
11
|
Wang X, Cheng L, Lu X, Jin H, Cui L, Guo Y, Guo J, Xu EY. Cross-species comparative single-cell transcriptomics highlights the molecular evolution and genetic basis of male infertility. Cell Rep 2025; 44:115118. [PMID: 39739532 DOI: 10.1016/j.celrep.2024.115118] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2024] [Revised: 09/24/2024] [Accepted: 12/05/2024] [Indexed: 01/02/2025] Open
Abstract
In male animals, spermatogonia in testes differentiate into sperm, one of the most diverse cell types across species. Despite the evolutionary retention of key genes essential for spermatogenesis, the extent of their conservation remains unclear. To explore the genetic basis of spermatogenesis under strong selective pressure, we compare single-cell RNA sequencing (scRNA-seq) datasets from the testes of humans, mice, and fruit flies. Our analysis identifies conserved genes involved in key molecular programs, such as post-transcriptional regulation, meiosis, and energy metabolism. We perform gene knockout experiments of 20 candidate genes, three of which, when mutated in fruit flies, result in reduced male fertility, emphasizing the conservation of sperm centriole and steroid lipid processes across mammals and Drosophila. Additionally, deep-learning analysis uncovers potential transcriptional mechanisms driving gene-expression evolution. These findings establish a core genetic foundation for spermatogenesis, offering insights into sperm-phenotype evolution and the underlying mechanisms of male infertility.
Collapse
Affiliation(s)
- Xiaoyan Wang
- Key Laboratory of Organ Regeneration and Reconstruction, State Key Laboratory of Stem Cell and Reproductive Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing, China; Beijing Institute for Stem Cell and Regenerative Medicine, Beijing, China
| | - Liping Cheng
- State Key Laboratory of Reproductive Medicine and Offspring Health, Nanjing Medical University, Nanjing, Jiangsu, China; The Third Affiliated Hospital of Shenzhen University - Shenzhen Luohu District People's Hospital, Shenzhen, Guangdong, China
| | - Xiaojian Lu
- Key Laboratory of Organ Regeneration and Reconstruction, State Key Laboratory of Stem Cell and Reproductive Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing, China; Beijing Institute for Stem Cell and Regenerative Medicine, Beijing, China; University of Chinese Academy of Sciences, Beijing, China
| | - He Jin
- State Key Laboratory of Reproductive Medicine and Offspring Health, Nanjing Medical University, Nanjing, Jiangsu, China
| | - Lina Cui
- Key Laboratory of Organ Regeneration and Reconstruction, State Key Laboratory of Stem Cell and Reproductive Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing, China; Beijing Institute for Stem Cell and Regenerative Medicine, Beijing, China; University of Chinese Academy of Sciences, Beijing, China
| | - Yifei Guo
- Key Laboratory of Organ Regeneration and Reconstruction, State Key Laboratory of Stem Cell and Reproductive Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing, China; Beijing Institute for Stem Cell and Regenerative Medicine, Beijing, China; University of Chinese Academy of Sciences, Beijing, China
| | - Jingtao Guo
- Key Laboratory of Organ Regeneration and Reconstruction, State Key Laboratory of Stem Cell and Reproductive Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing, China; Beijing Institute for Stem Cell and Regenerative Medicine, Beijing, China; University of Chinese Academy of Sciences, Beijing, China.
| | - Eugene Yujun Xu
- State Key Laboratory of Reproductive Medicine and Offspring Health, Nanjing Medical University, Nanjing, Jiangsu, China; Cellular Screening Center, The University of Chicago, Chicago, IL, USA; Department of Neurology, Center for Reproductive Sciences, Northwestern University Feinberg School of Medicine, Chicago, IL, USA.
| |
Collapse
|
12
|
Lin H, Ye X, Chen W, Hong D, Liu L, Chen F, Sun N, Ye K, Hong J, Zhang Y, Lu F, Li L, Huang J. Modular organization of enhancer network provides transcriptional robustness in mammalian development. Nucleic Acids Res 2025; 53:gkae1323. [PMID: 39817516 PMCID: PMC11736433 DOI: 10.1093/nar/gkae1323] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2024] [Revised: 11/27/2024] [Accepted: 12/30/2024] [Indexed: 01/18/2025] Open
Abstract
Enhancer clusters, pivotal in mammalian development and diseases, can organize as enhancer networks to control cell identity and disease genes; however, the underlying mechanism remains largely unexplored. Here, we introduce eNet 2.0, a comprehensive tool for enhancer networks analysis during development and diseases based on single-cell chromatin accessibility data. eNet 2.0 extends our previous work eNet 1.0 by adding network topology, comparison and dynamics analyses to its network construction function. We reveal modularly organized enhancer networks, where inter-module interactions synergistically affect gene expression. Moreover, network alterations correlate with abnormal and dynamic gene expression in disease and development. eNet 2.0 is robust across diverse datasets. To facilitate application, we introduce eNetDB (https://enetdb.huanglabxmu.com), an enhancer network database leveraging extensive scATAC-seq (single-cell assay for transposase-accessible chromatin sequencing) datasets from human and mouse tissues. Together, our work provides a powerful computational tool and reveals that modularly organized enhancer networks contribute to gene expression robustness in mammalian development and diseases.
Collapse
Affiliation(s)
- Hongli Lin
- State Key Laboratory of Cellular Stress Biology, Xiang’an Hospital, School of Life Sciences, Faculty of Medicine and Life Sciences, Xiamen University, No. 4221, Xiang’an South Road, Xiamen, Fujian 361102, China
| | - Xinyun Ye
- State Key Laboratory of Cellular Stress Biology, Xiang’an Hospital, School of Life Sciences, Faculty of Medicine and Life Sciences, Xiamen University, No. 4221, Xiang’an South Road, Xiamen, Fujian 361102, China
| | - Wenyan Chen
- Institute of Systems and Physical Biology, Shenzhen Bay Laboratory, Guangqiao Road, Shenzhen 518055, China
| | - Danni Hong
- State Key Laboratory of Cellular Stress Biology, Xiang’an Hospital, School of Life Sciences, Faculty of Medicine and Life Sciences, Xiamen University, No. 4221, Xiang’an South Road, Xiamen, Fujian 361102, China
| | - Lifang Liu
- State Key Laboratory of Cellular Stress Biology, Xiang’an Hospital, School of Life Sciences, Faculty of Medicine and Life Sciences, Xiamen University, No. 4221, Xiang’an South Road, Xiamen, Fujian 361102, China
| | - Feng Chen
- Chengdu Wiser Matrix Technology Co. Ltd, No. 399, Fucheng Road, Chengdu, Sichuan 614001, China
| | - Ning Sun
- Chengdu Wiser Matrix Technology Co. Ltd, No. 399, Fucheng Road, Chengdu, Sichuan 614001, China
| | - Keying Ye
- State Key Laboratory of Cellular Stress Biology, Xiang’an Hospital, School of Life Sciences, Faculty of Medicine and Life Sciences, Xiamen University, No. 4221, Xiang’an South Road, Xiamen, Fujian 361102, China
| | - Jizhou Hong
- State Key Laboratory of Cellular Stress Biology, Xiang’an Hospital, School of Life Sciences, Faculty of Medicine and Life Sciences, Xiamen University, No. 4221, Xiang’an South Road, Xiamen, Fujian 361102, China
| | - Yalin Zhang
- State Key Laboratory of Cellular Stress Biology, Xiang’an Hospital, School of Life Sciences, Faculty of Medicine and Life Sciences, Xiamen University, No. 4221, Xiang’an South Road, Xiamen, Fujian 361102, China
| | - Falong Lu
- State Key Laboratory of Molecular Developmental Biology, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, No. 2, Beichen West Road, Beijing 100101, China
- College of Advanced Agricultural Sciences, University of Chinese Academy of Sciences, No. 1, Yanqihu East Road, Beijing 101408, China
| | - Lei Li
- Institute of Systems and Physical Biology, Shenzhen Bay Laboratory, Guangqiao Road, Shenzhen 518055, China
| | - Jialiang Huang
- State Key Laboratory of Cellular Stress Biology, Xiang’an Hospital, School of Life Sciences, Faculty of Medicine and Life Sciences, Xiamen University, No. 4221, Xiang’an South Road, Xiamen, Fujian 361102, China
- National Institute for Data Science in Health and Medicine, Xiamen University, No. 4221, Xiang’an South Road, Xiamen, Fujian 361102, China
| |
Collapse
|
13
|
Sabit H, Arneth B, Pawlik TM, Abdel-Ghany S, Ghazy A, Abdelazeem RM, Alqosaibi A, Al-Dhuayan IS, Almulhim J, Alrabiah NA, Hashash A. Leveraging Single-Cell Multi-Omics to Decode Tumor Microenvironment Diversity and Therapeutic Resistance. Pharmaceuticals (Basel) 2025; 18:75. [PMID: 39861138 PMCID: PMC11768313 DOI: 10.3390/ph18010075] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2024] [Revised: 01/03/2025] [Accepted: 01/08/2025] [Indexed: 01/27/2025] Open
Abstract
Recent developments in single-cell multi-omics technologies have provided the ability to identify diverse cell types and decipher key components of the tumor microenvironment (TME), leading to important advancements toward a much deeper understanding of how tumor microenvironment heterogeneity contributes to cancer progression and therapeutic resistance. These technologies are able to integrate data from molecular genomic, transcriptomic, proteomics, and metabolomics studies of cells at a single-cell resolution scale that give rise to the full cellular and molecular complexity in the TME. Understanding the complex and sometimes reciprocal relationships among cancer cells, CAFs, immune cells, and ECs has led to novel insights into their immense heterogeneity in functions, which can have important consequences on tumor behavior. In-depth studies have uncovered immune evasion mechanisms, including the exhaustion of T cells and metabolic reprogramming in response to hypoxia from cancer cells. Single-cell multi-omics also revealed resistance mechanisms, such as stromal cell-secreted factors and physical barriers in the extracellular matrix. Future studies examining specific metabolic pathways and targeting approaches to reduce the heterogeneity in the TME will likely lead to better outcomes with immunotherapies, drug delivery, etc., for cancer treatments. Future studies will incorporate multi-omics data, spatial relationships in tumor micro-environments, and their translation into personalized cancer therapies. This review emphasizes how single-cell multi-omics can provide insights into the cellular and molecular heterogeneity of the TME, revealing immune evasion mechanisms, metabolic reprogramming, and stromal cell influences. These insights aim to guide the development of personalized and targeted cancer therapies, highlighting the role of TME diversity in shaping tumor behavior and treatment outcomes.
Collapse
Affiliation(s)
- Hussein Sabit
- Department of Medical Biotechnology, College of Biotechnology, Misr University for Science and Technology, P.O. Box 77, Giza 3237101, Egypt
| | - Borros Arneth
- Institute of Laboratory Medicine and Pathobiochemistry, Molecular Diagnostics, Hospital of the Universities of Giessen and Marburg (UKGM), Philipps University Marburg, Baldingerstr. 1, 35043 Marburg, Germany
| | - Timothy M. Pawlik
- Department of Surgery, The Ohio State University, Wexner Medical Center, Columbus, OH 43210, USA
| | - Shaimaa Abdel-Ghany
- Department of Environmental Biotechnology, College of Biotechnology, Misr University for Science and Technology, P.O. Box 77, Giza 3237101, Egypt
| | - Aysha Ghazy
- Department of Agricultural Biotechnology, College of Biotechnology, Misr University for Science and Technology, P.O. Box 77, Giza 3237101, Egypt
| | - Rawan M. Abdelazeem
- Department of Medical Biotechnology, College of Biotechnology, Misr University for Science and Technology, P.O. Box 77, Giza 3237101, Egypt
| | - Amany Alqosaibi
- Department of Biology, College of Science, Imam Abdulrahman bin Faisal University, P.O. Box 1982, Dammam 31441, Saudi Arabia
| | - Ibtesam S. Al-Dhuayan
- Department of Biology, College of Science, Imam Abdulrahman bin Faisal University, P.O. Box 1982, Dammam 31441, Saudi Arabia
| | - Jawaher Almulhim
- Department of Biological Sciences, King Faisal University, Alahsa 31982, Saudi Arabia
| | - Noof A. Alrabiah
- Department of Biological Sciences, King Faisal University, Alahsa 31982, Saudi Arabia
| | - Ahmed Hashash
- Department of Biomedicine, Texas A&M University, College Station, TX 77843, USA
| |
Collapse
|
14
|
Nichols RV, Rylaarsdam LE, O'Connell BL, Shipony Z, Iremadze N, Acharya SN, Adey AC. Atlas-scale single-cell DNA methylation profiling with sciMETv3. CELL GENOMICS 2025; 5:100726. [PMID: 39719707 PMCID: PMC11770211 DOI: 10.1016/j.xgen.2024.100726] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/03/2024] [Revised: 10/25/2024] [Accepted: 11/26/2024] [Indexed: 12/26/2024]
Abstract
Single-cell methods to assess DNA methylation have not achieved the same level of cell throughput per experiment compared to other modalities, with large-scale datasets requiring extensive automation, time, and other resources. Here, we describe sciMETv3, a combinatorial indexing-based technique that enables atlas-scale libraries to be produced in a single experiment. To reduce the sequencing burden, we demonstrate the compatibility of sciMETv3 with capture techniques to enrich regulatory regions, as well as the ability to leverage enzymatic conversion, which can yield higher library diversity. We showcase the throughput of sciMETv3 by producing a >140,000 cell library from human middle frontal gyrus split across four multiplexed individuals using both Illumina and Ultima sequencing instrumentation. Finally, we introduce sciMET+ATAC to enable high-throughput exploration of the interplay between chromatin accessibility and DNA methylation within the same cell.
Collapse
Affiliation(s)
- Ruth V Nichols
- Department of Molecular & Medical Genetics, Oregon Health & Science University, Portland, OR, USA
| | - Lauren E Rylaarsdam
- Department of Molecular & Medical Genetics, Oregon Health & Science University, Portland, OR, USA
| | - Brendan L O'Connell
- Department of Molecular & Medical Genetics, Oregon Health & Science University, Portland, OR, USA; Cancer Early Detection Advanced Research Institute, Oregon Health & Science University, Portland, OR, USA
| | | | | | - Sonia N Acharya
- Department of Molecular & Medical Genetics, Oregon Health & Science University, Portland, OR, USA
| | - Andrew C Adey
- Department of Molecular & Medical Genetics, Oregon Health & Science University, Portland, OR, USA; Cancer Early Detection Advanced Research Institute, Oregon Health & Science University, Portland, OR, USA; Knight Cardiovascular Institute, Oregon Health & Science University, Portland, OR, USA; Knight Cancer Institute, Oregon Health & Science University, Portland, OR, USA.
| |
Collapse
|
15
|
Miao Z, Wang J, Park K, Kuang D, Kim J. Depth-corrected multi-factor dissection of chromatin accessibility for scATAC-seq data with PACS. Nat Commun 2025; 16:401. [PMID: 39757254 DOI: 10.1038/s41467-024-55580-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2024] [Accepted: 12/10/2024] [Indexed: 01/07/2025] Open
Abstract
Single cell ATAC-seq (scATAC-seq) experimental designs have become increasingly complex, with multiple factors that might affect chromatin accessibility, including genotype, cell type, tissue of origin, sample location, batch, etc., whose compound effects are difficult to test by existing methods. In addition, current scATAC-seq data present statistical difficulties due to their sparsity and variations in individual sequence capture. To address these problems, we present a zero-adjusted statistical model, Probability model of Accessible Chromatin of Single cells (PACS), that allows complex hypothesis testing of accessibility-modulating factors while accounting for sparse and incomplete data. For differential accessibility analysis, PACS controls the false positive rate and achieves a 17% to 122% higher power on average than existing tools. We demonstrate the effectiveness of PACS through several analysis tasks, including supervised cell type annotation, compound hypothesis testing, batch effect correction, and spatiotemporal modeling. We apply PACS to datasets from various tissues and show its ability to reveal previously undiscovered insights in scATAC-seq data.
Collapse
Affiliation(s)
- Zhen Miao
- Graduate Group in Genomics and Computational Biology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
- Department of Biology, University of Pennsylvania, Philadelphia, PA, USA
| | - Jianqiao Wang
- Department of Biostatistics, Harvard T.H. Chan School of Health, Boston, MA, USA
- Department of Statistics and Data Science, Tsinghua University, Beijing, China
| | - Kernyu Park
- Department of Biology, University of Pennsylvania, Philadelphia, PA, USA
| | - Da Kuang
- Deptartment Computer and Information Science, University of Pennsylvania, Philadelphia, PA, USA
| | - Junhyong Kim
- Graduate Group in Genomics and Computational Biology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA.
- Department of Biology, University of Pennsylvania, Philadelphia, PA, USA.
- Deptartment Computer and Information Science, University of Pennsylvania, Philadelphia, PA, USA.
| |
Collapse
|
16
|
Kotliar M, Kartashov A, Barski A. Accelerating Single-Cell Sequencing Data Analysis with SciDAP: A User-Friendly Approach. Methods Mol Biol 2025; 2880:255-292. [PMID: 39900764 DOI: 10.1007/978-1-0716-4276-4_13] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2025]
Abstract
Single-cell (sc) RNA, ATAC, and Multiome sequencing became powerful tools for uncovering biological and disease mechanisms. Unfortunately, manual analysis of sc data presents multiple challenges due to large data volumes and complexity of configuration parameters. This complexity, as well as not being able to reproduce a computational environment, affects the reproducibility of analysis results. The Scientific Data Analysis Platform ( https://SciDAP.com ) allows biologists without computational expertise to analyze sequencing-based data using portable and reproducible pipelines written in Common Workflow Language (CWL). Our suite of computational pipelines addresses the most common needs in scRNA-Seq, scATAC-Seq and scMultiome data analysis. When executed on SciDAP, it offers a user-friendly alternative to manual data processing, eliminating the need for coding expertise. In this protocol, we describe the use of SciDAP to analyze scMultiome data. Similar approaches can be used for analysis of scRNA-Seq, scATAC-Seq and scVDJ-Seq datasets.
Collapse
Affiliation(s)
- Michael Kotliar
- Division of Allergy and Immunology, Cincinnati Children's Hospital Medical Center, Cincinnati, OH, USA
| | | | - Artem Barski
- Division of Allergy and Immunology, Cincinnati Children's Hospital Medical Center, Cincinnati, OH, USA.
- Datirium, LLC, Cincinnati, OH, USA.
- Division of Human Genetics, Cincinnati Children's Hospital Medical Center, Cincinnati, OH, USA.
- University of Cincinnati College of Medicine, Cincinnati, OH, USA.
| |
Collapse
|
17
|
Kan Y, Qi Y, Zhang Z, Liang X, Wang W, Jin S. Integration of unpaired single cell omics data by deep transfer graph convolutional network. PLoS Comput Biol 2025; 21:e1012625. [PMID: 39821189 PMCID: PMC11778791 DOI: 10.1371/journal.pcbi.1012625] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2024] [Revised: 01/29/2025] [Accepted: 11/09/2024] [Indexed: 01/19/2025] Open
Abstract
The rapid advance of large-scale atlas-level single cell RNA sequences and single-cell chromatin accessibility data provide extraordinary avenues to broad and deep insight into complex biological mechanism. Leveraging the datasets and transfering labels from scRNA-seq to scATAC-seq will empower the exploration of single-cell omics data. However, the current label transfer methods have limited performance, largely due to the lower capable of preserving fine-grained cell populations and intrinsic or extrinsic heterogeneity between datasets. Here, we present a robust deep transfer model based graph convolutional network, scTGCN, which achieves versatile performance in preserving biological variation, while achieving integration hundreds of thousands cells in minutes with low memory consumption. We show that scTGCN is powerful to the integration of mouse atlas data and multimodal data generated from APSA-seq and CITE-seq. Thus, scTGCN shows high label transfer accuracy and effectively knowledge transfer across different modalities.
Collapse
Affiliation(s)
- Yulong Kan
- School of Mathematics/Harbin Institute of Technology, Harbin, China
| | - Yunjing Qi
- School of Mathematics/Harbin Institute of Technology, Harbin, China
| | - Zhongxiao Zhang
- School of Mathematics/Harbin Institute of Technology, Harbin, China
| | - Xikeng Liang
- School of Mathematics/Harbin Institute of Technology, Harbin, China
| | - Weihao Wang
- School of Mathematics/Harbin Institute of Technology, Harbin, China
| | - Shuilin Jin
- School of Mathematics/Harbin Institute of Technology, Harbin, China
| |
Collapse
|
18
|
Yoshikawa T, Yanagita M. Single-Cell Analysis Provides New Insights into the Roles of Tertiary Lymphoid Structures and Immune Cell Infiltration in Kidney Injury and Chronic Kidney Disease. THE AMERICAN JOURNAL OF PATHOLOGY 2025; 195:40-54. [PMID: 39097168 DOI: 10.1016/j.ajpath.2024.07.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/10/2024] [Revised: 06/27/2024] [Accepted: 07/02/2024] [Indexed: 08/05/2024]
Abstract
Chronic kidney disease (CKD) is a global health concern with high morbidity and mortality. Acute kidney injury (AKI) is a pivotal risk factor for the progression of CKD, and the rate of AKI-to-CKD progression increases with aging. Intrarenal inflammation is a fundamental mechanism underlying AKI-to-CKD progression. Tertiary lymphoid structures (TLSs), ectopic lymphoid aggregates formed in nonlymphoid organs, develop in aged injured kidneys, but not in young kidneys, with prolonged inflammation and maladaptive repair, which potentially exacerbates AKI-to-CKD progression in aged individuals. Dysregulated immune responses are involved in the pathogenesis of various kidney diseases, such as IgA nephropathy, lupus nephritis, and diabetic kidney diseases, thereby deteriorating kidney function. TLSs also develop in several kidney diseases, including transplanted kidneys and renal cell carcinoma. However, the precise immunologic mechanisms driving AKI-to-CKD progression and development of these kidney diseases remain unclear, which hinders the development of novel therapeutic approaches. This review aims to describe recent findings from single-cell analysis of cellular heterogeneity and complex interactions among immune and renal parenchymal cells, which potentially contribute to the pathogenesis of AKI-to-CKD progression and other kidney diseases, highlighting the mechanisms of formation and pathogenic roles of TLSs in aged injured kidneys.
Collapse
Affiliation(s)
- Takahisa Yoshikawa
- Department of Nephrology, Graduate School of Medicine, Kyoto University, Kyoto, Japan
| | - Motoko Yanagita
- Department of Nephrology, Graduate School of Medicine, Kyoto University, Kyoto, Japan; Institute for the Advanced Study of Human Biology, Kyoto University, Kyoto, Japan.
| |
Collapse
|
19
|
Qiu X, Zhu DY, Lu Y, Yao J, Jing Z, Min KH, Cheng M, Pan H, Zuo L, King S, Fang Q, Zheng H, Wang M, Wang S, Zhang Q, Yu S, Liao S, Liu C, Wu X, Lai Y, Hao S, Zhang Z, Wu L, Zhang Y, Li M, Tu Z, Lin J, Yang Z, Li Y, Gu Y, Ellison D, Chen A, Liu L, Weissman JS, Ma J, Xu X, Liu S, Bai Y. Spatiotemporal modeling of molecular holograms. Cell 2024; 187:7351-7373.e61. [PMID: 39532097 DOI: 10.1016/j.cell.2024.10.011] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2022] [Revised: 05/29/2024] [Accepted: 10/08/2024] [Indexed: 11/16/2024]
Abstract
Quantifying spatiotemporal dynamics during embryogenesis is crucial for understanding congenital diseases. We developed Spateo (https://github.com/aristoteleo/spateo-release), a 3D spatiotemporal modeling framework, and applied it to a 3D mouse embryogenesis atlas at E9.5 and E11.5, capturing eight million cells. Spateo enables scalable, partial, non-rigid alignment, multi-slice refinement, and mesh correction to create molecular holograms of whole embryos. It introduces digitization methods to uncover multi-level biology from subcellular to whole organ, identifying expression gradients along orthogonal axes of emergent 3D structures, e.g., secondary organizers such as midbrain-hindbrain boundary (MHB). Spateo further jointly models intercellular and intracellular interaction to dissect signaling landscapes in 3D structures, including the zona limitans intrathalamica (ZLI). Lastly, Spateo introduces "morphometric vector fields" of cell migration and integrates spatial differential geometry to unveil molecular programs underlying asymmetrical murine heart organogenesis and others, bridging macroscopic changes with molecular dynamics. Thus, Spateo enables the study of organ ecology at a molecular level in 3D space over time.
Collapse
Affiliation(s)
- Xiaojie Qiu
- Department of Genetics, Stanford University School of Medicine, Stanford, CA, USA; Basic Sciences and Engineering Initiative, Betty Irene Moore Children's Heart Center, Lucile Packard Children's Hospital, Stanford, CA, USA; Department of Computer Science, Stanford University, Stanford, CA 94305, USA; Stanford Cardiovascular Institute, Stanford University, Stanford, CA, USA.
| | - Daniel Y Zhu
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Yifan Lu
- Department of Genetics, Stanford University School of Medicine, Stanford, CA, USA; Basic Sciences and Engineering Initiative, Betty Irene Moore Children's Heart Center, Lucile Packard Children's Hospital, Stanford, CA, USA; Department of Computer Science, Stanford University, Stanford, CA 94305, USA; Electronic Information School, Wuhan University, Wuhan 430072, China
| | - Jiajun Yao
- BGI Research, Hangzhou 310030, China; BGI Research, Sanya 572025, China; College of Life Sciences, Northwest University, Xi'an 710069, China
| | - Zehua Jing
- BGI Research, Hangzhou 310030, China; BGI Research, Sanya 572025, China; College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Kyung Hoi Min
- Ginkgo Bioworks, The Innovation and Design Building, Boston, MA 02210, USA
| | - Mengnan Cheng
- BGI Research, Hangzhou 310030, China; BGI Research, Shenzhen 518083, China
| | | | - Lulu Zuo
- BGI Research, Shenzhen 518083, China
| | - Samuel King
- Department of Bioengineering, Stanford University School of Medicine, Stanford, CA, USA
| | - Qi Fang
- BGI Research, Hangzhou 310030, China; BGI Research, Shenzhen 518083, China
| | - Huiwen Zheng
- BGI Research, Hangzhou 310030, China; College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Mingyue Wang
- BGI Research, Hangzhou 310030, China; Shenzhen Key Laboratory of Gene Regulation and Systems Biology, School of Life Sciences, Southern University of Science and Technology, Shenzhen 518055, China
| | - Shuai Wang
- BGI Research, Hangzhou 310030, China; College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Qingquan Zhang
- Department of Medicine, Division of Cardiology, University of California, San Diego, La Jolla, CA, USA
| | - Sichao Yu
- Whitehead Institute for Biomedical Research, Cambridge, MA, USA
| | - Sha Liao
- BGI Research, Shenzhen 518083, China; STOmics Tech Co., Ltd, Shenzhen 518083, China; BGI Research, Chongqing 401329, China
| | - Chao Liu
- BGI Research, Wuhan 430074, China
| | - Xinchao Wu
- BGI Research, Hangzhou 310030, China; BGI Research, Sanya 572025, China; School of Life Sciences, Southern University of Science and Technology, Shenzhen 518055, China
| | - Yiwei Lai
- BGI Research, Shenzhen 518083, China
| | | | - Zhewei Zhang
- BGI Research, Hangzhou 310030, China; BGI Research, Sanya 572025, China; School of Life Sciences, Southern University of Science and Technology, Shenzhen 518055, China
| | - Liang Wu
- BGI Research, Chongqing 401329, China
| | | | - Mei Li
- STOmics Tech Co., Ltd, Shenzhen 518083, China
| | - Zhencheng Tu
- BGI Research, Hangzhou 310030, China; College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Jinpei Lin
- BGI Research, Hangzhou 310030, China; BGI Research, Sanya 572025, China
| | - Zhuoxuan Yang
- BGI Research, Hangzhou 310030, China; School of Life Sciences, Southern University of Science and Technology, Shenzhen 518055, China
| | | | - Ying Gu
- BGI Research, Hangzhou 310030, China; BGI Research, Shenzhen 518083, China; College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | | | - Ao Chen
- BGI Research, Shenzhen 518083, China; STOmics Tech Co., Ltd, Shenzhen 518083, China; BGI Research, Chongqing 401329, China
| | - Longqi Liu
- BGI Research, Hangzhou 310030, China; Shenzhen Bay Laboratory, Shenzhen 518132, China; Shenzhen Key Laboratory of Single-Cell Omics, BGI-Shenzhen, Shenzhen 518120, China
| | - Jonathan S Weissman
- Whitehead Institute for Biomedical Research, Cambridge, MA, USA; Department of Biology and Howard Hughes Medical Institute, Massachusetts Institute of Technology, Cambridge, MA, USA; Koch Institute for Integrative Cancer Research at MIT, MIT, Cambridge, MA, USA
| | - Jiayi Ma
- Electronic Information School, Wuhan University, Wuhan 430072, China.
| | - Xun Xu
- BGI Research, Hangzhou 310030, China; College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China; Guangdong Provincial Key Laboratory of Genome Read and Write, BGI-Shenzhen, Shenzhen 518120, China.
| | - Shiping Liu
- BGI Research, Hangzhou 310030, China; Shenzhen Bay Laboratory, Shenzhen 518132, China; Shenzhen Key Laboratory of Single-Cell Omics, BGI-Shenzhen, Shenzhen 518120, China; The Guangdong-Hong Kong Joint Laboratory on Immunological and Genetic Kidney Diseases, Guangzhou, Guangdong, China.
| | - Yinqi Bai
- BGI Research, Sanya 572025, China; Hainan Technology Innovation Center for Marine Biological Resources Utilization (Preparatory Period), BGI Research, Sanya 572025, China.
| |
Collapse
|
20
|
Wang J, Ye F, Chai H, Jiang Y, Wang T, Ran X, Xia Q, Xu Z, Fu Y, Zhang G, Wu H, Guo G, Guo H, Ruan Y, Wang Y, Xing D, Xu X, Zhang Z. Advances and applications in single-cell and spatial genomics. SCIENCE CHINA. LIFE SCIENCES 2024:10.1007/s11427-024-2770-x. [PMID: 39792333 DOI: 10.1007/s11427-024-2770-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/20/2024] [Accepted: 10/10/2024] [Indexed: 01/12/2025]
Abstract
The applications of single-cell and spatial technologies in recent times have revolutionized the present understanding of cellular states and the cellular heterogeneity inherent in complex biological systems. These advancements offer unprecedented resolution in the examination of the functional genomics of individual cells and their spatial context within tissues. In this review, we have comprehensively discussed the historical development and recent progress in the field of single-cell and spatial genomics. We have reviewed the breakthroughs in single-cell multi-omics technologies, spatial genomics methods, and the computational strategies employed toward the analyses of single-cell atlas data. Furthermore, we have highlighted the advances made in constructing cellular atlases and their clinical applications, particularly in the context of disease. Finally, we have discussed the emerging trends, challenges, and opportunities in this rapidly evolving field.
Collapse
Affiliation(s)
- Jingjing Wang
- Bone Marrow Transplantation Center of the First Affiliated Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 310058, China
| | - Fang Ye
- Bone Marrow Transplantation Center of the First Affiliated Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 310058, China
| | - Haoxi Chai
- Life Sciences Institute and The Second Affiliated Hospital, Zhejiang University, Hangzhou, 310058, China
| | - Yujia Jiang
- BGI Research, Shenzhen, 518083, China
- BGI Research, Hangzhou, 310030, China
| | - Teng Wang
- Biomedical Pioneering Innovation Center (BIOPIC) and School of Life Sciences, Peking University, Beijing, 100871, China
- Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, 100871, China
| | - Xia Ran
- Bone Marrow Transplantation Center of the First Affiliated Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 310058, China
- Institute of Hematology, Zhejiang University, Hangzhou, 310000, China
| | - Qimin Xia
- Biomedical Pioneering Innovation Center (BIOPIC) and School of Life Sciences, Peking University, Beijing, 100871, China
| | - Ziye Xu
- Department of Laboratory Medicine of The First Affiliated Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 310058, China
| | - Yuting Fu
- Bone Marrow Transplantation Center of the First Affiliated Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 310058, China
- Center for Stem Cell and Regenerative Medicine, Zhejiang University School of Medicine, Hangzhou, 310058, China
| | - Guodong Zhang
- Bone Marrow Transplantation Center of the First Affiliated Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 310058, China
- Center for Stem Cell and Regenerative Medicine, Zhejiang University School of Medicine, Hangzhou, 310058, China
| | - Hanyu Wu
- Bone Marrow Transplantation Center of the First Affiliated Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 310058, China
- Center for Stem Cell and Regenerative Medicine, Zhejiang University School of Medicine, Hangzhou, 310058, China
| | - Guoji Guo
- Bone Marrow Transplantation Center of the First Affiliated Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 310058, China.
- Center for Stem Cell and Regenerative Medicine, Zhejiang University School of Medicine, Hangzhou, 310058, China.
- Zhejiang Provincial Key Lab for Tissue Engineering and Regenerative Medicine, Dr. Li Dak Sum & Yip Yio Chin Center for Stem Cell and Regenerative Medicine, Hangzhou, 310058, China.
- Institute of Hematology, Zhejiang University, Hangzhou, 310000, China.
| | - Hongshan Guo
- Bone Marrow Transplantation Center of the First Affiliated Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 310058, China.
- Institute of Hematology, Zhejiang University, Hangzhou, 310000, China.
| | - Yijun Ruan
- Life Sciences Institute and The Second Affiliated Hospital, Zhejiang University, Hangzhou, 310058, China.
| | - Yongcheng Wang
- Department of Laboratory Medicine of The First Affiliated Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 310058, China.
| | - Dong Xing
- Biomedical Pioneering Innovation Center (BIOPIC) and School of Life Sciences, Peking University, Beijing, 100871, China.
- Beijing Advanced Innovation Center for Genomics (ICG), Peking University, Beijing, 100871, China.
| | - Xun Xu
- BGI Research, Shenzhen, 518083, China.
- BGI Research, Hangzhou, 310030, China.
- Guangdong Provincial Key Laboratory of Genome Read and Write, BGI Research, Shenzhen, 518083, China.
| | - Zemin Zhang
- Biomedical Pioneering Innovation Center (BIOPIC) and School of Life Sciences, Peking University, Beijing, 100871, China.
| |
Collapse
|
21
|
Bongrand P. Should Artificial Intelligence Play a Durable Role in Biomedical Research and Practice? Int J Mol Sci 2024; 25:13371. [PMID: 39769135 PMCID: PMC11676049 DOI: 10.3390/ijms252413371] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2024] [Revised: 11/26/2024] [Accepted: 12/09/2024] [Indexed: 01/11/2025] Open
Abstract
During the last decade, artificial intelligence (AI) was applied to nearly all domains of human activity, including scientific research. It is thus warranted to ask whether AI thinking should be durably involved in biomedical research. This problem was addressed by examining three complementary questions (i) What are the major barriers currently met by biomedical investigators? It is suggested that during the last 2 decades there was a shift towards a growing need to elucidate complex systems, and that this was not sufficiently fulfilled by previously successful methods such as theoretical modeling or computer simulation (ii) What is the potential of AI to meet the aforementioned need? it is suggested that recent AI methods are well-suited to perform classification and prediction tasks on multivariate systems, and possibly help in data interpretation, provided their efficiency is properly validated. (iii) Recent representative results obtained with machine learning suggest that AI efficiency may be comparable to that displayed by human operators. It is concluded that AI should durably play an important role in biomedical practice. Also, as already suggested in other scientific domains such as physics, combining AI with conventional methods might generate further progress and new applications, involving heuristic and data interpretation.
Collapse
Affiliation(s)
- Pierre Bongrand
- Laboratory Adhesion and Inflammation (LAI), Inserm UMR 1067, Cnrs Umr 7333, Aix-Marseille Université UM 61, 13009 Marseille, France
| |
Collapse
|
22
|
Kwok AWC, Shim H, McCarthy DJ. Going beyond cell clustering and feature aggregation: Is there single cell level information in single-cell ATAC-seq data? BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.12.04.626927. [PMID: 39713401 PMCID: PMC11661094 DOI: 10.1101/2024.12.04.626927] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 12/24/2024]
Abstract
Single-cell Assay for Transposase Accessible Chromatin with sequencing (scATAC-seq) has become a widely used method for investigating chromatin accessibility at single-cell resolution. However, the resulting data is highly sparse with most data entries being zeros. As such, currently available computational methods for scATAC-seq feature a range of transformation procedures to extract meaningful information from the sparse data. Most notably, these transformations can be categorized into: 1) feature aggregation with known biological associations, 2) pseudo-bulking cells of similar biology, and 3) binarisation of count data. These strategies beg the question of whether or not scATAC-seq data actually has usable single-cell and single-region information as intended from the assay. If we can go beyond aggregated features and pooled cells, it opens up the possibility of more complex statistical tasks that require that degree of granularity. To reach the finest possible resolution of single-cell, single-region information there are inevitably many computational challenges to overcome. Here, we review the major data analysis challenges lying between raw data readout and biological discovery, and discuss the limitations of current data analysis approaches. Lastly, we conclude that chromatin accessibility profiling at true single-cell resolution is not yet achieved with current technology, but that it may be achieved with promising developments in optimising the efficiency of scATAC-seq assays.
Collapse
Affiliation(s)
- Aaron Wing Cheung Kwok
- Bioinformatics and Cellular Genomics, St Vincent's Institute of Medical Research, Fitzroy, VIC 3065, Australia
- Melbourne Integrative Genomics, University of Melbourne, Parkville, VIC, 3010, Australia
- School of Mathematics and Statistics, Faculty of Science, University of Melbourne, Parkville, VIC, 3010, Australia
| | - Heejung Shim
- Melbourne Integrative Genomics, University of Melbourne, Parkville, VIC, 3010, Australia
- School of Mathematics and Statistics, Faculty of Science, University of Melbourne, Parkville, VIC, 3010, Australia
| | - Davis J McCarthy
- Bioinformatics and Cellular Genomics, St Vincent's Institute of Medical Research, Fitzroy, VIC 3065, Australia
- Melbourne Integrative Genomics, University of Melbourne, Parkville, VIC, 3010, Australia
- School of Mathematics and Statistics, Faculty of Science, University of Melbourne, Parkville, VIC, 3010, Australia
- Faculty of Medicine, Dentistry and Health Sciences, University of Melbourne, Parkville, VIC, 3010, Australia
| |
Collapse
|
23
|
Soubeyrand S, Lau P, Nikpay M, Ma L, Bjorkegren JLM, McPherson R. Long Noncoding RNA TRIBAL Links the 8q24.13 Locus to Hepatic Lipid Metabolism and Coronary Artery Disease. CIRCULATION. GENOMIC AND PRECISION MEDICINE 2024; 17:e004674. [PMID: 39624902 DOI: 10.1161/circgen.124.004674] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/26/2024] [Accepted: 10/11/2024] [Indexed: 12/19/2024]
Abstract
BACKGROUND Genome-wide association studies identified a 20-Kb region of chromosome 8 (8q24.13) associated with plasma lipids, hepatic steatosis, and risk for coronary artery disease. The region is proximal to TRIB1, and given its well-established role in lipid regulation in animal models, TRIB1 has been proposed to mediate the contribution of the 8q24.13 locus to these traits. This region overlaps a gene encoding the primate-specific long noncoding RNA transcript TRIBAL/TRIB1AL (TRIB1-associated locus), but the contribution of TRIBAL to coronary artery disease risk remains untested. METHODS Using recently available expression quantitative trait loci data and hepatocyte models, we further investigated this locus by Mendelian randomization analysis. Following antisense oligonucleotide targeting of TRIBAL, transcription array, quantitative reverse transcription polymerase chain reaction, and enrichment analyses were performed and effects on apoB and triglyceride secretion were determined. RESULTS Mendelian randomization analysis supports a causal relationship between genetically determined hepatic TRIBAL expression and markers of hepatic steatosis and coronary artery disease risk. By contrast, expression data sets did not support expression quantitative trait loci relationships between coronary artery disease-associated variants and TRIB1. TRIBAL suppression reduced the expression of key regulators of triglyceride metabolism and bile acid synthesis. Enrichment analyses identified patterns consistent with impaired metabolic functions, including reduced triglyceride and cholesterol handling ability. Furthermore, TRIBAL suppression was associated with reduced hepatocyte secretion of triglycerides. CONCLUSIONS This work identifies TRIBAL as a gene bridging the genotype-phenotype relationship at the 8q24.13 locus with effects on genes regulating hepatocyte lipid metabolism and triglyceride secretion.
Collapse
Affiliation(s)
- Sébastien Soubeyrand
- Atherogenomics Laboratory (S.S., P.L., M.N., R.M.), University of Ottawa Heart Institute, Canada
| | - Paulina Lau
- Atherogenomics Laboratory (S.S., P.L., M.N., R.M.), University of Ottawa Heart Institute, Canada
| | - Majid Nikpay
- Atherogenomics Laboratory (S.S., P.L., M.N., R.M.), University of Ottawa Heart Institute, Canada
| | - Lijiang Ma
- Department of Genetics and Genomic Sciences, Institute of Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, NY (L.M., J.L.M.B.)
| | - Johan L M Bjorkegren
- Department of Genetics and Genomic Sciences, Institute of Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, NY (L.M., J.L.M.B.)
- Department of Medicine, Karolinska Institutet, Karolinska Universitetssjukhuset, Huddinge, Sweden (J.L.M.B.)
| | - Ruth McPherson
- Atherogenomics Laboratory (S.S., P.L., M.N., R.M.), University of Ottawa Heart Institute, Canada
- Division of Cardiology, Ruddy Canadian Cardiovascular Genetics Centre (R.M.), University of Ottawa Heart Institute, Canada
| |
Collapse
|
24
|
Zhao Y, Yu ZM, Cui T, Li LD, Li YY, Qian FC, Zhou LW, Li Y, Fang QL, Huang XM, Zhang QY, Cai FH, Dong FJ, Shang DS, Li CQ, Wang QY. scBlood: A comprehensive single-cell accessible chromatin database of blood cells. Comput Struct Biotechnol J 2024; 23:2746-2753. [PMID: 39050785 PMCID: PMC11266868 DOI: 10.1016/j.csbj.2024.06.015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2024] [Revised: 06/17/2024] [Accepted: 06/18/2024] [Indexed: 07/27/2024] Open
Abstract
The advent of single cell transposase-accessible chromatin sequencing (scATAC-seq) technology enables us to explore the genomic characteristics and chromatin accessibility of blood cells at the single-cell level. To fully make sense of the roles and regulatory complexities of blood cells, it is critical to collect and analyze these rapidly accumulating scATAC-seq datasets at a system level. Here, we present scBlood (https://bio.liclab.net/scBlood/), a comprehensive single-cell accessible chromatin database of blood cells. The current version of scBlood catalogs 770,907 blood cells and 452,247 non-blood cells from ∼400 high-quality scATAC-seq samples covering 30 tissues and 21 disease types. All data hosted on scBlood have undergone preprocessing from raw fastq files and multiple standards of quality control. Furthermore, we conducted comprehensive downstream analyses, including multi-sample integration analysis, cell clustering and annotation, differential chromatin accessibility analysis, functional enrichment analysis, co-accessibility analysis, gene activity score calculation, and transcription factor (TF) enrichment analysis. In summary, scBlood provides a user-friendly interface for searching, browsing, analyzing, visualizing, and downloading scATAC-seq data of interest. This platform facilitates insights into the functions and regulatory mechanisms of blood cells, as well as their involvement in blood-related diseases.
Collapse
Affiliation(s)
- Yu Zhao
- The First Affiliated Hospital & MOE Key Lab of Rare Pediatric Diseases, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- School of Computer, University of South China, Hengyang, Hunan 421001, China
| | - Zheng-Min Yu
- School of Computer, University of South China, Hengyang, Hunan 421001, China
| | - Ting Cui
- The First Affiliated Hospital & MOE Key Lab of Rare Pediatric Diseases, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
| | - Li-Dong Li
- School of Computer, University of South China, Hengyang, Hunan 421001, China
| | - Yan-Yu Li
- School of Medical Informatics, Daqing Campus, Harbin Medical University, Daqing 163319, China
| | - Feng-Cui Qian
- The First Affiliated Hospital, Cardiovascular Lab of Big Data and Imaging Artificial Intelligence, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
| | - Li-Wei Zhou
- State Key Laboratory of Stem Cell and Reproductive Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
| | - Ye Li
- Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
| | - Qiao-Li Fang
- School of Computer, University of South China, Hengyang, Hunan 421001, China
| | - Xue-Mei Huang
- School of Computer, University of South China, Hengyang, Hunan 421001, China
| | - Qin-Yi Zhang
- Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
| | - Fu-Hong Cai
- School of Computer, University of South China, Hengyang, Hunan 421001, China
| | - Fu-Juan Dong
- School of Computer, University of South China, Hengyang, Hunan 421001, China
| | - De-Si Shang
- The First Affiliated Hospital, Cardiovascular Lab of Big Data and Imaging Artificial Intelligence, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
| | - Chun-Quan Li
- The First Affiliated Hospital & MOE Key Lab of Rare Pediatric Diseases, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- School of Computer, University of South China, Hengyang, Hunan 421001, China
- Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- Hunan Provincial Key Laboratory of Multi-omics And Artificial Intelligence of Cardiovascular Diseases, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- Hunan Provincial Maternal and Child Health Care Hospital, National Health Commission Key Laboratory of Birth Defect Research and Prevention, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Cardiovascular Lab of Big Data and Imaging Artificial Intelligence, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Institute of Cardiovascular Disease, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
| | - Qiu-Yu Wang
- School of Computer, University of South China, Hengyang, Hunan 421001, China
- Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- Hunan Provincial Key Laboratory of Multi-omics And Artificial Intelligence of Cardiovascular Diseases, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- Hunan Provincial Maternal and Child Health Care Hospital, National Health Commission Key Laboratory of Birth Defect Research and Prevention, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
| |
Collapse
|
25
|
de Martin X, Oliva B, Santpere G. Recruitment of homodimeric proneural factors by conserved CAT-CAT E-boxes drives major epigenetic reconfiguration in cortical neurogenesis. Nucleic Acids Res 2024; 52:12895-12917. [PMID: 39494521 PMCID: PMC11602148 DOI: 10.1093/nar/gkae950] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2024] [Revised: 10/03/2024] [Accepted: 10/09/2024] [Indexed: 11/05/2024] Open
Abstract
Proneural factors of the basic helix-loop-helix family coordinate neurogenesis and neurodifferentiation. Among them, NEUROG2 and NEUROD2 subsequently act to specify neurons of the glutamatergic lineage. Disruption of these factors, their target genes and binding DNA motifs has been linked to various neuropsychiatric disorders. Proneural factors bind to specific DNA motifs called E-boxes (hexanucleotides of the form CANNTG, composed of two CAN half sites on opposed strands). While corticogenesis heavily relies on E-box activity, the collaboration of proneural factors on different E-box types and their chromatin remodeling mechanisms remain largely unknown. Here, we conducted a comprehensive analysis using chromatin immunoprecipitation followed by sequencing (ChIP-seq) data for NEUROG2 and NEUROD2, along with time-matched single-cell RNA-seq, ATAC-seq and DNA methylation data from the developing mouse cortex. Our findings show that these factors are highly enriched in transiently active genomic regions during intermediate stages of neuronal differentiation. Although they primarily bind CAG-containing E-boxes, their binding in dynamic regions is notably enriched in CAT-CAT E-boxes (i.e. CATATG, denoted as 5'3' half sites for dimers), which undergo significant DNA demethylation and exhibit the highest levels of evolutionary constraint. Aided by HT-SELEX data reanalysis, structural modeling and DNA footprinting, we propose that these proneural factors exert maximal chromatin remodeling influence during intermediate stages of neurogenesis by binding as homodimers to CAT-CAT motifs. This study provides an in-depth integrative analysis of the dynamic regulation of E-boxes during neuronal development, enhancing our understanding of the mechanisms underlying the binding specificity of critical proneural factors.
Collapse
Affiliation(s)
- Xabier de Martin
- Neurogenomics Group, Hospital del Mar Research Institute, Parc de Recerca Biomèdica de Barcelona (PRBB), Dr. Aiguader, 88, Barcelona 08003, Catalonia, Spain
| | - Baldomero Oliva
- Structural Bioinformatics Lab (GRIB-IMIM), Department of Medicine and Life Sciences, Universitat Pompeu Fabra, Dr. Aiguader, 88, Barcelona 08003 Catalonia, Spain
| | - Gabriel Santpere
- Neurogenomics Group, Hospital del Mar Research Institute, Parc de Recerca Biomèdica de Barcelona (PRBB), Dr. Aiguader, 88, Barcelona 08003, Catalonia, Spain
- Department of Neuroscience, Yale School of Medicine, 333 Cedar st., New Haven, CT 06510, USA
| |
Collapse
|
26
|
Dai W, Qiao X, Fang Y, Guo R, Bai P, Liu S, Li T, Jiang Y, Wei S, Na Z, Xiao X, Li D. Epigenetics-targeted drugs: current paradigms and future challenges. Signal Transduct Target Ther 2024; 9:332. [PMID: 39592582 PMCID: PMC11627502 DOI: 10.1038/s41392-024-02039-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2024] [Revised: 10/14/2024] [Accepted: 10/29/2024] [Indexed: 11/28/2024] Open
Abstract
Epigenetics governs a chromatin state regulatory system through five key mechanisms: DNA modification, histone modification, RNA modification, chromatin remodeling, and non-coding RNA regulation. These mechanisms and their associated enzymes convey genetic information independently of DNA base sequences, playing essential roles in organismal development and homeostasis. Conversely, disruptions in epigenetic landscapes critically influence the pathogenesis of various human diseases. This understanding has laid a robust theoretical groundwork for developing drugs that target epigenetics-modifying enzymes in pathological conditions. Over the past two decades, a growing array of small molecule drugs targeting epigenetic enzymes such as DNA methyltransferase, histone deacetylase, isocitrate dehydrogenase, and enhancer of zeste homolog 2, have been thoroughly investigated and implemented as therapeutic options, particularly in oncology. Additionally, numerous epigenetics-targeted drugs are undergoing clinical trials, offering promising prospects for clinical benefits. This review delineates the roles of epigenetics in physiological and pathological contexts and underscores pioneering studies on the discovery and clinical implementation of epigenetics-targeted drugs. These include inhibitors, agonists, degraders, and multitarget agents, aiming to identify practical challenges and promising avenues for future research. Ultimately, this review aims to deepen the understanding of epigenetics-oriented therapeutic strategies and their further application in clinical settings.
Collapse
Affiliation(s)
- Wanlin Dai
- Center of Reproductive Medicine, Department of Obstetrics and Gynecology, Shengjing Hospital of China Medical University, Shenyang, China
| | - Xinbo Qiao
- Department of Orthopedics, Shengjing Hospital of China Medical University, Shenyang, China
| | - Yuanyuan Fang
- Center of Reproductive Medicine, Department of Obstetrics and Gynecology, Shengjing Hospital of China Medical University, Shenyang, China
| | - Renhao Guo
- Center of Reproductive Medicine, Department of Obstetrics and Gynecology, Shengjing Hospital of China Medical University, Shenyang, China
| | - Peng Bai
- Department of Forensic Genetics, West China School of Basic Medical Sciences & Forensic Medicine, Sichuan University, Chengdu, China
| | - Shuang Liu
- Shenyang Maternity and Child Health Hospital, Shenyang, China
| | - Tingting Li
- Department of General Internal Medicine VIP Ward, Liaoning Cancer Hospital & Institute, Shenyang, China
| | - Yutao Jiang
- Center of Reproductive Medicine, Department of Obstetrics and Gynecology, Shengjing Hospital of China Medical University, Shenyang, China
| | - Shuang Wei
- Center of Reproductive Medicine, Department of Obstetrics and Gynecology, Shengjing Hospital of China Medical University, Shenyang, China
| | - Zhijing Na
- Center of Reproductive Medicine, Department of Obstetrics and Gynecology, Shengjing Hospital of China Medical University, Shenyang, China.
- NHC Key Laboratory of Advanced Reproductive Medicine and Fertility (China Medical University), National Health Commission, Shenyang, China.
| | - Xue Xiao
- Department of Gynecology and Obstetrics, West China Second Hospital, Sichuan University, Chengdu, China.
- Key Laboratory of Birth Defects and Related Diseases of Women and Children (Sichuan University), Ministry of Education, West China Second Hospital, Sichuan University, Chengdu, China.
| | - Da Li
- Center of Reproductive Medicine, Department of Obstetrics and Gynecology, Shengjing Hospital of China Medical University, Shenyang, China.
- NHC Key Laboratory of Advanced Reproductive Medicine and Fertility (China Medical University), National Health Commission, Shenyang, China.
- Key Laboratory of Reproductive Dysfunction Diseases and Fertility Remodeling of Liaoning Province, Shenyang, China.
| |
Collapse
|
27
|
Luo S, Zhu M, Lin L, Xie J, Lin S, Chen Y, Zhu J, Huang J. DECA: harnessing interpretable transformer model for cellular deconvolution of chromatin accessibility profile. Brief Bioinform 2024; 26:bbaf069. [PMID: 39987573 PMCID: PMC11847511 DOI: 10.1093/bib/bbaf069] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2024] [Revised: 01/09/2025] [Accepted: 02/06/2025] [Indexed: 02/25/2025] Open
Abstract
The assay for transposase-accessible chromatin with sequencing (ATAC-seq) identifies chromatin accessibility across the genome, crucial for gene expression regulating. However, bulk ATAC-seq obscures cellular heterogeneity, while single-cell ATAC-seq suffers from issues such as sparsity and costliness. To this end, we introduce DECA, a sophisticated deep learning model based on vision transformer to deconvolve cell type information from bulk chromatin accessibility profiles, utilizing single-cell ATAC-seq datasets as reference for enhanced precision and resolution. Notably, patch attention generated by DECA's multi-head attention mechanism aligns with chromatin interactions detected by Hi-C. Additionally, DECA predicted lineage-specific cell composition changes due to genetic perturbation. The chromatin accessibility signatures predicted by DECA are enriched with cell-type specific genetic variations. Ultimately, we applied DECA on pan-cancer ATAC-seq datasets and demonstrated its capability to deconvolve cell type proportions with clinical significance. Taken together, DECA deconvolves cellular proportions and predicts their chromatin accessibility profiles from bulk chromatin accessibility data, which enable exploring the gene regulatory programs in development and diseases.
Collapse
Affiliation(s)
- Shijie Luo
- State Key Laboratory of Cellular Stress Biology, School of Life Sciences, Faculty of Medicine and Life Sciences, Xiamen University, No. 4221, Xiang'an South Road, Xiamen, Fujian 361102, China
- National Institute for Data Science in Health and Medicine, Xiamen University, No. 4221, Xiang'an South Road, Xiamen, Fujian 361102, China
| | - Ming Zhu
- State Key Laboratory of Cellular Stress Biology, School of Life Sciences, Faculty of Medicine and Life Sciences, Xiamen University, No. 4221, Xiang'an South Road, Xiamen, Fujian 361102, China
| | - Liquan Lin
- State Key Laboratory of Cellular Stress Biology, School of Life Sciences, Faculty of Medicine and Life Sciences, Xiamen University, No. 4221, Xiang'an South Road, Xiamen, Fujian 361102, China
| | - Jiajing Xie
- National Institute for Data Science in Health and Medicine, Xiamen University, No. 4221, Xiang'an South Road, Xiamen, Fujian 361102, China
| | - Shihao Lin
- State Key Laboratory of Cellular Stress Biology, School of Life Sciences, Faculty of Medicine and Life Sciences, Xiamen University, No. 4221, Xiang'an South Road, Xiamen, Fujian 361102, China
| | - Ying Chen
- School of Informatics, Xiamen University, No. 4221, Xiang'an South Road, Fujian 361000, China
| | - Jiali Zhu
- State Key Laboratory of Cellular Stress Biology, School of Life Sciences, Faculty of Medicine and Life Sciences, Xiamen University, No. 4221, Xiang'an South Road, Xiamen, Fujian 361102, China
| | - Jialiang Huang
- State Key Laboratory of Cellular Stress Biology, School of Life Sciences, Faculty of Medicine and Life Sciences, Xiamen University, No. 4221, Xiang'an South Road, Xiamen, Fujian 361102, China
- National Institute for Data Science in Health and Medicine, Xiamen University, No. 4221, Xiang'an South Road, Xiamen, Fujian 361102, China
| |
Collapse
|
28
|
Wong KH, Rodriguez NA, Traylor-Knowles N. Exploring the Unknown: How Can We Improve Single-cell RNAseq Cell Type Annotations in Non-model Organisms? Integr Comp Biol 2024; 64:1291-1299. [PMID: 39013613 DOI: 10.1093/icb/icae112] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2024] [Revised: 07/05/2024] [Accepted: 07/08/2024] [Indexed: 07/18/2024] Open
Abstract
Single-cell RNA sequencing (scRNAseq) is a powerful tool to describe cell types in multicellular organisms across the animal kingdom. In standard scRNAseq analysis pipelines, clusters of cells with similar transcriptional signatures are given cell type labels based on marker genes that infer specialized known characteristics. Since these analyses are designed for model organisms, such as humans and mice, problems arise when attempting to label cell types of distantly related, non-model species that have unique or divergent cell types. Consequently, this leads to limited discovery of novel species-specific cell types and potential mis-annotation of cell types in non-model species while using scRNAseq. To address this problem, we discuss recently published approaches that help annotate scRNAseq clusters for any non-model organism. We first suggest that annotating with an evolutionary context of cell lineages will aid in the discovery of novel cell types and provide a marker-free approach to compare cell types across distantly related species. Secondly, machine learning has greatly improved bioinformatic analyses, so we highlight some open-source programs that use reference-free approaches to annotate cell clusters. Lastly, we propose the use of unannotated genes as potential cell markers for non-model organisms, as many do not have fully annotated genomes and these data are often disregarded. Improving single-cell annotations will aid the discovery of novel cell types and enhance our understanding of non-model organisms at a cellular level. By unifying approaches to annotate cell types in non-model organisms, we can increase the confidence of cell annotation label transfer and the flexibility to discover novel cell types.
Collapse
Affiliation(s)
- Kevin H Wong
- Department of Marine Biology and Ecology, Rosenstiel School of Marine, Atmospheric, and Earth Science, University of Miami, Miami, Florida, USA, 33149
| | - Natalia Andrade Rodriguez
- Department of Marine Biology and Ecology, Rosenstiel School of Marine, Atmospheric, and Earth Science, University of Miami, Miami, Florida, USA, 33149
| | - Nikki Traylor-Knowles
- Department of Marine Biology and Ecology, Rosenstiel School of Marine, Atmospheric, and Earth Science, University of Miami, Miami, Florida, USA, 33149
| |
Collapse
|
29
|
Holman AR, Tran S, Destici E, Farah EN, Li T, Nelson AC, Engler AJ, Chi NC. Single-cell multi-modal integrative analyses highlight functional dynamic gene regulatory networks directing human cardiac development. CELL GENOMICS 2024; 4:100680. [PMID: 39437788 PMCID: PMC11605693 DOI: 10.1016/j.xgen.2024.100680] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/01/2024] [Revised: 08/01/2024] [Accepted: 09/23/2024] [Indexed: 10/25/2024]
Abstract
Illuminating the precise stepwise genetic programs directing cardiac development provides insights into the mechanisms of congenital heart disease and strategies for cardiac regenerative therapies. Here, we integrate in vitro and in vivo human single-cell multi-omic studies with high-throughput functional genomic screening to reveal dynamic, cardiac-specific gene regulatory networks (GRNs) and transcriptional regulators during human cardiomyocyte development. Interrogating developmental trajectories reconstructed from single-cell data unexpectedly reveal divergent cardiomyocyte lineages with distinct gene programs based on developmental signaling pathways. High-throughput functional genomic screens identify key transcription factors from inferred GRNs that are functionally relevant for cardiomyocyte lineages derived from each pathway. Notably, we discover a critical heat shock transcription factor 1 (HSF1)-mediated cardiometabolic GRN controlling cardiac mitochondrial/metabolic function and cell survival, also observed in fetal human cardiomyocytes. Overall, these multi-modal genomic studies enable the systematic discovery and validation of coordinated GRNs and transcriptional regulators controlling the development of distinct human cardiomyocyte populations.
Collapse
Affiliation(s)
- Alyssa R Holman
- Division of Cardiology, Department of Medicine, University of California, San Diego, La Jolla, CA 92093, USA; Biomedical Sciences Graduate Program, University of California, San Diego, La Jolla, CA 92093, USA
| | - Shaina Tran
- Division of Cardiology, Department of Medicine, University of California, San Diego, La Jolla, CA 92093, USA
| | - Eugin Destici
- Division of Cardiology, Department of Medicine, University of California, San Diego, La Jolla, CA 92093, USA
| | - Elie N Farah
- Division of Cardiology, Department of Medicine, University of California, San Diego, La Jolla, CA 92093, USA
| | - Ting Li
- Division of Cardiology, Department of Medicine, University of California, San Diego, La Jolla, CA 92093, USA
| | - Aileena C Nelson
- Division of Cardiology, Department of Medicine, University of California, San Diego, La Jolla, CA 92093, USA; Department of Bioengineering, University of California, San Diego, La Jolla, CA 92093, USA
| | - Adam J Engler
- Department of Bioengineering, University of California, San Diego, La Jolla, CA 92093, USA; Institute of Engineering Medicine, University of California, San Diego, La Jolla, CA 92093, USA; Sanford Consortium for Regenerative Medicine, La Jolla, CA 92093, USA
| | - Neil C Chi
- Division of Cardiology, Department of Medicine, University of California, San Diego, La Jolla, CA 92093, USA; Department of Bioengineering, University of California, San Diego, La Jolla, CA 92093, USA; Institute of Engineering Medicine, University of California, San Diego, La Jolla, CA 92093, USA; Institute of Genomic Medicine, University of California, San Diego, La Jolla, CA 92093, USA.
| |
Collapse
|
30
|
Yang L, Han Y, Zhang T, Dong X, Ge J, Roy A, Zhu J, Lu T, Jeya Vandana J, de Silva N, Robertson CC, Xiang JZ, Pan C, Sun Y, Que J, Evans T, Liu C, Wang W, Naji A, Parker SCJ, Schwartz RE, Chen S. Human vascularized macrophage-islet organoids to model immune-mediated pancreatic β cell pyroptosis upon viral infection. Cell Stem Cell 2024; 31:1612-1629.e8. [PMID: 39232561 PMCID: PMC11546835 DOI: 10.1016/j.stem.2024.08.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2023] [Revised: 06/05/2024] [Accepted: 08/09/2024] [Indexed: 09/06/2024]
Abstract
There is a paucity of human models to study immune-mediated host damage. Here, we utilized the GeoMx spatial multi-omics platform to analyze immune cell changes in COVID-19 pancreatic autopsy samples, revealing an accumulation of proinflammatory macrophages. Single-cell RNA sequencing (scRNA-seq) analysis of human islets exposed to severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) or coxsackievirus B4 (CVB4) viruses identified activation of proinflammatory macrophages and β cell pyroptosis. To distinguish viral versus proinflammatory-macrophage-mediated β cell pyroptosis, we developed human pluripotent stem cell (hPSC)-derived vascularized macrophage-islet (VMI) organoids. VMI organoids exhibited enhanced marker expression and function in both β cells and endothelial cells compared with separately cultured cells. Notably, proinflammatory macrophages within VMI organoids induced β cell pyroptosis. Mechanistic investigations highlighted TNFSF12-TNFRSF12A involvement in proinflammatory-macrophage-mediated β cell pyroptosis. This study established hPSC-derived VMI organoids as a valuable tool for studying immune-cell-mediated host damage and uncovered the mechanism of β cell damage during viral exposure.
Collapse
Affiliation(s)
- Liuliu Yang
- Department of Surgery, Weill Cornell Medicine, 1300 York Avenue, New York, NY 10065, USA; Center for Genomic Health, Weill Cornell Medicine, 1300 York Avenue, New York, NY 10065, USA; State Key Laboratory of Experimental Hematology, National Clinical Research Center for Blood Disease, Haihe Laboratory of Cell Ecosystem, Institute of Hematology & Blood Diseases Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Tianjin 300020, China; Tianjin Institute of Health Science, Tianjin 301600, China.
| | - Yuling Han
- Department of Surgery, Weill Cornell Medicine, 1300 York Avenue, New York, NY 10065, USA; Center for Genomic Health, Weill Cornell Medicine, 1300 York Avenue, New York, NY 10065, USA; Key Laboratory of Organ Regeneration and Reconstruction, State Key Laboratory of Stem Cell and Reproductive Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China; Institute for Stem Cell and Regeneration, Chinese Academy of Sciences, Beijing 100101, China; Beijing Institute for Stem Cell and Regenerative Medicine, Beijing 100101, China
| | - Tuo Zhang
- Genomic Resource Core Facility, Weill Cornell Medicine, New York, NY 10065, USA
| | - Xue Dong
- Department of Surgery, Weill Cornell Medicine, 1300 York Avenue, New York, NY 10065, USA
| | - Jian Ge
- Columbia Center for Human Development, Department of Medicine, Columbia University Irving Medical Center, New York, NY 10032, USA
| | - Aadita Roy
- Department of Surgery, Weill Cornell Medicine, 1300 York Avenue, New York, NY 10065, USA
| | - Jiajun Zhu
- Department of Surgery, Weill Cornell Medicine, 1300 York Avenue, New York, NY 10065, USA; Center for Genomic Health, Weill Cornell Medicine, 1300 York Avenue, New York, NY 10065, USA
| | - Tiankun Lu
- Department of Surgery, Weill Cornell Medicine, 1300 York Avenue, New York, NY 10065, USA; Center for Genomic Health, Weill Cornell Medicine, 1300 York Avenue, New York, NY 10065, USA
| | - J Jeya Vandana
- Department of Surgery, Weill Cornell Medicine, 1300 York Avenue, New York, NY 10065, USA; Center for Genomic Health, Weill Cornell Medicine, 1300 York Avenue, New York, NY 10065, USA
| | - Neranjan de Silva
- Department of Surgery, Weill Cornell Medicine, 1300 York Avenue, New York, NY 10065, USA; Center for Genomic Health, Weill Cornell Medicine, 1300 York Avenue, New York, NY 10065, USA
| | - Catherine C Robertson
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
| | - Jenny Z Xiang
- Genomic Resource Core Facility, Weill Cornell Medicine, New York, NY 10065, USA
| | - Chendong Pan
- Genomic Resource Core Facility, Weill Cornell Medicine, New York, NY 10065, USA
| | - Yanjie Sun
- Genomic Resource Core Facility, Weill Cornell Medicine, New York, NY 10065, USA
| | - Jianwen Que
- Columbia Center for Human Development, Department of Medicine, Columbia University Irving Medical Center, New York, NY 10032, USA
| | - Todd Evans
- Department of Surgery, Weill Cornell Medicine, 1300 York Avenue, New York, NY 10065, USA; Center for Genomic Health, Weill Cornell Medicine, 1300 York Avenue, New York, NY 10065, USA
| | - Chengyang Liu
- Department of Surgery, University of Pennsylvania School of Medicine, Philadelphia, PA 19104, USA
| | - Wei Wang
- Department of Surgery, University of Pennsylvania School of Medicine, Philadelphia, PA 19104, USA
| | - Ali Naji
- Department of Surgery, University of Pennsylvania School of Medicine, Philadelphia, PA 19104, USA
| | - Stephen C J Parker
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA; Department of Human Genetics, University of Michigan, Ann Arbor, MI, USA; Department of Biostatistics, University of Michigan, Ann Arbor, MI, USA
| | - Robert E Schwartz
- Division of Gastroenterology and Hepatology, Department of Medicine, Weill Cornell Medicine, 1300 York Avenue, New York, NY 10065, USA; Department of Physiology, Biophysics and Systems Biology, Weill Cornell Medicine, 1300 York Avenue, New York, NY 10065, USA.
| | - Shuibing Chen
- Department of Surgery, Weill Cornell Medicine, 1300 York Avenue, New York, NY 10065, USA; Center for Genomic Health, Weill Cornell Medicine, 1300 York Avenue, New York, NY 10065, USA.
| |
Collapse
|
31
|
Zeng Y, Ma Q, Chen J, Kong X, Chen Z, Liu H, Liu L, Qian Y, Wang X, Lu S. Single-cell sequencing: Current applications in various tuberculosis specimen types. Cell Prolif 2024; 57:e13698. [PMID: 38956399 PMCID: PMC11533074 DOI: 10.1111/cpr.13698] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2024] [Revised: 05/21/2024] [Accepted: 06/07/2024] [Indexed: 07/04/2024] Open
Abstract
Tuberculosis (TB) is a chronic disease caused by Mycobacterium tuberculosis (M.tb) and responsible for millions of deaths worldwide each year. It has a complex pathogenesis that primarily affects the lungs but can also impact systemic organs. In recent years, single-cell sequencing technology has been utilized to characterize the composition and proportion of immune cell subpopulations associated with the pathogenesis of TB disease since it has a high resolution that surpasses conventional techniques. This paper reviews the current use of single-cell sequencing technologies in TB research and their application in analysing specimens from various sources of TB, primarily peripheral blood and lung specimens. The focus is on how these technologies can reveal dynamic changes in immune cell subpopulations, genes and proteins during disease progression after M.tb infection. Based on the current findings, single-cell sequencing has significant potential clinical value in the field of TB research. Next, we will focus on the real-world applications of the potential targets identified through single-cell sequencing for diagnostics, therapeutics and the development of effective vaccines.
Collapse
Affiliation(s)
- Yuqin Zeng
- National Clinical Research Center for Infectious DiseaseShenzhen Third People's HospitalShenzhenGuangdong ProvinceChina
| | - Quan Ma
- National Clinical Research Center for Infectious DiseaseShenzhen Third People's HospitalShenzhenGuangdong ProvinceChina
| | - Jinyun Chen
- National Clinical Research Center for Infectious DiseaseShenzhen Third People's HospitalShenzhenGuangdong ProvinceChina
| | - Xingxing Kong
- National Clinical Research Center for Infectious DiseaseShenzhen Third People's HospitalShenzhenGuangdong ProvinceChina
| | - Zhanpeng Chen
- National Clinical Research Center for Infectious DiseaseShenzhen Third People's HospitalShenzhenGuangdong ProvinceChina
| | - Huazhen Liu
- National Clinical Research Center for Infectious DiseaseShenzhen Third People's HospitalShenzhenGuangdong ProvinceChina
| | - Lanlan Liu
- National Clinical Research Center for Infectious DiseaseShenzhen Third People's HospitalShenzhenGuangdong ProvinceChina
| | - Yan Qian
- National Clinical Research Center for Infectious DiseaseShenzhen Third People's HospitalShenzhenGuangdong ProvinceChina
| | - Xiaomin Wang
- National Clinical Research Center for Infectious DiseaseShenzhen Third People's HospitalShenzhenGuangdong ProvinceChina
| | - Shuihua Lu
- National Clinical Research Center for Infectious DiseaseShenzhen Third People's HospitalShenzhenGuangdong ProvinceChina
| |
Collapse
|
32
|
Ren YY, Liu Z. Characterization of Single-Cell Cis-regulatory Elements Informs Implications for Cell Differentiation. Genome Biol Evol 2024; 16:evae241. [PMID: 39506564 PMCID: PMC11580522 DOI: 10.1093/gbe/evae241] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2024] [Revised: 10/17/2024] [Accepted: 11/04/2024] [Indexed: 11/08/2024] Open
Abstract
Cis-regulatory elements govern the specific patterns and dynamics of gene expression in cells during development, which are the fundamental mechanisms behind cell differentiation. However, the genomic characteristics of single-cell cis-regulatory elements closely linked to cell differentiation during development remain unclear. To explore this, we systematically analyzed ∼250,000 putative single-cell cis-regulatory elements obtained from snATAC-seq analysis of the developing mouse cerebellum. We found that over 80% of these single-cell cis-regulatory elements show pleiotropic effects, being active in 2 or more cell types. The pleiotropic degrees of proximal and distal single-cell cis-regulatory elements are positively correlated with the density and diversity of transcription factor binding motifs and GC content. There is a negative correlation between the pleiotropic degrees of single-cell cis-regulatory elements and their distances to the nearest transcription start sites, and proximal single-cell cis-regulatory elements display higher relevance strengths than distal ones. Furthermore, both proximal and distal single-cell cis-regulatory elements related to cell differentiation exhibit enhanced sequence-level evolutionary conservation, increased density and diversity of transcription factor binding motifs, elevated GC content, and greater distances from their nearest genes. Together, our findings reveal the general genomic characteristics of putative single-cell cis-regulatory elements and provide insights into the genomic and evolutionary mechanisms by which single-cell cis-regulatory elements regulate cell differentiation during development.
Collapse
Affiliation(s)
- Ying-Ying Ren
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China
- Key Laboratory of Genetic Evolution & Animal Models, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China
| | - Zhen Liu
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China
- Key Laboratory of Genetic Evolution & Animal Models, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China
- Yunnan Key Laboratory of Biodiversity Information, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China
| |
Collapse
|
33
|
Tang Z, Zhou M, Zhang K, Song Q. scPerb: Predict single-cell perturbation via style transfer-based variational autoencoder. J Adv Res 2024:S2090-1232(24)00489-2. [PMID: 39486785 DOI: 10.1016/j.jare.2024.10.035] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2024] [Revised: 10/06/2024] [Accepted: 10/28/2024] [Indexed: 11/04/2024] Open
Abstract
INTRODUCTION Traditional methods for obtaining cellular responses after perturbation are usually labor-intensive and costly, especially when working with multiple different experimental conditions. Therefore, accurate prediction of cellular responses to perturbations is of great importance in computational biology. Existing methodologies, such as graph-based approaches, vector arithmetic, and neural networks, either mix perturbation-related variances with cell-type-specific patterns or implicitly distinguish them within black-box models. OBJECTIVES This study aims to introduce and demonstrate a novel framework, scPerb, which explicitly extracts perturbation-related variances and transfers them from unperturbed to perturbed cells to accurately predict the effect of perturbation in single-cell level. METHODS scPerb utilizes a style transfer strategy by incorporating a style encoder into the architecture of a variational autoencoder. The style encoder captures the differences in latent representations between unperturbed and perturbed cells, enabling accurate prediction of post-perturbation gene expression data. RESULTS Comprehensive comparisons with existing methods demonstrate that scPerb delivers improved performance and higher accuracy in predicting cellular responses to perturbations. Notably, scPerb outperforms other methods across multiple datasets, achieving superior R2 values of 0.98, 0.98, and 0.96 on three benchmarking datasets. CONCLUSION scPerb offers a significant advancement in predicting cellular responses by effectively separating and transferring perturbation-related variances. This framework not only enhances prediction accuracy but also provides a robust tool for computational biology, addressing the limitations of current methodologies.
Collapse
Affiliation(s)
- Zijia Tang
- Trinity College, Duke University, Durham, NC, USA
| | - Minghao Zhou
- Department of Health Outcomes and Biomedical Informatics, College of Medicine, University of Florida, Gainesville, FL, USA
| | - Kai Zhang
- Department of Environmental Health Sciences, University at Albany, State University of New York School of Public Health, USA
| | - Qianqian Song
- Department of Health Outcomes and Biomedical Informatics, College of Medicine, University of Florida, Gainesville, FL, USA.
| |
Collapse
|
34
|
Wang R, Zheng Y, Zhang Z, Song K, Wu E, Zhu X, Wu TP, Ding J. MATES: a deep learning-based model for locus-specific quantification of transposable elements in single cell. Nat Commun 2024; 15:8798. [PMID: 39394211 PMCID: PMC11470080 DOI: 10.1038/s41467-024-53114-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2023] [Accepted: 09/24/2024] [Indexed: 10/13/2024] Open
Abstract
Transposable elements (TEs) are crucial for genetic diversity and gene regulation. Current single-cell quantification methods often align multi-mapping reads to either 'best-mapped' or 'random-mapped' locations and categorize them at the subfamily levels, overlooking the biological necessity for accurate, locus-specific TE quantification. Moreover, these existing methods are primarily designed for and focused on transcriptomics data, which restricts their adaptability to single-cell data of other modalities. To address these challenges, here we introduce MATES, a deep-learning approach that accurately allocates multi-mapping reads to specific loci of TEs, utilizing context from adjacent read alignments flanking the TE locus. When applied to diverse single-cell omics datasets, MATES shows improved performance over existing methods, enhancing the accuracy of TE quantification and aiding in the identification of marker TEs for identified cell populations. This development facilitates the exploration of single-cell heterogeneity and gene regulation through the lens of TEs, offering an effective transposon quantification tool for the single-cell genomics community.
Collapse
Affiliation(s)
- Ruohan Wang
- School of Computer Science, McGill University, Montreal, Quebec, Canada
- Meakins-Christie Laboratories, Translational Research in Respiratory Diseases Program, Research Institute of the McGill University Health Centre, Montreal, Quebec, Canada
- Department of Medicine, McGill University, Montreal, Quebec, Canada
| | - Yumin Zheng
- Meakins-Christie Laboratories, Translational Research in Respiratory Diseases Program, Research Institute of the McGill University Health Centre, Montreal, Quebec, Canada
- Department of Medicine, McGill University, Montreal, Quebec, Canada
- Quantitative Life Sciences, Faculty of Medicine & Health Sciences, McGill University, Montreal, Quebec, Canada
| | - Zijian Zhang
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
| | - Kailu Song
- Meakins-Christie Laboratories, Translational Research in Respiratory Diseases Program, Research Institute of the McGill University Health Centre, Montreal, Quebec, Canada
- Department of Medicine, McGill University, Montreal, Quebec, Canada
- Quantitative Life Sciences, Faculty of Medicine & Health Sciences, McGill University, Montreal, Quebec, Canada
| | - Erxi Wu
- Department of Neurosurgery, Baylor College of Medicine, Temple, TX, USA
- College of Medicine and Irma Lerma Rangel College of Pharmacy, Texas A&M University, College Station, TX, USA
- LIVESTRONG Cancer Institutes and Department of Oncology, Dell Medical School, The University of Texas at Austin, Austin, TX, USA
- Neuroscience Institute and Department of Neurosurgery, Baylor Scott & White Health, Temple, TX, USA
| | | | - Tao P Wu
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA.
| | - Jun Ding
- School of Computer Science, McGill University, Montreal, Quebec, Canada.
- Meakins-Christie Laboratories, Translational Research in Respiratory Diseases Program, Research Institute of the McGill University Health Centre, Montreal, Quebec, Canada.
- Department of Medicine, McGill University, Montreal, Quebec, Canada.
- Quantitative Life Sciences, Faculty of Medicine & Health Sciences, McGill University, Montreal, Quebec, Canada.
- Mila-Quebec AI Institue, Montreal, Quebec, Canada.
| |
Collapse
|
35
|
Yan H, Mendieta JP, Zhang X, Marand AP, Liang Y, Luo Z, Minow MAA, Jang H, Li X, Roule T, Wagner D, Tu X, Wang Y, Jiang D, Zhong S, Huang L, Wessler SR, Schmitz RJ. Evolution of plant cell-type-specific cis-regulatory elements. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.01.08.574753. [PMID: 38260561 PMCID: PMC10802394 DOI: 10.1101/2024.01.08.574753] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/24/2024]
Abstract
Cis-regulatory elements (CREs) are critical in regulating gene expression, and yet understanding of CRE evolution remains challenging. Here, we constructed a comprehensive single-cell atlas of chromatin accessibility in Oryza sativa, integrating data from 103,911 nuclei representing 126 discrete cell states across nine distinct organs. We used comparative genomics to compare cell-type resolved chromatin accessibility between O. sativa and 57,552 nuclei from four additional grass species (Zea mays, Sorghum bicolor, Panicum miliaceum, and Urochloa fusca). Accessible chromatin regions (ACRs) had different levels of conservation depending on the degree of cell-type specificity. We found a complex relationship between ACRs with conserved noncoding sequences, cell-type specificity, conservation, and tissue-specific switching. Additionally, we found that epidermal ACRs were less conserved compared to other cell types, potentially indicating that more rapid regulatory evolution has occurred in the L1-derived epidermal layer of these species. Finally, we identified and characterized a conserved subset of ACRs that overlapped the repressive histone modification H3K27me3, implicating them as potentially silencer-like CREs maintained by evolution. Collectively, this comparative genomics approach highlights the dynamics of plant cell-type-specific CRE evolution.
Collapse
|
36
|
Mendieta JP, Tu X, Jiang D, Yan H, Zhang X, Marand AP, Zhong S, Schmitz RJ. Investigating the cis-regulatory basis of C 3 and C 4 photosynthesis in grasses at single-cell resolution. Proc Natl Acad Sci U S A 2024; 121:e2402781121. [PMID: 39312655 PMCID: PMC11459142 DOI: 10.1073/pnas.2402781121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2024] [Accepted: 07/23/2024] [Indexed: 09/25/2024] Open
Abstract
While considerable knowledge exists about the enzymes pivotal for C4 photosynthesis, much less is known about the cis-regulation important for specifying their expression in distinct cell types. Here, we use single-cell-indexed ATAC-seq to identify cell-type-specific accessible chromatin regions (ACRs) associated with C4 enzymes for five different grass species. This study spans four C4 species, covering three distinct photosynthetic subtypes: Zea mays and Sorghum bicolor (NADP-dependent malic enzyme), Panicum miliaceum (NAD-dependent malic enzyme), Urochloa fusca (phosphoenolpyruvate carboxykinase), along with the C3 outgroup Oryza sativa. We studied the cis-regulatory landscape of enzymes essential across all C4 species and those unique to C4 subtypes, measuring cell-type-specific biases for C4 enzymes using chromatin accessibility data. Integrating these data with phylogenetics revealed diverse co-option of gene family members between species, showcasing the various paths of C4 evolution. Besides promoter proximal ACRs, we found that, on average, C4 genes have two to three distal cell-type-specific ACRs, highlighting the complexity and divergent nature of C4 evolution. Examining the evolutionary history of these cell-type-specific ACRs revealed a spectrum of conserved and novel ACRs, even among closely related species, indicating ongoing evolution of cis-regulation at these C4 loci. This study illuminates the dynamic and complex nature of cis-regulatory elements evolution in C4 photosynthesis, particularly highlighting the intricate cis-regulatory evolution of key loci. Our findings offer a valuable resource for future investigations, potentially aiding in the optimization of C3 crop performance under changing climatic conditions.
Collapse
Affiliation(s)
| | - Xiaoyu Tu
- Shanghai Collaborative Innovation Center of Agri-Seeds, Joint Center for Single-Cell Biology, School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai200240, China
| | - Daiquan Jiang
- State Key Laboratory of Agrobiotechnology, School of Life Sciences, The Chinese University of Hong Kong, Hong Kong, SAR
| | - Haidong Yan
- Department of Genetics, University of Georgia, Athens, GA30605
| | - Xuan Zhang
- Department of Genetics, University of Georgia, Athens, GA30605
| | | | - Silin Zhong
- State Key Laboratory of Agrobiotechnology, School of Life Sciences, The Chinese University of Hong Kong, Hong Kong, SAR
| | | |
Collapse
|
37
|
Cai S, Yin N. Single-cell transcriptome and chromatin accessibility mapping of upper lip and primary palate fusion. J Cell Mol Med 2024; 28:e70128. [PMID: 39392189 PMCID: PMC11467802 DOI: 10.1111/jcmm.70128] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2024] [Revised: 08/17/2024] [Accepted: 09/20/2024] [Indexed: 10/12/2024] Open
Abstract
Cleft lip and/or primary palate (CL/P) represent a prevalent congenital malformation, the aetiology of which is highly intricate. Although it is generally accepted that the condition arises from failed fusion between the upper lip and primary palate, the precise mechanism underlying this fusion process remains enigmatic. In this study, we utilized transposase-accessible chromatin sequencing (scATAC-seq) and single-cell RNA sequencing (scRNA-seq) to interrogate lambdoidal junction tissue derived from C57BL/6J mouse embryos at critical stages of embryogenesis (10.5, 11.5 and 12.5 embryonic days). We successfully identified distinct subgroups of mesenchymal and ectodermal cells involved in the fusion process and characterized their unique transcriptional profiles. Furthermore, we conducted cell differentiation trajectory analysis, revealing a dynamic repertoire of genes that are sequentially activated or repressed during pseudotime, facilitating the transition of relevant cell types. Additionally, we employed scATAC data to identify key genes associated with the fusion process and demonstrated differential chromatin accessibility across major cell types. Finally, we constructed a dynamic intercellular communication network and predicted upstream transcriptional regulators of critical genes involved in important signalling pathways. Our findings provide a valuable resource for future studies on upper lip and primary palate development, as well as congenital defects.
Collapse
Affiliation(s)
- Sini Cai
- The Department of Cleft Lip and Palate of Plastic Surgery HospitalChinese Academy of Medical Sciences and Peking Union Medical CollegeBeijingChina
- Medical Cosmetic Center of Dermatology Hospital of Southern Medical UniversityGuangdong Provincial Dermatology HospitalGuangzhouChina
| | - Ningbei Yin
- The Department of Cleft Lip and Palate of Plastic Surgery HospitalChinese Academy of Medical Sciences and Peking Union Medical CollegeBeijingChina
| |
Collapse
|
38
|
Lee AS, Ayers LJ, Kosicki M, Chan WM, Fozo LN, Pratt BM, Collins TE, Zhao B, Rose MF, Sanchis-Juan A, Fu JM, Wong I, Zhao X, Tenney AP, Lee C, Laricchia KM, Barry BJ, Bradford VR, Jurgens JA, England EM, Lek M, MacArthur DG, Lee EA, Talkowski ME, Brand H, Pennacchio LA, Engle EC. A cell type-aware framework for nominating non-coding variants in Mendelian regulatory disorders. Nat Commun 2024; 15:8268. [PMID: 39333082 PMCID: PMC11436875 DOI: 10.1038/s41467-024-52463-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2023] [Accepted: 09/04/2024] [Indexed: 09/29/2024] Open
Abstract
Unsolved Mendelian cases often lack obvious pathogenic coding variants, suggesting potential non-coding etiologies. Here, we present a single cell multi-omic framework integrating embryonic mouse chromatin accessibility, histone modification, and gene expression assays to discover cranial motor neuron (cMN) cis-regulatory elements and subsequently nominate candidate non-coding variants in the congenital cranial dysinnervation disorders (CCDDs), a set of Mendelian disorders altering cMN development. We generate single cell epigenomic profiles for ~86,000 cMNs and related cell types, identifying ~250,000 accessible regulatory elements with cognate gene predictions for ~145,000 putative enhancers. We evaluate enhancer activity for 59 elements using an in vivo transgenic assay and validate 44 (75%), demonstrating that single cell accessibility can be a strong predictor of enhancer activity. Applying our cMN atlas to 899 whole genome sequences from 270 genetically unsolved CCDD pedigrees, we achieve significant reduction in our variant search space and nominate candidate variants predicted to regulate known CCDD disease genes MAFB, PHOX2A, CHN1, and EBF3 - as well as candidates in recurrently mutated enhancers through peak- and gene-centric allelic aggregation. This work delivers non-coding variant discoveries of relevance to CCDDs and a generalizable framework for nominating non-coding variants of potentially high functional impact in other Mendelian disorders.
Collapse
Affiliation(s)
- Arthur S Lee
- Department of Neurology, Boston Children's Hospital and Harvard Medical School, Boston, MA, USA.
- Kirby Neurobiology Center, Boston Children's Hospital, Boston, MA, USA.
- Manton Center for Orphan Disease Research, Boston Children's Hospital, Boston, MA, USA.
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
| | - Lauren J Ayers
- Department of Neurology, Boston Children's Hospital and Harvard Medical School, Boston, MA, USA
| | - Michael Kosicki
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Wai-Man Chan
- Department of Neurology, Boston Children's Hospital and Harvard Medical School, Boston, MA, USA
- Howard Hughes Medical Institute, Chevy Chase, MD, USA
| | - Lydia N Fozo
- Department of Neurology, Boston Children's Hospital and Harvard Medical School, Boston, MA, USA
| | - Brandon M Pratt
- Department of Neurology, Boston Children's Hospital and Harvard Medical School, Boston, MA, USA
| | - Thomas E Collins
- Department of Neurology, Boston Children's Hospital and Harvard Medical School, Boston, MA, USA
| | - Boxun Zhao
- Manton Center for Orphan Disease Research, Boston Children's Hospital, Boston, MA, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Division of Genetics and Genomics, Boston Children's Hospital, Boston, MA, USA
- Department of Pediatrics, Harvard Medical School, Boston, MA, USA
| | - Matthew F Rose
- Department of Neurology, Boston Children's Hospital and Harvard Medical School, Boston, MA, USA
- Kirby Neurobiology Center, Boston Children's Hospital, Boston, MA, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Pathology, Boston Children's Hospital, Boston, MA, USA
- Department of Pathology, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
- Medical Genetics Training Program, Harvard Medical School, Boston, MA, USA
| | - Alba Sanchis-Juan
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
| | - Jack M Fu
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
- Department of Neurology, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA
| | - Isaac Wong
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
| | - Xuefang Zhao
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
- Department of Neurology, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA
| | - Alan P Tenney
- Department of Neurology, Boston Children's Hospital and Harvard Medical School, Boston, MA, USA
- Kirby Neurobiology Center, Boston Children's Hospital, Boston, MA, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Cassia Lee
- Department of Neurology, Boston Children's Hospital and Harvard Medical School, Boston, MA, USA
- Harvard College, Cambridge, MA, USA
| | - Kristen M Laricchia
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Brenda J Barry
- Department of Neurology, Boston Children's Hospital and Harvard Medical School, Boston, MA, USA
- Howard Hughes Medical Institute, Chevy Chase, MD, USA
| | - Victoria R Bradford
- Department of Neurology, Boston Children's Hospital and Harvard Medical School, Boston, MA, USA
| | - Julie A Jurgens
- Department of Neurology, Boston Children's Hospital and Harvard Medical School, Boston, MA, USA
- Kirby Neurobiology Center, Boston Children's Hospital, Boston, MA, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Eleina M England
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Monkol Lek
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Daniel G MacArthur
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Centre for Population Genomics, Garvan Institute of Medical Research and UNSW Sydney, Sydney, NSW, Australia
- Centre for Population Genomics, Murdoch Children's Research Institute, Melbourne, VIC, Australia
| | - Eunjung Alice Lee
- Manton Center for Orphan Disease Research, Boston Children's Hospital, Boston, MA, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Division of Genetics and Genomics, Boston Children's Hospital, Boston, MA, USA
- Department of Pediatrics, Harvard Medical School, Boston, MA, USA
| | - Michael E Talkowski
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
- Department of Neurology, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA
| | - Harrison Brand
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
- Department of Neurology, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA
- Pediatric Surgical Research Laboratories, Massachusetts General Hospital, Boston, MA, USA
| | - Len A Pennacchio
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Elizabeth C Engle
- Department of Neurology, Boston Children's Hospital and Harvard Medical School, Boston, MA, USA.
- Kirby Neurobiology Center, Boston Children's Hospital, Boston, MA, USA.
- Manton Center for Orphan Disease Research, Boston Children's Hospital, Boston, MA, USA.
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
- Howard Hughes Medical Institute, Chevy Chase, MD, USA.
- Division of Genetics and Genomics, Boston Children's Hospital, Boston, MA, USA.
- Medical Genetics Training Program, Harvard Medical School, Boston, MA, USA.
- Department of Ophthalmology, Boston Children's Hospital and Harvard Medical School, Boston, MA, USA.
| |
Collapse
|
39
|
Yi SV. Epigenetics Research in Evolutionary Biology: Perspectives on Timescales and Mechanisms. Mol Biol Evol 2024; 41:msae170. [PMID: 39235767 PMCID: PMC11376073 DOI: 10.1093/molbev/msae170] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2024] [Revised: 08/06/2024] [Accepted: 08/08/2024] [Indexed: 09/06/2024] Open
Abstract
Epigenetics research in evolutionary biology encompasses a variety of research areas, from regulation of gene expression to inheritance of environmentally mediated phenotypes. Such divergent research foci can occasionally render the umbrella term "epigenetics" ambiguous. Here I discuss several areas of contemporary epigenetics research in the context of evolutionary biology, aiming to provide balanced views across timescales and molecular mechanisms. The importance of epigenetics in development is now being assessed in many nonmodel species. These studies not only confirm the importance of epigenetic marks in developmental processes, but also highlight the significant diversity in epigenetic regulatory mechanisms across taxa. Further, these comparative epigenomic studies have begun to show promise toward enhancing our understanding of how regulatory programs evolve. A key property of epigenetic marks is that they can be inherited along mitotic cell lineages, and epigenetic differences that occur during early development can have lasting consequences on the organismal phenotypes. Thus, epigenetic marks may play roles in short-term (within an organism's lifetime or to the next generation) adaptation and phenotypic plasticity. However, the extent to which observed epigenetic variation occurs independently of genetic influences remains uncertain, due to the widespread impact of genetics on epigenetic variation and the limited availability of comprehensive (epi)genomic resources from most species. While epigenetic marks can be inherited independently of genetic sequences in some species, there is little evidence that such "transgenerational inheritance" is a general phenomenon. Rather, molecular mechanisms of epigenetic inheritance are highly variable between species.
Collapse
Affiliation(s)
- Soojin V Yi
- Department of Ecology, Evolution and Marine Biology, University of California, Santa Barbara, Santa Barbara, CA 93106, USA
- Department of Molecular, Cellular and Developmental Biology, University of California, Santa Barbara, Santa Barbara, CA 93106, USA
- Neuroscience Research Institute, University of California, Santa Barbara, Santa Barbara, CA 93106, USA
| |
Collapse
|
40
|
LeRoy N, Smith J, Zheng G, Rymuza J, Gharavi E, Brown D, Zhang A, Sheffield N. Fast clustering and cell-type annotation of scATAC data using pre-trained embeddings. NAR Genom Bioinform 2024; 6:lqae073. [PMID: 38974799 PMCID: PMC11224678 DOI: 10.1093/nargab/lqae073] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2023] [Revised: 04/29/2024] [Accepted: 06/20/2024] [Indexed: 07/09/2024] Open
Abstract
Data from the single-cell assay for transposase-accessible chromatin using sequencing (scATAC-seq) are now widely available. One major computational challenge is dealing with high dimensionality and inherent sparsity, which is typically addressed by producing lower dimensional representations of single cells for downstream clustering tasks. Current approaches produce such individual cell embeddings directly through a one-step learning process. Here, we propose an alternative approach by building embedding models pre-trained on reference data. We argue that this provides a more flexible analysis workflow that also has computational performance advantages through transfer learning. We implemented our approach in scEmbed, an unsupervised machine-learning framework that learns low-dimensional embeddings of genomic regulatory regions to represent and analyze scATAC-seq data. scEmbed performs well in terms of clustering ability and has the key advantage of learning patterns of region co-occurrence that can be transferred to other, unseen datasets. Moreover, models pre-trained on reference data can be exploited to build fast and accurate cell-type annotation systems without the need for other data modalities. scEmbed is implemented in Python and it is available to download from GitHub. We also make our pre-trained models available on huggingface for public use. scEmbed is open source and available at https://github.com/databio/geniml. Pre-trained models from this work can be obtained on huggingface: https://huggingface.co/databio.
Collapse
Affiliation(s)
- Nathan J LeRoy
- Center for Public Health Genomics, School of Medicine, University of Virginia, Charlottesville, VA 22908, USA
- Department of Biomedical Engineering, School of Medicine, University of Virginia, Charlottesville, VA 22904, USA
| | - Jason P Smith
- Center for Public Health Genomics, School of Medicine, University of Virginia, Charlottesville, VA 22908, USA
- Department of Biochemistry and Molecular Genetics, School of Medicine, University of Virginia, Charlottesville, VA 22908, USA
- Child Health Research Center, School of Medicine, University of Virginia, Charlottesville, VA 22908, USA
| | - Guangtao Zheng
- Department of Computer Science, School of Engineering, University of Virginia, Charlottesville, VA 22908, USA
| | - Julia Rymuza
- Center for Public Health Genomics, School of Medicine, University of Virginia, Charlottesville, VA 22908, USA
| | - Erfaneh Gharavi
- Center for Public Health Genomics, School of Medicine, University of Virginia, Charlottesville, VA 22908, USA
- School of Data Science, University of Virginia, Charlottesville, VA 22904, USA
| | - Donald E Brown
- School of Data Science, University of Virginia, Charlottesville, VA 22904, USA
- Department of Systems and Information Engineering, University of Virginia, Charlottesville, VA 22908, USA
| | - Aidong Zhang
- Department of Biomedical Engineering, School of Medicine, University of Virginia, Charlottesville, VA 22904, USA
- Department of Computer Science, School of Engineering, University of Virginia, Charlottesville, VA 22908, USA
- School of Data Science, University of Virginia, Charlottesville, VA 22904, USA
| | - Nathan C Sheffield
- Center for Public Health Genomics, School of Medicine, University of Virginia, Charlottesville, VA 22908, USA
- Department of Biomedical Engineering, School of Medicine, University of Virginia, Charlottesville, VA 22904, USA
- Department of Biochemistry and Molecular Genetics, School of Medicine, University of Virginia, Charlottesville, VA 22908, USA
- Child Health Research Center, School of Medicine, University of Virginia, Charlottesville, VA 22908, USA
- Department of Computer Science, School of Engineering, University of Virginia, Charlottesville, VA 22908, USA
- School of Data Science, University of Virginia, Charlottesville, VA 22904, USA
- Department of Public Health Sciences, School of Medicine, University of Virginia, Charlottesville, VA 22908, USA
| |
Collapse
|
41
|
Read DF, Booth GT, Daza RM, Jackson DL, Gladden RG, Srivatsan SR, Ewing B, Franks JM, Spurrell CH, Gomes AR, O'Day D, Gogate AA, Martin BK, Larson H, Pfleger C, Starita L, Lin Y, Shendure J, Lin S, Trapnell C. Single-cell analysis of chromatin and expression reveals age- and sex-associated alterations in the human heart. Commun Biol 2024; 7:1052. [PMID: 39187646 PMCID: PMC11347658 DOI: 10.1038/s42003-024-06582-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2022] [Accepted: 07/11/2024] [Indexed: 08/28/2024] Open
Abstract
Sex differences and age-related changes in the human heart at the tissue, cell, and molecular level have been well-documented and many may be relevant for cardiovascular disease. However, how molecular programs within individual cell types vary across individuals by age and sex remains poorly characterized. To better understand this variation, we performed single-nucleus combinatorial indexing (sci) ATAC- and RNA-Seq in human heart samples from nine donors. We identify hundreds of differentially expressed genes by age and sex and find epigenetic signatures of variation in ATAC-Seq data in this discovery cohort. We then scale up our single-cell RNA-Seq analysis by combining our data with five recently published single nucleus RNA-Seq datasets of healthy adult hearts. We find variation such as metabolic alterations by sex and immune changes by age in differential expression tests, as well as alterations in abundance of cardiomyocytes by sex and neurons with age. In addition, we compare our adult-derived ATAC-Seq profiles to analogous fetal cell types to identify putative developmental-stage-specific regulatory factors. Finally, we train predictive models of cell-type-specific RNA expression levels utilizing ATAC-Seq profiles to link distal regulatory sequences to promoters, quantifying the predictive value of a simple TF-to-expression regulatory grammar and identifying cell-type-specific TFs. Our analysis represents the largest single-cell analysis of cardiac variation by age and sex to date and provides a resource for further study of healthy cardiac variation and transcriptional regulation at single-cell resolution.
Collapse
Affiliation(s)
- David F Read
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | - Gregory T Booth
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | - Riza M Daza
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | - Dana L Jackson
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | - Rula Green Gladden
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | - Sanjay R Srivatsan
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | - Brent Ewing
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | - Jennifer M Franks
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | | | | | - Diana O'Day
- Brotman Baty Institute for Precision Medicine, Seattle, WA, USA
| | - Aishwarya A Gogate
- Brotman Baty Institute for Precision Medicine, Seattle, WA, USA
- Seattle Children's Research Institute, Seattle, WA, USA
| | - Beth K Martin
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | - Haleigh Larson
- Brotman Baty Institute for Precision Medicine, Seattle, WA, USA
| | - Christian Pfleger
- University of Washington School of Medicine, Division of Cardiology, Seattle, WA, USA
| | - Lea Starita
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
- Brotman Baty Institute for Precision Medicine, Seattle, WA, USA
| | - Yiing Lin
- Department of Surgery, Washington University, St Louis, MO, USA
| | - Jay Shendure
- Brotman Baty Institute for Precision Medicine, Seattle, WA, USA.
- Seattle Children's Research Institute, Seattle, WA, USA.
- Howard Hughes Medical Institute, Seattle, WA, USA.
- Allen Discovery Center for Cell Lineage Tracing, Seattle, WA, USA.
| | - Shin Lin
- University of Washington School of Medicine, Division of Cardiology, Seattle, WA, USA.
| | - Cole Trapnell
- Brotman Baty Institute for Precision Medicine, Seattle, WA, USA.
| |
Collapse
|
42
|
Luo S, Germain PL, Robinson MD, von Meyenn F. Benchmarking computational methods for single-cell chromatin data analysis. Genome Biol 2024; 25:225. [PMID: 39152456 PMCID: PMC11328424 DOI: 10.1186/s13059-024-03356-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2023] [Accepted: 07/29/2024] [Indexed: 08/19/2024] Open
Abstract
BACKGROUND Single-cell chromatin accessibility assays, such as scATAC-seq, are increasingly employed in individual and joint multi-omic profiling of single cells. As the accumulation of scATAC-seq and multi-omics datasets continue, challenges in analyzing such sparse, noisy, and high-dimensional data become pressing. Specifically, one challenge relates to optimizing the processing of chromatin-level measurements and efficiently extracting information to discern cellular heterogeneity. This is of critical importance, since the identification of cell types is a fundamental step in current single-cell data analysis practices. RESULTS We benchmark 8 feature engineering pipelines derived from 5 recent methods to assess their ability to discover and discriminate cell types. By using 10 metrics calculated at the cell embedding, shared nearest neighbor graph, or partition levels, we evaluate the performance of each method at different data processing stages. This comprehensive approach allows us to thoroughly understand the strengths and weaknesses of each method and the influence of parameter selection. CONCLUSIONS Our analysis provides guidelines for choosing analysis methods for different datasets. Overall, feature aggregation, SnapATAC, and SnapATAC2 outperform latent semantic indexing-based methods. For datasets with complex cell-type structures, SnapATAC and SnapATAC2 are preferred. With large datasets, SnapATAC2 and ArchR are most scalable.
Collapse
Affiliation(s)
- Siyuan Luo
- Laboratory of Nutrition and Metabolic Epigenetics, Department of Health Sciences and Technology, ETH Zurich, Zurich, Switzerland
- Department of Molecular Life Sciences, University of Zurich, Zurich, Switzerland
| | - Pierre-Luc Germain
- Department of Molecular Life Sciences, University of Zurich, Zurich, Switzerland
- SIB Swiss Institute of Bioinformatics, University of Zurich, Zurich, Switzerland
- Institute for Neuroscience, Department of Health Sciences and Technology, ETH Zurich, Zurich, Switzerland
| | - Mark D Robinson
- Department of Molecular Life Sciences, University of Zurich, Zurich, Switzerland.
- SIB Swiss Institute of Bioinformatics, University of Zurich, Zurich, Switzerland.
| | - Ferdinand von Meyenn
- Laboratory of Nutrition and Metabolic Epigenetics, Department of Health Sciences and Technology, ETH Zurich, Zurich, Switzerland.
| |
Collapse
|
43
|
Rachid Zaim S, Pebworth MP, McGrath I, Okada L, Weiss M, Reading J, Czartoski JL, Torgerson TR, McElrath MJ, Bumol TF, Skene PJ, Li XJ. MOCHA's advanced statistical modeling of scATAC-seq data enables functional genomic inference in large human cohorts. Nat Commun 2024; 15:6828. [PMID: 39122670 PMCID: PMC11316085 DOI: 10.1038/s41467-024-50612-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2023] [Accepted: 07/13/2024] [Indexed: 08/12/2024] Open
Abstract
Single-cell assay for transposase-accessible chromatin using sequencing (scATAC-seq) is being increasingly used to study gene regulation. However, major analytical gaps limit its utility in studying gene regulatory programs in complex diseases. In response, MOCHA (Model-based single cell Open CHromatin Analysis) presents major advances over existing analysis tools, including: 1) improving identification of sample-specific open chromatin, 2) statistical modeling of technical drop-out with zero-inflated methods, 3) mitigation of false positives in single cell analysis, 4) identification of alternative transcription-starting-site regulation, and 5) modules for inferring temporal gene regulatory networks from longitudinal data. These advances, in addition to open chromatin analyses, provide a robust framework after quality control and cell labeling to study gene regulatory programs in human disease. We benchmark MOCHA with four state-of-the-art tools to demonstrate its advances. We also construct cross-sectional and longitudinal gene regulatory networks, identifying potential mechanisms of COVID-19 response. MOCHA provides researchers with a robust analytical tool for functional genomic inference from scATAC-seq data.
Collapse
Affiliation(s)
| | | | | | - Lauren Okada
- Allen Institute for Immunology, Seattle, WA, USA
| | - Morgan Weiss
- Allen Institute for Immunology, Seattle, WA, USA
| | | | - Julie L Czartoski
- Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | | | - M Juliana McElrath
- Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | | | | | - Xiao-Jun Li
- Allen Institute for Immunology, Seattle, WA, USA.
| |
Collapse
|
44
|
Yang L, Han Y, Zhang T, Dong X, Ge J, Roy A, Zhu J, Lu T, Vandana JJ, de Silva N, Robertson CC, Xiang JZ, Pan C, Sun Y, Que J, Evans T, Liu C, Wang W, Naji A, Parker SC, Schwartz RE, Chen S. Human Vascularized Macrophage-Islet Organoids to Model Immune-Mediated Pancreatic β cell Pyroptosis upon Viral Infection. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.08.05.606734. [PMID: 39149298 PMCID: PMC11326194 DOI: 10.1101/2024.08.05.606734] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 08/17/2024]
Abstract
There is a paucity of human models to study immune-mediated host damage. Here, we utilized the GeoMx spatial multi-omics platform to analyze immune cell changes in COVID-19 pancreatic autopsy samples, revealing an accumulation of proinflammatory macrophages. Single cell RNA-seq analysis of human islets exposed to SARS-CoV-2 or Coxsackievirus B4 (CVB4) viruses identified activation of proinflammatory macrophages and β cell pyroptosis. To distinguish viral versus proinflammatory macrophage-mediated β cell pyroptosis, we developed human pluripotent stem cell (hPSC)-derived vascularized macrophage-islet (VMI) organoids. VMI organoids exhibited enhanced marker expression and function in both β cells and endothelial cells compared to separately cultured cells. Notably, proinflammatory macrophages within VMI organoids induced β cell pyroptosis. Mechanistic investigations highlighted TNFSF12-TNFRSF12A involvement in proinflammatory macrophage-mediated β cell pyroptosis. This study established hPSC-derived VMI organoids as a valuable tool for studying immune cell-mediated host damage and uncovered mechanism of β cell damage during viral exposure.
Collapse
Affiliation(s)
- Liuliu Yang
- Department of Surgery, Weill Cornell Medicine, 1300 York Ave, New York, NY, 10065, USA
- Center for Genomic Health, Weill Cornell Medicine, 1300 York Ave, New York, NY, 10065, USA
- State Key Laboratory of Experimental Hematology, National Clinical Research Center for Blood Disease, Haihe Laboratory of Cell Ecosystem, Institute of Hematology & Blood Diseases Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Tianjin 300020, China
- Tianjin Institute of Health Science, Tianjin 301600, China
| | - Yuling Han
- Department of Surgery, Weill Cornell Medicine, 1300 York Ave, New York, NY, 10065, USA
- Center for Genomic Health, Weill Cornell Medicine, 1300 York Ave, New York, NY, 10065, USA
- Key Laboratory of Organ Regeneration and Reconstruction, State Key Laboratory of Stem Cell and Reproductive Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
- Institute for Stem Cell and Regeneration, Chinese Academy of Sciences, Beijing 100101, China
- Beijing Institute for Stem Cell and Regenerative Medicine, Beijing 100101, China
| | - Tuo Zhang
- Genomic Resource Core Facility, Weill Cornell Medicine, New York, NY 10065, USA
| | - Xue Dong
- Department of Surgery, Weill Cornell Medicine, 1300 York Ave, New York, NY, 10065, USA
| | - Jian Ge
- Columbia Center for Human Development, Department of Medicine, Columbia University Irving Medical Center, New York, NY 10032, USA
| | - Aadita Roy
- Department of Surgery, Weill Cornell Medicine, 1300 York Ave, New York, NY, 10065, USA
| | - Jiajun Zhu
- Department of Surgery, Weill Cornell Medicine, 1300 York Ave, New York, NY, 10065, USA
- Center for Genomic Health, Weill Cornell Medicine, 1300 York Ave, New York, NY, 10065, USA
| | - Tiankun Lu
- Department of Surgery, Weill Cornell Medicine, 1300 York Ave, New York, NY, 10065, USA
- Center for Genomic Health, Weill Cornell Medicine, 1300 York Ave, New York, NY, 10065, USA
| | - J. Jeya Vandana
- Department of Surgery, Weill Cornell Medicine, 1300 York Ave, New York, NY, 10065, USA
- Center for Genomic Health, Weill Cornell Medicine, 1300 York Ave, New York, NY, 10065, USA
| | - Neranjan de Silva
- Department of Surgery, Weill Cornell Medicine, 1300 York Ave, New York, NY, 10065, USA
- Center for Genomic Health, Weill Cornell Medicine, 1300 York Ave, New York, NY, 10065, USA
| | - Catherine C. Robertson
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
| | - Jenny Z Xiang
- Genomic Resource Core Facility, Weill Cornell Medicine, New York, NY 10065, USA
| | - Chendong Pan
- Genomic Resource Core Facility, Weill Cornell Medicine, New York, NY 10065, USA
| | - Yanjie Sun
- Genomic Resource Core Facility, Weill Cornell Medicine, New York, NY 10065, USA
| | - Jianwen Que
- Columbia Center for Human Development, Department of Medicine, Columbia University Irving Medical Center, New York, NY 10032, USA
| | - Todd Evans
- Department of Surgery, Weill Cornell Medicine, 1300 York Ave, New York, NY, 10065, USA
- Center for Genomic Health, Weill Cornell Medicine, 1300 York Ave, New York, NY, 10065, USA
| | - Chengyang Liu
- Department of Surgery, University of Pennsylvania School of Medicine, Philadelphia, Pennsylvania 19104, USA
| | - Wei Wang
- Department of Surgery, University of Pennsylvania School of Medicine, Philadelphia, Pennsylvania 19104, USA
| | - Ali Naji
- Department of Surgery, University of Pennsylvania School of Medicine, Philadelphia, Pennsylvania 19104, USA
| | - Stephen C.J. Parker
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
- Department of Human Genetics, University of Michigan, Ann Arbor, MI, USA
- Department of Biostatistics, University of Michigan, Ann Arbor, MI, USA
| | - Robert E. Schwartz
- Division of Gastroenterology and Hepatology, Department of Medicine, Weill Cornell Medicine, 1300 York Ave, New York, NY, 10065, USA
- Department of Physiology, Biophysics and Systems Biology, Weill Cornell Medicine, 1300 York Ave, New York, NY, 10065, USA. New York 10021, USA
| | - Shuibing Chen
- Department of Surgery, Weill Cornell Medicine, 1300 York Ave, New York, NY, 10065, USA
- Center for Genomic Health, Weill Cornell Medicine, 1300 York Ave, New York, NY, 10065, USA
| |
Collapse
|
45
|
Yu PC, Hou D, Chang B, Liu N, Xu CH, Chen X, Hu CL, Liu T, Wang X, Zhang Q, Liu P, Jiang Y, Fei MY, Zong LJ, Zhang JY, Liu H, Chen BY, Chen SB, Wang Y, Li ZJ, Li X, Deng CH, Ren YY, Zhao M, Jiang S, Wang R, Jin J, Yang S, Xue K, Shi J, Chang CK, Shen S, Wang Z, He PC, Chen Z, Chen SJ, Sun XJ, Wang L. SMARCA5 reprograms AKR1B1-mediated fructose metabolism to control leukemogenesis. Dev Cell 2024; 59:1954-1971.e7. [PMID: 38776924 DOI: 10.1016/j.devcel.2024.04.023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2023] [Revised: 03/13/2024] [Accepted: 04/30/2024] [Indexed: 05/25/2024]
Abstract
A significant variation in chromatin accessibility is an epigenetic feature of leukemia. The cause of this variation in leukemia, however, remains elusive. Here, we identify SMARCA5, a core ATPase of the imitation switch (ISWI) chromatin remodeling complex, as being responsible for aberrant chromatin accessibility in leukemia cells. We find that SMARCA5 is required to maintain aberrant chromatin accessibility for leukemogenesis and then promotes transcriptional activation of AKR1B1, an aldo/keto reductase, by recruiting transcription co-activator DDX5 and transcription factor SP1. Higher levels of AKR1B1 are associated with a poor prognosis in leukemia patients and promote leukemogenesis by reprogramming fructose metabolism. Moreover, pharmacological inhibition of AKR1B1 has been shown to have significant therapeutic effects in leukemia mice and leukemia patient cells. Thus, our findings link the aberrant chromatin state mediated by SMARCA5 to AKR1B1-mediated endogenous fructose metabolism reprogramming and shed light on the essential role of AKR1B1 in leukemogenesis, which may provide therapeutic strategies for leukemia.
Collapse
Affiliation(s)
- Peng-Cheng Yu
- CAS Key Laboratory of Tissue Microenvironment and Tumor, Shanghai Institute of Nutrition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China; Shanghai Institute of Hematology, State Key Laboratory of Medical Genomics, National Research Center for Translational Medicine at Shanghai, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
| | - Dan Hou
- CAS Key Laboratory of Tissue Microenvironment and Tumor, Shanghai Institute of Nutrition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Binhe Chang
- CAS Key Laboratory of Tissue Microenvironment and Tumor, Shanghai Institute of Nutrition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Na Liu
- Department of Hematology, Institute of Hematology, Shanghai Changhai Hospital, Naval Medical University, Shanghai 200433, China
| | - Chun-Hui Xu
- CAS Key Laboratory of Tissue Microenvironment and Tumor, Shanghai Institute of Nutrition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Xinchi Chen
- CAS Key Laboratory of Tissue Microenvironment and Tumor, Shanghai Institute of Nutrition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Cheng-Long Hu
- CAS Key Laboratory of Tissue Microenvironment and Tumor, Shanghai Institute of Nutrition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Ting Liu
- Key Laboratory of Pediatric Hematology & Oncology of the Ministry of Health of China, Department of Hematology & Oncology, Shanghai Children's Medical Center, School of Medicine, Shanghai Jiao Tong University, Shanghai 200127, China
| | - Xiaoning Wang
- Department of Hematology, The First Affiliated Hospital of Xi'an Jiaotong University, Xi'an 710061, China
| | - Qunling Zhang
- Department of Medical Oncology, Fudan University, Shanghai 200032, China; Department of Oncology, Shanghai Medical College, Fudan University, Shanghai 200032, China
| | - Ping Liu
- CAS Key Laboratory of Tissue Microenvironment and Tumor, Shanghai Institute of Nutrition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Yilun Jiang
- CAS Key Laboratory of Tissue Microenvironment and Tumor, Shanghai Institute of Nutrition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Ming-Yue Fei
- CAS Key Laboratory of Tissue Microenvironment and Tumor, Shanghai Institute of Nutrition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Li-Juan Zong
- CAS Key Laboratory of Tissue Microenvironment and Tumor, Shanghai Institute of Nutrition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Jia-Ying Zhang
- CAS Key Laboratory of Tissue Microenvironment and Tumor, Shanghai Institute of Nutrition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Hui Liu
- Key Laboratory of Pediatric Hematology & Oncology of the Ministry of Health of China, Department of Hematology & Oncology, Shanghai Children's Medical Center, School of Medicine, Shanghai Jiao Tong University, Shanghai 200127, China
| | - Bing-Yi Chen
- CAS Key Laboratory of Tissue Microenvironment and Tumor, Shanghai Institute of Nutrition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Shu-Bei Chen
- School of Life Sciences & Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Yong Wang
- CAS Key Laboratory of Tissue Microenvironment and Tumor, Shanghai Institute of Nutrition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Zi-Juan Li
- CAS Key Laboratory of Tissue Microenvironment and Tumor, Shanghai Institute of Nutrition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Xiya Li
- CAS Key Laboratory of Tissue Microenvironment and Tumor, Shanghai Institute of Nutrition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Chu-Han Deng
- CAS Key Laboratory of Tissue Microenvironment and Tumor, Shanghai Institute of Nutrition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Yi-Yi Ren
- CAS Key Laboratory of Tissue Microenvironment and Tumor, Shanghai Institute of Nutrition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Muying Zhao
- CAS Key Laboratory of Tissue Microenvironment and Tumor, Shanghai Institute of Nutrition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Shiyu Jiang
- Department of Medical Oncology, Fudan University, Shanghai 200032, China; Department of Oncology, Shanghai Medical College, Fudan University, Shanghai 200032, China
| | - Roujia Wang
- Department of Hematology, Shanghai Jiao Tong University Affiliated Sixth People's Hospital, Shanghai 200233, China
| | - Jiacheng Jin
- Department of Hematology, Shanghai Jiao Tong University Affiliated Sixth People's Hospital, Shanghai 200233, China
| | - Shaoxin Yang
- Department of Hematology, Shanghai Ninth People's Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200011, China
| | - Kai Xue
- Shanghai Institute of Hematology, State Key Laboratory of Medical Genomics, National Research Center for Translational Medicine at Shanghai, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
| | - Jun Shi
- Department of Hematology, Shanghai Ninth People's Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200011, China
| | - Chun-Kang Chang
- Department of Hematology, Shanghai Jiao Tong University Affiliated Sixth People's Hospital, Shanghai 200233, China
| | - Shuhong Shen
- Key Laboratory of Pediatric Hematology & Oncology of the Ministry of Health of China, Department of Hematology & Oncology, Shanghai Children's Medical Center, School of Medicine, Shanghai Jiao Tong University, Shanghai 200127, China
| | - Zhikai Wang
- MOE Key Laboratory for Membraneless Organelles and Cellular Dynamics, University of Science and Technology of China, Hefei 230027, China
| | - Peng-Cheng He
- Department of Hematology, The First Affiliated Hospital of Xi'an Jiaotong University, Xi'an 710061, China
| | - Zhu Chen
- Shanghai Institute of Hematology, State Key Laboratory of Medical Genomics, National Research Center for Translational Medicine at Shanghai, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
| | - Sai-Juan Chen
- Shanghai Institute of Hematology, State Key Laboratory of Medical Genomics, National Research Center for Translational Medicine at Shanghai, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
| | - Xiao-Jian Sun
- Shanghai Institute of Hematology, State Key Laboratory of Medical Genomics, National Research Center for Translational Medicine at Shanghai, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China; School of Life Sciences & Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Lan Wang
- CAS Key Laboratory of Tissue Microenvironment and Tumor, Shanghai Institute of Nutrition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China.
| |
Collapse
|
46
|
Noguchi Y, Matsui R, Suh J, Dou Y, Suzuki J. Genome-Wide Screening Approaches for Biochemical Reactions Independent of Cell Growth. Annu Rev Genomics Hum Genet 2024; 25:51-76. [PMID: 38692586 DOI: 10.1146/annurev-genom-121222-115958] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/03/2024]
Abstract
Genome-wide screening is a potent approach for comprehensively understanding the molecular mechanisms of biological phenomena. However, despite its widespread use in the past decades across various biological targets, its application to biochemical reactions with temporal and reversible biological outputs remains a formidable challenge. To uncover the molecular machinery underlying various biochemical reactions, we have recently developed the revival screening method, which combines flow cytometry-based cell sorting with library reconstruction from collected cells. Our refinements to the traditional genome-wide screening technique have proven successful in revealing the molecular machinery of biochemical reactions of interest. In this article, we elucidate the technical basis of revival screening, focusing on its application to CRISPR-Cas9 single guide RNA (sgRNA) library screening. Finally, we also discuss the future of genome-wide screening while describing recent achievements from in vitro and in vivo screening.
Collapse
Affiliation(s)
- Yuki Noguchi
- Graduate School of Biostudies, Kyoto University, Kyoto, Japan
- Institute for Integrated Cell-Material Sciences (WPI-iCeMS), Kyoto University, Kyoto, Japan;
| | - Risa Matsui
- Graduate School of Biostudies, Kyoto University, Kyoto, Japan
- Institute for Integrated Cell-Material Sciences (WPI-iCeMS), Kyoto University, Kyoto, Japan;
| | - Jaeyeon Suh
- Graduate School of Biostudies, Kyoto University, Kyoto, Japan
- Institute for Integrated Cell-Material Sciences (WPI-iCeMS), Kyoto University, Kyoto, Japan;
| | - Yu Dou
- Graduate School of Biostudies, Kyoto University, Kyoto, Japan
- Institute for Integrated Cell-Material Sciences (WPI-iCeMS), Kyoto University, Kyoto, Japan;
| | - Jun Suzuki
- Center for Integrated Biosystems, Institute of Biomedical Sciences, Academia Sinica, Taipei, Taiwan
- Core Research for Evolutional Science and Technology (CREST), Japan Science and Technology Agency, Kawaguchi, Japan
- Graduate School of Biostudies, Kyoto University, Kyoto, Japan
- Institute for Integrated Cell-Material Sciences (WPI-iCeMS), Kyoto University, Kyoto, Japan;
| |
Collapse
|
47
|
Lin Y, Pan Z, Zeng Y, Yang Y, Dai Z. Detecting novel cell type in single-cell chromatin accessibility data via open-set domain adaptation. Brief Bioinform 2024; 25:bbae370. [PMID: 39073828 DOI: 10.1093/bib/bbae370] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2024] [Revised: 06/27/2024] [Accepted: 07/15/2024] [Indexed: 07/30/2024] Open
Abstract
Recent advances in single-cell technologies enable the rapid growth of multi-omics data. Cell type annotation is one common task in analyzing single-cell data. It is a challenge that some cell types in the testing set are not present in the training set (i.e. unknown cell types). Most scATAC-seq cell type annotation methods generally assign each cell in the testing set to one known type in the training set but neglect unknown cell types. Here, we present OVAAnno, an automatic cell types annotation method which utilizes open-set domain adaptation to detect unknown cell types in scATAC-seq data. Comprehensive experiments show that OVAAnno successfully identifies known and unknown cell types. Further experiments demonstrate that OVAAnno also performs well on scRNA-seq data. Our codes are available online at https://github.com/lisaber/OVAAnno/tree/master.
Collapse
Affiliation(s)
- Yuefan Lin
- School of Computer Science and Engineering, Sun Yat-sen University, Guangzhou, 510006, China
| | - Zixiang Pan
- School of Computer Science and Engineering, Sun Yat-sen University, Guangzhou, 510006, China
| | - Yuansong Zeng
- School of Computer Science and Engineering, Sun Yat-sen University, Guangzhou, 510006, China
| | - Yuedong Yang
- School of Computer Science and Engineering, Sun Yat-sen University, Guangzhou, 510006, China
| | - Zhiming Dai
- School of Computer Science and Engineering, Sun Yat-sen University, Guangzhou, 510006, China
| |
Collapse
|
48
|
Samuel BER, Diaz FE, Maina TW, Corbett RJ, Tuggle CK, McGill JL. Evidence of innate training in bovine γδ T cells following subcutaneous BCG administration. Front Immunol 2024; 15:1423843. [PMID: 39100669 PMCID: PMC11295143 DOI: 10.3389/fimmu.2024.1423843] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2024] [Accepted: 06/20/2024] [Indexed: 08/06/2024] Open
Abstract
The Bacillus Calmette Guerin (BCG) vaccine has been shown to induce non-specific protection against diseases other than tuberculosis in vaccinated individuals, attributed to the induction of trained immunity. We have previously demonstrated that BCG administration induces innate immune training in mixed peripheral blood mononuclear cells and monocytes in calves. Gamma Delta (γδ) T cells are non-conventional T cells that exhibit innate and adaptive immune system features. They are in higher proportion in the peripheral blood of cattle than humans or rodents and play an essential role in bovine immune response to pathogens. In the current study, we determined if BCG administration induced innate immune training in bovine γδ T cells. A group of 16 pre-weaned Holstein calves (2-4 d age) were enrolled in the study and randomly assigned to vaccine and control groups (n=8/group). The vaccine group received two doses of 106 colony forming units (CFU) BCG Danish strain subcutaneously, separated by 2 weeks. The control group remained unvaccinated. Gamma delta T cells were purified from peripheral blood using magnetic cell sorting three weeks after receiving the 1st BCG dose. We observed functional changes in the γδ T cells from BCG-treated calves shown by increased IL-6 and TNF-α cytokine production in response to in vitro stimulation with Escherichia coli LPS and PAM3CSK4. ATAC-Seq analysis of 78,278 regions of open chromatin (peaks) revealed that γδ T cells from BCG-treated calves had an altered epigenetic status compared to cells from the control calves. Differentially accessible peaks (DAP) found near the promoters of innate immunity-related genes like Siglec14, Irf4, Ifna2, Lrrfip1, and Tnfrsf10d were 1 to 4-fold more accessible in cells from BCG-treated calves. MOTIF enrichment analysis of the sequences within DAPs, which explores transcription factor binding motifs (TFBM) upstream of regulatory elements, revealed TFBM for Eomes and IRF-5 were among the most enriched transcription factors. GO enrichment analysis of genes proximal to the DAPs showed enrichment of pathways such as regulation of IL-2 production, T-cell receptor signaling pathway, and other immune regulatory pathways. In conclusion, our study shows that subcutaneous BCG administration in pre-weaned calves can induce innate immune memory in the form of trained immunity in γδ T cells. This memory is associated with increased chromatin accessibility of innate immune response-related genes, thereby inducing a functional trained immune response evidenced by increased IL-6 and TNF-α cytokine production.
Collapse
Affiliation(s)
- Beulah Esther Rani Samuel
- Department of Veterinary Microbiology and Preventive Medicine, Iowa State University, Ames, IA, United States
| | - Fabian E. Diaz
- Department of Veterinary Microbiology and Preventive Medicine, Iowa State University, Ames, IA, United States
| | - Teresia W. Maina
- Immunology, Cargill Animal Nutrition & Health, Elk River, MN, United States
| | - Ryan J. Corbett
- Center for Data Driven Discovery, Children’s Hospital of Philadelphia, Philadelphia, PA, United States
| | | | - Jodi L. McGill
- Department of Veterinary Microbiology and Preventive Medicine, Iowa State University, Ames, IA, United States
| |
Collapse
|
49
|
Wang X, Lian Q, Dong H, Xu S, Su Y, Wu X. Benchmarking Algorithms for Gene Set Scoring of Single-cell ATAC-seq Data. GENOMICS, PROTEOMICS & BIOINFORMATICS 2024; 22:qzae014. [PMID: 39049508 PMCID: PMC11423854 DOI: 10.1093/gpbjnl/qzae014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/01/2022] [Revised: 06/20/2023] [Accepted: 06/25/2023] [Indexed: 07/27/2024]
Abstract
Gene set scoring (GSS) has been routinely conducted for gene expression analysis of bulk or single-cell RNA sequencing (RNA-seq) data, which helps to decipher single-cell heterogeneity and cell type-specific variability by incorporating prior knowledge from functional gene sets. Single-cell assay for transposase accessible chromatin using sequencing (scATAC-seq) is a powerful technique for interrogating single-cell chromatin-based gene regulation, and genes or gene sets with dynamic regulatory potentials can be regarded as cell type-specific markers as if in single-cell RNA-seq (scRNA-seq). However, there are few GSS tools specifically designed for scATAC-seq, and the applicability and performance of RNA-seq GSS tools on scATAC-seq data remain to be investigated. Here, we systematically benchmarked ten GSS tools, including four bulk RNA-seq tools, five scRNA-seq tools, and one scATAC-seq method. First, using matched scATAC-seq and scRNA-seq datasets, we found that the performance of GSS tools on scATAC-seq data was comparable to that on scRNA-seq, suggesting their applicability to scATAC-seq. Then, the performance of different GSS tools was extensively evaluated using up to ten scATAC-seq datasets. Moreover, we evaluated the impact of gene activity conversion, dropout imputation, and gene set collections on the results of GSS. Results show that dropout imputation can significantly promote the performance of almost all GSS tools, while the impact of gene activity conversion methods or gene set collections on GSS performance is more dependent on GSS tools or datasets. Finally, we provided practical guidelines for choosing appropriate preprocessing methods and GSS tools in different application scenarios.
Collapse
Affiliation(s)
- Xi Wang
- Pasteurien College, Suzhou Medical College of Soochow University, Soochow University, Suzhou 215000, China
- Department of Automation, Xiamen University, Xiamen 361005, China
| | - Qiwei Lian
- Pasteurien College, Suzhou Medical College of Soochow University, Soochow University, Suzhou 215000, China
- Department of Automation, Xiamen University, Xiamen 361005, China
| | - Haoyu Dong
- Pasteurien College, Suzhou Medical College of Soochow University, Soochow University, Suzhou 215000, China
| | - Shuo Xu
- Department of Automation, Xiamen University, Xiamen 361005, China
| | - Yaru Su
- College of Mathematics and Computer Science, Fuzhou University, Fuzhou 350116, China
| | - Xiaohui Wu
- Pasteurien College, Suzhou Medical College of Soochow University, Soochow University, Suzhou 215000, China
| |
Collapse
|
50
|
Shwab EK, Gingerich DC, Man Z, Gamache J, Garrett ME, Crawford GE, Ashley-Koch AE, Serrano GE, Beach TG, Lutz MW, Chiba-Falek O. Single-nucleus multi-omics of Parkinson's disease reveals a glutamatergic neuronal subtype susceptible to gene dysregulation via alteration of transcriptional networks. Acta Neuropathol Commun 2024; 12:111. [PMID: 38956662 PMCID: PMC11218415 DOI: 10.1186/s40478-024-01803-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2024] [Accepted: 05/27/2024] [Indexed: 07/04/2024] Open
Abstract
The genetic architecture of Parkinson's disease (PD) is complex and multiple brain cell subtypes are involved in the neuropathological progression of the disease. Here we aimed to advance our understanding of PD genetic complexity at a cell subtype precision level. Using parallel single-nucleus (sn)RNA-seq and snATAC-seq analyses we simultaneously profiled the transcriptomic and chromatin accessibility landscapes in temporal cortex tissues from 12 PD compared to 12 control subjects at a granular single cell resolution. An integrative bioinformatic pipeline was developed and applied for the analyses of these snMulti-omics datasets. The results identified a subpopulation of cortical glutamatergic excitatory neurons with remarkably altered gene expression in PD, including differentially-expressed genes within PD risk loci identified in genome-wide association studies (GWAS). This was the only neuronal subtype showing significant and robust overexpression of SNCA. Further characterization of this neuronal-subpopulation showed upregulation of specific pathways related to axon guidance, neurite outgrowth and post-synaptic structure, and downregulated pathways involved in presynaptic organization and calcium response. Additionally, we characterized the roles of three molecular mechanisms in governing PD-associated cell subtype-specific dysregulation of gene expression: (1) changes in cis-regulatory element accessibility to transcriptional machinery; (2) changes in the abundance of master transcriptional regulators, including YY1, SP3, and KLF16; (3) candidate regulatory variants in high linkage disequilibrium with PD-GWAS genomic variants impacting transcription factor binding affinities. To our knowledge, this study is the first and the most comprehensive interrogation of the multi-omics landscape of PD at a cell-subtype resolution. Our findings provide new insights into a precise glutamatergic neuronal cell subtype, causal genes, and non-coding regulatory variants underlying the neuropathological progression of PD, paving the way for the development of cell- and gene-targeted therapeutics to halt disease progression as well as genetic biomarkers for early preclinical diagnosis.
Collapse
Affiliation(s)
- E Keats Shwab
- Division of Translational Brain Sciences, Department of Neurology, Duke University School of Medicine, Duke University Medical Center, Durham, NC, 27710, USA
- Center for Genomic and Computational Biology, Duke University Medical Center, Durham, NC, 27708, USA
| | - Daniel C Gingerich
- Division of Translational Brain Sciences, Department of Neurology, Duke University School of Medicine, Duke University Medical Center, Durham, NC, 27710, USA
- Center for Genomic and Computational Biology, Duke University Medical Center, Durham, NC, 27708, USA
| | - Zhaohui Man
- Division of Translational Brain Sciences, Department of Neurology, Duke University School of Medicine, Duke University Medical Center, Durham, NC, 27710, USA
- Center for Genomic and Computational Biology, Duke University Medical Center, Durham, NC, 27708, USA
| | - Julia Gamache
- Division of Translational Brain Sciences, Department of Neurology, Duke University School of Medicine, Duke University Medical Center, Durham, NC, 27710, USA
- Center for Genomic and Computational Biology, Duke University Medical Center, Durham, NC, 27708, USA
| | - Melanie E Garrett
- Duke Molecular Physiology Institute, Duke University Medical Center, Durham, NC, 27701, USA
| | - Gregory E Crawford
- Center for Genomic and Computational Biology, Duke University Medical Center, Durham, NC, 27708, USA
- Division of Medical Genetics, Department of Pediatrics, Duke University Medical Center, Durham, NC, 27708, USA
- Center for Advanced Genomic Technologies, Duke University Medical Center, Durham, NC, 27708, USA
| | - Allison E Ashley-Koch
- Duke Molecular Physiology Institute, Duke University Medical Center, Durham, NC, 27701, USA
- Department of Medicine, Duke University Medical Center, Durham, NC, 27708, USA
| | - Geidy E Serrano
- Banner Sun Health Research Institute, Sun City, AZ, 85351, USA
| | - Thomas G Beach
- Banner Sun Health Research Institute, Sun City, AZ, 85351, USA
| | - Michael W Lutz
- Division of Translational Brain Sciences, Department of Neurology, Duke University School of Medicine, Duke University Medical Center, Durham, NC, 27710, USA
| | - Ornit Chiba-Falek
- Division of Translational Brain Sciences, Department of Neurology, Duke University School of Medicine, Duke University Medical Center, Durham, NC, 27710, USA.
- Center for Genomic and Computational Biology, Duke University Medical Center, Durham, NC, 27708, USA.
| |
Collapse
|