51
|
Felce C, Gorin G, Pachter L. Biophysical model for joint analysis of chromatin and RNA sequencing data. Phys Rev E 2024; 110:064405. [PMID: 39916216 DOI: 10.1103/physreve.110.064405] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2024] [Accepted: 11/19/2024] [Indexed: 05/07/2025]
Abstract
The assay for transposase-accessible chromatin using sequencing (ATAC-seq) can be used to identify open chromatin regions, providing complementary information to RNA-seq which measures gene expression by sequencing. Single-cell multiome methods offer the possibility of measuring both modalities simultaneously in cells, raising the question of how to analyze them jointly, and also the extent to which the information they provide is better than unregistered data, where single-cell ATAC-seq and single-cell RNA-seq are performed on the same sample, but on different cells. We propose and motivate a biophysical model for chromatin dynamics and subsequent transcription that can be used to parametrize multiome data, and use it to assess the benefits of multiome data over unregistered single-cell RNA-seq and single-cell ATAC-seq. We also show that our model provides a biophysically grounded approach to the integration of chromatin accessibility data with other modalitie, and apply the model to single-cell ATAC-seq data.
Collapse
Affiliation(s)
- Catherine Felce
- California Institute of Technology, Division of Physics, Math and Astronomy, Pasadena, California 91125, USA
| | | | - Lior Pachter
- California Institute of Technology, Division of Biology and Biological Engineering, Pasadena, California 91125, USA
| |
Collapse
|
52
|
Tian M, Tang X, Ouyang Z, Li Y, Bai X, Chen B, Yue S, Hu P, Bo X, Ren C, Chen H, Lu M. Long-range transcription factor binding sites clustered regions may mediate transcriptional regulation through phase-separation interactions in early human embryo. Comput Struct Biotechnol J 2024; 23:3514-3526. [PMID: 39435341 PMCID: PMC11492133 DOI: 10.1016/j.csbj.2024.09.017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2024] [Revised: 09/19/2024] [Accepted: 09/25/2024] [Indexed: 10/23/2024] Open
Abstract
In mammals, during the post-fertilization pre-implantation phase, the expression of cell type-specific genes is crucial for normal embryonic development, which is regulated by cis-regulatory elements (CREs). TFs control gene expression by interacting with CREs. Research shows that transcription factor binding sites (TFBSs) reflect the general characteristics of the regulatory genome. Here, we identified TFBSs from chromatin accessibility data in five stages of early human embryonic development, and quantified transcription factor binding sites-clustered regions (TFCRs) and their complexity (TC). Assigning TC values to TFCRs has made it possible to assess the functionality of these regulatory elements in a more quantitative way. Our findings reveal a robust correlation between TFCR complexity and gene expression starting from the 8Cell stage, which is when the zygotic genome is activated in humans. Furthermore, we have defined long-range TFCRs (LR-TFCRs) and conjecture that LR-TFCRs may regulate gene expression through phase-separation mechanisms during the early stages of human embryonic development.
Collapse
Affiliation(s)
- Mengge Tian
- The First Affiliated Hospital of Harbin Medical University, Harbin 150001, China
- Academy of Military Medical Sciences, Beijing 100850, China
| | - Xiaohan Tang
- The First Affiliated Hospital of Harbin Medical University, Harbin 150001, China
| | - Zhangyi Ouyang
- Academy of Military Medical Sciences, Beijing 100850, China
| | - Yaru Li
- Academy of Military Medical Sciences, Beijing 100850, China
| | - Xuemei Bai
- Academy of Military Medical Sciences, Beijing 100850, China
| | - Bijia Chen
- Academy of Military Medical Sciences, Beijing 100850, China
| | - Shutong Yue
- Academy of Military Medical Sciences, Beijing 100850, China
- Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
| | - Pengzhen Hu
- Academy of Military Medical Sciences, Beijing 100850, China
- School of Life Sciences, Northwestern Polytechnical University, Xi’an 710072, China
| | - Xiaochen Bo
- Academy of Military Medical Sciences, Beijing 100850, China
| | - Chao Ren
- Academy of Military Medical Sciences, Beijing 100850, China
| | - Hebing Chen
- Academy of Military Medical Sciences, Beijing 100850, China
| | - Meisong Lu
- The First Affiliated Hospital of Harbin Medical University, Harbin 150001, China
| |
Collapse
|
53
|
Zhao Y, Yu ZM, Cui T, Li LD, Li YY, Qian FC, Zhou LW, Li Y, Fang QL, Huang XM, Zhang QY, Cai FH, Dong FJ, Shang DS, Li CQ, Wang QY. scBlood: A comprehensive single-cell accessible chromatin database of blood cells. Comput Struct Biotechnol J 2024; 23:2746-2753. [PMID: 39050785 PMCID: PMC11266868 DOI: 10.1016/j.csbj.2024.06.015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2024] [Revised: 06/17/2024] [Accepted: 06/18/2024] [Indexed: 07/27/2024] Open
Abstract
The advent of single cell transposase-accessible chromatin sequencing (scATAC-seq) technology enables us to explore the genomic characteristics and chromatin accessibility of blood cells at the single-cell level. To fully make sense of the roles and regulatory complexities of blood cells, it is critical to collect and analyze these rapidly accumulating scATAC-seq datasets at a system level. Here, we present scBlood (https://bio.liclab.net/scBlood/), a comprehensive single-cell accessible chromatin database of blood cells. The current version of scBlood catalogs 770,907 blood cells and 452,247 non-blood cells from ∼400 high-quality scATAC-seq samples covering 30 tissues and 21 disease types. All data hosted on scBlood have undergone preprocessing from raw fastq files and multiple standards of quality control. Furthermore, we conducted comprehensive downstream analyses, including multi-sample integration analysis, cell clustering and annotation, differential chromatin accessibility analysis, functional enrichment analysis, co-accessibility analysis, gene activity score calculation, and transcription factor (TF) enrichment analysis. In summary, scBlood provides a user-friendly interface for searching, browsing, analyzing, visualizing, and downloading scATAC-seq data of interest. This platform facilitates insights into the functions and regulatory mechanisms of blood cells, as well as their involvement in blood-related diseases.
Collapse
Affiliation(s)
- Yu Zhao
- The First Affiliated Hospital & MOE Key Lab of Rare Pediatric Diseases, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- School of Computer, University of South China, Hengyang, Hunan 421001, China
| | - Zheng-Min Yu
- School of Computer, University of South China, Hengyang, Hunan 421001, China
| | - Ting Cui
- The First Affiliated Hospital & MOE Key Lab of Rare Pediatric Diseases, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
| | - Li-Dong Li
- School of Computer, University of South China, Hengyang, Hunan 421001, China
| | - Yan-Yu Li
- School of Medical Informatics, Daqing Campus, Harbin Medical University, Daqing 163319, China
| | - Feng-Cui Qian
- The First Affiliated Hospital, Cardiovascular Lab of Big Data and Imaging Artificial Intelligence, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
| | - Li-Wei Zhou
- State Key Laboratory of Stem Cell and Reproductive Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
| | - Ye Li
- Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
| | - Qiao-Li Fang
- School of Computer, University of South China, Hengyang, Hunan 421001, China
| | - Xue-Mei Huang
- School of Computer, University of South China, Hengyang, Hunan 421001, China
| | - Qin-Yi Zhang
- Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
| | - Fu-Hong Cai
- School of Computer, University of South China, Hengyang, Hunan 421001, China
| | - Fu-Juan Dong
- School of Computer, University of South China, Hengyang, Hunan 421001, China
| | - De-Si Shang
- The First Affiliated Hospital, Cardiovascular Lab of Big Data and Imaging Artificial Intelligence, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
| | - Chun-Quan Li
- The First Affiliated Hospital & MOE Key Lab of Rare Pediatric Diseases, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- School of Computer, University of South China, Hengyang, Hunan 421001, China
- Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- Hunan Provincial Key Laboratory of Multi-omics And Artificial Intelligence of Cardiovascular Diseases, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- Hunan Provincial Maternal and Child Health Care Hospital, National Health Commission Key Laboratory of Birth Defect Research and Prevention, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Cardiovascular Lab of Big Data and Imaging Artificial Intelligence, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Institute of Cardiovascular Disease, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
| | - Qiu-Yu Wang
- School of Computer, University of South China, Hengyang, Hunan 421001, China
- Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- Hunan Provincial Key Laboratory of Multi-omics And Artificial Intelligence of Cardiovascular Diseases, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- Hunan Provincial Maternal and Child Health Care Hospital, National Health Commission Key Laboratory of Birth Defect Research and Prevention, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
| |
Collapse
|
54
|
Liu T, Long W, Cao Z, Wang Y, He CH, Zhang L, Strittmatter SM, Zhao H. CosGeneGate selects multi-functional and credible biomarkers for single-cell analysis. Brief Bioinform 2024; 26:bbae626. [PMID: 39592241 PMCID: PMC11596696 DOI: 10.1093/bib/bbae626] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2024] [Revised: 10/07/2024] [Accepted: 11/14/2024] [Indexed: 11/28/2024] Open
Abstract
MOTIVATION Selecting representative genes or marker genes to distinguish cell types is an important task in single-cell sequencing analysis. Although many methods have been proposed to select marker genes, the genes selected may have redundancy and/or do not show cell-type-specific expression patterns to distinguish cell types. RESULTS Here, we present a novel model, named CosGeneGate, to select marker genes for more effective marker selections. CosGeneGate is inspired by combining the advantages of selecting marker genes based on both cell-type classification accuracy and marker gene specific expression patterns. We demonstrate the better performance of the marker genes selected by CosGeneGate for various downstream analyses than the existing methods with both public datasets and newly sequenced datasets. The non-redundant marker genes identified by CosGeneGate for major cell types and tissues in human can be found at the website as follows: https://github.com/VivLon/CosGeneGate/blob/main/marker gene list.xlsx.
Collapse
Affiliation(s)
- Tianyu Liu
- Department of Biostatistics, Yale University, New Haven, CT, 06520, United States
- Interdepartmental Program in Computational Biology & Bioinformatics, Yale University, New Haven, CT, 06520, United States
| | - Wenxin Long
- Department of Biostatistics, Yale University, New Haven, CT, 06520, United States
- Department of Statistics, The Pennsylvania State University, University Park, PA, 16820, United States
| | - Zhiyuan Cao
- Department of Biostatistics, Yale University, New Haven, CT, 06520, United States
- Interdepartmental Program in Computational Biology & Bioinformatics, Yale University, New Haven, CT, 06520, United States
- Program of Health Informatics, Yale University, New Haven, CT, 06520, United States
| | - Yuge Wang
- Department of Biostatistics, Yale University, New Haven, CT, 06520, United States
| | - Chuan Hua He
- Department of Neurology, Yale University School of Medicine, New Haven, CT, 06520, United States
| | - Le Zhang
- Department of Neurology, Yale University School of Medicine, New Haven, CT, 06520, United States
- Department of Neuroscience, Yale University School of Medicine, New Haven, CT, 06520, United States
| | - Stephen M Strittmatter
- Department of Neurology, Yale University School of Medicine, New Haven, CT, 06520, United States
- Department of Neuroscience, Yale University School of Medicine, New Haven, CT, 06520, United States
- Cellular Neuroscience, Neurodegeneration and Repair Program, Yale University School of Medicine, New Haven, CT, 06520, United States
| | - Hongyu Zhao
- Department of Biostatistics, Yale University, New Haven, CT, 06520, United States
- Interdepartmental Program in Computational Biology & Bioinformatics, Yale University, New Haven, CT, 06520, United States
| |
Collapse
|
55
|
Xu X, Lin Y, Yin L, Serpa PDS, Conacher B, Pacholec C, Carvallo F, Hrubec T, Farris S, Zimmerman K, Wang X, Xie H. Spatial Transcriptomics and Single-Nucleus Multi-Omics Analysis Revealing the Impact of High Maternal Folic Acid Supplementation on Offspring Brain Development. Nutrients 2024; 16:3820. [PMID: 39599606 PMCID: PMC11597041 DOI: 10.3390/nu16223820] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2024] [Revised: 10/27/2024] [Accepted: 11/04/2024] [Indexed: 11/29/2024] Open
Abstract
Background: Folate, an essential vitamin B9, is crucial for diverse biological processes, including neurogenesis. Folic acid (FA) supplementation during pregnancy is a standard practice for preventing neural tube defects (NTDs). However, concerns are growing over the potential risks of excessive maternal FA intake. Objectives/Methods: Here, we employed a mouse model and spatial transcriptomic and single-nucleus multi-omics approaches to investigate the impact of high maternal FA supplementation during the periconceptional period on offspring brain development. Results: Maternal high FA supplementation affected gene pathways linked to neurogenesis and neuronal axon myelination across multiple brain regions, as well as gene expression alterations related to learning and memory in thalamic and ventricular regions. Single-nucleus multi-omics analysis revealed that maturing excitatory neurons in the dentate gyrus (DG) are particularly vulnerable to high maternal FA intake, leading to aberrant gene expressions and chromatin accessibility in pathways governing ribosomal biogenesis critical for synaptic formation. Conclusions: Our findings provide new insights into specific brain regions, cell types, gene expressions and pathways that can be affected by maternal high FA supplementation.
Collapse
Affiliation(s)
- Xiguang Xu
- Epigenomics and Computational Biology Lab, Fralin Life Sciences Institute, Virginia Tech, Blacksburg, VA 24061, USA
- Department of Biomedical Sciences and Pathobiology, Virginia-Maryland College of Veterinary Medicine, Virginia Tech, Blacksburg, VA 24061, USA
| | - Yu Lin
- Epigenomics and Computational Biology Lab, Fralin Life Sciences Institute, Virginia Tech, Blacksburg, VA 24061, USA
- Genetics, Bioinformatics and Computational Biology Program, Virginia Tech, Blacksburg, VA 24061, USA
| | - Liduo Yin
- Epigenomics and Computational Biology Lab, Fralin Life Sciences Institute, Virginia Tech, Blacksburg, VA 24061, USA
- Department of Biomedical Sciences and Pathobiology, Virginia-Maryland College of Veterinary Medicine, Virginia Tech, Blacksburg, VA 24061, USA
| | - Priscila da Silva Serpa
- Department of Biomedical Sciences and Pathobiology, Virginia-Maryland College of Veterinary Medicine, Virginia Tech, Blacksburg, VA 24061, USA
| | - Benjamin Conacher
- Epigenomics and Computational Biology Lab, Fralin Life Sciences Institute, Virginia Tech, Blacksburg, VA 24061, USA
- Genetics, Bioinformatics and Computational Biology Program, Virginia Tech, Blacksburg, VA 24061, USA
| | - Christina Pacholec
- Department of Biomedical Sciences and Pathobiology, Virginia-Maryland College of Veterinary Medicine, Virginia Tech, Blacksburg, VA 24061, USA
| | - Francisco Carvallo
- Department of Biomedical Sciences and Pathobiology, Virginia-Maryland College of Veterinary Medicine, Virginia Tech, Blacksburg, VA 24061, USA
| | - Terry Hrubec
- Department of Biomedical Sciences and Pathobiology, Virginia-Maryland College of Veterinary Medicine, Virginia Tech, Blacksburg, VA 24061, USA
- Department of Biomedical Science, E. Via College of Osteopathic Medicine-Virginia, Blacksburg, VA 24060, USA
| | - Shannon Farris
- Department of Biomedical Sciences and Pathobiology, Virginia-Maryland College of Veterinary Medicine, Virginia Tech, Blacksburg, VA 24061, USA
- Center for Neurobiology Research, Fralin Biomedical Research Institute at Virginia Tech Carilion, Roanoke, VA 24001, USA
| | - Kurt Zimmerman
- Department of Biomedical Sciences and Pathobiology, Virginia-Maryland College of Veterinary Medicine, Virginia Tech, Blacksburg, VA 24061, USA
| | - Xiaobin Wang
- Center on Early Life Origins of Disease, Department of Population, Family and Reproductive Health, Johns Hopkins University Bloomberg School of Public Health, Baltimore, MD 21205, USA
- Department of Pediatrics, Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA
| | - Hehuang Xie
- Epigenomics and Computational Biology Lab, Fralin Life Sciences Institute, Virginia Tech, Blacksburg, VA 24061, USA
- Department of Biomedical Sciences and Pathobiology, Virginia-Maryland College of Veterinary Medicine, Virginia Tech, Blacksburg, VA 24061, USA
- Genetics, Bioinformatics and Computational Biology Program, Virginia Tech, Blacksburg, VA 24061, USA
- Translational Biology, Medicine, and Health Program, Virginia Tech, Blacksburg, VA 24061, USA
- School of Neuroscience, Virginia Tech, Blacksburg, VA 24061, USA
| |
Collapse
|
56
|
Lewis MW, King CM, Wisniewska K, Regner MJ, Coffey A, Kelly MR, Mendez-Giraldez R, Davis ES, Phanstiel DH, Franco HL. CRISPR Screening of Transcribed Super-Enhancers Identifies Drivers of Triple-Negative Breast Cancer Progression. Cancer Res 2024; 84:3684-3700. [PMID: 39186674 PMCID: PMC11534545 DOI: 10.1158/0008-5472.can-23-3995] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2023] [Revised: 06/03/2024] [Accepted: 08/14/2024] [Indexed: 08/28/2024]
Abstract
Triple-negative breast cancer (TNBC) is the most therapeutically recalcitrant form of breast cancer, which is due in part to the paucity of targeted therapies. A systematic analysis of regulatory elements that extend beyond protein-coding genes could uncover avenues for therapeutic intervention. To this end, we analyzed the regulatory mechanisms of TNBC-specific transcriptional enhancers together with their noncoding enhancer RNA (eRNA) transcripts. The functions of the top 30 eRNA-producing super-enhancers were systematically probed using high-throughput CRISPR-interference assays coupled to RNA sequencing that enabled unbiased detection of target genes genome-wide. Generation of high-resolution Hi-C chromatin interaction maps enabled annotation of the direct target genes for each super-enhancer, which highlighted their proclivity for genes that portend worse clinical outcomes in patients with TNBC. Illustrating the utility of this dataset, deletion of an identified super-enhancer controlling the nearby PODXL gene or specific degradation of its eRNAs led to profound inhibitory effects on target gene expression, cell proliferation, and migration. Furthermore, loss of this super-enhancer suppressed tumor growth and metastasis in TNBC mouse xenograft models. Single-cell RNA sequencing and assay for transposase-accessible chromatin with high-throughput sequencing analyses demonstrated the enhanced activity of this super-enhancer within the malignant cells of TNBC tumor specimens compared with nonmalignant cell types. Collectively, this work examines several fundamental questions about how regulatory information encoded into eRNA-producing super-enhancers drives gene expression networks that underlie the biology of TNBC. Significance: Integrative analysis of eRNA-producing super-enhancers defines molecular mechanisms controlling global patterns of gene expression that regulate clinical outcomes in breast cancer, highlighting the potential of enhancers as biomarkers and therapeutic targets.
Collapse
Affiliation(s)
- Michael W. Lewis
- The Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA
| | - Caitlin M. King
- The Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA
| | - Kamila Wisniewska
- The Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA
| | - Matthew J. Regner
- The Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA
- Bioinformatics and Computational Biology Graduate Program, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA
| | - Alisha Coffey
- The Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA
| | - Michael R. Kelly
- The Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA
- Bioinformatics and Computational Biology Graduate Program, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA
| | - Raul Mendez-Giraldez
- The Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA
| | - Eric S. Davis
- Bioinformatics and Computational Biology Graduate Program, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA
| | - Douglas H. Phanstiel
- Bioinformatics and Computational Biology Graduate Program, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA
- Department of Cell Biology & Physiology, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA
- Thurston Arthritis Research Center, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA
| | - Hector L. Franco
- The Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA
- Bioinformatics and Computational Biology Graduate Program, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA
- The Department of Genetics, School of Medicine, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA
- Division of Clinical and Translational Cancer Research, University of Puerto Rico Comprehensive Cancer Center, San Juan, PR 00935
| |
Collapse
|
57
|
Wu B, Bennett HM, Ye X, Sridhar A, Eidenschenk C, Everett C, Nazarova EV, Chen HH, Kim IK, Deangelis M, Owen LA, Chen C, Lau J, Shi M, Lund JM, Xavier-Magalhães A, Patel N, Liang Y, Modrusan Z, Darmanis S. Overloading And unpacKing (OAK) - droplet-based combinatorial indexing for ultra-high throughput single-cell multiomic profiling. Nat Commun 2024; 15:9146. [PMID: 39443484 PMCID: PMC11499997 DOI: 10.1038/s41467-024-53227-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2024] [Accepted: 10/02/2024] [Indexed: 10/25/2024] Open
Abstract
Multiomic profiling of single cells by sequencing is a powerful technique for investigating cellular diversity. Existing droplet-based microfluidic methods produce many cell-free droplets, underutilizing bead barcodes and reagents. Combinatorial indexing on microplates is more efficient for barcoding but labor-intensive. Here we present Overloading And unpacKing (OAK), which uses a droplet-based barcoding system for initial compartmentalization followed by a second aliquoting round to achieve combinatorial indexing. We demonstrate OAK's versatility with single-cell RNA sequencing as well as paired single-nucleus RNA sequencing and accessible chromatin profiling. We further showcase OAK's performance on complex samples, including differentiated bronchial epithelial cells and primary retinal tissue. Finally, we examine transcriptomic responses of over 400,000 melanoma cells to a RAF inhibitor, belvarafenib, discovering a rare resistant cell population (0.12%). OAK's ultra-high throughput, broad compatibility, high sensitivity, and simplified procedures make it a powerful tool for large-scale molecular analysis, even for rare cells.
Collapse
Affiliation(s)
- Bing Wu
- Department of Proteomic and Genomic Technologies, Genentech, South San Francisco, CA, USA
| | - Hayley M Bennett
- Department of Proteomic and Genomic Technologies, Genentech, South San Francisco, CA, USA
| | - Xin Ye
- Department of Discovery Oncology, Genentech, South San Francisco, CA, USA
| | - Akshayalakshmi Sridhar
- Department of Human Pathobiology & OMNI Reverse Translation, Genentech, South San Francisco, CA, USA
| | - Celine Eidenschenk
- Department of Functional Genomics, Genentech, South San Francisco, CA, USA
| | - Christine Everett
- Department of Functional Genomics, Genentech, South San Francisco, CA, USA
| | | | - Hsu-Hsin Chen
- Department of Human Pathobiology & OMNI Reverse Translation, Genentech, South San Francisco, CA, USA
| | - Ivana K Kim
- Retina Service, Massachusetts Eye & Ear, Department of Ophthalmology, Harvard Medical School, Boston, MA, USA
| | - Margaret Deangelis
- Department of Ophthalmology, Ross Eye Institute; Department of Biochemistry; Neuroscience Graduate Program; Genetics, Genomics and Bioinformatics Graduate Program, Jacobs School of Medicine and Biomedical Sciences, State University of New York, University at Buffalo, Buffalo, NY, USA
| | - Leah A Owen
- Department of Ophthalmology and Visual Sciences, University of Utah School of Medicine, The University of Utah, Salt Lake City, UT, USA
| | - Cynthia Chen
- Department of Proteomic and Genomic Technologies, Genentech, South San Francisco, CA, USA
| | - Julia Lau
- Department of Proteomic and Genomic Technologies, Genentech, South San Francisco, CA, USA
| | - Minyi Shi
- Department of Proteomic and Genomic Technologies, Genentech, South San Francisco, CA, USA
| | - Jessica M Lund
- Department of Proteomic and Genomic Technologies, Genentech, South San Francisco, CA, USA
| | - Ana Xavier-Magalhães
- Department of Proteomic and Genomic Technologies, Genentech, South San Francisco, CA, USA
| | - Neha Patel
- Department of Proteomic and Genomic Technologies, Genentech, South San Francisco, CA, USA
| | - Yuxin Liang
- Department of Proteomic and Genomic Technologies, Genentech, South San Francisco, CA, USA
| | - Zora Modrusan
- Department of Proteomic and Genomic Technologies, Genentech, South San Francisco, CA, USA.
| | - Spyros Darmanis
- Department of Proteomic and Genomic Technologies, Genentech, South San Francisco, CA, USA.
| |
Collapse
|
58
|
Teo AYY, Squair JW, Courtine G, Skinnider MA. Best practices for differential accessibility analysis in single-cell epigenomics. Nat Commun 2024; 15:8805. [PMID: 39394227 PMCID: PMC11470024 DOI: 10.1038/s41467-024-53089-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2023] [Accepted: 09/24/2024] [Indexed: 10/13/2024] Open
Abstract
Differential accessibility (DA) analysis of single-cell epigenomics data enables the discovery of regulatory programs that establish cell type identity and steer responses to physiological and pathophysiological perturbations. While many statistical methods to identify DA regions have been developed, the principles that determine the performance of these methods remain unclear. As a result, there is no consensus on the most appropriate statistical methods for DA analysis of single-cell epigenomics data. Here, we present a systematic evaluation of statistical methods that have been applied to identify DA regions in single-cell ATAC-seq (scATAC-seq) data. We leverage a compendium of scATAC-seq experiments with matching bulk ATAC-seq or scRNA-seq in order to assess the accuracy, bias, robustness, and scalability of each statistical method. The structure of our experiments also provides the opportunity to define best practices for the analysis of scATAC-seq data beyond DA itself. We leverage this understanding to develop an R package implementing these best practices.
Collapse
Affiliation(s)
- Alan Yue Yang Teo
- Defitech Center for Interventional Neurotherapies (.NeuroRestore), EPFL/CHUV/UNIL, Lausanne, Switzerland
- NeuroX Institute and Brain Mind Institute, School of Life Sciences, Swiss Federal Institute of Technology (EPFL), Lausanne, Switzerland
| | - Jordan W Squair
- Defitech Center for Interventional Neurotherapies (.NeuroRestore), EPFL/CHUV/UNIL, Lausanne, Switzerland.
- NeuroX Institute and Brain Mind Institute, School of Life Sciences, Swiss Federal Institute of Technology (EPFL), Lausanne, Switzerland.
- Department of Clinical Neuroscience, Lausanne University Hospital (CHUV) and University of Lausanne (UNIL), Lausanne, Switzerland.
| | - Gregoire Courtine
- Defitech Center for Interventional Neurotherapies (.NeuroRestore), EPFL/CHUV/UNIL, Lausanne, Switzerland.
- NeuroX Institute and Brain Mind Institute, School of Life Sciences, Swiss Federal Institute of Technology (EPFL), Lausanne, Switzerland.
- Department of Clinical Neuroscience, Lausanne University Hospital (CHUV) and University of Lausanne (UNIL), Lausanne, Switzerland.
| | - Michael A Skinnider
- Defitech Center for Interventional Neurotherapies (.NeuroRestore), EPFL/CHUV/UNIL, Lausanne, Switzerland.
- NeuroX Institute and Brain Mind Institute, School of Life Sciences, Swiss Federal Institute of Technology (EPFL), Lausanne, Switzerland.
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ, USA.
- Ludwig Institute for Cancer Research, Princeton University, Princeton, NJ, USA.
| |
Collapse
|
59
|
Gabriel AAG, Racle J, Falquet M, Jandus C, Gfeller D. Robust estimation of cancer and immune cell-type proportions from bulk tumor ATAC-Seq data. eLife 2024; 13:RP94833. [PMID: 39383060 PMCID: PMC11464006 DOI: 10.7554/elife.94833] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/11/2024] Open
Abstract
Assay for Transposase-Accessible Chromatin sequencing (ATAC-Seq) is a widely used technique to explore gene regulatory mechanisms. For most ATAC-Seq data from healthy and diseased tissues such as tumors, chromatin accessibility measurement represents a mixed signal from multiple cell types. In this work, we derive reliable chromatin accessibility marker peaks and reference profiles for most non-malignant cell types frequently observed in the microenvironment of human tumors. We then integrate these data into the EPIC deconvolution framework (Racle et al., 2017) to quantify cell-type heterogeneity in bulk ATAC-Seq data. Our EPIC-ATAC tool accurately predicts non-malignant and malignant cell fractions in tumor samples. When applied to a human breast cancer cohort, EPIC-ATAC accurately infers the immune contexture of the main breast cancer subtypes.
Collapse
Affiliation(s)
- Aurélie Anne-Gaëlle Gabriel
- Department of Oncology, Ludwig Institute for Cancer Research, University of LausanneLausanneSwitzerland
- Agora Cancer Research CenterLausanneSwitzerland
- Swiss Cancer Center Leman (SCCL)GenevaSwitzerland
- Swiss Institute of Bioinformatics (SIB)LausanneSwitzerland
| | - Julien Racle
- Department of Oncology, Ludwig Institute for Cancer Research, University of LausanneLausanneSwitzerland
- Agora Cancer Research CenterLausanneSwitzerland
- Swiss Cancer Center Leman (SCCL)GenevaSwitzerland
- Swiss Institute of Bioinformatics (SIB)LausanneSwitzerland
| | - Maryline Falquet
- Swiss Cancer Center Leman (SCCL)GenevaSwitzerland
- Ludwig Institute for Cancer Research, Lausanne BranchLausanneSwitzerland
- Department of Pathology and Immunology, Faculty of Medicine, University of GenevaGenevaSwitzerland
- Geneva Center for Inflammation ResearchGenevaSwitzerland
| | - Camilla Jandus
- Swiss Cancer Center Leman (SCCL)GenevaSwitzerland
- Ludwig Institute for Cancer Research, Lausanne BranchLausanneSwitzerland
- Department of Pathology and Immunology, Faculty of Medicine, University of GenevaGenevaSwitzerland
- Geneva Center for Inflammation ResearchGenevaSwitzerland
| | - David Gfeller
- Department of Oncology, Ludwig Institute for Cancer Research, University of LausanneLausanneSwitzerland
- Agora Cancer Research CenterLausanneSwitzerland
- Swiss Cancer Center Leman (SCCL)GenevaSwitzerland
- Swiss Institute of Bioinformatics (SIB)LausanneSwitzerland
| |
Collapse
|
60
|
Xu S, Liu J, Zhang J. scACT: Accurate Cross-modality Translation via Cycle-consistent Training from Unpaired Single-cell Data. PROCEEDINGS OF THE ... ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT. ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT 2024; 2024:2722-2731. [PMID: 39628660 PMCID: PMC11611688 DOI: 10.1145/3627673.3679576] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/06/2024]
Abstract
Single-cell sequencing technologies have revolutionized genomics by enabling the simultaneous profiling of various molecular modalities within individual cells. Their integration, especially cross-modality translation, offers deep insights into cellular regulatory mechanisms. Many methods have been developed for cross-modality translation, but their reliance on scarce high-quality co-assay data limits their applicability. Addressing this, we introduce scACT, a deep generative model designed to extract cross-modality biological insights from unpaired single-cell data. scACT tackles three major challenges: aligning unpaired multi-modal data via adversarial training, facilitating cross-modality translation without prior knowledge via cycle-consistent training, and enabling interpretable regulatory interconnections explorations via in-silico perturbations. To test its performance, we applied scACT on diverse single-cell datasets and found it outperformed existing methods in all three tasks. Finally, we have developed scACT as an individual open-source software package to advance single-cell omics data processing and analysis within the research community.
Collapse
Affiliation(s)
- Siwei Xu
- University of California, Irvine, Irvine, California, USA
| | - Junhao Liu
- University of California, Irvine, Irvine, California, USA
| | - Jing Zhang
- University of California, Irvine, Irvine, California, USA
| |
Collapse
|
61
|
Schlachetzki JC, Gianella S, Ouyang Z, Lana AJ, Yang X, O'Brien S, Challacombe JF, Gaskill PJ, Jordan-Sciutto KL, Chaillon A, Moore D, Achim CL, Ellis RJ, Smith DM, Glass CK. Gene expression and chromatin conformation of microglia in virally suppressed people with HIV. Life Sci Alliance 2024; 7:e202402736. [PMID: 39060113 PMCID: PMC11282357 DOI: 10.26508/lsa.202402736] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2024] [Revised: 07/14/2024] [Accepted: 07/15/2024] [Indexed: 07/28/2024] Open
Abstract
The presence of HIV in sequestered reservoirs is a central impediment to a functional cure, allowing HIV to persist despite life-long antiretroviral therapy (ART), and driving a variety of comorbid conditions. Our understanding of the latent HIV reservoir in the central nervous system is incomplete, because of difficulties in accessing human central nervous system tissues. Microglia contribute to HIV reservoirs, but the molecular phenotype of HIV-infected microglia is poorly understood. We leveraged the unique "Last Gift" rapid autopsy program, in which people with HIV are closely followed until days or even hours before death. Microglial populations were heterogeneous regarding their gene expression profiles but showed similar chromatin accessibility landscapes. Despite ART, we detected occasional microglia containing cell-associated HIV RNA and HIV DNA integrated into open regions of the host's genome (∼0.005%). Microglia with detectable HIV RNA showed an inflammatory phenotype. These results demonstrate a distinct myeloid cell reservoir in the brains of people with HIV despite suppressive ART. Strategies for curing HIV and neurocognitive impairment will need to consider the myeloid compartment to be successful.
Collapse
Affiliation(s)
- Johannes Cm Schlachetzki
- Department of Cellular and Molecular Medicine, University of California San Diego, San Diego, CA, USA
- Department of Neurosciences, University of California San Diego, San Diego, CA, USA
| | - Sara Gianella
- Department of Medicine, Division of Infectious Diseases and Global Public Health, University of California San Diego, San Diego, CA, USA
| | - Zhengyu Ouyang
- Department of Cellular and Molecular Medicine, University of California San Diego, San Diego, CA, USA
| | - Addison J Lana
- Department of Cellular and Molecular Medicine, University of California San Diego, San Diego, CA, USA
| | - Xiaoxu Yang
- Department of Human Genetics, University of Utah, Salt Lake City, UT, USA
| | - Sydney O'Brien
- Department of Cellular and Molecular Medicine, University of California San Diego, San Diego, CA, USA
| | - Jean F Challacombe
- Department of Cellular and Molecular Medicine, University of California San Diego, San Diego, CA, USA
| | - Peter J Gaskill
- Department of Pharmacology and Physiology, Drexel University College of Medicine, Philadelphia, PA, USA
| | - Kelly L Jordan-Sciutto
- Department of Oral Medicine, School of Dental Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Antoine Chaillon
- Department of Medicine, Division of Infectious Diseases and Global Public Health, University of California San Diego, San Diego, CA, USA
| | - David Moore
- Department of Psychiatry, University of California San Diego, San Diego, CA, USA
| | - Cristian L Achim
- Department of Pathology, University of California San Diego, San Diego, CA, USA
| | - Ronald J Ellis
- Department of Neurosciences, University of California San Diego, San Diego, CA, USA
| | - Davey M Smith
- Department of Medicine, Division of Infectious Diseases and Global Public Health, University of California San Diego, San Diego, CA, USA
| | - Christopher K Glass
- Department of Cellular and Molecular Medicine, University of California San Diego, San Diego, CA, USA
| |
Collapse
|
62
|
Wu X, Yang X, Dai Y, Zhao Z, Zhu J, Guo H, Yang R. Single-cell sequencing to multi-omics: technologies and applications. Biomark Res 2024; 12:110. [PMID: 39334490 PMCID: PMC11438019 DOI: 10.1186/s40364-024-00643-4] [Citation(s) in RCA: 20] [Impact Index Per Article: 20.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2024] [Accepted: 08/17/2024] [Indexed: 09/30/2024] Open
Abstract
Cells, as the fundamental units of life, contain multidimensional spatiotemporal information. Single-cell RNA sequencing (scRNA-seq) is revolutionizing biomedical science by analyzing cellular state and intercellular heterogeneity. Undoubtedly, single-cell transcriptomics has emerged as one of the most vibrant research fields today. With the optimization and innovation of single-cell sequencing technologies, the intricate multidimensional details concealed within cells are gradually unveiled. The combination of scRNA-seq and other multi-omics is at the forefront of the single-cell field. This involves simultaneously measuring various omics data within individual cells, expanding our understanding across a broader spectrum of dimensions. Single-cell multi-omics precisely captures the multidimensional aspects of single-cell transcriptomes, immune repertoire, spatial information, temporal information, epitopes, and other omics in diverse spatiotemporal contexts. In addition to depicting the cell atlas of normal or diseased tissues, it also provides a cornerstone for studying cell differentiation and development patterns, disease heterogeneity, drug resistance mechanisms, and treatment strategies. Herein, we review traditional single-cell sequencing technologies and outline the latest advancements in single-cell multi-omics. We summarize the current status and challenges of applying single-cell multi-omics technologies to biological research and clinical applications. Finally, we discuss the limitations and challenges of single-cell multi-omics and potential strategies to address them.
Collapse
Affiliation(s)
- Xiangyu Wu
- Department of Urology, Nanjing Drum Tower Hospital, Affiliated Hospital of Medical School, Nanjing University, 321 Zhongshan Road, Nanjing, 210008, Jiangsu, China
| | - Xin Yang
- Department of Urology, Nanjing Drum Tower Hospital, Affiliated Hospital of Medical School, Nanjing University, 321 Zhongshan Road, Nanjing, 210008, Jiangsu, China
| | - Yunhan Dai
- Medical School, Nanjing University, Nanjing, China
| | - Zihan Zhao
- Department of Urology, Nanjing Drum Tower Hospital, Affiliated Hospital of Medical School, Nanjing University, 321 Zhongshan Road, Nanjing, 210008, Jiangsu, China
| | - Junmeng Zhu
- Department of Oncology, Nanjing Drum Tower Hospital, Affiliated Hospital of Medical School, Nanjing University, Nanjing, China
| | - Hongqian Guo
- Department of Urology, Nanjing Drum Tower Hospital, Affiliated Hospital of Medical School, Nanjing University, 321 Zhongshan Road, Nanjing, 210008, Jiangsu, China.
| | - Rong Yang
- Department of Urology, Nanjing Drum Tower Hospital, Affiliated Hospital of Medical School, Nanjing University, 321 Zhongshan Road, Nanjing, 210008, Jiangsu, China.
| |
Collapse
|
63
|
Lin X, Jiang S, Gao L, Wei Z, Wang J. MultiSC: a deep learning pipeline for analyzing multiomics single-cell data. Brief Bioinform 2024; 25:bbae492. [PMID: 39376034 PMCID: PMC11458747 DOI: 10.1093/bib/bbae492] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2024] [Revised: 07/22/2024] [Accepted: 09/17/2024] [Indexed: 10/09/2024] Open
Abstract
Single-cell technologies enable researchers to investigate cell functions at an individual cell level and study cellular processes with higher resolution. Several multi-omics single-cell sequencing techniques have been developed to explore various aspects of cellular behavior. Using NEAT-seq as an example, this method simultaneously obtains three kinds of omics data for each cell: gene expression, chromatin accessibility, and protein expression of transcription factors (TFs). Consequently, NEAT-seq offers a more comprehensive understanding of cellular activities in multiple modalities. However, there is a lack of tools available for effectively integrating the three types of omics data. To address this gap, we propose a novel pipeline called MultiSC for the analysis of MULTIomic Single-Cell data. Our pipeline leverages a multimodal constraint autoencoder (single-cell hierarchical constraint autoencoder) to integrate the multi-omics data during the clustering process and a matrix factorization-based model (scMF) to predict target genes regulated by a TF. Moreover, we utilize multivariate linear regression models to predict gene regulatory networks from the multi-omics data. Additional functionalities, including differential expression, mediation analysis, and causal inference, are also incorporated into the MultiSC pipeline. Extensive experiments were conducted to evaluate the performance of MultiSC. The results demonstrate that our pipeline enables researchers to gain a comprehensive view of cell activities and gene regulatory networks by fully leveraging the potential of multiomics single-cell data. By employing MultiSC, researchers can effectively integrate and analyze diverse omics data types, enhancing their understanding of cellular processes.
Collapse
Affiliation(s)
- Xiang Lin
- Department of Quantitative Health Sciences, Mayo Clinic, 13400 E Shea Blvd, Scottsdale, AZ 85259, United States
- Department of Computer Science, New Jersey Institute of Technology, 323 Dr Martin Luther King Jr Blvd, Newark, NJ 07102, United States
| | - Siqi Jiang
- Department of Computer Science, New Jersey Institute of Technology, 323 Dr Martin Luther King Jr Blvd, Newark, NJ 07102, United States
| | - Le Gao
- Department of Quantitative Health Sciences, Mayo Clinic, 13400 E Shea Blvd, Scottsdale, AZ 85259, United States
- Department of Computer Science, New Jersey Institute of Technology, 323 Dr Martin Luther King Jr Blvd, Newark, NJ 07102, United States
| | - Zhi Wei
- Department of Computer Science, New Jersey Institute of Technology, 323 Dr Martin Luther King Jr Blvd, Newark, NJ 07102, United States
| | - Junwen Wang
- Department of Quantitative Health Sciences, Mayo Clinic, 13400 E Shea Blvd, Scottsdale, AZ 85259, United States
- Center for Individualized Medicine, and Mayo Clinic Comprehensive Cancer Center, Mayo Clinic, 13400 E Shea Blvd, Scottsdale, AZ 85259, United States
- Division of Applied Oral Sciences & Community Dental Care, Faculty of Dentistry, The University of Hong Kong, 34 Hospital Road, Sai Ying Pun, Hong Kong SAR, PR China
| |
Collapse
|
64
|
Ak Ç, Sayar Z, Thibault G, Burlingame EA, Kuykendall MJ, Eng J, Chitsazan A, Chin K, Adey AC, Boniface C, Spellman PT, Thomas GV, Kopp RP, Demir E, Chang YH, Stavrinides V, Eksi SE. Multiplex imaging of localized prostate tumors reveals altered spatial organization of AR-positive cells in the microenvironment. iScience 2024; 27:110668. [PMID: 39246442 PMCID: PMC11379676 DOI: 10.1016/j.isci.2024.110668] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2024] [Revised: 07/19/2024] [Accepted: 08/01/2024] [Indexed: 09/10/2024] Open
Abstract
Mapping the spatial interactions of cancer, immune, and stromal cell states presents novel opportunities for patient stratification and for advancing immunotherapy. While single-cell studies revealed significant molecular heterogeneity in prostate cancer cells, the impact of spatial stromal cell heterogeneity remains poorly understood. Here, we used cyclic immunofluorescent imaging on whole-tissue sections to uncover novel spatial associations between cancer and stromal cells in low- and high-grade prostate tumors and tumor-adjacent normal tissues. Our results provide a spatial map of single cells and recurrent cellular neighborhoods in the prostate tumor microenvironment of treatment-naive patients. We report unique populations of mast cells that show distinct spatial associations with M2 macrophages and regulatory T cells. Our results show disease-specific neighborhoods that are primarily driven by androgen receptor-positive (AR+) stromal cells and identify inflammatory gene networks active in AR+ prostate stroma.
Collapse
Affiliation(s)
- Çiğdem Ak
- Cancer Early Detection Advanced Research (CEDAR), Knight Cancer Institute, OHSU, Portland, OR 97239, USA
- Department of Biomedical Engineering, School of Medicine, OHSU, Portland, OR 97209, USA
| | - Zeynep Sayar
- Cancer Early Detection Advanced Research (CEDAR), Knight Cancer Institute, OHSU, Portland, OR 97239, USA
- Department of Biomedical Engineering, School of Medicine, OHSU, Portland, OR 97209, USA
| | - Guillaume Thibault
- Department of Biomedical Engineering, School of Medicine, OHSU, Portland, OR 97209, USA
| | - Erik A Burlingame
- Department of Biomedical Engineering, School of Medicine, OHSU, Portland, OR 97209, USA
| | - M J Kuykendall
- Cancer Early Detection Advanced Research (CEDAR), Knight Cancer Institute, OHSU, Portland, OR 97239, USA
| | - Jennifer Eng
- Department of Biomedical Engineering, School of Medicine, OHSU, Portland, OR 97209, USA
| | - Alex Chitsazan
- Cancer Early Detection Advanced Research (CEDAR), Knight Cancer Institute, OHSU, Portland, OR 97239, USA
| | - Koei Chin
- Cancer Early Detection Advanced Research (CEDAR), Knight Cancer Institute, OHSU, Portland, OR 97239, USA
| | - Andrew C Adey
- Cancer Early Detection Advanced Research (CEDAR), Knight Cancer Institute, OHSU, Portland, OR 97239, USA
- Department of Molecular and Medical Genetics, Knight Cancer Institute, OHSU, Portland, OR 97239, USA
| | - Christopher Boniface
- Cancer Early Detection Advanced Research (CEDAR), Knight Cancer Institute, OHSU, Portland, OR 97239, USA
| | - Paul T Spellman
- Cancer Early Detection Advanced Research (CEDAR), Knight Cancer Institute, OHSU, Portland, OR 97239, USA
- Department of Molecular and Medical Genetics, Knight Cancer Institute, OHSU, Portland, OR 97239, USA
| | - George V Thomas
- Cancer Early Detection Advanced Research (CEDAR), Knight Cancer Institute, OHSU, Portland, OR 97239, USA
- Department of Pathology & Laboratory Medicine, School of Medicine, OHSU, Portland, OR 97239, USA
| | - Ryan P Kopp
- Cancer Early Detection Advanced Research (CEDAR), Knight Cancer Institute, OHSU, Portland, OR 97239, USA
- Department of Urology, School of Medicine, Knight Cancer Institute, Portland, OR 97239, USA
| | - Emek Demir
- Cancer Early Detection Advanced Research (CEDAR), Knight Cancer Institute, OHSU, Portland, OR 97239, USA
- Division of Oncological Sciences, School of Medicine, OHSU, Portland, OR 97239, USA
| | - Young Hwan Chang
- Department of Biomedical Engineering, School of Medicine, OHSU, Portland, OR 97209, USA
| | | | - Sebnem Ece Eksi
- Cancer Early Detection Advanced Research (CEDAR), Knight Cancer Institute, OHSU, Portland, OR 97239, USA
- Department of Biomedical Engineering, School of Medicine, OHSU, Portland, OR 97209, USA
| |
Collapse
|
65
|
Wang K, Yan Y, Elgamal H, Li J, Tang C, Bai S, Xiao Z, Sei E, Lin Y, Wang J, Montalvan J, Nagi C, Thompson AM, Navin N. Single cell genome and epigenome co-profiling reveals hardwiring and plasticity in breast cancer. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.09.06.611519. [PMID: 39314325 PMCID: PMC11418942 DOI: 10.1101/2024.09.06.611519] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/25/2024]
Abstract
Understanding the impact of genetic alterations on epigenomic phenotypes during breast cancer progression is challenging with unimodal measurements. Here, we report wellDA-seq, the first high-genomic resolution, high-throughput method that can simultaneously measure the whole genome and chromatin accessibility profiles of thousands of single cells. Using wellDA-seq, we profiled 22,123 single cells from 2 normal and 9 tumors breast tissues. By directly mapping the epigenomic phenotypes to genetic lineages across cancer subclones, we found evidence of both genetic hardwiring and epigenetic plasticity. In 6 estrogen-receptor positive breast cancers, we directly identified the ancestral cancer cells, and found that their epithelial cell-of-origin was Luminal Hormone Responsive cells. We also identified cell types with copy number aberrations (CNA) in normal breast tissues and discovered non-epithelial cell types in the microenvironment with CNAs in breast cancers. These data provide insights into the complex relationship between genetic alterations and epigenomic phenotypes during breast tumor evolution.
Collapse
|
66
|
Buquicchio FA, Fonseca R, Yan PK, Wang F, Evrard M, Obers A, Gutierrez JC, Raposo CJ, Belk JA, Daniel B, Zareie P, Yost KE, Qi Y, Yin Y, Nico KF, Tierney FM, Howitt MR, Lareau CA, Satpathy AT, Mackay LK. Distinct epigenomic landscapes underlie tissue-specific memory T cell differentiation. Immunity 2024; 57:2202-2215.e6. [PMID: 39043184 DOI: 10.1016/j.immuni.2024.06.014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2023] [Revised: 05/07/2024] [Accepted: 06/27/2024] [Indexed: 07/25/2024]
Abstract
The memory CD8+ T cell pool contains phenotypically and transcriptionally heterogeneous subsets with specialized functions and recirculation patterns. Here, we examined the epigenetic landscape of CD8+ T cells isolated from seven non-lymphoid organs across four distinct infection models, alongside their circulating T cell counterparts. Using single-cell transposase-accessible chromatin sequencing (scATAC-seq), we found that tissue-resident memory T (TRM) cells and circulating memory T (TCIRC) cells develop along distinct epigenetic trajectories. We identified organ-specific transcriptional regulators of TRM cell development, including FOSB, FOS, FOSL1, and BACH2, and defined an epigenetic signature common to TRM cells across organs. Finally, we found that although terminal TEX cells share accessible regulatory elements with TRM cells, they are defined by TEX-specific epigenetic features absent from TRM cells. Together, this comprehensive data resource shows that TRM cell development is accompanied by dynamic transcriptome alterations and chromatin accessibility changes that direct tissue-adapted and functionally distinct T cell states.
Collapse
Affiliation(s)
- Frank A Buquicchio
- Department of Pathology, Stanford University, Stanford, CA 94305, USA; Program in Immunology, Stanford University, Stanford, CA 94304, USA; Gladstone-UCSF Institute of Genomic Immunology, San Francisco, CA 94158, USA
| | - Raissa Fonseca
- Department of Microbiology and Immunology, The University of Melbourne at The Peter Doherty Institute for Infection and Immunity, Melbourne, VIC 3000, Australia
| | - Patrick K Yan
- Department of Pathology, Stanford University, Stanford, CA 94305, USA; Program in Immunology, Stanford University, Stanford, CA 94304, USA
| | - Fangyi Wang
- Department of Pathology, Stanford University, Stanford, CA 94305, USA; Program in Immunology, Stanford University, Stanford, CA 94304, USA
| | - Maximilien Evrard
- Department of Microbiology and Immunology, The University of Melbourne at The Peter Doherty Institute for Infection and Immunity, Melbourne, VIC 3000, Australia
| | - Andreas Obers
- Department of Microbiology and Immunology, The University of Melbourne at The Peter Doherty Institute for Infection and Immunity, Melbourne, VIC 3000, Australia
| | - Jacob C Gutierrez
- Department of Pathology, Stanford University, Stanford, CA 94305, USA; Program in Immunology, Stanford University, Stanford, CA 94304, USA
| | - Colin J Raposo
- Department of Pathology, Stanford University, Stanford, CA 94305, USA; Program in Immunology, Stanford University, Stanford, CA 94304, USA
| | - Julia A Belk
- Department of Pathology, Stanford University, Stanford, CA 94305, USA; Department of Computer Science, Stanford University, Stanford, CA 94305, USA
| | - Bence Daniel
- Department of Pathology, Stanford University, Stanford, CA 94305, USA
| | - Pirooz Zareie
- Department of Microbiology and Immunology, The University of Melbourne at The Peter Doherty Institute for Infection and Immunity, Melbourne, VIC 3000, Australia
| | - Kathryn E Yost
- Department of Pathology, Stanford University, Stanford, CA 94305, USA
| | - Yanyan Qi
- Department of Pathology, Stanford University, Stanford, CA 94305, USA
| | - Yajie Yin
- Department of Pathology, Stanford University, Stanford, CA 94305, USA; Program in Immunology, Stanford University, Stanford, CA 94304, USA
| | - Katherine F Nico
- Department of Pathology, Stanford University, Stanford, CA 94305, USA; Program in Immunology, Stanford University, Stanford, CA 94304, USA
| | - Flora M Tierney
- Department of Pathology, Stanford University, Stanford, CA 94305, USA; Program in Immunology, Stanford University, Stanford, CA 94304, USA
| | - Michael R Howitt
- Department of Pathology, Stanford University, Stanford, CA 94305, USA; Program in Immunology, Stanford University, Stanford, CA 94304, USA
| | - Caleb A Lareau
- Department of Pathology, Stanford University, Stanford, CA 94305, USA; Program in Immunology, Stanford University, Stanford, CA 94304, USA; Parker Institute for Cancer Immunotherapy, Stanford University, Stanford, CA 94129, USA; Gladstone-UCSF Institute of Genomic Immunology, San Francisco, CA 94158, USA
| | - Ansuman T Satpathy
- Department of Pathology, Stanford University, Stanford, CA 94305, USA; Program in Immunology, Stanford University, Stanford, CA 94304, USA; Parker Institute for Cancer Immunotherapy, Stanford University, Stanford, CA 94129, USA; Gladstone-UCSF Institute of Genomic Immunology, San Francisco, CA 94158, USA.
| | - Laura K Mackay
- Department of Microbiology and Immunology, The University of Melbourne at The Peter Doherty Institute for Infection and Immunity, Melbourne, VIC 3000, Australia.
| |
Collapse
|
67
|
Kim J, Schanzer N, Singh RS, Zaman MI, Garcia-Medina JS, Proszynski J, Ganesan S, Dan Landau, Park CY, Melnick AM, Mason CE. DOGMA-seq and multimodal, single-cell analysis in acute myeloid leukemia. INTERNATIONAL REVIEW OF CELL AND MOLECULAR BIOLOGY 2024; 390:67-108. [PMID: 39864897 DOI: 10.1016/bs.ircmb.2024.08.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/28/2025]
Abstract
Acute myeloid leukemia (AML) is a complex cancer, yet advances in recent years from integrated genomics methods have helped improve diagnosis, treatment, and means of patient stratification. A recent example of a powerful, multimodal method is DOGMA-seq, which can measure chromatin accessibility, gene expression, and cell-surface protein levels from the same individual cell simultaneously. Previous bimodal single-cell techniques, such as CITE-seq (Cellular indexing of transcriptomes and epitopes), have only permitted the transcriptome and cell-surface protein expression measurement. DOGMA-seq, however, builds on this foundation and has implications for examining epigenomic, transcriptomic, and proteomic interactions between various cell types. This technique has the potential to be particularly useful in the study of cancers such as AML. This is because the cellular mechanisms that drive AML are rather heterogeneous and require a more complete understanding of the interplay between the genetic mutations, disruptions in RNA transcription and translation, and surface protein expression that cause these cancers to develop and evolve. This technique will hopefully contribute to a more clear and complete understanding of the growth and progression of complex cancers.
Collapse
Affiliation(s)
- JangKeun Kim
- Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY, United States; The HRH Prince Alwaleed Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, Weill Cornell Medicine, New York, NY, United States
| | - Nathan Schanzer
- School of Medicine, New York Medical College, Valhalla, NY, United States
| | - Ruth Subhash Singh
- Department of Biomedical Engineering, Rensselaer Polytechnic Institute, Troy, NY, United States
| | - Mohammed I Zaman
- Department of Biophysics and Physiology, Stony Brook University, Stony Brook, NY, United States
| | - J Sebastian Garcia-Medina
- Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY, United States; The HRH Prince Alwaleed Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, Weill Cornell Medicine, New York, NY, United States
| | - Jacqueline Proszynski
- Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY, United States; The HRH Prince Alwaleed Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, Weill Cornell Medicine, New York, NY, United States
| | - Saravanan Ganesan
- Division of Hematology and Medical Oncology, Department of Medicine, Weill Cornell Medicine, New York, NY, United States; Sandra and Edward Meyer Cancer Center, Weill Cornell Medicine, New York, NY, United States; New York Genome Center, New York, NY, United States
| | - Dan Landau
- Division of Hematology and Medical Oncology, Department of Medicine, Weill Cornell Medicine, New York, NY, United States; Sandra and Edward Meyer Cancer Center, Weill Cornell Medicine, New York, NY, United States
| | | | - Ari M Melnick
- Division of Hematology and Medical Oncology, Department of Medicine, Weill Cornell Medicine, New York, NY, United States; Sandra and Edward Meyer Cancer Center, Weill Cornell Medicine, New York, NY, United States
| | - Christopher E Mason
- Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY, United States; The HRH Prince Alwaleed Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, Weill Cornell Medicine, New York, NY, United States.
| |
Collapse
|
68
|
Samaran J, Peyré G, Cantini L. scConfluence: single-cell diagonal integration with regularized Inverse Optimal Transport on weakly connected features. Nat Commun 2024; 15:7762. [PMID: 39237488 PMCID: PMC11377776 DOI: 10.1038/s41467-024-51382-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2024] [Accepted: 08/06/2024] [Indexed: 09/07/2024] Open
Abstract
The abundance of unpaired multimodal single-cell data has motivated a growing body of research into the development of diagonal integration methods. However, the state-of-the-art suffers from the loss of biological information due to feature conversion and struggles with modality-specific populations. To overcome these crucial limitations, we here introduce scConfluence, a method for single-cell diagonal integration. scConfluence combines uncoupled autoencoders on the complete set of features with regularized Inverse Optimal Transport on weakly connected features. We extensively benchmark scConfluence in several single-cell integration scenarios proving that it outperforms the state-of-the-art. We then demonstrate the biological relevance of scConfluence in three applications. We predict spatial patterns for Scgn, Synpr and Olah in scRNA-smFISH integration. We improve the classification of B cells and Monocytes in highly heterogeneous scRNA-scATAC-CyTOF integration. Finally, we reveal the joint contribution of Fezf2 and apical dendrite morphology in Intra Telencephalic neurons, based on morphological images and scRNA.
Collapse
Affiliation(s)
- Jules Samaran
- Institut Pasteur, Université Paris Cité, CNRS UMR 3738, Machine Learning for Integrative Genomics Group, Paris, France
| | - Gabriel Peyré
- CNRS and DMA de l'Ecole Normale Supérieure, CNRS, Ecole Normale Supérieure, Université PSL, Paris, France
| | - Laura Cantini
- Institut Pasteur, Université Paris Cité, CNRS UMR 3738, Machine Learning for Integrative Genomics Group, Paris, France.
| |
Collapse
|
69
|
Trapnell C. Revealing gene function with statistical inference at single-cell resolution. Nat Rev Genet 2024; 25:623-638. [PMID: 38951690 DOI: 10.1038/s41576-024-00750-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/21/2024] [Indexed: 07/03/2024]
Abstract
Single-cell and spatial molecular profiling assays have shown large gains in sensitivity, resolution and throughput. Applying these technologies to specimens from human and model organisms promises to comprehensively catalogue cell types, reveal their lineage origins in development and discern their contributions to disease pathogenesis. Moreover, rapidly dropping costs have made well-controlled perturbation experiments and cohort studies widely accessible, illuminating mechanisms that give rise to phenotypes at the scale of the cell, the tissue and the whole organism. Interpreting the coming flood of single-cell data, much of which will be spatially resolved, will place a tremendous burden on existing computational pipelines. However, statistical concepts, models, tools and algorithms can be repurposed to solve problems now arising in genetic and molecular biology studies of development and disease. Here, I review how the questions that recent technological innovations promise to answer can be addressed by the major classes of statistical tools.
Collapse
Affiliation(s)
- Cole Trapnell
- Department of Genome Sciences, University of Washington, Seattle, WA, USA.
- Brotman Baty Institute for Precision Medicine, University of Washington, Seattle, WA, USA.
- Allen Discovery Center for Cell Lineage Tracing, Seattle, WA, USA.
- Seattle Hub for Synthetic Biology, Seattle, WA, USA.
| |
Collapse
|
70
|
Dong C, Meng X, Zhang T, Guo Z, Liu Y, Wu P, Chen S, Zhou F, Ma Y, Xiong H, Shu S, He A. Single-cell EpiChem jointly measures drug-chromatin binding and multimodal epigenome. Nat Methods 2024; 21:1624-1633. [PMID: 39025969 PMCID: PMC11399096 DOI: 10.1038/s41592-024-02360-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2023] [Accepted: 06/25/2024] [Indexed: 07/20/2024]
Abstract
Studies of molecular and cellular functions of small-molecule inhibitors in cancer treatment, eliciting effects by targeting genome and epigenome associated proteins, requires measurement of drug-target engagement in single-cell resolution. Here we present EpiChem for in situ single-cell joint mapping of small molecules and multimodal epigenomic landscape. We demonstrate single-cell co-assays of three small molecules together with histone modifications, chromatin accessibility or target proteins in human colorectal cancer (CRC) organoids. Integrated multimodal analysis reveals diverse drug interactions in the context of chromatin states within heterogeneous CRC organoids. We further reveal drug genomic binding dynamics and adaptive epigenome across cell types after small-molecule drug treatment in CRC organoids. This method provides a unique tool to exploit the mechanisms of cell type-specific drug actions.
Collapse
Affiliation(s)
- Chao Dong
- Institute of Molecular Medicine, National Biomedical Imaging Center, College of Future Technology, Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, China
| | - Xiaoxuan Meng
- Institute of Molecular Medicine, National Biomedical Imaging Center, College of Future Technology, Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, China
| | - Tong Zhang
- Institute of Molecular Medicine, National Biomedical Imaging Center, College of Future Technology, Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, China
| | - Zhifang Guo
- State Key Laboratory of Molecular Oncology, Beijing Key Laboratory of Carcinogenesis and Translational Research, Department of Lymphoma, Peking University Cancer Hospital & Institute, Beijing, China
- Peking University International Cancer Institute, Beijing, China
- Peking University-Yunnan Baiyao International Medical Research Center, Beijing, China
| | - Yaxi Liu
- Institute of Molecular Medicine, National Biomedical Imaging Center, College of Future Technology, Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, China
| | - Peihuang Wu
- State Key Laboratory of Molecular Oncology, Beijing Key Laboratory of Carcinogenesis and Translational Research, Department of Lymphoma, Peking University Cancer Hospital & Institute, Beijing, China
| | - Shiwei Chen
- Peking University International Cancer Institute, Beijing, China
- Peking University-Yunnan Baiyao International Medical Research Center, Beijing, China
| | - Fanqi Zhou
- State Key Laboratory of Medical Molecular Biology, Haihe laboratory of Cell Ecosystem, Key Laboratory of RNA and Hematopoietic Regulation, Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences, School of Basic Medicine Peking Union Medical College, Beijing, China
| | - Yanni Ma
- State Key Laboratory of Medical Molecular Biology, Haihe laboratory of Cell Ecosystem, Key Laboratory of RNA and Hematopoietic Regulation, Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences, School of Basic Medicine Peking Union Medical College, Beijing, China
| | - Haiqing Xiong
- State Key Laboratory of Experimental Hematology, National Clinical Research Center for Blood Diseases, Haihe Laboratory of Cell Ecosystem, Institute of Hematology & Blood Diseases Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Tianjin, China
- Tianjin Institutes of Health Science, Tianjin, China
| | - Shaokun Shu
- State Key Laboratory of Molecular Oncology, Beijing Key Laboratory of Carcinogenesis and Translational Research, Department of Lymphoma, Peking University Cancer Hospital & Institute, Beijing, China.
- Peking University International Cancer Institute, Beijing, China.
- Peking University-Yunnan Baiyao International Medical Research Center, Beijing, China.
| | - Aibin He
- Institute of Molecular Medicine, National Biomedical Imaging Center, College of Future Technology, Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, China.
- Key laboratory of Carcinogenesis and Translational Research of Ministry of Education of China, Peking University Cancer Hospital & Institute, Beijing, China.
- Peking University Chengdu Academy for Advanced Interdisciplinary Biotechnologies, Chengdu, China.
| |
Collapse
|
71
|
Stuart T. Progress in multifactorial single-cell chromatin profiling methods. Biochem Soc Trans 2024; 52:1827-1839. [PMID: 39023855 PMCID: PMC11668300 DOI: 10.1042/bst20231471] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2024] [Revised: 07/01/2024] [Accepted: 07/08/2024] [Indexed: 07/20/2024]
Abstract
Chromatin states play a key role in shaping overall cellular states and fates. Building a complete picture of the functional state of chromatin in cells requires the co-detection of several distinct biochemical aspects. These span DNA methylation, chromatin accessibility, chromosomal conformation, histone posttranslational modifications, and more. While this certainly presents a challenging task, over the past few years many new and creative methods have been developed that now enable co-assay of these different aspects of chromatin at single cell resolution. This field is entering an exciting phase, where a confluence of technological improvements, decreased sequencing costs, and computational innovation are presenting new opportunities to dissect the diversity of chromatin states present in tissues, and how these states may influence gene regulation. In this review, I discuss the spectrum of current experimental approaches for multifactorial chromatin profiling, highlight some of the experimental and analytical challenges, as well as some areas for further innovation.
Collapse
Affiliation(s)
- Tim Stuart
- Genome Institute of Singapore (GIS), Agency for Science, Technology and Research (A*STAR), 60 Biopolis Street, Genome, Singapore 138672, Republic of Singapore
| |
Collapse
|
72
|
Luo S, Germain PL, Robinson MD, von Meyenn F. Benchmarking computational methods for single-cell chromatin data analysis. Genome Biol 2024; 25:225. [PMID: 39152456 PMCID: PMC11328424 DOI: 10.1186/s13059-024-03356-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2023] [Accepted: 07/29/2024] [Indexed: 08/19/2024] Open
Abstract
BACKGROUND Single-cell chromatin accessibility assays, such as scATAC-seq, are increasingly employed in individual and joint multi-omic profiling of single cells. As the accumulation of scATAC-seq and multi-omics datasets continue, challenges in analyzing such sparse, noisy, and high-dimensional data become pressing. Specifically, one challenge relates to optimizing the processing of chromatin-level measurements and efficiently extracting information to discern cellular heterogeneity. This is of critical importance, since the identification of cell types is a fundamental step in current single-cell data analysis practices. RESULTS We benchmark 8 feature engineering pipelines derived from 5 recent methods to assess their ability to discover and discriminate cell types. By using 10 metrics calculated at the cell embedding, shared nearest neighbor graph, or partition levels, we evaluate the performance of each method at different data processing stages. This comprehensive approach allows us to thoroughly understand the strengths and weaknesses of each method and the influence of parameter selection. CONCLUSIONS Our analysis provides guidelines for choosing analysis methods for different datasets. Overall, feature aggregation, SnapATAC, and SnapATAC2 outperform latent semantic indexing-based methods. For datasets with complex cell-type structures, SnapATAC and SnapATAC2 are preferred. With large datasets, SnapATAC2 and ArchR are most scalable.
Collapse
Affiliation(s)
- Siyuan Luo
- Laboratory of Nutrition and Metabolic Epigenetics, Department of Health Sciences and Technology, ETH Zurich, Zurich, Switzerland
- Department of Molecular Life Sciences, University of Zurich, Zurich, Switzerland
| | - Pierre-Luc Germain
- Department of Molecular Life Sciences, University of Zurich, Zurich, Switzerland
- SIB Swiss Institute of Bioinformatics, University of Zurich, Zurich, Switzerland
- Institute for Neuroscience, Department of Health Sciences and Technology, ETH Zurich, Zurich, Switzerland
| | - Mark D Robinson
- Department of Molecular Life Sciences, University of Zurich, Zurich, Switzerland.
- SIB Swiss Institute of Bioinformatics, University of Zurich, Zurich, Switzerland.
| | - Ferdinand von Meyenn
- Laboratory of Nutrition and Metabolic Epigenetics, Department of Health Sciences and Technology, ETH Zurich, Zurich, Switzerland.
| |
Collapse
|
73
|
Rachid Zaim S, Pebworth MP, McGrath I, Okada L, Weiss M, Reading J, Czartoski JL, Torgerson TR, McElrath MJ, Bumol TF, Skene PJ, Li XJ. MOCHA's advanced statistical modeling of scATAC-seq data enables functional genomic inference in large human cohorts. Nat Commun 2024; 15:6828. [PMID: 39122670 PMCID: PMC11316085 DOI: 10.1038/s41467-024-50612-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2023] [Accepted: 07/13/2024] [Indexed: 08/12/2024] Open
Abstract
Single-cell assay for transposase-accessible chromatin using sequencing (scATAC-seq) is being increasingly used to study gene regulation. However, major analytical gaps limit its utility in studying gene regulatory programs in complex diseases. In response, MOCHA (Model-based single cell Open CHromatin Analysis) presents major advances over existing analysis tools, including: 1) improving identification of sample-specific open chromatin, 2) statistical modeling of technical drop-out with zero-inflated methods, 3) mitigation of false positives in single cell analysis, 4) identification of alternative transcription-starting-site regulation, and 5) modules for inferring temporal gene regulatory networks from longitudinal data. These advances, in addition to open chromatin analyses, provide a robust framework after quality control and cell labeling to study gene regulatory programs in human disease. We benchmark MOCHA with four state-of-the-art tools to demonstrate its advances. We also construct cross-sectional and longitudinal gene regulatory networks, identifying potential mechanisms of COVID-19 response. MOCHA provides researchers with a robust analytical tool for functional genomic inference from scATAC-seq data.
Collapse
Affiliation(s)
| | | | | | - Lauren Okada
- Allen Institute for Immunology, Seattle, WA, USA
| | - Morgan Weiss
- Allen Institute for Immunology, Seattle, WA, USA
| | | | - Julie L Czartoski
- Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | | | - M Juliana McElrath
- Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | | | | | - Xiao-Jun Li
- Allen Institute for Immunology, Seattle, WA, USA.
| |
Collapse
|
74
|
Liao H, Kottapalli S, Huang Y, Chaw M, Gehring J, Waltner O, Phung-Rojas M, Daza RM, Matsen FA, Trapnell C, Shendure J, Srivatsan S. Optics-free reconstruction of 2D images via DNA barcode proximity graphs. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.08.06.606834. [PMID: 39149271 PMCID: PMC11326233 DOI: 10.1101/2024.08.06.606834] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 08/17/2024]
Abstract
Spatial genomic technologies include imaging- and sequencing-based methods (1-3). An emerging subcategory of sequencing-based methods relies on a surface coated with coordinate-associated DNA barcodes, which are leveraged to tag endogenous nucleic acids or cells in an overlaid tissue section (4-7). However, the physical registration of DNA barcodes to spatial coordinates is challenging, necessitating either high density printing of coordinate-specific oligonucleotides or in situ sequencing/probing of randomly deposited, oligonucleotide-bearing beads. As a consequence, the surface areas available to sequencing-based spatial genomic methods are constrained by the time, labor, cost, and instrumentation required to either print, synthesize or decode a coordinate-tagged surface. To address this challenge, we developed SCOPE (Spatial reConstruction via Oligonucleotide Proximity Encoding), an optics-free, DNA microscopy (8) inspired method. With SCOPE, the relative positions of randomly deposited beads on a 2D surface are inferred from the ex situ sequencing of chimeric molecules formed from diffusing "sender" and tethered "receiver" oligonucleotides. As a first proof-of-concept, we apply SCOPE to reconstruct an asymmetric "swoosh" shape resembling the Nike logo (16.75 × 9.25 mm). Next, we use a microarray printer to encode a "color" version of the Snellen eye chart for visual acuity (17.18 × 40.97 mm), and apply SCOPE to achieve optics-free reconstruction of individual letters. Although these are early demonstrations of the concept and much work remains to be done, we envision that the optics-free, sequencing-based quantitation of the molecular proximities of DNA barcodes will enable spatial genomics in constant experimental time, across fields of view and at resolutions that are determined by sequencing depth, bead size, and diffusion kinetics, rather than the limitations of optical instruments or microarray printers.
Collapse
Affiliation(s)
- Hanna Liao
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
- Molecular and Cellular Biology Program, University of Washington, Seattle, WA, USA
| | - Sanjay Kottapalli
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | - Yuqi Huang
- Basic Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | - Matthew Chaw
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
- Basic Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | - Jase Gehring
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | - Olivia Waltner
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | - Melissa Phung-Rojas
- Basic Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | - Riza M. Daza
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
- Seattle Hub for Synthetic Biology, Seattle, WA, USA
| | - Frederick A. Matsen
- Basic Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
- Computational Biology Program, Fred Hutchinson Cancer Center, Seattle, WA, USA
- Department of Statistics, University of Washington, Seattle, WA, USA
- Howard Hughes Medical Institute, Seattle, WA, USA
| | - Cole Trapnell
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
- Seattle Hub for Synthetic Biology, Seattle, WA, USA
- Brotman Baty Institute for Precision Medicine, Seattle, WA, USA
| | - Jay Shendure
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
- Seattle Hub for Synthetic Biology, Seattle, WA, USA
- Howard Hughes Medical Institute, Seattle, WA, USA
- Brotman Baty Institute for Precision Medicine, Seattle, WA, USA
| | - Sanjay Srivatsan
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
- Basic Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
- Brotman Baty Institute for Precision Medicine, Seattle, WA, USA
| |
Collapse
|
75
|
Yun S, Noh M, Yu J, Kim HJ, Hui CC, Lee H, Son JE. Unlocking biological mechanisms with integrative functional genomics approaches. Mol Cells 2024; 47:100092. [PMID: 39019219 PMCID: PMC11345568 DOI: 10.1016/j.mocell.2024.100092] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2024] [Revised: 07/01/2024] [Accepted: 07/08/2024] [Indexed: 07/19/2024] Open
Abstract
Reverse genetics offers precise functional insights into genes through the targeted manipulation of gene expression followed by phenotypic assessment. While these approaches have proven effective in model organisms such as Saccharomyces cerevisiae, large-scale genetic manipulations in human cells were historically unfeasible due to methodological limitations. However, recent advancements in functional genomics, particularly clustered regularly interspaced short palindromic repeats (CRISPR)-based screening technologies and next-generation sequencing platforms, have enabled pooled screening technologies that allow massively parallel, unbiased assessments of biological phenomena in human cells. This review provides a comprehensive overview of cutting-edge functional genomic screening technologies applicable to human cells, ranging from short hairpin RNA screens to modern CRISPR screens. Additionally, we explore the integration of CRISPR platforms with single-cell approaches to monitor gene expression, chromatin accessibility, epigenetic regulation, and chromatin architecture following genetic perturbations at the omics level. By offering an in-depth understanding of these genomic screening methods, this review aims to provide insights into more targeted and effective strategies for genomic research and personalized medicine.
Collapse
Affiliation(s)
- Sehee Yun
- Department of Life Sciences, Korea University, Seoul 02841, Korea
| | - Minsoo Noh
- Department of Life Sciences, Korea University, Seoul 02841, Korea; Department of Internal Medicine and Laboratory of Genomics and Translational Medicine, Gachon University College of Medicine, Incheon 21565, Korea
| | - Jivin Yu
- Department of Life Sciences, Korea University, Seoul 02841, Korea
| | - Hyeon-Jai Kim
- Department of Life Sciences, Korea University, Seoul 02841, Korea
| | - Chi-Chung Hui
- Program in Developmental and Stem Cell Biology, The Hospital for Sick Children, Toronto, ON, Canada; Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada
| | - Hunsang Lee
- Department of Life Sciences, Korea University, Seoul 02841, Korea.
| | - Joe Eun Son
- School of Food Science and Biotechnology, Kyungpook National University, Daegu 41566, Korea.
| |
Collapse
|
76
|
Zhou T, Zhang R, Jia D, Doty RT, Munday AD, Gao D, Xin L, Abkowitz JL, Duan Z, Ma J. GAGE-seq concurrently profiles multiscale 3D genome organization and gene expression in single cells. Nat Genet 2024; 56:1701-1711. [PMID: 38744973 PMCID: PMC11323187 DOI: 10.1038/s41588-024-01745-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2023] [Accepted: 04/05/2024] [Indexed: 05/16/2024]
Abstract
The organization of mammalian genomes features a complex, multiscale three-dimensional (3D) architecture, whose functional significance remains elusive because of limited single-cell technologies that can concurrently profile genome organization and transcriptional activities. Here, we introduce genome architecture and gene expression by sequencing (GAGE-seq), a scalable, robust single-cell co-assay measuring 3D genome structure and transcriptome simultaneously within the same cell. Applied to mouse brain cortex and human bone marrow CD34+ cells, GAGE-seq characterized the intricate relationships between 3D genome and gene expression, showing that multiscale 3D genome features inform cell-type-specific gene expression and link regulatory elements to target genes. Integration with spatial transcriptomic data revealed in situ 3D genome variations in mouse cortex. Observations in human hematopoiesis unveiled discordant changes between 3D genome organization and gene expression, underscoring a complex, temporal interplay at the single-cell level. GAGE-seq provides a powerful, cost-effective approach for exploring genome structure and gene expression relationships at the single-cell level across diverse biological contexts.
Collapse
Affiliation(s)
- Tianming Zhou
- Ray and Stephanie Lane Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA
| | - Ruochi Zhang
- Ray and Stephanie Lane Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA
- Eric and Wendy Schmidt Center, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Deyong Jia
- Department of Urology, University of Washington, Seattle, WA, USA
| | - Raymond T Doty
- Division of Hematology and Oncology, Department of Medicine/Fred Hutch Cancer Center, University of Washington, Seattle, WA, USA
| | - Adam D Munday
- Division of Hematology and Oncology, Department of Medicine/Fred Hutch Cancer Center, University of Washington, Seattle, WA, USA
| | - Daniel Gao
- Institute for Stem Cell and Regenerative Medicine, University of Washington, Seattle, WA, USA
- Department of Chemistry, Pomona College, Claremont, CA, USA
| | - Li Xin
- Department of Urology, University of Washington, Seattle, WA, USA
- Institute for Stem Cell and Regenerative Medicine, University of Washington, Seattle, WA, USA
| | - Janis L Abkowitz
- Division of Hematology and Oncology, Department of Medicine/Fred Hutch Cancer Center, University of Washington, Seattle, WA, USA
- Institute for Stem Cell and Regenerative Medicine, University of Washington, Seattle, WA, USA
| | - Zhijun Duan
- Division of Hematology and Oncology, Department of Medicine/Fred Hutch Cancer Center, University of Washington, Seattle, WA, USA.
- Institute for Stem Cell and Regenerative Medicine, University of Washington, Seattle, WA, USA.
| | - Jian Ma
- Ray and Stephanie Lane Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA.
| |
Collapse
|
77
|
Yang X, Mann KK, Wu H, Ding J. scCross: a deep generative model for unifying single-cell multi-omics with seamless integration, cross-modal generation, and in silico exploration. Genome Biol 2024; 25:198. [PMID: 39075536 PMCID: PMC11285326 DOI: 10.1186/s13059-024-03338-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2023] [Accepted: 07/16/2024] [Indexed: 07/31/2024] Open
Abstract
Single-cell multi-omics data reveal complex cellular states, providing significant insights into cellular dynamics and disease. Yet, integration of multi-omics data presents challenges. Some modalities have not reached the robustness or clarity of established transcriptomics. Coupled with data scarcity for less established modalities and integration intricacies, these challenges limit our ability to maximize single-cell omics benefits. We introduce scCross, a tool leveraging variational autoencoders, generative adversarial networks, and the mutual nearest neighbors (MNN) technique for modality alignment. By enabling single-cell cross-modal data generation, multi-omics data simulation, and in silico cellular perturbations, scCross enhances the utility of single-cell multi-omics studies.
Collapse
Affiliation(s)
- Xiuhui Yang
- School of Software, Shandong University, 1500 Shunhua, Jinan, 250101, Shandong, China
- Meakins-Christie Laboratories, Department of Medicine, McGill University Health Centre, Montreal, H4A 3J1, QC, Canada
- Quantitative Life Sciences, Faculty of Medicine & Health Sciences, McGill University, Montreal, QC, H3G 1Y6, Canada
| | - Koren K Mann
- Department of Pharmacology and Therapeutics, McGill University, Montreal, QC, H3G 1Y6, Canada
| | - Hao Wu
- School of Software, Shandong University, 1500 Shunhua, Jinan, 250101, Shandong, China.
| | - Jun Ding
- Meakins-Christie Laboratories, Department of Medicine, McGill University Health Centre, Montreal, H4A 3J1, QC, Canada.
- Quantitative Life Sciences, Faculty of Medicine & Health Sciences, McGill University, Montreal, QC, H3G 1Y6, Canada.
- Mila-Quebec AI Institute, Montreal, QC, H2S 3H1, Canada.
| |
Collapse
|
78
|
Loers JU, Vermeirssen V. A single-cell multimodal view on gene regulatory network inference from transcriptomics and chromatin accessibility data. Brief Bioinform 2024; 25:bbae382. [PMID: 39207727 PMCID: PMC11359808 DOI: 10.1093/bib/bbae382] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2024] [Revised: 06/27/2024] [Accepted: 07/23/2024] [Indexed: 09/04/2024] Open
Abstract
Eukaryotic gene regulation is a combinatorial, dynamic, and quantitative process that plays a vital role in development and disease and can be modeled at a systems level in gene regulatory networks (GRNs). The wealth of multi-omics data measured on the same samples and even on the same cells has lifted the field of GRN inference to the next stage. Combinations of (single-cell) transcriptomics and chromatin accessibility allow the prediction of fine-grained regulatory programs that go beyond mere correlation of transcription factor and target gene expression, with enhancer GRNs (eGRNs) modeling molecular interactions between transcription factors, regulatory elements, and target genes. In this review, we highlight the key components for successful (e)GRN inference from (sc)RNA-seq and (sc)ATAC-seq data exemplified by state-of-the-art methods as well as open challenges and future developments. Moreover, we address preprocessing strategies, metacell generation and computational omics pairing, transcription factor binding site detection, and linear and three-dimensional approaches to identify chromatin interactions as well as dynamic and causal eGRN inference. We believe that the integration of transcriptomics together with epigenomics data at a single-cell level is the new standard for mechanistic network inference, and that it can be further advanced with integrating additional omics layers and spatiotemporal data, as well as with shifting the focus towards more quantitative and causal modeling strategies.
Collapse
Affiliation(s)
- Jens Uwe Loers
- Lab for Computational Biology, Integromics and Gene Regulation (CBIGR), Cancer Research Institute Ghent (CRIG), Corneel Heymanslaan 10, 9000 Ghent, Belgium
- Department of Biomedical Molecular Biology, Ghent University, Zwijnaarde-Technologiepark 71, 9052 Ghent, Belgium
- Department of Biomolecular Medicine, Ghent University, Corneel Heymanslaan 10, 9000 Ghent, Belgium
| | - Vanessa Vermeirssen
- Lab for Computational Biology, Integromics and Gene Regulation (CBIGR), Cancer Research Institute Ghent (CRIG), Corneel Heymanslaan 10, 9000 Ghent, Belgium
- Department of Biomedical Molecular Biology, Ghent University, Zwijnaarde-Technologiepark 71, 9052 Ghent, Belgium
- Department of Biomolecular Medicine, Ghent University, Corneel Heymanslaan 10, 9000 Ghent, Belgium
| |
Collapse
|
79
|
Veerappa AM, Rowley MJ, Maggio A, Beaudry L, Hawkins D, Kim A, Sethi S, Sorgen PL, Guda C. CloudATAC: a cloud-based framework for ATAC-Seq data analysis. Brief Bioinform 2024; 25:bbae090. [PMID: 39041910 PMCID: PMC11264300 DOI: 10.1093/bib/bbae090] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2023] [Revised: 02/05/2024] [Accepted: 02/18/2024] [Indexed: 07/24/2024] Open
Abstract
Assay for transposase-accessible chromatin with high-throughput sequencing (ATAC-seq) generates genome-wide chromatin accessibility profiles, providing valuable insights into epigenetic gene regulation at both pooled-cell and single-cell population levels. Comprehensive analysis of ATAC-seq data involves the use of various interdependent programs. Learning the correct sequence of steps needed to process the data can represent a major hurdle. Selecting appropriate parameters at each stage, including pre-analysis, core analysis, and advanced downstream analysis, is important to ensure accurate analysis and interpretation of ATAC-seq data. Additionally, obtaining and working within a limited computational environment presents a significant challenge to non-bioinformatic researchers. Therefore, we present Cloud ATAC, an open-source, cloud-based interactive framework with a scalable, flexible, and streamlined analysis framework based on the best practices approach for pooled-cell and single-cell ATAC-seq data. These frameworks use on-demand computational power and memory, scalability, and a secure and compliant environment provided by the Google Cloud. Additionally, we leverage Jupyter Notebook's interactive computing platform that combines live code, tutorials, narrative text, flashcards, quizzes, and custom visualizations to enhance learning and analysis. Further, leveraging GPU instances has significantly improved the run-time of the single-cell framework. The source codes and data are publicly available through NIH Cloud lab https://github.com/NIGMS/ATAC-Seq-and-Single-Cell-ATAC-Seq-Analysis. This manuscript describes the development of a resource module that is part of a learning platform named ``NIGMS Sandbox for Cloud-based Learning'' https://github.com/NIGMS/NIGMS-Sandbox. The overall genesis of the Sandbox is described in the editorial NIGMS Sandbox [1] at the beginning of this Supplement. This module delivers learning materials on the analysis of bulk and single-cell ATAC-seq data in an interactive format that uses appropriate cloud resources for data access and analyses.
Collapse
Affiliation(s)
| | | | - Angela Maggio
- Deloitte Consulting LLP, Health Data and AI Arlington, VA, USA
| | - Laura Beaudry
- Google Google Public Sector, Professional Services Reston, VA, USA
| | - Dale Hawkins
- Google Google Public Sector, Professional Services Reston, VA, USA
| | - Allen Kim
- Google Google Public Sector, Professional Services Reston, VA, USA
| | - Sahil Sethi
- University of Nebraska Medical Center, Omaha, NE 68105 USA
| | - Paul L Sorgen
- University of Nebraska Medical Center, Omaha, NE 68105 USA
| | | |
Collapse
|
80
|
Xu X, Lin Y, Yin L, Serpa PDS, Conacher B, Pacholac C, Carvallo F, Hrubec T, Farris S, Zimmerman K, Wang X, Xie H. Spatial Transcriptomics and Single-Nucleus Multi-omics Analysis Revealing the Impact of High Maternal Folic Acid Supplementation on Offspring Brain Development. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.07.12.603269. [PMID: 39071367 PMCID: PMC11275885 DOI: 10.1101/2024.07.12.603269] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/30/2024]
Abstract
Folate, an essential vitamin B9, is crucial for diverse biological processes including neurogenesis. Folic acid (FA) supplementation during pregnancy is a standard practice for preventing neural tube defects (NTDs). However, concerns are growing over the potential risks of excessive maternal FA intake. Here, we employed mouse model and spatial transcriptomics and single-nucleus multi-omics approaches to investigate the impact of high maternal FA supplementation during the periconceptional period on offspring brain development. Maternal high FA supplementation affected gene pathways linked to neurogenesis and neuronal axon myelination across multiple brain regions, as well as gene expression alterations related to learning and memory in thalamic and ventricular regions. Single-nucleus multi-omics analysis revealed that maturing excitatory neurons in the dentate gyrus (DG) are particularly vulnerable to high maternal FA intake, leading to aberrant gene expressions and chromatin accessibility in pathways governing ribosomal biogenesis critical for synaptic formation. Our findings provide new insights into specific brain regions, cell types, gene expressions and pathways that can be affected by maternal high FA supplementation.
Collapse
|
81
|
Soroczynski J, Anderson LJ, Yeung JL, Rendleman JM, Oren DA, Konishi HA, Risca VI. OpenTn5: Open-Source Resource for Robust and Scalable Tn5 Transposase Purification and Characterization. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.07.11.602973. [PMID: 39026714 PMCID: PMC11257509 DOI: 10.1101/2024.07.11.602973] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/20/2024]
Abstract
Tagmentation combines DNA fragmentation and sequencing adapter addition by leveraging the transposition activity of the bacterial cut-and-paste Tn5 transposase, to enable efficient sequencing library preparation. Here we present an open-source protocol for the generation of multi-purpose hyperactive Tn5 transposase, including its benchmarking in CUT&Tag, bulk and single-cell ATAC-seq. The OpenTn5 protocol yields multi-milligram quantities of pG-Tn5E54K, L372P protein per liter of E. coli culture, sufficient for thousands of tagmentation reactions and the enzyme retains activity in storage for more than a year.
Collapse
Affiliation(s)
- Jan Soroczynski
- Laboratory of Genome Architecture and Dynamics, The Rockefeller University, New York, NY
| | - Lauren J. Anderson
- Laboratory of Genome Architecture and Dynamics, The Rockefeller University, New York, NY
| | - Joanna L. Yeung
- Laboratory of Genome Architecture and Dynamics, The Rockefeller University, New York, NY
| | - Justin M. Rendleman
- Laboratory of Genome Architecture and Dynamics, The Rockefeller University, New York, NY
| | - Deena A. Oren
- Structural Biology Resource Center, The Rockefeller University, New York, NY
| | - Hide A. Konishi
- Laboratory of Chromosome and Cell Biology, The Rockefeller University, New York, NY
| | - Viviana I. Risca
- Laboratory of Genome Architecture and Dynamics, The Rockefeller University, New York, NY
| |
Collapse
|
82
|
Abraham E, Volmert B, Roule T, Huang L, Yu J, Williams AE, Cohen HM, Douglas A, Megill E, Morris A, Stronati E, Fueyo R, Zubillaga M, Elrod JW, Akizu N, Aguirre A, Estaras C. A Retinoic Acid:YAP1 signaling axis controls atrial lineage commitment. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.07.11.602981. [PMID: 39026825 PMCID: PMC11257518 DOI: 10.1101/2024.07.11.602981] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 07/20/2024]
Abstract
Vitamin A/Retinoic Acid (Vit A/RA) signaling is essential for heart development. In cardiac progenitor cells (CPCs), RA signaling induces the expression of atrial lineage genes while repressing ventricular genes, thereby promoting the acquisition of an atrial cardiomyocyte cell fate. To achieve this, RA coordinates a complex regulatory network of downstream effectors that is not fully identified. To address this gap, we applied a functional genomics approach (i.e scRNAseq and snATACseq) to untreated and RA-treated human embryonic stem cells (hESCs)-derived CPCs. Unbiased analysis revealed that the Hippo effectors YAP1 and TEAD4 are integrated with the atrial transcription factor enhancer network, and that YAP1 is necessary for activation of RA-enhancers in CPCs. Furthermore, in vivo analysis of control and conditionally YAP1 KO mouse embryos (Sox2-cre) revealed that the expression of atrial lineage genes, such as NR2F2, is compromised by YAP1 deletion in the CPCs of the second heart field. Accordingly, we found that YAP1 is required for the formation of an atrial chamber but is dispensable for the formation of a ventricle, in hESC-derived patterned cardiac organoids. Overall, our findings revealed that YAP1 is a non-canonical effector of RA signaling essential for the acquisition of atrial lineages during cardiogenesis.
Collapse
Affiliation(s)
- Elizabeth Abraham
- Department of Cardiovascular Sciences, Aging + Cardiovascular Discovery Center, Temple University, Lewis Katz School of Medicine, Philadelphia, PA, USA
| | - Brett Volmert
- Institute for Quantitative Health Science and Engineering, Division of Developmental and Stem Cell Biology, Michigan State University, East Lansing, MI, USA
| | - Thomas Roule
- Raymond G. Perelman Center for Cellular and Molecular Therapeutics, The Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Ling Huang
- Integrative Genomics and Bioinformatics Core, Salk Institute for Biological Studies, La Jolla, CA, USA
| | - Jingting Yu
- Integrative Genomics and Bioinformatics Core, Salk Institute for Biological Studies, La Jolla, CA, USA
| | - April E Williams
- Integrative Genomics and Bioinformatics Core, Salk Institute for Biological Studies, La Jolla, CA, USA
| | - Henry M Cohen
- Department of Cardiovascular Sciences, Aging + Cardiovascular Discovery Center, Temple University, Lewis Katz School of Medicine, Philadelphia, PA, USA
| | - Aidan Douglas
- Department of Cardiovascular Sciences, Aging + Cardiovascular Discovery Center, Temple University, Lewis Katz School of Medicine, Philadelphia, PA, USA
| | - Emily Megill
- Department of Cardiovascular Sciences, Aging + Cardiovascular Discovery Center, Temple University, Lewis Katz School of Medicine, Philadelphia, PA, USA
| | - Alex Morris
- Department of Cardiovascular Sciences, Aging + Cardiovascular Discovery Center, Temple University, Lewis Katz School of Medicine, Philadelphia, PA, USA
| | - Eleonora Stronati
- Department of Child and Adolescence Psychiatry, The Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Raquel Fueyo
- Department of Chemical and Systems Biology, Stanford University School of Medicine, Stanford, CA, USA
| | - Mikel Zubillaga
- Department of Cardiovascular Sciences, Aging + Cardiovascular Discovery Center, Temple University, Lewis Katz School of Medicine, Philadelphia, PA, USA
| | - John W Elrod
- Department of Cardiovascular Sciences, Aging + Cardiovascular Discovery Center, Temple University, Lewis Katz School of Medicine, Philadelphia, PA, USA
| | - Naiara Akizu
- Raymond G. Perelman Center for Cellular and Molecular Therapeutics, The Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Aitor Aguirre
- Institute for Quantitative Health Science and Engineering, Division of Developmental and Stem Cell Biology, Michigan State University, East Lansing, MI, USA
| | - Conchi Estaras
- Department of Cardiovascular Sciences, Aging + Cardiovascular Discovery Center, Temple University, Lewis Katz School of Medicine, Philadelphia, PA, USA
| |
Collapse
|
83
|
Liao L, Martin PCN, Kim H, Panahandeh S, Won KJ. Data enhancement in the age of spatial biology. Adv Cancer Res 2024; 163:39-70. [PMID: 39271267 DOI: 10.1016/bs.acr.2024.06.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/15/2024]
Abstract
Unveiling the intricate interplay of cells in their native environment lies at the heart of understanding fundamental biological processes and unraveling disease mechanisms, particularly in complex diseases like cancer. Spatial transcriptomics (ST) offers a revolutionary lens into the spatial organization of gene expression within tissues, empowering researchers to study both cell heterogeneity and microenvironments in health and disease. However, current ST technologies often face limitations in either resolution or the number of genes profiled simultaneously. Integrating ST data with complementary sources, such as single-cell transcriptomics and detailed tissue staining images, presents a powerful solution to overcome these limitations. This review delves into the computational approaches driving the integration of spatial transcriptomics with other data types. By illuminating the key challenges and outlining the current algorithmic solutions, we aim to highlight the immense potential of these methods to revolutionize our understanding of cancer biology.
Collapse
Affiliation(s)
- Linbu Liao
- Biotech Research and Innovation Centre (BRIC), University of Copenhagen, Denmark; Samuel Oschin Cancer Center, Cedars-Sinai Medical Center, Los Angeles, CA, United States
| | - Patrick C N Martin
- Department of Computational Biomedicine, Cedars-Sinai Medical Center, Los Angeles, CA, United States
| | - Hyobin Kim
- Department of Computational Biomedicine, Cedars-Sinai Medical Center, Los Angeles, CA, United States
| | - Sanaz Panahandeh
- Department of Computational Biomedicine, Cedars-Sinai Medical Center, Los Angeles, CA, United States
| | - Kyoung Jae Won
- Department of Computational Biomedicine, Cedars-Sinai Medical Center, Los Angeles, CA, United States.
| |
Collapse
|
84
|
Curion F, Rich-Griffin C, Agarwal D, Ouologuem S, Rue-Albrecht K, May L, Garcia GEL, Heumos L, Thomas T, Lason W, Sims D, Theis FJ, Dendrou CA. Panpipes: a pipeline for multiomic single-cell and spatial transcriptomic data analysis. Genome Biol 2024; 25:181. [PMID: 38978088 PMCID: PMC11229213 DOI: 10.1186/s13059-024-03322-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2023] [Accepted: 06/25/2024] [Indexed: 07/10/2024] Open
Abstract
Single-cell multiomic analysis of the epigenome, transcriptome, and proteome allows for comprehensive characterization of the molecular circuitry that underpins cell identity and state. However, the holistic interpretation of such datasets presents a challenge given a paucity of approaches for systematic, joint evaluation of different modalities. Here, we present Panpipes, a set of computational workflows designed to automate multimodal single-cell and spatial transcriptomic analyses by incorporating widely-used Python-based tools to perform quality control, preprocessing, integration, clustering, and reference mapping at scale. Panpipes allows reliable and customizable analysis and evaluation of individual and integrated modalities, thereby empowering decision-making before downstream investigations.
Collapse
Affiliation(s)
- Fabiola Curion
- Department of Computational Health, Institute of Computational Biology, Helmholtz Munich, Germany
- Department of Mathematics, School of Computation, Information and Technology, Technical University of Munich, Munich, Germany
| | - Charlotte Rich-Griffin
- Nuffield Department of Medicine, Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Devika Agarwal
- Nuffield Department of Medicine, Centre for Human Genetics, University of Oxford, Oxford, UK
- Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, Kennedy Institute of Rheumatology, University of Oxford, Oxford, UK
| | - Sarah Ouologuem
- Department of Computational Health, Institute of Computational Biology, Helmholtz Munich, Germany
| | - Kevin Rue-Albrecht
- MRC WIMM Centre for Computational Biology, MRC Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK
| | - Lilly May
- Department of Computational Health, Institute of Computational Biology, Helmholtz Munich, Germany
| | - Giulia E L Garcia
- Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, Kennedy Institute of Rheumatology, University of Oxford, Oxford, UK
- Doctoral Training Centre, University of Oxford, Oxford, UK
| | - Lukas Heumos
- Department of Computational Health, Institute of Computational Biology, Helmholtz Munich, Germany
- Comprehensive Pneumology Center With the CPC-M bioArchive, Helmholtz Zentrum Munich, Member of the German Center for Lung Research (DZL), Munich, Germany
- TUM School of Life Sciences Weihenstephan, Technical University of Munich, Munich, Germany
| | - Tom Thomas
- Nuffield Department of Medicine, Centre for Human Genetics, University of Oxford, Oxford, UK
- Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, Kennedy Institute of Rheumatology, University of Oxford, Oxford, UK
- Nuffield Department of Medicine, Translational Gastroenterology Unit, University of Oxford, Oxford, UK
| | - Wojciech Lason
- Nuffield Department of Medicine, Respiratory Medicine Unit, Experimental Medicine Division, University of Oxford, John Radcliffe Hospital, Oxford, UK
| | - David Sims
- MRC WIMM Centre for Computational Biology, MRC Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK
| | - Fabian J Theis
- Department of Computational Health, Institute of Computational Biology, Helmholtz Munich, Germany.
- Department of Mathematics, School of Computation, Information and Technology, Technical University of Munich, Munich, Germany.
- TUM School of Life Sciences Weihenstephan, Technical University of Munich, Munich, Germany.
| | - Calliope A Dendrou
- Nuffield Department of Medicine, Centre for Human Genetics, University of Oxford, Oxford, UK.
- Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, Kennedy Institute of Rheumatology, University of Oxford, Oxford, UK.
- NIHR Oxford Biomedical Research Centre, Oxford, UK.
| |
Collapse
|
85
|
Wang X, Lian Q, Dong H, Xu S, Su Y, Wu X. Benchmarking Algorithms for Gene Set Scoring of Single-cell ATAC-seq Data. GENOMICS, PROTEOMICS & BIOINFORMATICS 2024; 22:qzae014. [PMID: 39049508 PMCID: PMC11423854 DOI: 10.1093/gpbjnl/qzae014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/01/2022] [Revised: 06/20/2023] [Accepted: 06/25/2023] [Indexed: 07/27/2024]
Abstract
Gene set scoring (GSS) has been routinely conducted for gene expression analysis of bulk or single-cell RNA sequencing (RNA-seq) data, which helps to decipher single-cell heterogeneity and cell type-specific variability by incorporating prior knowledge from functional gene sets. Single-cell assay for transposase accessible chromatin using sequencing (scATAC-seq) is a powerful technique for interrogating single-cell chromatin-based gene regulation, and genes or gene sets with dynamic regulatory potentials can be regarded as cell type-specific markers as if in single-cell RNA-seq (scRNA-seq). However, there are few GSS tools specifically designed for scATAC-seq, and the applicability and performance of RNA-seq GSS tools on scATAC-seq data remain to be investigated. Here, we systematically benchmarked ten GSS tools, including four bulk RNA-seq tools, five scRNA-seq tools, and one scATAC-seq method. First, using matched scATAC-seq and scRNA-seq datasets, we found that the performance of GSS tools on scATAC-seq data was comparable to that on scRNA-seq, suggesting their applicability to scATAC-seq. Then, the performance of different GSS tools was extensively evaluated using up to ten scATAC-seq datasets. Moreover, we evaluated the impact of gene activity conversion, dropout imputation, and gene set collections on the results of GSS. Results show that dropout imputation can significantly promote the performance of almost all GSS tools, while the impact of gene activity conversion methods or gene set collections on GSS performance is more dependent on GSS tools or datasets. Finally, we provided practical guidelines for choosing appropriate preprocessing methods and GSS tools in different application scenarios.
Collapse
Affiliation(s)
- Xi Wang
- Pasteurien College, Suzhou Medical College of Soochow University, Soochow University, Suzhou 215000, China
- Department of Automation, Xiamen University, Xiamen 361005, China
| | - Qiwei Lian
- Pasteurien College, Suzhou Medical College of Soochow University, Soochow University, Suzhou 215000, China
- Department of Automation, Xiamen University, Xiamen 361005, China
| | - Haoyu Dong
- Pasteurien College, Suzhou Medical College of Soochow University, Soochow University, Suzhou 215000, China
| | - Shuo Xu
- Department of Automation, Xiamen University, Xiamen 361005, China
| | - Yaru Su
- College of Mathematics and Computer Science, Fuzhou University, Fuzhou 350116, China
| | - Xiaohui Wu
- Pasteurien College, Suzhou Medical College of Soochow University, Soochow University, Suzhou 215000, China
| |
Collapse
|
86
|
Rhaman MS, Ali M, Ye W, Li B. Opportunities and Challenges in Advancing Plant Research with Single-cell Omics. GENOMICS, PROTEOMICS & BIOINFORMATICS 2024; 22:qzae026. [PMID: 38996445 PMCID: PMC11423859 DOI: 10.1093/gpbjnl/qzae026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/12/2023] [Revised: 01/12/2024] [Accepted: 01/15/2024] [Indexed: 07/14/2024]
Abstract
Plants possess diverse cell types and intricate regulatory mechanisms to adapt to the ever-changing environment of nature. Various strategies have been employed to study cell types and their developmental progressions, including single-cell sequencing methods which provide high-dimensional catalogs to address biological concerns. In recent years, single-cell sequencing technologies in transcriptomics, epigenomics, proteomics, metabolomics, and spatial transcriptomics have been increasingly used in plant science to reveal intricate biological relationships at the single-cell level. However, the application of single-cell technologies to plants is more limited due to the challenges posed by cell structure. This review outlines the advancements in single-cell omics technologies, their implications in plant systems, future research applications, and the challenges of single-cell omics in plant systems.
Collapse
Affiliation(s)
- Mohammad Saidur Rhaman
- Peking University Institute of Advanced Agricultural Sciences, Shandong Laboratory of Advanced Agricultural Sciences in Weifang, Weifang 261325, China
| | - Muhammad Ali
- Peking University Institute of Advanced Agricultural Sciences, Shandong Laboratory of Advanced Agricultural Sciences in Weifang, Weifang 261325, China
| | - Wenxiu Ye
- Peking University Institute of Advanced Agricultural Sciences, Shandong Laboratory of Advanced Agricultural Sciences in Weifang, Weifang 261325, China
| | - Bosheng Li
- Peking University Institute of Advanced Agricultural Sciences, Shandong Laboratory of Advanced Agricultural Sciences in Weifang, Weifang 261325, China
| |
Collapse
|
87
|
Sun K, Liu X, Xu R, Liu C, Meng A, Lan X. Mapping the chromatin accessibility landscape of zebrafish embryogenesis at single-cell resolution by SPATAC-seq. Nat Cell Biol 2024; 26:1187-1199. [PMID: 38977847 DOI: 10.1038/s41556-024-01449-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2022] [Accepted: 05/30/2024] [Indexed: 07/10/2024]
Abstract
Currently, the dynamic accessible elements that determine regulatory programs responsible for the unique identity and function of each cell type during zebrafish embryogenesis lack detailed study. Here we present SPATAC-seq: a split-pool ligation-based assay for transposase-accessible chromatin using sequencing. Using SPATAC-seq, we profiled chromatin accessibility in more than 800,000 individual nuclei across 20 developmental stages spanning the sphere stage to the early larval protruding mouth stage. Using this chromatin accessibility map, we identified 604 cell states and inferred their developmental relationships. We also identified 959,040 candidate cis-regulatory elements (cCREs) and delineated development-specific cCREs, as well as transcription factors defining diverse cell identities. Importantly, enhancer reporter assays confirmed that the majority of tested cCREs exhibited robust enhanced green fluorescent protein expression in restricted cell types or tissues. Finally, we explored gene regulatory programs that drive pigment and notochord cell differentiation. Our work provides a valuable open resource for exploring driver regulators of cell fate decisions in zebrafish embryogenesis.
Collapse
Affiliation(s)
- Keyong Sun
- School of Medicine, Tsinghua University, Beijing, China
- Peking University-Tsinghua University-National Institute of Biological Sciences Joint Graduate Program, Tsinghua University, Beijing, China
| | - Xin Liu
- School of Life Sciences, Tsinghua University, Beijing, China
- Tsinghua University-Peking University Center for Life Sciences, Beijing, China
| | - Runda Xu
- School of Medicine, Tsinghua University, Beijing, China
- Tsinghua University-Peking University Center for Life Sciences, Beijing, China
| | - Chang Liu
- School of Medicine, Tsinghua University, Beijing, China
| | - Anming Meng
- School of Life Sciences, Tsinghua University, Beijing, China.
- Tsinghua University-Peking University Center for Life Sciences, Beijing, China.
| | - Xun Lan
- School of Medicine, Tsinghua University, Beijing, China.
- Peking University-Tsinghua University-National Institute of Biological Sciences Joint Graduate Program, Tsinghua University, Beijing, China.
- Tsinghua University-Peking University Center for Life Sciences, Beijing, China.
- Ministry of Education Key Laboratory of Bioinformatics, Tsinghua University, Beijing, China.
| |
Collapse
|
88
|
Chang X, Zheng Y, Xu K. Single-Cell RNA Sequencing: Technological Progress and Biomedical Application in Cancer Research. Mol Biotechnol 2024; 66:1497-1519. [PMID: 37322261 PMCID: PMC11217094 DOI: 10.1007/s12033-023-00777-0] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2023] [Accepted: 05/23/2023] [Indexed: 06/17/2023]
Abstract
Single-cell RNA-seq (scRNA-seq) is a revolutionary technology that allows for the genomic investigation of individual cells in a population, allowing for the discovery of unusual cells associated with cancer and metastasis. ScRNA-seq has been used to discover different types of cancers with poor prognosis and medication resistance such as lung cancer, breast cancer, ovarian cancer, and gastric cancer. Besides, scRNA-seq is a promising method that helps us comprehend the biological features and dynamics of cell development, as well as other disorders. This review gives a concise summary of current scRNA-seq technology. We also explain the main technological steps involved in implementing the technology. We highlight the present applications of scRNA-seq in cancer research, including tumor heterogeneity analysis in lung cancer, breast cancer, and ovarian cancer. In addition, this review elucidates potential applications of scRNA-seq in lineage tracing, personalized medicine, illness prediction, and disease diagnosis, which reveals that scRNA-seq facilitates these events by producing genetic variations on the single-cell level.
Collapse
Affiliation(s)
- Xu Chang
- Department of Otolaryngology, Head and Neck Surgery, The Second Affiliated Hospital of Nanchang University, Nanchang University, Nanchang, 330006, Jiangxi, People's Republic of China
| | - Yunxi Zheng
- Department of Otolaryngology, Head and Neck Surgery, The Second Affiliated Hospital of Nanchang University, Nanchang University, Nanchang, 330006, Jiangxi, People's Republic of China
| | - Kai Xu
- Department of Otolaryngology, Head and Neck Surgery, The Second Affiliated Hospital of Nanchang University, Nanchang University, Nanchang, 330006, Jiangxi, People's Republic of China.
| |
Collapse
|
89
|
Frenkel M, Raman S. Discovering mechanisms of human genetic variation and controlling cell states at scale. Trends Genet 2024; 40:587-600. [PMID: 38658256 PMCID: PMC11607914 DOI: 10.1016/j.tig.2024.03.010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2024] [Revised: 03/29/2024] [Accepted: 03/29/2024] [Indexed: 04/26/2024]
Abstract
Population-scale sequencing efforts have catalogued substantial genetic variation in humans such that variant discovery dramatically outpaces interpretation. We discuss how single-cell sequencing is poised to reveal genetic mechanisms at a rate that may soon approach that of variant discovery. The functional genomics toolkit is sufficiently modular to systematically profile almost any type of variation within increasingly diverse contexts and with molecularly comprehensive and unbiased readouts. As a result, we can construct deep phenotypic atlases of variant effects that span the entire regulatory cascade. The same conceptual approach to interpreting genetic variation should be applied to engineering therapeutic cell states. In this way, variant mechanism discovery and cell state engineering will become reciprocating and iterative processes towards genomic medicine.
Collapse
Affiliation(s)
- Max Frenkel
- Cellular and Molecular Biology Graduate Program, University of Wisconsin, Madison, WI, USA; Medical Scientist Training Program, University of Wisconsin School of Medicine and Public Health, Madison, WI, USA; Department of Biochemistry, University of Wisconsin, Madison, WI, USA.
| | - Srivatsan Raman
- Department of Biochemistry, University of Wisconsin, Madison, WI, USA; Department of Bacteriology, University of Wisconsin, Madison, WI, USA; Department of Chemical and Biological Engineering, University of Wisconsin, Madison, WI, USA.
| |
Collapse
|
90
|
Regner MJ, Garcia-Recio S, Thennavan A, Wisniewska K, Mendez-Giraldez R, Felsheim B, Spanheimer PM, Parker JS, Perou CM, Franco HL. Defining the Regulatory Logic of Breast Cancer Using Single-Cell Epigenetic and Transcriptome Profiling. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.06.13.598858. [PMID: 38948758 PMCID: PMC11212881 DOI: 10.1101/2024.06.13.598858] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/02/2024]
Abstract
Annotation of the cis-regulatory elements that drive transcriptional dysregulation in cancer cells is critical to improving our understanding of tumor biology. Herein, we present a compendium of matched chromatin accessibility (scATAC-seq) and transcriptome (scRNA-seq) profiles at single-cell resolution from human breast tumors and healthy mammary tissues processed immediately following surgical resection. We identify the most likely cell-of-origin for luminal breast tumors and basal breast tumors and then introduce a novel methodology that implements linear mixed-effects models to systematically quantify associations between regions of chromatin accessibility (i.e. regulatory elements) and gene expression in malignant cells versus normal mammary epithelial cells. These data unveil regulatory elements with that switch from silencers of gene expression in normal cells to enhancers of gene expression in cancer cells, leading to the upregulation of clinically relevant oncogenes. To translate the utility of this dataset into tractable models, we generated matched scATAC-seq and scRNA-seq profiles for breast cancer cell lines, revealing, for each subtype, a conserved oncogenic gene expression program between in vitro and in vivo cells. Together, this work highlights the importance of non-coding regulatory mechanisms that underlie oncogenic processes and the ability of single-cell multi-omics to define the regulatory logic of BC cells at single-cell resolution.
Collapse
Affiliation(s)
- Matthew J. Regner
- Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA
- Curriculum in Bioinformatics and Computational Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA
| | - Susana Garcia-Recio
- Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA
- Department of Genetics, School of Medicine, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA
| | - Aatish Thennavan
- Department of Systems Biology, UT MD Anderson Cancer Center, Houston, TX, USA, 77030
| | - Kamila Wisniewska
- Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA
| | - Raul Mendez-Giraldez
- Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA
| | - Brooke Felsheim
- Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA
- Curriculum in Bioinformatics and Computational Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA
| | - Philip M. Spanheimer
- Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA
- Department of Surgery, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA
| | - Joel S. Parker
- Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA
- Curriculum in Bioinformatics and Computational Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA
- Department of Genetics, School of Medicine, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA
| | - Charles M. Perou
- Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA
- Curriculum in Bioinformatics and Computational Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA
- Department of Genetics, School of Medicine, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA
- Department of Pathology and Laboratory Medicine, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA
| | - Hector L. Franco
- Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA
- Curriculum in Bioinformatics and Computational Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA
- Department of Genetics, School of Medicine, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA
- Division of Clinical and Translational Cancer Research, University of Puerto Rico Comprehensive Cancer Center, San Juan, PR 00935
| |
Collapse
|
91
|
Ma Y, Guo S, Chen Y, Peng Y, Su X, Jiang H, Lin X, Zhang J. Single-nucleus chromatin landscape dataset of mouse brain development and aging. Sci Data 2024; 11:616. [PMID: 38866804 PMCID: PMC11169343 DOI: 10.1038/s41597-024-03382-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2024] [Accepted: 05/15/2024] [Indexed: 06/14/2024] Open
Abstract
The development and aging of the brain constitute a lifelong dynamic process, marked by structural and functional changes that entail highly coordinated cellular differentiation and epigenetic regulatory mechanisms. Chromatin accessibility serves as the foundational basis for genetic activity. However, the holistic and dynamic chromatin landscape that spans various brain regions throughout development and ageing remains predominantly unexplored. In this study, we employed single-nucleus ATAC-seq to generate comprehensive chromatin accessibility maps, incorporating data from 69,178 cells obtained from four distinct brain regions - namely, the olfactory bulb (OB), cerebellum (CB), prefrontal cortex (PFC), and hippocampus (HP) - across key developmental time points at 7 P, 3 M, 12 M, and 18 M. We delineated the distribution of cell types across different age stages and brain regions, providing insight into chromatin accessible regions and key transcription factors specific to different cell types. Our data contribute to understanding the epigenetic basis of the formation of different brain regions, providing a dynamic landscape and comprehensive resource for revealing gene regulatory programs during brain development and aging.
Collapse
Affiliation(s)
- Yuting Ma
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, 100049, China
- Hebei Industrial Technology Research Institute of Genomics in Maternal & Child Health, Clin Lab, BGI Genomics, Shijiazhuang, 050035, China
- BGI Genomics, Shenzhen, 518083, China
| | - Sicheng Guo
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, 100049, China
- Hebei Industrial Technology Research Institute of Genomics in Maternal & Child Health, Clin Lab, BGI Genomics, Shijiazhuang, 050035, China
- BGI Genomics, Shenzhen, 518083, China
| | - Yixi Chen
- BGI Research, Shenzhen, 518083, China
- School of Biology and Biological Engineering, South China University of Technology, Guangzhou, China
| | | | - Xi Su
- Hebei Industrial Technology Research Institute of Genomics in Maternal & Child Health, Clin Lab, BGI Genomics, Shijiazhuang, 050035, China
- BGI Genomics, Shenzhen, 518083, China
| | - Hui Jiang
- BGI Genomics, Shenzhen, 518083, China
| | - Xiumei Lin
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, 100049, China.
- BGI Research, Shenzhen, 518083, China.
| | - Jianguo Zhang
- Hebei Industrial Technology Research Institute of Genomics in Maternal & Child Health, Clin Lab, BGI Genomics, Shijiazhuang, 050035, China.
- BGI Genomics, Shenzhen, 518083, China.
- BGI Research, Shenzhen, 518083, China.
- School of Public Health, Hebei Medical University, Shijiazhuang, 050017, China.
| |
Collapse
|
92
|
Lalanne JB, Regalado SG, Domcke S, Calderon D, Martin BK, Li X, Li T, Suiter CC, Lee C, Trapnell C, Shendure J. Multiplex profiling of developmental cis-regulatory elements with quantitative single-cell expression reporters. Nat Methods 2024; 21:983-993. [PMID: 38724692 PMCID: PMC11166576 DOI: 10.1038/s41592-024-02260-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2023] [Accepted: 03/22/2024] [Indexed: 06/13/2024]
Abstract
The inability to scalably and precisely measure the activity of developmental cis-regulatory elements (CREs) in multicellular systems is a bottleneck in genomics. Here we develop a dual RNA cassette that decouples the detection and quantification tasks inherent to multiplex single-cell reporter assays. The resulting measurement of reporter expression is accurate over multiple orders of magnitude, with a precision approaching the limit set by Poisson counting noise. Together with RNA barcode stabilization via circularization, these scalable single-cell quantitative expression reporters provide high-contrast readouts, analogous to classic in situ assays but entirely from sequencing. Screening >200 regions of accessible chromatin in a multicellular in vitro model of early mammalian development, we identify 13 (8 previously uncharacterized) autonomous and cell-type-specific developmental CREs. We further demonstrate that chimeric CRE pairs generate cognate two-cell-type activity profiles and assess gain- and loss-of-function multicellular expression phenotypes from CRE variants with perturbed transcription factor binding sites. Single-cell quantitative expression reporters can be applied in developmental and multicellular systems to quantitatively characterize native, perturbed and synthetic CREs at scale, with high sensitivity and at single-cell resolution.
Collapse
Affiliation(s)
| | - Samuel G Regalado
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | - Silvia Domcke
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | - Diego Calderon
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | - Beth K Martin
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | - Xiaoyi Li
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | - Tony Li
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | - Chase C Suiter
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
- Molecular and Cellular Biology Program, University of Washington, Seattle, WA, USA
| | - Choli Lee
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | - Cole Trapnell
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
- Brotman Baty Institute for Precision Medicine, Seattle, WA, USA
- Allen Discovery Center for Cell Lineage Tracing, Seattle, WA, USA
| | - Jay Shendure
- Department of Genome Sciences, University of Washington, Seattle, WA, USA.
- Brotman Baty Institute for Precision Medicine, Seattle, WA, USA.
- Allen Discovery Center for Cell Lineage Tracing, Seattle, WA, USA.
- Howard Hughes Medical Institute, Seattle, WA, USA.
| |
Collapse
|
93
|
Nanda AS, Wu K, Irkliyenko I, Woo B, Ostrowski MS, Clugston AS, Sayles LC, Xu L, Satpathy AT, Nguyen HG, Alejandro Sweet-Cordero E, Goodarzi H, Kasinathan S, Ramani V. Direct transposition of native DNA for sensitive multimodal single-molecule sequencing. Nat Genet 2024; 56:1300-1309. [PMID: 38724748 PMCID: PMC11176058 DOI: 10.1038/s41588-024-01748-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2022] [Accepted: 04/08/2024] [Indexed: 05/23/2024]
Abstract
Concurrent readout of sequence and base modifications from long unamplified DNA templates by Pacific Biosciences of California (PacBio) single-molecule sequencing requires large amounts of input material. Here we adapt Tn5 transposition to introduce hairpin oligonucleotides and fragment (tagment) limiting quantities of DNA for generating PacBio-compatible circular molecules. We developed two methods that implement tagmentation and use 90-99% less input than current protocols: (1) single-molecule real-time sequencing by tagmentation (SMRT-Tag), which allows detection of genetic variation and CpG methylation; and (2) single-molecule adenine-methylated oligonucleosome sequencing assay by tagmentation (SAMOSA-Tag), which uses exogenous adenine methylation to add a third channel for probing chromatin accessibility. SMRT-Tag of 40 ng or more human DNA (approximately 7,000 cell equivalents) yielded data comparable to gold standard whole-genome and bisulfite sequencing. SAMOSA-Tag of 30,000-50,000 nuclei resolved single-fiber chromatin structure, CTCF binding and DNA methylation in patient-derived prostate cancer xenografts and uncovered metastasis-associated global epigenome disorganization. Tagmentation thus promises to enable sensitive, scalable and multimodal single-molecule genomics for diverse basic and clinical applications.
Collapse
Affiliation(s)
- Arjun S Nanda
- Gladstone Institute for Data Science and Biotechnology, Gladstone Institutes, San Francisco, CA, USA
- Department of Biochemistry and Biophysics, University of California, San Francisco, San Francisco, CA, USA
| | - Ke Wu
- Gladstone Institute for Data Science and Biotechnology, Gladstone Institutes, San Francisco, CA, USA
| | - Iryna Irkliyenko
- Gladstone Institute for Data Science and Biotechnology, Gladstone Institutes, San Francisco, CA, USA
| | - Brian Woo
- Department of Biochemistry and Biophysics, University of California, San Francisco, San Francisco, CA, USA
- Helen-Diller Cancer Center, San Francisco, CA, USA
| | - Megan S Ostrowski
- Gladstone Institute for Data Science and Biotechnology, Gladstone Institutes, San Francisco, CA, USA
| | - Andrew S Clugston
- Helen-Diller Cancer Center, San Francisco, CA, USA
- Department of Pediatrics, University of California, San Francisco, San Francisco, CA, USA
| | - Leanne C Sayles
- Helen-Diller Cancer Center, San Francisco, CA, USA
- Department of Pediatrics, University of California, San Francisco, San Francisco, CA, USA
| | - Lingru Xu
- Helen-Diller Cancer Center, San Francisco, CA, USA
| | - Ansuman T Satpathy
- Department of Pathology, Stanford University, Stanford, CA, USA
- Parker Institute for Cancer Immunotherapy, San Francisco, CA, USA
- Gladstone-University of California, San Francisco Institute for Genomic Immunology, Gladstone Institutes, San Francisco, CA, USA
| | - Hao G Nguyen
- Helen-Diller Cancer Center, San Francisco, CA, USA
| | - E Alejandro Sweet-Cordero
- Helen-Diller Cancer Center, San Francisco, CA, USA
- Department of Pediatrics, University of California, San Francisco, San Francisco, CA, USA
| | - Hani Goodarzi
- Department of Biochemistry and Biophysics, University of California, San Francisco, San Francisco, CA, USA
- Helen-Diller Cancer Center, San Francisco, CA, USA
- Parker Institute for Cancer Immunotherapy, San Francisco, CA, USA
- Bakar Computational Health Sciences Institute, San Francisco, CA, USA
| | - Sivakanthan Kasinathan
- Gladstone-University of California, San Francisco Institute for Genomic Immunology, Gladstone Institutes, San Francisco, CA, USA.
- Division of Rheumatology, Department of Pediatrics, Stanford University, Stanford, CA, USA.
| | - Vijay Ramani
- Gladstone Institute for Data Science and Biotechnology, Gladstone Institutes, San Francisco, CA, USA.
- Department of Biochemistry and Biophysics, University of California, San Francisco, San Francisco, CA, USA.
- Helen-Diller Cancer Center, San Francisco, CA, USA.
- Bakar Computational Health Sciences Institute, San Francisco, CA, USA.
| |
Collapse
|
94
|
Gupta P, O’Neill H, Wolvetang E, Chatterjee A, Gupta I. Advances in single-cell long-read sequencing technologies. NAR Genom Bioinform 2024; 6:lqae047. [PMID: 38774511 PMCID: PMC11106032 DOI: 10.1093/nargab/lqae047] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2024] [Revised: 04/18/2024] [Accepted: 04/29/2024] [Indexed: 05/24/2024] Open
Abstract
With an increase in accuracy and throughput of long-read sequencing technologies, they are rapidly being assimilated into the single-cell sequencing pipelines. For transcriptome sequencing, these techniques provide RNA isoform-level information in addition to the gene expression profiles. Long-read sequencing technologies not only help in uncovering complex patterns of cell-type specific splicing, but also offer unprecedented insights into the origin of cellular complexity and thus potentially new avenues for drug development. Additionally, single-cell long-read DNA sequencing enables high-quality assemblies, structural variant detection, haplotype phasing, resolving high-complexity regions, and characterization of epigenetic modifications. Given that significant progress has primarily occurred in single-cell RNA isoform sequencing (scRiso-seq), this review will delve into these advancements in depth and highlight the practical considerations and operational challenges, particularly pertaining to downstream analysis. We also aim to offer a concise introduction to complementary technologies for single-cell sequencing of the genome, epigenome and epitranscriptome. We conclude by identifying certain key areas of innovation that may drive these technologies further and foster more widespread application in biomedical science.
Collapse
Affiliation(s)
- Pallavi Gupta
- University of Queensland – IIT Delhi Research Academy, Hauz Khas, New Delhi 110016, India
- Australian Institute of Bioengineering and Nanotechnology (AIBN), The University of Queensland, St Lucia, QLD 4072, Australia
- Department of Biochemical Engineering and Biotechnology, Indian Institute of Technology Delhi, Hauz Khas, New Delhi 110016, India
| | - Hannah O’Neill
- Department of Pathology, Dunedin School of Medicine, University of Otago, 58 Hanover Street, Dunedin 9054, New Zealand
| | - Ernst J Wolvetang
- Australian Institute of Bioengineering and Nanotechnology (AIBN), The University of Queensland, St Lucia, QLD 4072, Australia
| | - Aniruddha Chatterjee
- Department of Pathology, Dunedin School of Medicine, University of Otago, 58 Hanover Street, Dunedin 9054, New Zealand
| | - Ishaan Gupta
- Department of Biochemical Engineering and Biotechnology, Indian Institute of Technology Delhi, Hauz Khas, New Delhi 110016, India
| |
Collapse
|
95
|
Aalam SMM, Nguyen LV, Ritting ML, Kannan N. Clonal tracking in cancer and metastasis. Cancer Metastasis Rev 2024; 43:639-656. [PMID: 37910295 PMCID: PMC11500829 DOI: 10.1007/s10555-023-10149-4] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/01/2023] [Accepted: 10/16/2023] [Indexed: 11/03/2023]
Abstract
The eradication of many cancers has proven challenging due to the presence of functionally and genetically heterogeneous clones maintained by rare cancer stem cells (CSCs), which contribute to disease progression, treatment refractoriness, and late relapse. The characterization of functional CSC activity has necessitated the development of modern clonal tracking strategies. This review describes viral-based and CRISPR-Cas9-based cellular barcoding, lineage tracing, and imaging-based approaches. DNA-based cellular barcoding technology is emerging as a powerful and robust strategy that has been widely applied to in vitro and in vivo model systems, including patient-derived xenograft models. This review also highlights the potential of these methods for use in the clinical and drug discovery contexts and discusses the important insights gained from such approaches.
Collapse
Affiliation(s)
| | - Long Viet Nguyen
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada
- Department of Medicine, University of Toronto, Toronto, Ontario, Canada
- Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada
| | - Megan L Ritting
- Department of Laboratory Medicine and Pathology, Mayo Clinic, 200 First St SW, Rochester, MN, 55905, USA
| | - Nagarajan Kannan
- Department of Laboratory Medicine and Pathology, Mayo Clinic, 200 First St SW, Rochester, MN, 55905, USA.
- Mayo Clinic Comprehensive Cancer Center, Mayo Clinic, Rochester, MN, USA.
- Center for Regenerative Biotherapeutics, Mayo Clinic, Rochester, MN, USA.
| |
Collapse
|
96
|
Román ÁC, Benítez DA, Díaz-Pizarro A, Del Valle-Del Pino N, Olivera-Gómez M, Cumplido-Laso G, Carvajal-González JM, Mulero-Navarro S. Next generation sequencing technologies to address aberrant mRNA translation in cancer. NAR Cancer 2024; 6:zcae024. [PMID: 38751936 PMCID: PMC11094761 DOI: 10.1093/narcan/zcae024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2023] [Revised: 04/30/2024] [Accepted: 05/06/2024] [Indexed: 05/18/2024] Open
Abstract
In this review, we explore the transformative impact of next generation sequencing technologies in the realm of translatomics (the study of how translational machinery acts on a genome-wide scale). Despite the expectation of a direct correlation between mRNA and protein content, the complex regulatory mechanisms that affect this relationship remark the limitations of standard RNA-seq approaches. Then, the review characterizes crucial techniques such as polysome profiling, ribo-seq, trap-seq, proximity-specific ribosome profiling, rnc-seq, tcp-seq, qti-seq and scRibo-seq. All these methods are summarized within the context of cancer research, shedding light on their applications in deciphering aberrant translation in cancer cells. In addition, we encompass databases and bioinformatic tools essential for researchers that want to address translatome analysis in the context of cancer biology.
Collapse
Affiliation(s)
- Ángel-Carlos Román
- Departamento de Bioquímica y Biología Molecular y Genética, Universidad de Extremadura. Avda. de Elvas s/n, 06071 Badajoz, Spain
| | - Dixan A Benítez
- Departamento de Bioquímica y Biología Molecular y Genética, Universidad de Extremadura. Avda. de Elvas s/n, 06071 Badajoz, Spain
| | - Alba Díaz-Pizarro
- Departamento de Bioquímica y Biología Molecular y Genética, Universidad de Extremadura. Avda. de Elvas s/n, 06071 Badajoz, Spain
| | - Nuria Del Valle-Del Pino
- Departamento de Bioquímica y Biología Molecular y Genética, Universidad de Extremadura. Avda. de Elvas s/n, 06071 Badajoz, Spain
| | - Marcos Olivera-Gómez
- Departamento de Bioquímica y Biología Molecular y Genética, Universidad de Extremadura. Avda. de Elvas s/n, 06071 Badajoz, Spain
| | - Guadalupe Cumplido-Laso
- Departamento de Bioquímica y Biología Molecular y Genética, Universidad de Extremadura. Avda. de Elvas s/n, 06071 Badajoz, Spain
| | - Jose M Carvajal-González
- Departamento de Bioquímica y Biología Molecular y Genética, Universidad de Extremadura. Avda. de Elvas s/n, 06071 Badajoz, Spain
| | - Sonia Mulero-Navarro
- Departamento de Bioquímica y Biología Molecular y Genética, Universidad de Extremadura. Avda. de Elvas s/n, 06071 Badajoz, Spain
| |
Collapse
|
97
|
Lyons A, Brown J, Davenport KM. Single-Cell Sequencing Technology in Ruminant Livestock: Challenges and Opportunities. Curr Issues Mol Biol 2024; 46:5291-5306. [PMID: 38920988 PMCID: PMC11202421 DOI: 10.3390/cimb46060316] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2024] [Revised: 05/20/2024] [Accepted: 05/25/2024] [Indexed: 06/27/2024] Open
Abstract
Advancements in single-cell sequencing have transformed the genomics field by allowing researchers to delve into the intricate cellular heterogeneity within tissues at greater resolution. While single-cell omics are more widely applied in model organisms and humans, their use in livestock species is just beginning. Studies in cattle, sheep, and goats have already leveraged single-cell and single-nuclei RNA-seq as well as single-cell and single-nuclei ATAC-seq to delineate cellular diversity in tissues, track changes in cell populations and gene expression over developmental stages, and characterize immune cell populations important for disease resistance and resilience. Although challenges exist for the use of this technology in ruminant livestock, such as the precise annotation of unique cell populations and spatial resolution of cells within a tissue, there is vast potential to enhance our understanding of the cellular and molecular mechanisms underpinning traits essential for healthy and productive livestock. This review intends to highlight the insights gained from published single-cell omics studies in cattle, sheep, and goats, particularly those with publicly accessible data. Further, this manuscript will discuss the challenges and opportunities of this technology in ruminant livestock and how it may contribute to enhanced profitability and sustainability of animal agriculture in the future.
Collapse
|
98
|
Li L, Sun J, Fu Y, Changrob S, McGrath JJC, Wilson PC. A hybrid demultiplexing strategy that improves performance and robustness of cell hashing. Brief Bioinform 2024; 25:bbae254. [PMID: 38828640 PMCID: PMC11145454 DOI: 10.1093/bib/bbae254] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2023] [Revised: 04/27/2024] [Accepted: 05/08/2024] [Indexed: 06/05/2024] Open
Abstract
Cell hashing, a nucleotide barcode-based method that allows users to pool multiple samples and demultiplex in downstream analysis, has gained widespread popularity in single-cell sequencing due to its compatibility, simplicity, and cost-effectiveness. Despite these advantages, the performance of this method remains unsatisfactory under certain circumstances, especially in experiments that have imbalanced sample sizes or use many hashtag antibodies. Here, we introduce a hybrid demultiplexing strategy that increases accuracy and cell recovery in multi-sample single-cell experiments. This approach correlates the results of cell hashing and genetic variant clustering, enabling precise and efficient cell identity determination without additional experimental costs or efforts. In addition, we developed HTOreader, a demultiplexing tool for cell hashing that improves the accuracy of cut-off calling by avoiding the dominance of negative signals in experiments with many hashtags or imbalanced sample sizes. When compared to existing methods using real-world datasets, this hybrid approach and HTOreader consistently generate reliable results with increased accuracy and cell recovery.
Collapse
Affiliation(s)
- Lei Li
- Gale and Ira Drukier Institute for Children’s Health, Weill Cornell Medicine, 413 E. 69th Street, New York, NY 10021, United States
- Center for Applied Bioinformatics, St. Jude Children’s Research Hospital, 262 Danny Thomas Place, Memphis, TN 38105, United States
| | - Jiayi Sun
- Gale and Ira Drukier Institute for Children’s Health, Weill Cornell Medicine, 413 E. 69th Street, New York, NY 10021, United States
| | - Yanbin Fu
- Gale and Ira Drukier Institute for Children’s Health, Weill Cornell Medicine, 413 E. 69th Street, New York, NY 10021, United States
| | - Siriruk Changrob
- Gale and Ira Drukier Institute for Children’s Health, Weill Cornell Medicine, 413 E. 69th Street, New York, NY 10021, United States
| | - Joshua J C McGrath
- Gale and Ira Drukier Institute for Children’s Health, Weill Cornell Medicine, 413 E. 69th Street, New York, NY 10021, United States
| | - Patrick C Wilson
- Gale and Ira Drukier Institute for Children’s Health, Weill Cornell Medicine, 413 E. 69th Street, New York, NY 10021, United States
| |
Collapse
|
99
|
Kotliar M, Kartashov A, Barski A. Accelerating Single-Cell Sequencing Data Analysis with SciDAP: A User-Friendly Approach. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.02.28.582604. [PMID: 38464095 PMCID: PMC10925325 DOI: 10.1101/2024.02.28.582604] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/12/2024]
Abstract
Single-cell (sc) RNA, ATAC and Multiome sequencing became powerful tools for uncovering biological and disease mechanisms. Unfortunately, manual analysis of sc data presents multiple challenges due to large data volumes and complexity of configuration parameters. This complexity, as well as not being able to reproduce a computational environment, affects the reproducibility of analysis results. The Scientific Data Analysis Platform (https://SciDAP.com) allows biologists without computational expertise to analyze sequencing-based data using portable and reproducible pipelines written in Common Workflow Language (CWL). Our suite of computational pipelines addresses the most common needs in scRNA-Seq, scATAC-Seq and scMultiome data analysis. When executed on SciDAP, it offers a user-friendly alternative to manual data processing, eliminating the need for coding expertise. In this protocol, we describe the use of SciDAP to analyze scMultiome data. Similar approaches can be used for analysis of scRNA-Seq, scATAC-Seq and scVDJ-Seq datasets.
Collapse
Affiliation(s)
- Michael Kotliar
- Division of Allergy and Immunology, Cincinnati Children's Hospital Medical Center, Cincinnati, OH, 45229, USA
| | | | - Artem Barski
- Division of Allergy and Immunology, Cincinnati Children's Hospital Medical Center, Cincinnati, OH, 45229, USA
- Division of Human Genetics, Cincinnati Children's Hospital Medical Center, Cincinnati, OH, 45229, USA
- University of Cincinnati College of Medicine, Cincinnati, OH, 45267, USA
- Datirium, LLC, Cincinnati, OH, USA
| |
Collapse
|
100
|
Hong SC, Muyas F, Cortés-Ciriano I, Hormoz S. scAI-SNP: a method for inferring ancestry from single-cell data. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.05.14.594208. [PMID: 38798590 PMCID: PMC11118306 DOI: 10.1101/2024.05.14.594208] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 05/29/2024]
Abstract
Collaborative efforts, such as the Human Cell Atlas, are rapidly accumulating large amounts of single-cell data. To ensure that single-cell atlases are representative of human genetic diversity, we need to determine the ancestry of the donors from whom single-cell data are generated. Self-reporting of race and ethnicity, although important, can be biased and is not always available for the datasets already collected. Here, we introduce scAI-SNP, a tool to infer ancestry directly from single-cell genomics data. To train scAI-SNP, we identified 4.5 million ancestry-informative single-nucleotide polymorphisms (SNPs) in the 1000 Genomes Project dataset across 3201 individuals from 26 population groups. For a query single-cell data set, scAI-SNP uses these ancestry-informative SNPs to compute the contribution of each of the 26 population groups to the ancestry of the donor from whom the cells were obtained. Using diverse single-cell data sets with matched whole-genome sequencing data, we show that scAI-SNP is robust to the sparsity of single-cell data, can accurately and consistently infer ancestry from samples derived from diverse types of tissues and cancer cells, and can be applied to different modalities of single-cell profiling assays, such as single-cell RNA-seq and single-cell ATAC-seq. Finally, we argue that ensuring that single-cell atlases represent diverse ancestry, ideally alongside race and ethnicity, is ultimately important for improved and equitable health outcomes by accounting for human diversity.
Collapse
Affiliation(s)
- Sung Chul Hong
- Department of Data Science, Dana-Farber Cancer Institute, Boston, MA 02215, USA
| | - Francesc Muyas
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, Cambridge CB10 1SD, UK
| | - Isidro Cortés-Ciriano
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, Cambridge CB10 1SD, UK
| | - Sahand Hormoz
- Department of Data Science, Dana-Farber Cancer Institute, Boston, MA 02215, USA
- Department of Systems Biology, Harvard Medical School, Boston, MA 02115, USA
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| |
Collapse
|