1
|
Zhou H, Clark E, Guan D, Lagarrigue S, Fang L, Cheng H, Tuggle CK, Kapoor M, Wang Y, Giuffra E, Egidy G. Comparative Genomics and Epigenomics of Transcriptional Regulation. Annu Rev Anim Biosci 2025; 13:73-98. [PMID: 39565835 DOI: 10.1146/annurev-animal-111523-102217] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2024]
Abstract
Transcriptional regulation in response to diverse physiological cues involves complicated biological processes. Recent initiatives that leverage whole genome sequencing and annotation of regulatory elements significantly contribute to our understanding of transcriptional gene regulation. Advances in the data sets available for comparative genomics and epigenomics can identify evolutionarily constrained regulatory variants and shed light on noncoding elements that influence transcription in different tissues and developmental stages across species. Most epigenomic data, however, are generated from healthy subjects at specific developmental stages. To bridge the genotype-phenotype gap, future research should focus on generating multidimensional epigenomic data under diverse physiological conditions. Farm animal species offer advantages in terms of feasibility, cost, and experimental design for such integrative analyses in comparison to humans. Deep learning modeling and cutting-edge technologies in sequencing and functional screening and validation also provide great promise for better understanding transcriptional regulation in this dynamic field.
Collapse
Affiliation(s)
- Huaijun Zhou
- Department of Animal Science, University of California, Davis, California, USA; , , ,
| | - Emily Clark
- The Roslin Institute, University of Edinburgh, Edinburgh, Midlothian, United Kingdom;
| | - Dailu Guan
- Department of Animal Science, University of California, Davis, California, USA; , , ,
| | | | - Lingzhao Fang
- Center for Quantitative Genetics and Genomics, Aarhus University, Aarhus, Denmark;
| | - Hao Cheng
- Department of Animal Science, University of California, Davis, California, USA; , , ,
| | | | - Muskan Kapoor
- Department of Animal Science, Iowa State University, Ames, Iowa, USA; ,
| | - Ying Wang
- Department of Animal Science, University of California, Davis, California, USA; , , ,
| | | | - Giorgia Egidy
- GABI, AgroParisTech, INRAE, Jouy-en-Josas, France; ,
| |
Collapse
|
2
|
Madrigal A, Lu T, Soto LM, Najafabadi HS. A unified model for interpretable latent embedding of multi-sample, multi-condition single-cell data. Nat Commun 2024; 15:6573. [PMID: 39097589 PMCID: PMC11298001 DOI: 10.1038/s41467-024-50963-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2023] [Accepted: 07/23/2024] [Indexed: 08/05/2024] Open
Abstract
Single-cell analysis across multiple samples and conditions requires quantitative modeling of the interplay between the continuum of cell states and the technical and biological sources of sample-to-sample variability. We introduce GEDI, a generative model that identifies latent space variations in multi-sample, multi-condition single-cell datasets and attributes them to sample-level covariates. GEDI enables cross-sample cell state mapping on par with state-of-the-art integration methods, cluster-free differential gene expression analysis along the continuum of cell states, and machine learning-based prediction of sample characteristics from single-cell data. GEDI can also incorporate gene-level prior knowledge to infer pathway and regulatory network activities in single cells. Finally, GEDI extends all these concepts to previously unexplored modalities that require joint consideration of dual measurements, such as the joint analysis of exon inclusion/exclusion reads to model alternative cassette exon splicing, or spliced/unspliced reads to model the mRNA stability landscapes of single cells.
Collapse
Affiliation(s)
- Ariel Madrigal
- Department of Human Genetics, McGill University, Montreal, QC, H3A 0C7, Canada
- Victor P. Dahdaleh Institute of Genomic Medicine, Montreal, QC, H3A 0G1, Canada
| | - Tianyuan Lu
- Lady Davis Institute for Medical Research, Montreal, QC, H3T 1E2, Canada
- Department of Statistical Sciences, University of Toronto, Toronto, ON, M5S 1A1, Canada
- Department of Population Health Sciences, University of Wisconsin-Madison, Madison, WI, 53726, USA
| | - Larisa M Soto
- Department of Human Genetics, McGill University, Montreal, QC, H3A 0C7, Canada
- Victor P. Dahdaleh Institute of Genomic Medicine, Montreal, QC, H3A 0G1, Canada
| | - Hamed S Najafabadi
- Department of Human Genetics, McGill University, Montreal, QC, H3A 0C7, Canada.
- Victor P. Dahdaleh Institute of Genomic Medicine, Montreal, QC, H3A 0G1, Canada.
- McGill Centre for RNA Sciences, McGill University, Montreal, Canada.
| |
Collapse
|
3
|
Moakley DF, Campbell M, Anglada-Girotto M, Feng H, Califano A, Au E, Zhang C. Reverse engineering neuron type-specific and type-orthogonal splicing-regulatory networks using single-cell transcriptomes. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.06.13.597128. [PMID: 38915499 PMCID: PMC11195221 DOI: 10.1101/2024.06.13.597128] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/26/2024]
Abstract
Cell type-specific alternative splicing (AS) enables differential gene isoform expression between diverse neuron types with distinct identities and functions. Current studies linking individual RNA-binding proteins (RBPs) to AS in a few neuron types underscore the need for holistic modeling. Here, we use network reverse engineering to derive a map of the neuron type-specific AS regulatory landscape from 133 mouse neocortical cell types defined by single-cell transcriptomes. This approach reliably inferred the regulons of 350 RBPs and their cell type-specific activities. Our analysis revealed driving factors delineating neuronal identities, among which we validated Elavl2 as a key RBP for MGE-specific splicing in GABAergic interneurons using an in vitro ESC differentiation system. We also identified a module of exons and candidate regulators specific for long- and short-projection neurons across multiple neuronal classes. This study provides a resource for elucidating splicing regulatory programs that drive neuronal molecular diversity, including those that do not align with gene expression-based classifications.
Collapse
Affiliation(s)
- Daniel F Moakley
- Department of Systems Biology, Columbia University, New York, NY 10032, USA
- Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY 10032, USA
- Center for Motor Neuron Biology and Disease, Columbia University, New York, NY 10032, USA
| | - Melissa Campbell
- Department of Systems Biology, Columbia University, New York, NY 10032, USA
- Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY 10032, USA
- Center for Motor Neuron Biology and Disease, Columbia University, New York, NY 10032, USA
- Present address: Department of Neurosciences, University of California, San Diego, USA
| | - Miquel Anglada-Girotto
- Department of Systems Biology, Columbia University, New York, NY 10032, USA
- Present address: Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Huijuan Feng
- Department of Systems Biology, Columbia University, New York, NY 10032, USA
- Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY 10032, USA
- Center for Motor Neuron Biology and Disease, Columbia University, New York, NY 10032, USA
- Present address: Department of Biostatistics and Computational Biology, School of Life Sciences, Fudan University, Shanghai 200438, China
| | - Andrea Califano
- Department of Systems Biology, Columbia University, New York, NY 10032, USA
| | - Edmund Au
- Department of Pathology and Cell Biology, Columbia University, New York, NY 10032, USA
- Columbia Translational Neuroscience Initiative Scholar, New York, NY 10032, USA
| | - Chaolin Zhang
- Department of Systems Biology, Columbia University, New York, NY 10032, USA
- Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY 10032, USA
- Center for Motor Neuron Biology and Disease, Columbia University, New York, NY 10032, USA
| |
Collapse
|
4
|
Schilling K. Revisiting the development of cerebellar inhibitory interneurons in the light of single-cell genetic analyses. Histochem Cell Biol 2024; 161:5-27. [PMID: 37940705 PMCID: PMC10794478 DOI: 10.1007/s00418-023-02251-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/12/2023] [Indexed: 11/10/2023]
Abstract
The present review aims to provide a short update of our understanding of the inhibitory interneurons of the cerebellum. While these cells constitute but a minority of all cerebellar neurons, their functional significance is increasingly being recognized. For one, inhibitory interneurons of the cerebellar cortex are now known to constitute a clearly more diverse group than their traditional grouping as stellate, basket, and Golgi cells suggests, and this diversity is now substantiated by single-cell genetic data. The past decade or so has also provided important information about interneurons in cerebellar nuclei. Significantly, developmental studies have revealed that the specification and formation of cerebellar inhibitory interneurons fundamentally differ from, say, the cortical interneurons, and define a mode of diversification critically dependent on spatiotemporally patterned external signals. Last, but not least, in the past years, dysfunction of cerebellar inhibitory interneurons could also be linked with clinically defined deficits. I hope that this review, however fragmentary, may stimulate interest and help focus research towards understanding the cerebellum.
Collapse
Affiliation(s)
- Karl Schilling
- Anatomisches Institut - Anatomie und Zellbiologie, Rheinische Friedrich-Wilhelms-Universität Bonn, Nussallee 10, 53115, Bonn, Germany.
| |
Collapse
|
5
|
Kozlova A, Sarygina E, Deinichenko K, Radko S, Ptitsyn K, Khmeleva S, Kurbatov L, Spirin P, Prassolov V, Ilgisonis E, Lisitsa A, Ponomarenko E. Comparison of Alternative Splicing Landscapes Revealed by Long-Read Sequencing in Hepatocyte-Derived HepG2 and Huh7 Cultured Cells and Human Liver Tissue. BIOLOGY 2023; 12:1494. [PMID: 38132320 PMCID: PMC10740679 DOI: 10.3390/biology12121494] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/27/2023] [Revised: 11/17/2023] [Accepted: 11/25/2023] [Indexed: 12/23/2023]
Abstract
The long-read RNA sequencing developed by Oxford Nanopore Technologies provides a direct quantification of transcript isoforms, thereby making it possible to present alternative splicing (AS) profiles as arrays of single splice variants with different abundances. Additionally, AS profiles can be presented as arrays of genes characterized by the degree of alternative splicing (the DAS-the number of detected splice variants per gene). Here, we successfully utilized the DAS to reveal biological pathways influenced by the alterations in AS in human liver tissue and the hepatocyte-derived malignant cell lines HepG2 and Huh7, thus employing the mathematical algorithm of gene set enrichment analysis. Furthermore, analysis of the AS profiles as abundances of single splice variants by using the graded tissue specificity index τ provided the selection of the groups of genes expressing particular splice variants specifically in liver tissue, HepG2 cells, and Huh7 cells. The majority of these splice variants were translated into proteins products and appeal to be in focus regarding further insights into the mechanisms underlying cell malignization. The used metrics are intrinsically suitable for transcriptome-wide AS profiling using long-read sequencing.
Collapse
Affiliation(s)
- Anna Kozlova
- Institute of Biomedical Chemistry, Pogodinskaya Street 10, 119121 Moscow, Russia (S.R.)
| | - Elizaveta Sarygina
- Institute of Biomedical Chemistry, Pogodinskaya Street 10, 119121 Moscow, Russia (S.R.)
| | - Kseniia Deinichenko
- Institute of Biomedical Chemistry, Pogodinskaya Street 10, 119121 Moscow, Russia (S.R.)
| | - Sergey Radko
- Institute of Biomedical Chemistry, Pogodinskaya Street 10, 119121 Moscow, Russia (S.R.)
| | - Konstantin Ptitsyn
- Institute of Biomedical Chemistry, Pogodinskaya Street 10, 119121 Moscow, Russia (S.R.)
| | - Svetlana Khmeleva
- Institute of Biomedical Chemistry, Pogodinskaya Street 10, 119121 Moscow, Russia (S.R.)
| | - Leonid Kurbatov
- Institute of Biomedical Chemistry, Pogodinskaya Street 10, 119121 Moscow, Russia (S.R.)
| | - Pavel Spirin
- Department of Cancer Cell Biology, Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Vavilova 32, 119991 Moscow, Russia; (P.S.); (V.P.)
| | - Vladimir Prassolov
- Department of Cancer Cell Biology, Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Vavilova 32, 119991 Moscow, Russia; (P.S.); (V.P.)
| | - Ekaterina Ilgisonis
- Institute of Biomedical Chemistry, Pogodinskaya Street 10, 119121 Moscow, Russia (S.R.)
| | - Andrey Lisitsa
- Institute of Biomedical Chemistry, Pogodinskaya Street 10, 119121 Moscow, Russia (S.R.)
| | - Elena Ponomarenko
- Institute of Biomedical Chemistry, Pogodinskaya Street 10, 119121 Moscow, Russia (S.R.)
| |
Collapse
|
6
|
Wong ACH, Wong JJL, Rasko JEJ, Schmitz U. SpliceWiz: interactive analysis and visualization of alternative splicing in R. Brief Bioinform 2023; 25:bbad468. [PMID: 38152981 PMCID: PMC10753292 DOI: 10.1093/bib/bbad468] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2023] [Revised: 10/31/2023] [Accepted: 11/23/2023] [Indexed: 12/29/2023] Open
Abstract
Alternative splicing (AS) is a crucial mechanism for regulating gene expression and isoform diversity in eukaryotes. However, the analysis and visualization of AS events from RNA sequencing data remains challenging. Most tools require a certain level of computer literacy and the available means of visualizing AS events, such as coverage and sashimi plots, have limitations and can be misleading. To address these issues, we present SpliceWiz, an R package with an interactive Shiny interface that allows easy and efficient AS analysis and visualization at scale. A novel normalization algorithm is implemented to aggregate splicing levels within sample groups, thereby allowing group differences in splicing levels to be accurately visualized. The tool also offers downstream gene ontology enrichment analysis, highlighting ASEs belonging to functional pathways of interest. SpliceWiz is optimized for speed and efficiency and introduces a new file format for coverage data storage that is more efficient than BigWig. Alignment files are processed orders of magnitude faster than other R-based AS analysis tools and on par with command-line tools. Overall, SpliceWiz streamlines AS analysis, enabling reliable identification of functionally relevant AS events for further characterization. SpliceWiz is a Bioconductor package and is also available on GitHub (https://github.com/alexchwong/SpliceWiz).
Collapse
Affiliation(s)
- Alex C H Wong
- Gene and Stem Cell Therapy Program, Centenary Institute, the University of Sydney, Camperdown, NSW 2050, Australia
- Epigenetics and RNA Biology Laboratory, School of Medical Sciences, Faculty of Medicine and Health, the University of Sydney, Camperdown, NSW 2050, Australia
- Faculty of Medicine and Health, the University of Sydney, Camperdown, NSW 2050, Australia
| | - Justin J-L Wong
- Epigenetics and RNA Biology Laboratory, School of Medical Sciences, Faculty of Medicine and Health, the University of Sydney, Camperdown, NSW 2050, Australia
- Faculty of Medicine and Health, the University of Sydney, Camperdown, NSW 2050, Australia
| | - John E J Rasko
- Gene and Stem Cell Therapy Program, Centenary Institute, the University of Sydney, Camperdown, NSW 2050, Australia
- Faculty of Medicine and Health, the University of Sydney, Camperdown, NSW 2050, Australia
- Cell and Molecular Therapies, Royal Prince Alfred Hospital, Camperdown, NSW 2050, Australia
| | - Ulf Schmitz
- Biomedical Sciences and Molecular Biology, James Cook University, Townsville, QLD 4810, Australia
- Centre for Tropical Bioinformatics and Molecular Biology, James Cook University, Townsville, QLD 4810, Australia
| |
Collapse
|
7
|
Joglekar A, Foord C, Jarroux J, Pollard S, Tilgner HU. From words to complete phrases: insight into single-cell isoforms using short and long reads. Transcription 2023; 14:92-104. [PMID: 37314295 PMCID: PMC10807471 DOI: 10.1080/21541264.2023.2213514] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2022] [Revised: 04/24/2023] [Accepted: 05/07/2023] [Indexed: 06/15/2023] Open
Abstract
The profiling of gene expression patterns to glean biological insights from single cells has become commonplace over the last few years. However, this approach overlooks the transcript contents that can differ between individual cells and cell populations. In this review, we describe early work in the field of single-cell short-read sequencing as well as full-length isoforms from single cells. We then describe recent work in single-cell long-read sequencing wherein some transcript elements have been observed to work in tandem. Based on earlier work in bulk tissue, we motivate the study of combination patterns of other RNA variables. Given that we are still blind to some aspects of isoform biology, we suggest possible future avenues such as CRISPR screens which can further illuminate the function of RNA variables in distinct cell populations.
Collapse
Affiliation(s)
- Anoushka Joglekar
- Feil Family Brain and Mind Research Institute, Weill Cornell Medicine, New York, NY, USA
- Center for Neurogenetics, Weill Cornell Medicine, New York, NY, USA
| | - Careen Foord
- Feil Family Brain and Mind Research Institute, Weill Cornell Medicine, New York, NY, USA
- Center for Neurogenetics, Weill Cornell Medicine, New York, NY, USA
| | - Julien Jarroux
- Feil Family Brain and Mind Research Institute, Weill Cornell Medicine, New York, NY, USA
- Center for Neurogenetics, Weill Cornell Medicine, New York, NY, USA
| | - Shaun Pollard
- Feil Family Brain and Mind Research Institute, Weill Cornell Medicine, New York, NY, USA
- Center for Neurogenetics, Weill Cornell Medicine, New York, NY, USA
| | - Hagen U Tilgner
- Feil Family Brain and Mind Research Institute, Weill Cornell Medicine, New York, NY, USA
- Center for Neurogenetics, Weill Cornell Medicine, New York, NY, USA
| |
Collapse
|