201
|
Melé M, Mattioli K, Mallard W, Shechner DM, Gerhardinger C, Rinn JL. Chromatin environment, transcriptional regulation, and splicing distinguish lincRNAs and mRNAs. Genome Res 2016; 27:27-37. [PMID: 27927715 PMCID: PMC5204342 DOI: 10.1101/gr.214205.116] [Citation(s) in RCA: 178] [Impact Index Per Article: 19.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2016] [Accepted: 11/09/2016] [Indexed: 12/29/2022]
Abstract
While long intergenic noncoding RNAs (lincRNAs) and mRNAs share similar biogenesis pathways, these transcript classes differ in many regards. LincRNAs are less evolutionarily conserved, less abundant, and more tissue-specific, suggesting that their pre- and post-transcriptional regulation is different from that of mRNAs. Here, we perform an in-depth characterization of the features that contribute to lincRNA regulation in multiple human cell lines. We find that lincRNA promoters are depleted of transcription factor (TF) binding sites, yet enriched for some specific factors such as GATA and FOS relative to mRNA promoters. Surprisingly, we find that H3K9me3—a histone modification typically associated with transcriptional repression—is more enriched at the promoters of active lincRNA loci than at those of active mRNAs. Moreover, H3K9me3-marked lincRNA genes are more tissue-specific. The most discriminant differences between lincRNAs and mRNAs involve splicing. LincRNAs are less efficiently spliced, which cannot be explained by differences in U1 binding or the density of exonic splicing enhancers but may be partially attributed to lower U2AF65 binding and weaker splicing-related motifs. Conversely, the stability of lincRNAs and mRNAs is similar, differing only with regard to the location of stabilizing protein binding sites. Finally, we find that certain transcriptional properties are correlated with higher evolutionary conservation in both DNA and RNA motifs and are enriched in lincRNAs that have been functionally characterized.
Collapse
Affiliation(s)
- Marta Melé
- Department of Stem Cell and Regenerative Biology, Harvard University, Cambridge, Massachusetts 02138, USA.,Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, Massachusetts 02142, USA
| | - Kaia Mattioli
- Department of Stem Cell and Regenerative Biology, Harvard University, Cambridge, Massachusetts 02138, USA.,Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, Massachusetts 02142, USA.,Department of Biological and Biomedical Sciences, Harvard University, Boston, Massachusetts 02115, USA
| | - William Mallard
- Department of Stem Cell and Regenerative Biology, Harvard University, Cambridge, Massachusetts 02138, USA.,Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, Massachusetts 02142, USA
| | - David M Shechner
- Department of Stem Cell and Regenerative Biology, Harvard University, Cambridge, Massachusetts 02138, USA.,Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, Massachusetts 02142, USA
| | - Chiara Gerhardinger
- Department of Stem Cell and Regenerative Biology, Harvard University, Cambridge, Massachusetts 02138, USA.,Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, Massachusetts 02142, USA
| | - John L Rinn
- Department of Stem Cell and Regenerative Biology, Harvard University, Cambridge, Massachusetts 02138, USA.,Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, Massachusetts 02142, USA.,Department of Pathology, Beth Israel Deaconess Medical Center, Boston, Massachusetts 02215, USA
| |
Collapse
|
202
|
Georgomanolis T, Sofiadis K, Papantonis A. Cutting a Long Intron Short: Recursive Splicing and Its Implications. Front Physiol 2016; 7:598. [PMID: 27965595 PMCID: PMC5126111 DOI: 10.3389/fphys.2016.00598] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2016] [Accepted: 11/16/2016] [Indexed: 11/13/2022] Open
Abstract
Over time eukaryotic genomes have evolved to host genes carrying multiple exons separated by increasingly larger intronic, mostly non-protein-coding, sequences. Initially, little attention was paid to these intronic sequences, as they were considered not to contain regulatory information. However, advances in molecular biology, sequencing, and computational tools uncovered that numerous segments within these genomic elements do contribute to the regulation of gene expression. Introns are differentially removed in a cell type-specific manner to produce a range of alternatively-spliced transcripts, and many span tens to hundreds of kilobases. Recent work in human and fruitfly tissues revealed that long introns are extensively processed cotranscriptionally and in a stepwise manner, before their two flanking exons are spliced together. This process, called "recursive splicing," often involves non-canonical splicing elements positioned deep within introns, and different mechanisms for its deployment have been proposed. Still, the very existence and widespread nature of recursive splicing offers a new regulatory layer in the transcript maturation pathway, which may also have implications in human disease.
Collapse
Affiliation(s)
- Theodore Georgomanolis
- Chromatin Systems Biology Laboratory, Center for Molecular Medicine, University of Cologne Cologne, Germany
| | - Konstantinos Sofiadis
- Chromatin Systems Biology Laboratory, Center for Molecular Medicine, University of Cologne Cologne, Germany
| | - Argyris Papantonis
- Chromatin Systems Biology Laboratory, Center for Molecular Medicine, University of Cologne Cologne, Germany
| |
Collapse
|
203
|
Alpert T, Herzel L, Neugebauer KM. Perfect timing: splicing and transcription rates in living cells. WILEY INTERDISCIPLINARY REVIEWS-RNA 2016; 8. [PMID: 27873472 DOI: 10.1002/wrna.1401] [Citation(s) in RCA: 67] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/20/2016] [Revised: 09/12/2016] [Accepted: 09/26/2016] [Indexed: 12/27/2022]
Abstract
An important step toward understanding gene regulation is the elucidation of the time necessary for the completion of individual steps. Measurement of reaction rates can reveal potential nodes for regulation. For example, measurements of in vivo transcription elongation rates reveal regulation by DNA sequence, gene architecture, and chromatin. Pre-mRNA splicing is regulated by transcription elongation rates and vice versa, yet the rates of RNA processing reactions remain largely elusive. Since the 1980s, numerous model systems and approaches have been used to determine the precise timing of splicing in vivo. Because splicing can be co-transcriptional, the position of Pol II when splicing is detected has been used as a proxy for time by some investigators. In addition to these 'distance-based' measurements, 'time-based' measurements have been possible through live cell imaging, metabolic labeling of RNA, and gene induction. Yet splicing rates can be convolved by the time it takes for transcription, spliceosome assembly and spliceosome disassembly. The variety of assays and systems used has, perhaps not surprisingly, led to reports of widely differing splicing rates in vivo. Recently, single molecule RNA-seq has indicated that splicing occurs more quickly than previously deduced. Here we comprehensively review these findings and discuss evidence that splicing and transcription rates are closely coordinated, facilitating the efficiency of gene expression. On the other hand, introduction of splicing delays through as yet unknown mechanisms provide opportunity for regulation. More work is needed to understand how cells optimize the rates of gene expression for a range of biological conditions. WIREs RNA 2017, 8:e1401. doi: 10.1002/wrna.1401 For further resources related to this article, please visit the WIREs website.
Collapse
Affiliation(s)
- Tara Alpert
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | - Lydia Herzel
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | - Karla M Neugebauer
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| |
Collapse
|
204
|
Integrative classification of human coding and noncoding genes through RNA metabolism profiles. Nat Struct Mol Biol 2016; 24:86-96. [PMID: 27870833 DOI: 10.1038/nsmb.3325] [Citation(s) in RCA: 130] [Impact Index Per Article: 14.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2016] [Accepted: 10/18/2016] [Indexed: 12/26/2022]
Abstract
Pervasive transcription of the human genome results in a heterogeneous mix of coding RNAs and long noncoding RNAs (lncRNAs). Only a small fraction of lncRNAs have demonstrated regulatory functions, thus making functional lncRNAs difficult to distinguish from nonfunctional transcriptional byproducts. This difficulty has resulted in numerous competing human lncRNA classifications that are complicated by a steady increase in the number of annotated lncRNAs. To address these challenges, we quantitatively examined transcription, splicing, degradation, localization and translation for coding and noncoding human genes. We observed that annotated lncRNAs had lower synthesis and higher degradation rates than mRNAs and discovered mechanistic differences explaining slower lncRNA splicing. We grouped genes into classes with similar RNA metabolism profiles, containing both mRNAs and lncRNAs to varying extents. These classes exhibited distinct RNA metabolism, different evolutionary patterns and differential sensitivity to cellular RNA-regulatory pathways. Our classification provides an alternative to genomic context-driven annotations of lncRNAs.
Collapse
|
205
|
Kornienko AE, Vlatkovic I, Neesen J, Barlow DP, Pauler FM. A human haploid gene trap collection to study lncRNAs with unusual RNA biology. RNA Biol 2016; 13:196-220. [PMID: 26670263 PMCID: PMC4829315 DOI: 10.1080/15476286.2015.1110676] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023] Open
Abstract
Many thousand long non-coding (lnc) RNAs are mapped in the human genome. Time consuming studies using reverse genetic approaches by post-transcriptional knock-down or genetic modification of the locus demonstrated diverse biological functions for a few of these transcripts. The Human Gene Trap Mutant Collection in haploid KBM7 cells is a ready-to-use tool for studying protein-coding gene function. As lncRNAs show remarkable differences in RNA biology compared to protein-coding genes, it is unclear if this gene trap collection is useful for functional analysis of lncRNAs. Here we use the uncharacterized LOC100288798 lncRNA as a model to answer this question. Using public RNA-seq data we show that LOC100288798 is ubiquitously expressed, but inefficiently spliced. The minor spliced LOC100288798 isoforms are exported to the cytoplasm, whereas the major unspliced isoform is nuclear localized. This shows that LOC100288798 RNA biology differs markedly from typical mRNAs. De novo assembly from RNA-seq data suggests that LOC100288798 extends 289kb beyond its annotated 3' end and overlaps the downstream SLC38A4 gene. Three cell lines with independent gene trap insertions in LOC100288798 were available from the KBM7 gene trap collection. RT-qPCR and RNA-seq confirmed successful lncRNA truncation and its extended length. Expression analysis from RNA-seq data shows significant deregulation of 41 protein-coding genes upon LOC100288798 truncation. Our data shows that gene trap collections in human haploid cell lines are useful tools to study lncRNAs, and identifies the previously uncharacterized LOC100288798 as a potential gene regulator.
Collapse
Affiliation(s)
- Aleksandra E Kornienko
- a CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences, Lazarettgasse 14, AKH BT 25.3 , 1090 Vienna , Austria
| | - Irena Vlatkovic
- a CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences, Lazarettgasse 14, AKH BT 25.3 , 1090 Vienna , Austria.,b Institute of Medical Genetics, Medical University of Vienna, Währingerstrasse 10 , 1090 Vienna , Austria
| | - Jürgen Neesen
- b Institute of Medical Genetics, Medical University of Vienna, Währingerstrasse 10 , 1090 Vienna , Austria
| | - Denise P Barlow
- a CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences, Lazarettgasse 14, AKH BT 25.3 , 1090 Vienna , Austria
| | - Florian M Pauler
- a CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences, Lazarettgasse 14, AKH BT 25.3 , 1090 Vienna , Austria
| |
Collapse
|
206
|
Stellaris® RNA Fluorescence In Situ Hybridization for the Simultaneous Detection of Immature and Mature Long Noncoding RNAs in Adherent Cells. Methods Mol Biol 2016; 1402:119-134. [PMID: 26721487 DOI: 10.1007/978-1-4939-3378-5_10] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]
Abstract
RNA fluorescence in situ hybridization (FISH), long an indispensable tool for the detection and localization of RNA, is becoming an increasingly important complement to other gene expression analysis methods. Especially important for long noncoding RNAs (lncRNAs), RNA FISH adds the ability to distinguish between primary and mature lncRNA transcripts and thus to segregate the site of synthesis from the site of action.We detail a streamlined RNA FISH protocol for the simultaneous imaging of multiple primary and mature mRNA and lncRNA gene products and RNA variants in fixed mammalian cells. The technique makes use of fluorescently pre-labeled, short DNA oligonucleotides (circa 20 nucleotides in length), pooled into sets of up to 48 individual probes. The overall binding of multiple oligonucleotides to the same RNA target results in fluorescent signals that reveal clusters of RNAs or single RNA molecules as punctate spots without the need for enzymatic signal amplification. Visualization of these punctate signals, through the use of wide-field fluorescence microscopy, enables the counting of single transcripts down to one copy per cell. Additionally, by using probe sets with spectrally distinct fluorophores, multiplex analysis of gene-specific RNAs, or RNA variants, can be achieved. The presented examples illustrate how this method can add temporospatial information between the transcription event and both the location and the endurance of the mature lncRNA. We also briefly discuss post-processing of images and spot counting to demonstrate the capabilities of this method for the statistical analysis of RNA molecules per cell. This information can be utilized to determine both overall gene expression levels and cell-to-cell gene expression variation.
Collapse
|
207
|
The Tetraodon nigroviridis reference transcriptome: developmental transition, length retention and microsynteny of long non-coding RNAs in a compact vertebrate genome. Sci Rep 2016; 6:33210. [PMID: 27628538 PMCID: PMC5024134 DOI: 10.1038/srep33210] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2016] [Accepted: 07/28/2016] [Indexed: 01/03/2023] Open
Abstract
Pufferfish such as fugu and tetraodon carry the smallest genomes among all vertebrates and are ideal for studying genome evolution. However, comparative genomics using these species is hindered by the poor annotation of their genomes. We performed RNA sequencing during key stages of maternal to zygotic transition of Tetraodon nigroviridis and report its first developmental transcriptome. We assembled 61,033 transcripts (23,837 loci) representing 80% of the annotated gene models and 3816 novel coding transcripts from 2667 loci. We demonstrate the similarities of gene expression profiles between pufferfish and zebrafish during maternal to zygotic transition and annotated 1120 long non-coding RNAs (lncRNAs) many of which differentially expressed during development. The promoters for 60% of the assembled transcripts result validated by CAGE-seq. Despite the extreme compaction of the tetraodon genome and the dramatic loss of transposons, the length of lncRNA exons remain comparable to that of other vertebrates and a small set of lncRNAs appears enriched for transposable elements suggesting a selective pressure acting on lncRNAs length and composition. Finally, a set of lncRNAs are microsyntenic between teleost and vertebrates, which indicates potential regulatory interactions between lncRNAs and their flanking coding genes. Our work provides a fundamental molecular resource for vertebrate comparative genomics and embryogenesis studies.
Collapse
|
208
|
Abstract
Transcription and splicing are fundamental steps in gene expression. These processes have been studied intensively over the past four decades, and very recent findings are challenging some of the formerly established ideas. In particular, splicing was shown to occur much faster than previously thought, with the first spliced products observed as soon as splice junctions emerge from RNA polymerase II (Pol II). Splicing was also found coupled to a specific phosphorylation pattern of Pol II carboxyl-terminal domain (CTD), suggesting a new layer of complexity in the CTD code. Moreover, phosphorylation of the CTD may be scarcer than expected, and other post-translational modifications of the CTD are emerging with unanticipated roles in gene expression regulation.
Collapse
Affiliation(s)
- Noélia Custódio
- a Instituto de Medicina Molecular, Faculdade de Medicina, Universidade de Lisboa , Lisboa , Portugal
| | - Maria Carmo-Fonseca
- a Instituto de Medicina Molecular, Faculdade de Medicina, Universidade de Lisboa , Lisboa , Portugal
| |
Collapse
|
209
|
Ferreira PG, Oti M, Barann M, Wieland T, Ezquina S, Friedländer MR, Rivas MA, Esteve-Codina A, Rosenstiel P, Strom TM, Lappalainen T, Guigó R, Sammeth M. Sequence variation between 462 human individuals fine-tunes functional sites of RNA processing. Sci Rep 2016; 6:32406. [PMID: 27617755 PMCID: PMC5019111 DOI: 10.1038/srep32406] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2016] [Accepted: 08/03/2016] [Indexed: 12/23/2022] Open
Abstract
Recent advances in the cost-efficiency of sequencing technologies enabled the combined DNA- and RNA-sequencing of human individuals at the population-scale, making genome-wide investigations of the inter-individual genetic impact on gene expression viable. Employing mRNA-sequencing data from the Geuvadis Project and genome sequencing data from the 1000 Genomes Project we show that the computational analysis of DNA sequences around splice sites and poly-A signals is able to explain several observations in the phenotype data. In contrast to widespread assessments of statistically significant associations between DNA polymorphisms and quantitative traits, we developed a computational tool to pinpoint the molecular mechanisms by which genetic markers drive variation in RNA-processing, cataloguing and classifying alleles that change the affinity of core RNA elements to their recognizing factors. The in silico models we employ further suggest RNA editing can moonlight as a splicing-modulator, albeit less frequently than genomic sequence diversity. Beyond existing annotations, we demonstrate that the ultra-high resolution of RNA-Seq combined from 462 individuals also provides evidence for thousands of bona fide novel elements of RNA processing-alternative splice sites, introns, and cleavage sites-which are often rare and lowly expressed but in other characteristics similar to their annotated counterparts.
Collapse
Affiliation(s)
- Pedro G. Ferreira
- Bioinformatics and Genomics, Center for Genomic Regulation (CRG), 08003 Barcelona, Catalonia, Spain
- Department of Genetic Medicine and Development, University of Geneva Medical School, 1211 Geneva, Switzerland
- Instituto de Investigação e Inovação em Saúde, (i3S) Universidade do Porto, 4200-625 Porto, Portugal
- Institute of Molecular Pathology and Immunology (IPATIMUP), University of Porto, 4200-625 Porto, Portugal
| | - Martin Oti
- Institute of Biophysics Carlos Chagas Filho (IBCCF), Federal University of Rio de Janeiro (UFRJ), 21941-902 Rio de Janeiro, Brazil
| | - Matthias Barann
- Institute of Clinical Molecular Biology, Christians-Albrechts-Universität zu Kiel, 24105 Kiel, Germany
| | - Thomas Wieland
- Institute of Human Genetics, Helmholtz Center Munich, 85764 Neuherberg, Germany
| | - Suzana Ezquina
- Center for Human Genome and Stem-cell research (HUG-CELL), University of São Paulo (USP), 05508090 São Paulo, Brazil
| | - Marc R. Friedländer
- Science for Life Laboratory, Stockholm University, Box 1031, 17121 Solna, Sweden
| | - Manuel A. Rivas
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford OX3 7BN, United Kingdom
| | - Anna Esteve-Codina
- Centre Nacional d’Anàlisi Genòmica, 08028 Barcelona, Catalonia, Spain
- Center for Research in Agricultural Genomics (CRAG), Autonome University of Barcelona, 08193 Bellaterra, Catalonia, Spain
| | - Philip Rosenstiel
- Institute of Clinical Molecular Biology, Christians-Albrechts-Universität zu Kiel, 24105 Kiel, Germany
| | - Tim M Strom
- Institute of Human Genetics, Helmholtz Center Munich, 85764 Neuherberg, Germany
- Institute of Human Genetics, Technische Universität München, 81675 Munich, Germany
| | - Tuuli Lappalainen
- Department of Genetic Medicine and Development, University of Geneva Medical School, 1211 Geneva, Switzerland
- Institute for Genetics and Genomics in Geneva (iGE3), University of Geneva, 1211 Geneva, Switzerland
- Swiss Institute of Bioinformatics, 1211 Geneva, Switzerland
| | - Roderic Guigó
- Bioinformatics and Genomics, Center for Genomic Regulation (CRG), 08003 Barcelona, Catalonia, Spain
- Pompeu Fabra University (UPF), 08003 Barcelona, Catalonia, Spain
| | - Michael Sammeth
- Bioinformatics and Genomics, Center for Genomic Regulation (CRG), 08003 Barcelona, Catalonia, Spain
- Institute of Biophysics Carlos Chagas Filho (IBCCF), Federal University of Rio de Janeiro (UFRJ), 21941-902 Rio de Janeiro, Brazil
- National Center of Scientific Computing (LNCC), 2233-6000 Petrópolis, Rio de Janeiro, Brazil
| |
Collapse
|
210
|
Stegeman R, Spreacker PJ, Swanson SK, Stephenson R, Florens L, Washburn MP, Weake VM. The Spliceosomal Protein SF3B5 is a Novel Component of Drosophila SAGA that Functions in Gene Expression Independent of Splicing. J Mol Biol 2016; 428:3632-49. [PMID: 27185460 PMCID: PMC5011000 DOI: 10.1016/j.jmb.2016.05.009] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2016] [Revised: 04/19/2016] [Accepted: 05/08/2016] [Indexed: 12/16/2022]
Abstract
The interaction between splicing factors and the transcriptional machinery provides an intriguing link between the coupled processes of transcription and splicing. Here, we show that the two components of the SF3B complex, SF3B3 and SF3B5, that form part of the U2 small nuclear ribonucleoprotein particle (snRNP) are also subunits of the Spt-Ada-Gcn5 acetyltransferase (SAGA) transcriptional coactivator complex in Drosophila melanogaster. Whereas SF3B3 had previously been identified as a human SAGA subunit, SF3B5 had not been identified as a component of SAGA in any species. We show that SF3B3 and SF3B5 bind to SAGA independent of RNA and interact with multiple SAGA subunits including Sgf29 and Spt7 in a yeast two-hybrid assay. Through analysis of sf3b5 mutant flies, we show that SF3B5 is necessary for proper development and cell viability but not for histone acetylation. Although SF3B5 does not appear to function in SAGA's histone-modifying activities, SF3B5 is still required for expression of a subset of SAGA-regulated genes independent of splicing. Thus, our data support an independent function of SF3B5 in SAGA's transcription coactivator activity that is separate from its role in splicing.
Collapse
Affiliation(s)
- Rachel Stegeman
- Department of Biochemistry, Purdue University, West Lafayette, IN 47907, USA
| | - Peyton J Spreacker
- Department of Biochemistry, Purdue University, West Lafayette, IN 47907, USA
| | - Selene K Swanson
- Stowers Institute for Medical Research, 1000 E. 50th St., Kansas City, MO 64110, USA
| | - Robert Stephenson
- Department of Biochemistry, Purdue University, West Lafayette, IN 47907, USA
| | - Laurence Florens
- Stowers Institute for Medical Research, 1000 E. 50th St., Kansas City, MO 64110, USA
| | - Michael P Washburn
- Stowers Institute for Medical Research, 1000 E. 50th St., Kansas City, MO 64110, USA; Department of Pathology and Laboratory Medicine, University of Kansas Medical Center, 3901 Rainbow Boulevard, Kansas City, KS 66160, USA
| | - Vikki M Weake
- Department of Biochemistry, Purdue University, West Lafayette, IN 47907, USA; Purdue University Center for Cancer Research, Purdue University, West Lafayette, IN 47907, USA.
| |
Collapse
|
211
|
Hollander D, Naftelberg S, Lev-Maor G, Kornblihtt AR, Ast G. How Are Short Exons Flanked by Long Introns Defined and Committed to Splicing? Trends Genet 2016; 32:596-606. [PMID: 27507607 DOI: 10.1016/j.tig.2016.07.003] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2016] [Revised: 07/19/2016] [Accepted: 07/22/2016] [Indexed: 11/19/2022]
Abstract
The splice sites (SSs) delimiting an intron are brought together in the earliest step of spliceosome assembly yet it remains obscure how SS pairing occurs, especially when introns are thousands of nucleotides long. Splicing occurs in vivo in mammals within minutes regardless of intron length, implying that SS pairing can instantly follow transcription. Also, factors required for SS pairing, such as the U1 small nuclear ribonucleoprotein (snRNP) and U2AF65, associate with RNA polymerase II (RNAPII), while nucleosomes preferentially bind exonic sequences and associate with U2 snRNP. Based on recent publications, we assume that the 5' SS-bound U1 snRNP can remain tethered to RNAPII until complete synthesis of the downstream intron and exon. An additional U1 snRNP then binds the downstream 5' SS, whereas the RNAPII-associated U2AF65 binds the upstream 3' SS to facilitate SS pairing along with exon definition. Next, the nucleosome-associated U2 snRNP binds the branch site to advance splicing complex assembly. This may explain how RNAPII and chromatin are involved in spliceosome assembly and how introns lengthened during evolution with a relatively minimal compromise in splicing.
Collapse
Affiliation(s)
- Dror Hollander
- Department of Human Molecular Genetics and Biochemistry, Sackler Faculty of Medicine, Tel Aviv University, Ramat Aviv 69978, Israel
| | - Shiran Naftelberg
- Department of Human Molecular Genetics and Biochemistry, Sackler Faculty of Medicine, Tel Aviv University, Ramat Aviv 69978, Israel
| | - Galit Lev-Maor
- Department of Human Molecular Genetics and Biochemistry, Sackler Faculty of Medicine, Tel Aviv University, Ramat Aviv 69978, Israel
| | - Alberto R Kornblihtt
- IFIBYNE-UBA-CONICET and Departamento de Fisiología, Biología Molecular y Celular, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Ciudad Universitaria, Pabellón II, C1428EHA Buenos Aires, Argentina
| | - Gil Ast
- Department of Human Molecular Genetics and Biochemistry, Sackler Faculty of Medicine, Tel Aviv University, Ramat Aviv 69978, Israel.
| |
Collapse
|
212
|
Ahn JH, Rechsteiner A, Strome S, Kelly WG. A Conserved Nuclear Cyclophilin Is Required for Both RNA Polymerase II Elongation and Co-transcriptional Splicing in Caenorhabditis elegans. PLoS Genet 2016; 12:e1006227. [PMID: 27541139 PMCID: PMC4991786 DOI: 10.1371/journal.pgen.1006227] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/29/2016] [Accepted: 07/08/2016] [Indexed: 01/22/2023] Open
Abstract
The elongation phase of transcription by RNA Polymerase II (Pol II) involves numerous events that are tightly coordinated, including RNA processing, histone modification, and chromatin remodeling. RNA splicing factors are associated with elongating Pol II, and the interdependent coupling of splicing and elongation has been documented in several systems. Here we identify a conserved, multi-domain cyclophilin family member, SIG-7, as an essential factor for both normal transcription elongation and co-transcriptional splicing. In embryos depleted for SIG-7, RNA levels for over a thousand zygotically expressed genes are substantially reduced, Pol II becomes significantly reduced at the 3' end of genes, marks of transcription elongation are reduced, and unspliced mRNAs accumulate. Our findings suggest that SIG-7 plays a central role in both Pol II elongation and co-transcriptional splicing and may provide an important link for their coordination and regulation.
Collapse
Affiliation(s)
- Jeong H. Ahn
- Biology Department, Emory University, Atlanta, Georgia, United States of America
- Program in Genetics and Molecular Biology, Emory University, Atlanta, Georgia, United States of America
| | - Andreas Rechsteiner
- Department of Molecular, Cell and Developmental Biology, University of California Santa Cruz, Santa Cruz, California
| | - Susan Strome
- Department of Molecular, Cell and Developmental Biology, University of California Santa Cruz, Santa Cruz, California
| | - William G. Kelly
- Biology Department, Emory University, Atlanta, Georgia, United States of America
| |
Collapse
|
213
|
Abstract
Our genome is protected from the introduction of mutations by high fidelity replication and an extensive network of DNA damage response and repair mechanisms. However, the expression of our genome, via RNA and protein synthesis, allows for more diversity in translating genetic information. In addition, the splicing process has become less stringent over evolutionary time allowing for a substantial increase in the diversity of transcripts generated. The result is a diverse transcriptome and proteome that harbor selective advantages over a more tightly regulated system. Here, we describe mechanisms in place that both safeguard the genome and promote translational diversity, with emphasis on post-transcriptional RNA processing.
Collapse
Affiliation(s)
- Brian Magnuson
- Department of Radiation Oncology, University of Michigan Comprehensive Cancer Center, and Translational Oncology Program, University of Michigan, Ann Arbor, USA; Department of Environmental Health Sciences, School of Public Health, University of Michigan, Ann Arbor, USA
| | - Karan Bedi
- Department of Radiation Oncology, University of Michigan Comprehensive Cancer Center, and Translational Oncology Program, University of Michigan, Ann Arbor, USA
| | - Mats Ljungman
- Department of Radiation Oncology, University of Michigan Comprehensive Cancer Center, and Translational Oncology Program, University of Michigan, Ann Arbor, USA; Department of Environmental Health Sciences, School of Public Health, University of Michigan, Ann Arbor, USA.
| |
Collapse
|
214
|
Lopez-Maestre H, Brinza L, Marchet C, Kielbassa J, Bastien S, Boutigny M, Monnin D, Filali AE, Carareto CM, Vieira C, Picard F, Kremer N, Vavre F, Sagot MF, Lacroix V. SNP calling from RNA-seq data without a reference genome: identification, quantification, differential analysis and impact on the protein sequence. Nucleic Acids Res 2016; 44:e148. [PMID: 27458203 PMCID: PMC5100560 DOI: 10.1093/nar/gkw655] [Citation(s) in RCA: 47] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2015] [Accepted: 07/11/2016] [Indexed: 11/14/2022] Open
Abstract
SNPs (Single Nucleotide Polymorphisms) are genetic markers whose precise identification is a prerequisite for association studies. Methods to identify them are currently well developed for model species, but rely on the availability of a (good) reference genome, and therefore cannot be applied to non-model species. They are also mostly tailored for whole genome (re-)sequencing experiments, whereas in many cases, transcriptome sequencing can be used as a cheaper alternative which already enables to identify SNPs located in transcribed regions. In this paper, we propose a method that identifies, quantifies and annotates SNPs without any reference genome, using RNA-seq data only. Individuals can be pooled prior to sequencing, if not enough material is available from one individual. Using pooled human RNA-seq data, we clarify the precision and recall of our method and discuss them with respect to other methods which use a reference genome or an assembled transcriptome. We then validate experimentally the predictions of our method using RNA-seq data from two non-model species. The method can be used for any species to annotate SNPs and predict their impact on the protein sequence. We further enable to test for the association of the identified SNPs with a phenotype of interest.
Collapse
Affiliation(s)
- Hélène Lopez-Maestre
- Université de Lyon, F-69000, Lyon; Université Lyon 1; CNRS, UMR5558, Laboratoire de Biométrie et Biologie Evolutive, F-69622 Villeurbanne, France.,EPI ERABLE - Inria Grenoble, Rhône-Alpes
| | - Lilia Brinza
- PT Génomique et Transcriptomique, BIOASTER, Lyon, France
| | - Camille Marchet
- Université de Rennes, F-35000 Rennes; équipe GenScale, IRISA, Rennes
| | - Janice Kielbassa
- Synergie-Lyon-Cancer, Universite Lyon 1, Centre Leon Berard, Lyon, France
| | - Sylvère Bastien
- Université de Lyon, F-69000, Lyon; Université Lyon 1; CNRS, UMR5558, Laboratoire de Biométrie et Biologie Evolutive, F-69622 Villeurbanne, France.,EPI ERABLE - Inria Grenoble, Rhône-Alpes
| | - Mathilde Boutigny
- Université de Lyon, F-69000, Lyon; Université Lyon 1; CNRS, UMR5558, Laboratoire de Biométrie et Biologie Evolutive, F-69622 Villeurbanne, France.,EPI ERABLE - Inria Grenoble, Rhône-Alpes
| | - David Monnin
- Université de Lyon, F-69000, Lyon; Université Lyon 1; CNRS, UMR5558, Laboratoire de Biométrie et Biologie Evolutive, F-69622 Villeurbanne, France
| | - Adil El Filali
- Université de Lyon, F-69000, Lyon; Université Lyon 1; CNRS, UMR5558, Laboratoire de Biométrie et Biologie Evolutive, F-69622 Villeurbanne, France
| | - Claudia Marcia Carareto
- Department of Biology, UNESP - São Paulo State University, São José do Rio Preto, São Paulo, Brazil
| | - Cristina Vieira
- Université de Lyon, F-69000, Lyon; Université Lyon 1; CNRS, UMR5558, Laboratoire de Biométrie et Biologie Evolutive, F-69622 Villeurbanne, France.,EPI ERABLE - Inria Grenoble, Rhône-Alpes
| | - Franck Picard
- Université de Lyon, F-69000, Lyon; Université Lyon 1; CNRS, UMR5558, Laboratoire de Biométrie et Biologie Evolutive, F-69622 Villeurbanne, France
| | - Natacha Kremer
- Université de Lyon, F-69000, Lyon; Université Lyon 1; CNRS, UMR5558, Laboratoire de Biométrie et Biologie Evolutive, F-69622 Villeurbanne, France
| | - Fabrice Vavre
- Université de Lyon, F-69000, Lyon; Université Lyon 1; CNRS, UMR5558, Laboratoire de Biométrie et Biologie Evolutive, F-69622 Villeurbanne, France.,EPI ERABLE - Inria Grenoble, Rhône-Alpes
| | - Marie-France Sagot
- Université de Lyon, F-69000, Lyon; Université Lyon 1; CNRS, UMR5558, Laboratoire de Biométrie et Biologie Evolutive, F-69622 Villeurbanne, France.,EPI ERABLE - Inria Grenoble, Rhône-Alpes
| | - Vincent Lacroix
- Université de Lyon, F-69000, Lyon; Université Lyon 1; CNRS, UMR5558, Laboratoire de Biométrie et Biologie Evolutive, F-69622 Villeurbanne, France .,EPI ERABLE - Inria Grenoble, Rhône-Alpes
| |
Collapse
|
215
|
TALE-directed local modulation of H3K9 methylation shapes exon recognition. Sci Rep 2016; 6:29961. [PMID: 27439481 PMCID: PMC4954949 DOI: 10.1038/srep29961] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2016] [Accepted: 06/28/2016] [Indexed: 12/12/2022] Open
Abstract
In search for the function of local chromatin environment on pre-mRNA processing we established a new tool, which allows for the modification of chromatin using a targeted approach. Using Transcription Activator-Like Effector domains fused to histone modifying enzymes (TALE-HME), we show locally restricted alteration of histone methylation modulates the splicing of target exons. We provide evidence that a local increase in H3K9 di- and trimethylation promotes inclusion of the target alternative exon, while demethylation by JMJD2D leads to exon skipping. We further demonstrate that H3K9me3 is localized on internal exons genome-wide suggesting a general role in splicing. Consistently, targeting of the H3K9 demethylase to a weak constitutive exon reduced co-transcriptional splicing. Together our data show H3K9 methylation within the gene body is a factor influencing recognition of both constitutive and alternative exons.
Collapse
|
216
|
RNA polymerase II promoter-proximal pausing in mammalian long non-coding genes. Genomics 2016; 108:64-77. [PMID: 27432546 DOI: 10.1016/j.ygeno.2016.07.003] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2016] [Revised: 07/11/2016] [Accepted: 07/14/2016] [Indexed: 01/13/2023]
Abstract
Mammalian genomes encode a large number of non-coding RNAs (ncRNAs) that greatly exceed mRNA genes. While the physiological and pathological roles of ncRNAs have been increasingly understood, the mechanisms of regulation of ncRNA expression are less clear. Here, our genomic study has shown that a significant number of long non-coding RNAs (lncRNAs, >1000 nucleotides) harbor RNA polymerase II (Pol II) engaged with the transcriptional start site. A pausing and transcriptional elongation factor for protein-coding genes, tripartite motif-containing 28 (TRIM28) regulates the transcription of a subset of lncRNAs in mammalian cells. In addition, the majority of lncRNAs in human and murine cells regulated by Pol II promoter-proximal pausing appear to function in stimulus-inducible biological pathways. Our findings suggest an important role of Pol II pausing for the transcription of mammalian lncRNA genes.
Collapse
|
217
|
Ni T, Yang W, Han M, Zhang Y, Shen T, Nie H, Zhou Z, Dai Y, Yang Y, Liu P, Cui K, Zeng Z, Tian Y, Zhou B, Wei G, Zhao K, Peng W, Zhu J. Global intron retention mediated gene regulation during CD4+ T cell activation. Nucleic Acids Res 2016; 44:6817-29. [PMID: 27369383 PMCID: PMC5001615 DOI: 10.1093/nar/gkw591] [Citation(s) in RCA: 78] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2015] [Accepted: 06/17/2016] [Indexed: 01/02/2023] Open
Abstract
T cell activation is a well-established model for studying cellular responses to exogenous stimulation. Using strand-specific RNA-seq, we observed that intron retention is prevalent in polyadenylated transcripts in resting CD4+ T cells and is significantly reduced upon T cell activation. Several lines of evidence suggest that intron-retained transcripts are less stable than fully spliced transcripts. Strikingly, the decrease in intron retention (IR) levels correlate with the increase in steady-state mRNA levels. Further, the majority of the genes upregulated in activated T cells are accompanied by a significant reduction in IR. Of these 1583 genes, 185 genes are predominantly regulated at the IR level, and highly enriched in the proteasome pathway, which is essential for proper T cell proliferation and cytokine release. These observations were corroborated in both human and mouse CD4+ T cells. Our study revealed a novel post-transcriptional regulatory mechanism that may potentially contribute to coordinated and/or quick cellular responses to extracellular stimuli such as an acute infection.
Collapse
Affiliation(s)
- Ting Ni
- State Key Laboratory of Genetic Engineering & MOE Key Laboratory of Contemporary Anthropology, Collaborative Innovation Center of Genetics and Development, School of Life Sciences, Fudan University, Shanghai 200438, P.R. China
| | - Wenjing Yang
- Systems Biology Center, National Heart Lung and Blood Institute, National Institutes of Health, Bethesda, MD 20892, USA
| | - Miao Han
- State Key Laboratory of Genetic Engineering & MOE Key Laboratory of Contemporary Anthropology, Collaborative Innovation Center of Genetics and Development, School of Life Sciences, Fudan University, Shanghai 200438, P.R. China
| | - Yubo Zhang
- Systems Biology Center, National Heart Lung and Blood Institute, National Institutes of Health, Bethesda, MD 20892, USA
| | - Ting Shen
- State Key Laboratory of Genetic Engineering & MOE Key Laboratory of Contemporary Anthropology, Collaborative Innovation Center of Genetics and Development, School of Life Sciences, Fudan University, Shanghai 200438, P.R. China
| | - Hongbo Nie
- State Key Laboratory of Genetic Engineering & MOE Key Laboratory of Contemporary Anthropology, Collaborative Innovation Center of Genetics and Development, School of Life Sciences, Fudan University, Shanghai 200438, P.R. China
| | - Zhihui Zhou
- Department of Immunology, Tongji University School of Medicine, Shanghai 200092, P.R. China
| | - Yalei Dai
- Department of Immunology, Tongji University School of Medicine, Shanghai 200092, P.R. China
| | - Yanqin Yang
- Systems Biology Center, National Heart Lung and Blood Institute, National Institutes of Health, Bethesda, MD 20892, USA
| | - Poching Liu
- Systems Biology Center, National Heart Lung and Blood Institute, National Institutes of Health, Bethesda, MD 20892, USA
| | - Kairong Cui
- Systems Biology Center, National Heart Lung and Blood Institute, National Institutes of Health, Bethesda, MD 20892, USA
| | - Zhouhao Zeng
- Department of Physics, George Washington University, Washington, DC 20052, USA
| | - Yi Tian
- Department of Physics, George Washington University, Washington, DC 20052, USA Institute of Immunology, PLA, Third Military Medical University, Chongqing 400038, P.R. China
| | - Bin Zhou
- State Key Laboratory of Genetic Engineering & MOE Key Laboratory of Contemporary Anthropology, Collaborative Innovation Center of Genetics and Development, School of Life Sciences, Fudan University, Shanghai 200438, P.R. China
| | - Gang Wei
- State Key Laboratory of Genetic Engineering & MOE Key Laboratory of Contemporary Anthropology, Collaborative Innovation Center of Genetics and Development, School of Life Sciences, Fudan University, Shanghai 200438, P.R. China
| | - Keji Zhao
- Systems Biology Center, National Heart Lung and Blood Institute, National Institutes of Health, Bethesda, MD 20892, USA
| | - Weiqun Peng
- Department of Physics, George Washington University, Washington, DC 20052, USA
| | - Jun Zhu
- Systems Biology Center, National Heart Lung and Blood Institute, National Institutes of Health, Bethesda, MD 20892, USA
| |
Collapse
|
218
|
Carlevaro-Fita J, Rahim A, Guigó R, Vardy LA, Johnson R. Cytoplasmic long noncoding RNAs are frequently bound to and degraded at ribosomes in human cells. RNA (NEW YORK, N.Y.) 2016; 22:867-82. [PMID: 27090285 PMCID: PMC4878613 DOI: 10.1261/rna.053561.115] [Citation(s) in RCA: 169] [Impact Index Per Article: 18.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/30/2015] [Accepted: 03/01/2016] [Indexed: 05/03/2023]
Abstract
Recent footprinting studies have made the surprising observation that long noncoding RNAs (lncRNAs) physically interact with ribosomes. However, these findings remain controversial, and the overall proportion of cytoplasmic lncRNAs involved is unknown. Here we make a global, absolute estimate of the cytoplasmic and ribosome-associated population of stringently filtered lncRNAs in a human cell line using polysome profiling coupled to spike-in normalized microarray analysis. Fifty-four percent of expressed lncRNAs are detected in the cytoplasm. The majority of these (70%) have >50% of their cytoplasmic copies associated with polysomal fractions. These interactions are lost upon disruption of ribosomes by puromycin. Polysomal lncRNAs are distinguished by a number of 5' mRNA-like features, including capping and 5'UTR length. On the other hand, nonpolysomal "free cytoplasmic" lncRNAs have more conserved promoters and a wider range of expression across cell types. Exons of polysomal lncRNAs are depleted of endogenous retroviral insertions, suggesting a role for repetitive elements in lncRNA localization. Finally, we show that blocking of ribosomal elongation results in stabilization of many associated lncRNAs. Together these findings suggest that the ribosome is the default destination for the majority of cytoplasmic long noncoding RNAs and may play a role in their degradation.
Collapse
Affiliation(s)
- Joana Carlevaro-Fita
- Centre for Genomic Regulation (CRG), 08003 Barcelona, Spain Universitat Pompeu Fabra (UPF), 08003 Barcelona, Spain Institut Hospital del Mar d'Investigacions Mèdiques (IMIM), 08003 Barcelona, Spain
| | - Anisa Rahim
- A*STAR Institute of Medical Biology, Singapore 138648, Singapore
| | - Roderic Guigó
- Centre for Genomic Regulation (CRG), 08003 Barcelona, Spain Universitat Pompeu Fabra (UPF), 08003 Barcelona, Spain Institut Hospital del Mar d'Investigacions Mèdiques (IMIM), 08003 Barcelona, Spain
| | - Leah A Vardy
- A*STAR Institute of Medical Biology, Singapore 138648, Singapore School of Biological Sciences, Nanyang Technological University, 637551 Singapore
| | - Rory Johnson
- Centre for Genomic Regulation (CRG), 08003 Barcelona, Spain Universitat Pompeu Fabra (UPF), 08003 Barcelona, Spain Institut Hospital del Mar d'Investigacions Mèdiques (IMIM), 08003 Barcelona, Spain
| |
Collapse
|
219
|
Mor A, White A, Zhang K, Thompson M, Esparza M, Muñoz-Moreno R, Koide K, Lynch KW, García-Sastre A, Fontoura BM. Influenza virus mRNA trafficking through host nuclear speckles. Nat Microbiol 2016; 1:16069. [PMID: 27572970 PMCID: PMC4917225 DOI: 10.1038/nmicrobiol.2016.69] [Citation(s) in RCA: 81] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2015] [Accepted: 04/20/2016] [Indexed: 12/26/2022]
Abstract
Influenza A virus is a human pathogen with a genome composed of eight viral RNA segments that replicate in the nucleus. Two viral mRNAs are alternatively spliced. The unspliced M1 mRNA is translated into the matrix M1 protein, while the ion channel M2 protein is generated after alternative splicing. These proteins are critical mediators of viral trafficking and budding. We show that the influenza virus uses nuclear speckles to promote post-transcriptional splicing of its M1 mRNA. We assign previously unknown roles for the viral NS1 protein and cellular factors to an intranuclear trafficking pathway that targets the viral M1 mRNA to nuclear speckles, mediates splicing at these nuclear bodies and exports the spliced M2 mRNA from the nucleus. Given that nuclear speckles are storage sites for splicing factors, which leave these sites to splice cellular pre-mRNAs at transcribing genes, we reveal a functional subversion of nuclear speckles to promote viral gene expression.
Collapse
Affiliation(s)
- Amir Mor
- Department of Cell Biology, University of Texas Southwestern Medical Center, Dallas, TX 75390-9039, USA
| | - Alexander White
- Department of Cell Biology, University of Texas Southwestern Medical Center, Dallas, TX 75390-9039, USA
| | - Ke Zhang
- Department of Cell Biology, University of Texas Southwestern Medical Center, Dallas, TX 75390-9039, USA
| | - Matthew Thompson
- Department of Biochemistry and Biophysics, University of Pennsylvania School of Medicine, Philadelphia, PA 19104-6059, USA
| | - Matthew Esparza
- Department of Cell Biology, University of Texas Southwestern Medical Center, Dallas, TX 75390-9039, USA
| | - Raquel Muñoz-Moreno
- Department of Microbiology, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Global Health and Emerging Pathogens Institute, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Kazunori Koide
- Department of Chemistry, University of Pittsburgh, Pittsburgh, PA 15260, USA
| | - Kristen W. Lynch
- Department of Biochemistry and Biophysics, University of Pennsylvania School of Medicine, Philadelphia, PA 19104-6059, USA
| | - Adolfo García-Sastre
- Department of Microbiology, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Global Health and Emerging Pathogens Institute, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Department of Medicine, Division of Infectious Diseases, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Beatriz M.A. Fontoura
- Department of Cell Biology, University of Texas Southwestern Medical Center, Dallas, TX 75390-9039, USA
| |
Collapse
|
220
|
Jimeno-González S, Reyes JC. Chromatin structure and pre-mRNA processing work together. Transcription 2016; 7:63-8. [PMID: 27028548 PMCID: PMC4984687 DOI: 10.1080/21541264.2016.1168507] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2016] [Revised: 03/14/2016] [Accepted: 03/14/2016] [Indexed: 10/22/2022] Open
Abstract
Chromatin is the natural context for transcription elongation. However, the elongating RNA polymerase II (RNAPII) is forced to pause by the positioned nucleosomes present in gene bodies. Here, we briefly discuss the current results suggesting that those pauses could serve as a mechanism to coordinate transcription elongation with pre-mRNA processing. Further, histone post-translational modifications have been found to regulate the recruitment of factors involved in pre-mRNA processing. This view highlights the important regulatory role of the chromatin context in the whole process of the mature mRNA synthesis.
Collapse
Affiliation(s)
- Silvia Jimeno-González
- Centro Andaluz de Biología Molecular y Medicina Regenerativa (CABIMER), Consejo Superior de Investigaciones Científicas (CSIC), Seville, Spain
| | - José C. Reyes
- Centro Andaluz de Biología Molecular y Medicina Regenerativa (CABIMER), Consejo Superior de Investigaciones Científicas (CSIC), Seville, Spain
| |
Collapse
|
221
|
Jain S, Thakkar N, Chhatai J, Pal Bhadra M, Bhadra U. Long non-coding RNA: Functional agent for disease traits. RNA Biol 2016; 14:522-535. [PMID: 27229269 DOI: 10.1080/15476286.2016.1172756] [Citation(s) in RCA: 46] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023] Open
Abstract
In recent years, long non-coding RNAs (lncRNAs) have attracted the attention of researchers with their involvement in all facets of life. LncRNAs are transcripts of more than 200 nucleotides which lack defined protein coding potential. Although they do not code for proteins, a large number of them are involved in regulating gene expression and translation. The presence of numerous lncRNAs in the human genome has prompted us to investigate the contribution of these molecules to human biology and medicine. In this review, we present the potential role of lncRNAs interlinked to different human diseases and genetic disorders. We also describe their role in cellular differentiation and aging and discuss their potential importance as biomarkers and as therapeutic agents.
Collapse
Affiliation(s)
- Sriyans Jain
- a Functional Genomics and Gene Silencing Group , CSIR- Center for Cellular and Molecular Biology , Hyderabad , India
| | - Nirav Thakkar
- a Functional Genomics and Gene Silencing Group , CSIR- Center for Cellular and Molecular Biology , Hyderabad , India
| | - Jagamohan Chhatai
- a Functional Genomics and Gene Silencing Group , CSIR- Center for Cellular and Molecular Biology , Hyderabad , India
| | - Manika Pal Bhadra
- b Centre for Chemical Biology , Indian Institute for Chemical Technology , Hyderabad , India
| | - Utpal Bhadra
- a Functional Genomics and Gene Silencing Group , CSIR- Center for Cellular and Molecular Biology , Hyderabad , India
| |
Collapse
|
222
|
Towards understanding pre-mRNA splicing mechanisms and the role of SR proteins. Gene 2016; 587:107-19. [PMID: 27154819 DOI: 10.1016/j.gene.2016.04.057] [Citation(s) in RCA: 48] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2016] [Accepted: 04/30/2016] [Indexed: 01/04/2023]
Abstract
Alternative pre-mRNA splicing provides a source of vast protein diversity by removing non-coding sequences (introns) and accurately linking different exonic regions in the correct reading frame. The regulation of alternative splicing is essential for various cellular functions in both pathological and physiological conditions. In eukaryotic cells, this process is commonly used to increase proteomic diversity and to control gene expression either co- or post-transcriptionally. Alternative splicing occurs within a megadalton-sized, multi-component machine consisting of RNA and proteins; during the splicing process, this complex undergoes dynamic changes via RNA-RNA, protein-protein and RNA-protein interactions. Co-transcriptional splicing functionally integrates the transcriptional machinery, thereby enabling the two processes to influence one another, whereas post-transcriptional splicing facilitates the coupling of RNA splicing with post-splicing events. This review addresses the structural aspects of spliceosomes and the mechanistic implications of their stepwise assembly on the regulation of pre-mRNA splicing. Moreover, the role of phosphorylation-based, signal-induced changes in the regulation of the splicing process is demonstrated.
Collapse
|
223
|
Rutenberg-Schoenberg M, Sexton AN, Simon MD. The Properties of Long Noncoding RNAs That Regulate Chromatin. Annu Rev Genomics Hum Genet 2016; 17:69-94. [PMID: 27147088 DOI: 10.1146/annurev-genom-090314-024939] [Citation(s) in RCA: 66] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Beyond coding for proteins, RNA molecules have well-established functions in the posttranscriptional regulation of gene expression. Less clear are the upstream roles of RNA in regulating transcription and chromatin-based processes in the nucleus. RNA is transcribed in the nucleus, so it is logical that RNA could play diverse and broad roles that would impact human physiology. Indeed, this idea is supported by well-established examples of noncoding RNAs that affect chromatin structure and function. There has been dramatic growth in studies focused on the nuclear roles of long noncoding RNAs (lncRNAs). Although little is known about the biochemical mechanisms of these lncRNAs, there is a developing consensus regarding the challenges of defining lncRNA function and mechanism. In this review, we examine the definition, discovery, functions, and mechanisms of lncRNAs. We emphasize areas where challenges remain and where consensus among laboratories has underscored the exciting ways in which human lncRNAs may affect chromatin biology.
Collapse
Affiliation(s)
- Michael Rutenberg-Schoenberg
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut 06511; , , .,Chemical Biology Institute, Yale University, West Haven, Connecticut 06516
| | - Alec N Sexton
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut 06511; , , .,Chemical Biology Institute, Yale University, West Haven, Connecticut 06516
| | - Matthew D Simon
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut 06511; , , .,Chemical Biology Institute, Yale University, West Haven, Connecticut 06516
| |
Collapse
|
224
|
Single molecule approaches for quantifying transcription and degradation rates in intact mammalian tissues. Methods 2016; 98:134-142. [DOI: 10.1016/j.ymeth.2015.11.015] [Citation(s) in RCA: 47] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2015] [Revised: 11/15/2015] [Accepted: 11/19/2015] [Indexed: 11/23/2022] Open
|
225
|
Fiszbein A, Giono LE, Quaglino A, Berardino BG, Sigaut L, von Bilderling C, Schor IE, Enriqué Steinberg JH, Rossi M, Pietrasanta LI, Caramelo JJ, Srebrow A, Kornblihtt AR. Alternative Splicing of G9a Regulates Neuronal Differentiation. Cell Rep 2016; 14:2797-808. [PMID: 26997278 DOI: 10.1016/j.celrep.2016.02.063] [Citation(s) in RCA: 50] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2015] [Revised: 01/25/2016] [Accepted: 02/12/2016] [Indexed: 01/08/2023] Open
Abstract
Chromatin modifications are critical for the establishment and maintenance of differentiation programs. G9a, the enzyme responsible for histone H3 lysine 9 dimethylation in mammalian euchromatin, exists as two isoforms with differential inclusion of exon 10 (E10) through alternative splicing. We find that the G9a methyltransferase is required for differentiation of the mouse neuronal cell line N2a and that E10 inclusion increases during neuronal differentiation of cultured cells, as well as in the developing mouse brain. Although E10 inclusion greatly stimulates overall H3K9me2 levels, it does not affect G9a catalytic activity. Instead, E10 increases G9a nuclear localization. We show that the G9a E10(+) isoform is necessary for neuron differentiation and regulates the alternative splicing pattern of its own pre-mRNA, enhancing E10 inclusion. Overall, our findings indicate that by regulating its own alternative splicing, G9a promotes neuron differentiation and creates a positive feedback loop that reinforces cellular commitment to differentiation.
Collapse
Affiliation(s)
- Ana Fiszbein
- Departamento de Fisiología, Biología Molecular y Celular, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires and Instituto de Fisiología, Biología Molecular y Neurociencias (IFIBYNE-CONICET), Ciudad Universitaria Pabellón II, C1428EHA Buenos Aires, Argentina
| | - Luciana E Giono
- Departamento de Fisiología, Biología Molecular y Celular, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires and Instituto de Fisiología, Biología Molecular y Neurociencias (IFIBYNE-CONICET), Ciudad Universitaria Pabellón II, C1428EHA Buenos Aires, Argentina
| | - Ana Quaglino
- Departamento de Fisiología, Biología Molecular y Celular, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires and Instituto de Fisiología, Biología Molecular y Neurociencias (IFIBYNE-CONICET), Ciudad Universitaria Pabellón II, C1428EHA Buenos Aires, Argentina
| | - Bruno G Berardino
- Departamento de Química Biológica, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Ciudad Universitaria Pabellón II, C1428EHA Buenos Aires, Argentina
| | - Lorena Sigaut
- Departamento de Física, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires and IFIBA-CONICET, Cuidad Universitaria Pabellón I, C1428EHA Buenos Aires, Argentina
| | - Catalina von Bilderling
- Departamento de Física, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires and IFIBA-CONICET, Cuidad Universitaria Pabellón I, C1428EHA Buenos Aires, Argentina
| | - Ignacio E Schor
- Departamento de Fisiología, Biología Molecular y Celular, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires and Instituto de Fisiología, Biología Molecular y Neurociencias (IFIBYNE-CONICET), Ciudad Universitaria Pabellón II, C1428EHA Buenos Aires, Argentina
| | - Juliana H Enriqué Steinberg
- Instituto de Investigación en Biomedicina de Buenos Aires CONICET, Partner Institute of the Max Planck Society, C1425FQD Buenos Aires, Argentina
| | - Mario Rossi
- Instituto de Investigación en Biomedicina de Buenos Aires CONICET, Partner Institute of the Max Planck Society, C1425FQD Buenos Aires, Argentina
| | - Lía I Pietrasanta
- Departamento de Física, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires and IFIBA-CONICET, Cuidad Universitaria Pabellón I, C1428EHA Buenos Aires, Argentina; Centro de Microscopías Avanzadas, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Cuidad Universitaria, C1428EHA Buenos Aires, Argentina
| | - Julio J Caramelo
- Departamento de Química Biológica, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Ciudad Universitaria Pabellón II, C1428EHA Buenos Aires, Argentina; Fundación Instituto Leloir, C1405BWE Buenos Aires, Argentina
| | - Anabella Srebrow
- Departamento de Fisiología, Biología Molecular y Celular, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires and Instituto de Fisiología, Biología Molecular y Neurociencias (IFIBYNE-CONICET), Ciudad Universitaria Pabellón II, C1428EHA Buenos Aires, Argentina
| | - Alberto R Kornblihtt
- Departamento de Fisiología, Biología Molecular y Celular, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires and Instituto de Fisiología, Biología Molecular y Neurociencias (IFIBYNE-CONICET), Ciudad Universitaria Pabellón II, C1428EHA Buenos Aires, Argentina.
| |
Collapse
|
226
|
Meng Y, Yi X, Li X, Hu C, Wang J, Bai L, Czajkowsky DM, Shao Z. The non-coding RNA composition of the mitotic chromosome by 5'-tag sequencing. Nucleic Acids Res 2016; 44:4934-46. [PMID: 27016738 PMCID: PMC4889943 DOI: 10.1093/nar/gkw195] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2015] [Accepted: 03/15/2016] [Indexed: 12/16/2022] Open
Abstract
Mitotic chromosomes are one of the most commonly recognized sub-cellular structures in eukaryotic cells. Yet basic information necessary to understand their structure and assembly, such as their composition, is still lacking. Recent proteomic studies have begun to fill this void, identifying hundreds of RNA-binding proteins bound to mitotic chromosomes. However, by contrast, there are only two RNA species (U3 snRNA and rRNA) that are known to be associated with the mitotic chromosome, suggesting that there are many mitotic chromosome-associated RNAs (mCARs) not yet identified. Here, using a targeted protocol based on 5'-tag sequencing to profile the mammalian mCAR population, we report the identification of 1279 mCARs, the majority of which are ncRNAs, including lncRNAs that exhibit greater conservation across 60 vertebrate species than the entire population of lncRNAs. There is also a significant enrichment of snoRNAs and specific SINE RNAs. Finally, ∼40% of the mCARs are presently unannotated, many of which are as abundant as the annotated mCARs, suggesting that there are also many novel ncRNAs in the mCARs. Overall, the mCARs identified here, together with the previous proteomic and genomic data, constitute the first comprehensive catalogue of the molecular composition of the eukaryotic mitotic chromosomes.
Collapse
Affiliation(s)
- Yicong Meng
- Shanghai Center for Systems Biomedicine, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Xianfu Yi
- School of Biomedical Engineering, Tianjin Medical University, Tianjin 300070, China
| | - Xinhui Li
- Bio-ID Center, School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Chuansheng Hu
- Bio-ID Center, School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Ju Wang
- School of Biomedical Engineering, Tianjin Medical University, Tianjin 300070, China
| | - Ling Bai
- Bio-ID Center, School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Daniel M Czajkowsky
- Bio-ID Center, School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Zhifeng Shao
- Bio-ID Center, School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai 200240, China State Key Laboratory of Oncogenes & Related Genes, Shanghai Jiao Tong University, Shanghai 200240, China
| |
Collapse
|
227
|
Song L, Sabunciyan S, Florea L. CLASS2: accurate and efficient splice variant annotation from RNA-seq reads. Nucleic Acids Res 2016; 44:e98. [PMID: 26975657 PMCID: PMC4889935 DOI: 10.1093/nar/gkw158] [Citation(s) in RCA: 46] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2014] [Accepted: 02/28/2016] [Indexed: 11/29/2022] Open
Abstract
Next generation sequencing of cellular RNA is making it possible to characterize genes and alternative splicing in unprecedented detail. However, designing bioinformatics tools to accurately capture splicing variation has proven difficult. Current programs can find major isoforms of a gene but miss lower abundance variants, or are sensitive but imprecise. CLASS2 is a novel open source tool for accurate genome-guided transcriptome assembly from RNA-seq reads based on the model of splice graph. An extension of our program CLASS, CLASS2 jointly optimizes read patterns and the number of supporting reads to score and prioritize transcripts, implemented in a novel, scalable and efficient dynamic programming algorithm. When compared against reference programs, CLASS2 had the best overall accuracy and could detect up to twice as many splicing events with precision similar to the best reference program. Notably, it was the only tool to produce consistently reliable transcript models for a wide range of applications and sequencing strategies, including ribosomal RNA-depleted samples. Lightweight and multi-threaded, CLASS2 requires <3GB RAM and can analyze a 350 million read set within hours, and can be widely applied to transcriptomics studies ranging from clinical RNA sequencing, to alternative splicing analyses, and to the annotation of new genomes.
Collapse
Affiliation(s)
- Li Song
- Center for Computational Biology, McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins School of Medicine, Baltimore, MD 21205, USA Department of Computer Science, Johns Hopkins University, Baltimore, MD 21218, USA
| | - Sarven Sabunciyan
- Department of Pediatrics, Johns Hopkins School of Medicine, Baltimore, MD 21287, USA
| | - Liliana Florea
- Center for Computational Biology, McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins School of Medicine, Baltimore, MD 21205, USA Department of Computer Science, Johns Hopkins University, Baltimore, MD 21218, USA Department of Medicine, Johns Hopkins School of Medicine, Baltimore, MD 21205, USA
| |
Collapse
|
228
|
Wang HLV, Chekanova JA. Small RNAs: essential regulators of gene expression and defenses against environmental stresses in plants. WILEY INTERDISCIPLINARY REVIEWS-RNA 2016; 7:356-81. [PMID: 26924473 DOI: 10.1002/wrna.1340] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/27/2015] [Revised: 12/28/2015] [Accepted: 12/30/2015] [Indexed: 12/18/2022]
Abstract
Eukaryotic genomes produce thousands of diverse small RNAs (smRNAs), which play vital roles in regulating gene expression in all conditions, including in survival of biotic and abiotic environmental stresses. SmRNA pathways intersect with most of the pathways regulating different steps in the life of a messenger RNA (mRNA), starting from transcription and ending at mRNA decay. SmRNAs function in both nuclear and cytoplasmic compartments; the regulation of mRNA stability and translation in the cytoplasm and the epigenetic regulation of gene expression in the nucleus are the main and best-known modes of smRNA action. However, recent evidence from animal systems indicates that smRNAs and RNA interference (RNAi) also participate in the regulation of alternative pre-mRNA splicing, one of the most crucial steps in the fast, efficient global reprogramming of gene expression required for survival under stress. Emerging evidence from bioinformatics studies indicates that a specific class of plant smRNAs, induced by various abiotic stresses, the sutr-siRNAs, has the potential to target regulatory regions within introns and thus may act in the regulation of splicing in response to stresses. This review summarizes the major types of plant smRNAs in the context of their mechanisms of action and also provides examples of their involvement in regulation of gene expression in response to environmental cues and developmental stresses. In addition, we describe current advances in our understanding of how smRNAs function in the regulation of pre-mRNA splicing. WIREs RNA 2016, 7:356-381. doi: 10.1002/wrna.1340 For further resources related to this article, please visit the WIREs website.
Collapse
Affiliation(s)
- Hsiao-Lin V Wang
- School of Biological Sciences, University of Missouri-Kansas City, Kansas City, MO, USA
| | - Julia A Chekanova
- School of Biological Sciences, University of Missouri-Kansas City, Kansas City, MO, USA
| |
Collapse
|
229
|
Abstract
RNA splicing represents a post-transcriptional mechanism to generate multiple functional RNAs or proteins from a single transcript. The evolution of RNA splicing is a prime example of the Darwinian function follows form concept. A mutation that leads to a new mRNA (form) that encodes for a new functional protein (function) is likely to be retained, and this way, the genome has gradually evolved to encode for genes with multiple isoforms, thereby creating an enormously diverse transcriptome. Advances in technologies to characterize RNA populations have led to a better understanding of RNA processing in health and disease. In the heart, alternative splicing is increasingly being recognized as an important layer of post-transcriptional gene regulation. Moreover, the recent identification of several cardiac splice factors, such as RNA-binding motif protein 20 and SF3B1, not only provided important insight into the mechanisms underlying alternative splicing but also revealed how these splicing factors impact functional properties of the heart. Here, we review our current knowledge of alternative splicing in the heart, with a particular focus on the major and minor spliceosome, the factors controlling RNA splicing, and the role of alternative splicing in cardiac development and disease.
Collapse
Affiliation(s)
- Maarten M.G. van den Hoogenhof
- From the Department of Experimental Cardiology, Academic Medical Center, University of Amsterdam, Amsterdam, The Netherlands
| | - Yigal M. Pinto
- From the Department of Experimental Cardiology, Academic Medical Center, University of Amsterdam, Amsterdam, The Netherlands
| | - Esther E. Creemers
- From the Department of Experimental Cardiology, Academic Medical Center, University of Amsterdam, Amsterdam, The Netherlands
| |
Collapse
|
230
|
Long non-coding RNAs display higher natural expression variation than protein-coding genes in healthy humans. Genome Biol 2016; 17:14. [PMID: 26821746 PMCID: PMC4731934 DOI: 10.1186/s13059-016-0873-8] [Citation(s) in RCA: 111] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2015] [Accepted: 01/06/2016] [Indexed: 02/06/2023] Open
Abstract
Background Long non-coding RNAs (lncRNAs) are increasingly implicated as gene regulators and may ultimately be more numerous than protein-coding genes in the human genome. Despite large numbers of reported lncRNAs, reference annotations are likely incomplete due to their lower and tighter tissue-specific expression compared to mRNAs. An unexplored factor potentially confounding lncRNA identification is inter-individual expression variability. Here, we characterize lncRNA natural expression variability in human primary granulocytes. Results We annotate granulocyte lncRNAs and mRNAs in RNA-seq data from 10 healthy individuals, identifying multiple lncRNAs absent from reference annotations, and use this to investigate three known features (higher tissue-specificity, lower expression, and reduced splicing efficiency) of lncRNAs relative to mRNAs. Expression variability was examined in seven individuals sampled three times at 1- or more than 1-month intervals. We show that lncRNAs display significantly more inter-individual expression variability compared to mRNAs. We confirm this finding in two independent human datasets by analyzing multiple tissues from the GTEx project and lymphoblastoid cell lines from the GEUVADIS project. Using the latter dataset we also show that including more human donors into the transcriptome annotation pipeline allows identification of an increasing number of lncRNAs, but minimally affects mRNA gene number. Conclusions A comprehensive annotation of lncRNAs is known to require an approach that is sensitive to low and tight tissue-specific expression. Here we show that increased inter-individual expression variability is an additional general lncRNA feature to consider when creating a comprehensive annotation of human lncRNAs or proposing their use as prognostic or disease markers. Electronic supplementary material The online version of this article (doi:10.1186/s13059-016-0873-8) contains supplementary material, which is available to authorized users.
Collapse
|
231
|
Abstract
Most transcriptome studies involve sequencing and quantification of steady-state mRNA by isolating and sequencing poly (A) RNA. Although this type of sequencing data is informative to determine steady-state mRNA levels it does not provide information on transcriptional output and thus may not always reflect changes in transcriptional regulation of gene expression. Furthermore, sequencing poly (A) RNA may miss transcribed regions of the genome not usually modified by polyadenylation which includes many long noncoding RNAs. Here, we describe nuclear-RNA sequencing (nucRNA-seq) which investigates the transcriptional landscape through sequencing and quantification of nuclear RNAs which are both unspliced and spliced transcripts for protein-coding genes and nuclear-retained long noncoding RNAs.
Collapse
|
232
|
Marina RJ, Sturgill D, Bailly MA, Thenoz M, Varma G, Prigge MF, Nanan KK, Shukla S, Haque N, Oberdoerffer S. TET-catalyzed oxidation of intragenic 5-methylcytosine regulates CTCF-dependent alternative splicing. EMBO J 2015; 35:335-55. [PMID: 26711177 DOI: 10.15252/embj.201593235] [Citation(s) in RCA: 74] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2015] [Accepted: 11/25/2015] [Indexed: 01/09/2023] Open
Abstract
Intragenic 5-methylcytosine and CTCF mediate opposing effects on pre-mRNA splicing: CTCF promotes inclusion of weak upstream exons through RNA polymerase II pausing, whereas 5-methylcytosine evicts CTCF, leading to exon exclusion. However, the mechanisms governing dynamic DNA methylation at CTCF-binding sites were unclear. Here, we reveal the methylcytosine dioxygenases TET1 and TET2 as active regulators of CTCF-mediated alternative splicing through conversion of 5-methylcytosine to its oxidation derivatives. 5-hydroxymethylcytosine and 5-carboxylcytosine are enriched at an intragenic CTCF-binding sites in the CD45 model gene and are associated with alternative exon inclusion. Reduced TET levels culminate in increased 5-methylcytosine, resulting in CTCF eviction and exon exclusion. In vitro analyses establish the oxidation derivatives are not sufficient to stimulate splicing, but efficiently promote CTCF association. We further show genomewide that reciprocal exchange of 5-hydroxymethylcytosine and 5-methylcytosine at downstream CTCF-binding sites is a general feature of alternative splicing in naïve and activated CD4(+) T cells. These findings significantly expand our current concept of the pre-mRNA "splicing code" to include dynamic intragenic DNA methylation catalyzed by the TET proteins.
Collapse
Affiliation(s)
- Ryan J Marina
- Center for Cancer Research, Laboratory of Receptor Biology and Gene Expression, National Cancer Institute, Bethesda, MD, USA
| | - David Sturgill
- Center for Cancer Research, Laboratory of Receptor Biology and Gene Expression, National Cancer Institute, Bethesda, MD, USA
| | - Marc A Bailly
- Center for Cancer Research, Laboratory of Receptor Biology and Gene Expression, National Cancer Institute, Bethesda, MD, USA
| | - Morgan Thenoz
- Center for Cancer Research, Laboratory of Receptor Biology and Gene Expression, National Cancer Institute, Bethesda, MD, USA
| | - Garima Varma
- Center for Cancer Research, Laboratory of Receptor Biology and Gene Expression, National Cancer Institute, Bethesda, MD, USA
| | - Maria F Prigge
- Center for Cancer Research, Laboratory of Receptor Biology and Gene Expression, National Cancer Institute, Bethesda, MD, USA
| | - Kyster K Nanan
- Center for Cancer Research, Laboratory of Receptor Biology and Gene Expression, National Cancer Institute, Bethesda, MD, USA
| | - Sanjeev Shukla
- Center for Cancer Research, Laboratory of Receptor Biology and Gene Expression, National Cancer Institute, Bethesda, MD, USA
| | - Nazmul Haque
- Center for Cancer Research, Laboratory of Receptor Biology and Gene Expression, National Cancer Institute, Bethesda, MD, USA
| | - Shalini Oberdoerffer
- Center for Cancer Research, Laboratory of Receptor Biology and Gene Expression, National Cancer Institute, Bethesda, MD, USA
| |
Collapse
|
233
|
Barrass JD, Reid JEA, Huang Y, Hector RD, Sanguinetti G, Beggs JD, Granneman S. Transcriptome-wide RNA processing kinetics revealed using extremely short 4tU labeling. Genome Biol 2015; 16:282. [PMID: 26679539 PMCID: PMC4699367 DOI: 10.1186/s13059-015-0848-1] [Citation(s) in RCA: 52] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2015] [Accepted: 11/30/2015] [Indexed: 11/26/2022] Open
Abstract
BACKGROUND RNA levels detected at steady state are the consequence of multiple dynamic processes within the cell. In addition to synthesis and decay, transcripts undergo processing. Metabolic tagging with a nucleotide analog is one way of determining the relative contributions of synthesis, decay and conversion processes globally. RESULTS By improving 4-thiouracil labeling of RNA in Saccharomyces cerevisiae we were able to isolate RNA produced during as little as 1 minute, allowing the detection of nascent pervasive transcription. Nascent RNA labeled for 1.5, 2.5 or 5 minutes was isolated and analyzed by reverse transcriptase-quantitative polymerase chain reaction and RNA sequencing. High kinetic resolution enabled detection and analysis of short-lived non-coding RNAs as well as intron-containing pre-mRNAs in wild-type yeast. From these data we measured the relative stability of pre-mRNA species with different high turnover rates and investigated potential correlations with sequence features. CONCLUSIONS Our analysis of non-coding RNAs reveals a highly significant association between non-coding RNA stability, transcript length and predicted secondary structure. Our quantitative analysis of the kinetics of pre-mRNA splicing in yeast reveals that ribosomal protein transcripts are more efficiently spliced if they contain intron secondary structures that are predicted to be less stable. These data, in combination with previous results, indicate that there is an optimal range of stability of intron secondary structures that allows for rapid splicing.
Collapse
Affiliation(s)
- J David Barrass
- Wellcome Trust Centre for Cell Biology, University of Edinburgh, Edinburgh, EH9 3BF, UK
| | - Jane E A Reid
- Wellcome Trust Centre for Cell Biology, University of Edinburgh, Edinburgh, EH9 3BF, UK
| | - Yuanhua Huang
- School of Informatics, University of Edinburgh, Edinburgh, EH8 9AB, UK
| | - Ralph D Hector
- Centre for Synthetic and Systems Biology (SynthSys), University of Edinburgh, Edinburgh, EH9 3BF, UK
- Present Address: Institute of Neuroscience and Psychology, University of Glasgow, Glasgow, G12 8QB, UK
| | - Guido Sanguinetti
- School of Informatics, University of Edinburgh, Edinburgh, EH8 9AB, UK
- Centre for Synthetic and Systems Biology (SynthSys), University of Edinburgh, Edinburgh, EH9 3BF, UK
| | - Jean D Beggs
- Wellcome Trust Centre for Cell Biology, University of Edinburgh, Edinburgh, EH9 3BF, UK.
| | - Sander Granneman
- Centre for Synthetic and Systems Biology (SynthSys), University of Edinburgh, Edinburgh, EH9 3BF, UK.
| |
Collapse
|
234
|
Naftelberg S, Schor IE, Ast G, Kornblihtt AR. Regulation of alternative splicing through coupling with transcription and chromatin structure. Annu Rev Biochem 2015; 84:165-98. [PMID: 26034889 DOI: 10.1146/annurev-biochem-060614-034242] [Citation(s) in RCA: 323] [Impact Index Per Article: 32.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Alternative precursor messenger RNA (pre-mRNA) splicing plays a pivotal role in the flow of genetic information from DNA to proteins by expanding the coding capacity of genomes. Regulation of alternative splicing is as important as regulation of transcription to determine cell- and tissue-specific features, normal cell functioning, and responses of eukaryotic cells to external cues. Its importance is confirmed by the evolutionary conservation and diversification of alternative splicing and the fact that its deregulation causes hereditary disease and cancer. This review discusses the multiple layers of cotranscriptional regulation of alternative splicing in which chromatin structure, DNA methylation, histone marks, and nucleosome positioning play a fundamental role in providing a dynamic scaffold for interactions between the splicing and transcription machineries. We focus on evidence for how the kinetics of RNA polymerase II (RNAPII) elongation and the recruitment of splicing factors and adaptor proteins to chromatin components act in coordination to regulate alternative splicing.
Collapse
Affiliation(s)
- Shiran Naftelberg
- Sackler Medical School, Tel Aviv University, Tel Aviv 69978, Israel;
| | | | | | | |
Collapse
|
235
|
Bahar Halpern K, Caspi I, Lemze D, Levy M, Landen S, Elinav E, Ulitsky I, Itzkovitz S. Nuclear Retention of mRNA in Mammalian Tissues. Cell Rep 2015; 13:2653-62. [PMID: 26711333 PMCID: PMC4700052 DOI: 10.1016/j.celrep.2015.11.036] [Citation(s) in RCA: 203] [Impact Index Per Article: 20.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2015] [Revised: 09/15/2015] [Accepted: 11/10/2015] [Indexed: 12/30/2022] Open
Abstract
mRNA is thought to predominantly reside in the cytoplasm, where it is translated and eventually degraded. Although nuclear retention of mRNA has a regulatory potential, it is considered extremely rare in mammals. Here, to explore the extent of mRNA retention in metabolic tissues, we combine deep sequencing of nuclear and cytoplasmic RNA fractions with single-molecule transcript imaging in mouse beta cells, liver, and gut. We identify a wide range of protein-coding genes for which the levels of spliced polyadenylated mRNA are higher in the nucleus than in the cytoplasm. These include genes such as the transcription factor ChREBP, Nlrp6, Glucokinase, and Glucagon receptor. We demonstrate that nuclear retention of mRNA can efficiently buffer cytoplasmic transcript levels from noise that emanates from transcriptional bursts. Our study challenges the view that transcripts predominantly reside in the cytoplasm and reveals a role of the nucleus in dampening gene expression noise. Genome-wide catalog of nuclear and cytoplasmic mRNA in mouse tissues Spliced, polyadenylated mRNA is retained in the nucleus for many protein-coding genes Retained genes include ChREBP and liver Nlrp6, co-localized with nuclear speckles Nuclear retention of mRNA reduces cytoplasmic gene expression noise
Collapse
Affiliation(s)
- Keren Bahar Halpern
- Department of Molecular Cell Biology, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Inbal Caspi
- Department of Molecular Cell Biology, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Doron Lemze
- Department of Molecular Cell Biology, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Maayan Levy
- Department of Immunology, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Shanie Landen
- Department of Molecular Cell Biology, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Eran Elinav
- Department of Immunology, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Igor Ulitsky
- Department of Biological Regulation, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Shalev Itzkovitz
- Department of Molecular Cell Biology, Weizmann Institute of Science, Rehovot 76100, Israel.
| |
Collapse
|
236
|
Gazzoli I, Pulyakhina I, Verwey NE, Ariyurek Y, Laros JFJ, 't Hoen PAC, Aartsma-Rus A. Non-sequential and multi-step splicing of the dystrophin transcript. RNA Biol 2015; 13:290-305. [PMID: 26670121 PMCID: PMC4829307 DOI: 10.1080/15476286.2015.1125074] [Citation(s) in RCA: 48] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
The dystrophin protein encoding DMD gene is the longest human gene. The 2.2 Mb long human dystrophin transcript takes 16 hours to be transcribed and is co-transcriptionally spliced. It contains long introns (24 over 10kb long, 5 over 100kb long) and the heterogeneity in intron size makes it an ideal transcript to study different aspects of the human splicing process. Splicing is a complex process and much is unknown regarding the splicing of long introns in human genes. Here, we used ultra-deep transcript sequencing to characterize splicing of the dystrophin transcripts in 3 different human skeletal muscle cell lines, and explored the order of intron removal and multi-step splicing. Coverage and read pair analyses showed that around 40% of the introns were not always removed sequentially. Additionally, for the first time, we report that non-consecutive intron removal resulted in 3 or more joined exons which are flanked by unspliced introns and we defined these joined exons as an exon block. Lastly, computational and experimental data revealed that, for the majority of dystrophin introns, multistep splicing events are used to splice out a single intron. Overall, our data show for the first time in a human transcript, that multi-step intron removal is a general feature of mRNA splicing.
Collapse
Affiliation(s)
- Isabella Gazzoli
- a Department of Human Genetics , Leiden University Medical Center , Leiden , the Netherlands
| | - Irina Pulyakhina
- a Department of Human Genetics , Leiden University Medical Center , Leiden , the Netherlands
| | - Nisha E Verwey
- a Department of Human Genetics , Leiden University Medical Center , Leiden , the Netherlands
| | - Yavuz Ariyurek
- b Leiden Genome Technology Center, Leiden University Medical Center , Leiden , The Netherlands
| | - Jeroen F J Laros
- a Department of Human Genetics , Leiden University Medical Center , Leiden , the Netherlands.,b Leiden Genome Technology Center, Leiden University Medical Center , Leiden , The Netherlands
| | - Peter A C 't Hoen
- a Department of Human Genetics , Leiden University Medical Center , Leiden , the Netherlands
| | - Annemieke Aartsma-Rus
- a Department of Human Genetics , Leiden University Medical Center , Leiden , the Netherlands
| |
Collapse
|
237
|
Identifying transcription start sites and active enhancer elements using BruUV-seq. Sci Rep 2015; 5:17978. [PMID: 26656874 PMCID: PMC4675984 DOI: 10.1038/srep17978] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2015] [Accepted: 11/10/2015] [Indexed: 01/12/2023] Open
Abstract
BruUV-seq utilizes UV light to introduce transcription-blocking DNA lesions randomly in the genome prior to bromouridine-labeling and deep sequencing of nascent RNA. By inhibiting transcription elongation, but not initiation, pre-treatment with UV light leads to a redistribution of transcription reads resulting in the enhancement of nascent RNA signal towards the 5′-end of genes promoting the identification of transcription start sites (TSSs). Furthermore, transcripts associated with arrested RNA polymerases are protected from 3′–5′ degradation and thus, unstable transcripts such as putative enhancer RNA (eRNA) are dramatically increased. Validation of BruUV-seq against GRO-cap that identifies capped run-on transcripts showed that most BruUV-seq peaks overlapped with GRO-cap signal over both TSSs and enhancer elements. Finally, BruUV-seq identified putative enhancer elements induced by tumor necrosis factor (TNF) treatment concomitant with expression of nearby TNF-induced genes. Taken together, BruUV-seq is a powerful new approach for identifying TSSs and active enhancer elements genome-wide in intact cells.
Collapse
|
238
|
Carey LB. RNA polymerase errors cause splicing defects and can be regulated by differential expression of RNA polymerase subunits. eLife 2015; 4. [PMID: 26652005 PMCID: PMC4868539 DOI: 10.7554/elife.09945] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2015] [Accepted: 10/26/2015] [Indexed: 12/26/2022] Open
Abstract
Errors during transcription may play an important role in determining cellular phenotypes: the RNA polymerase error rate is >4 orders of magnitude higher than that of DNA polymerase and errors are amplified >1000-fold due to translation. However, current methods to measure RNA polymerase fidelity are low-throughout, technically challenging, and organism specific. Here I show that changes in RNA polymerase fidelity can be measured using standard RNA sequencing protocols. I find that RNA polymerase is error-prone, and these errors can result in splicing defects. Furthermore, I find that differential expression of RNA polymerase subunits causes changes in RNA polymerase fidelity, and that coding sequences may have evolved to minimize the effect of these errors. These results suggest that errors caused by RNA polymerase may be a major source of stochastic variability at the level of single cells.
Collapse
Affiliation(s)
- Lucas B Carey
- Department of Experimental and Health Sciences, Universitat Pompeu Fabra, Barcelona, Spain
| |
Collapse
|
239
|
Abstract
The human transcriptome is composed of a vast RNA population that undergoes further diversification by splicing. Detecting specific splice sites in this large sequence pool is the responsibility of the major and minor spliceosomes in collaboration with numerous splicing factors. This complexity makes splicing susceptible to sequence polymorphisms and deleterious mutations. Indeed, RNA mis-splicing underlies a growing number of human diseases with substantial societal consequences. Here, we provide an overview of RNA splicing mechanisms followed by a discussion of disease-associated errors, with an emphasis on recently described mutations that have provided new insights into splicing regulation. We also discuss emerging strategies for splicing-modulating therapy.
Collapse
Affiliation(s)
- Marina M Scotti
- Department of Molecular Genetics and Microbiology, Center for NeuroGenetics and the Genetics Institute, University of Florida, College of Medicine, Gainesville, Florida 32610-3610 USA
| | - Maurice S Swanson
- Department of Molecular Genetics and Microbiology, Center for NeuroGenetics and the Genetics Institute, University of Florida, College of Medicine, Gainesville, Florida 32610-3610 USA
| |
Collapse
|
240
|
Niemelä EH, Frilander MJ. Regulation of gene expression through inefficient splicing of U12-type introns. RNA Biol 2015; 11:1325-9. [PMID: 25692230 PMCID: PMC4615840 DOI: 10.1080/15476286.2014.996454] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
U12-type introns are a rare class of nuclear introns that are removed by a dedicated U12-dependent spliceosome and are thought to regulate the expression of their target genes owing through their slower splicing reaction. Recent genome-wide studies on the splicing of U12-type introns are now providing new insights on the biological significance of this parallel splicing machinery. The new studies cover multiple different organisms and experimental systems, including human patient cells with mutations in the components of the minor spliceosome, zebrafish with similar mutations and various experimentally manipulated human cells and Arabidopsis plants. Here, we will discuss the potential implications of these studies on the understanding of the mechanism and regulation of the minor spliceosome, as well as their medical implications.
Collapse
Affiliation(s)
- Elina H Niemelä
- a Institute of Biotechnology; Genome Biology Research Program ; University of Helsinki ; Helsinki , Finland
| | | |
Collapse
|
241
|
Curado J, Iannone C, Tilgner H, Valcárcel J, Guigó R. Promoter-like epigenetic signatures in exons displaying cell type-specific splicing. Genome Biol 2015; 16:236. [PMID: 26498677 PMCID: PMC4619081 DOI: 10.1186/s13059-015-0797-8] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2015] [Accepted: 10/05/2015] [Indexed: 02/01/2023] Open
Abstract
BACKGROUND Pre-mRNA splicing occurs mainly co-transcriptionally, and both nucleosome density and histone modifications have been proposed to play a role in splice site recognition and regulation. However, the extent and mechanisms behind this interplay remain poorly understood. RESULTS We use transcriptomic and epigenomic data generated by the ENCODE project to investigate the association between chromatin structure and alternative splicing. We find a strong and significant positive association between H3K9ac, H3K27ac, H3K4me3, epigenetic marks characteristic of active promoters, and exon inclusion in a small but well-defined class of exons, representing approximately 4 % of all regulated exons. These exons are systematically maintained at comparatively low levels of inclusion across cell types, but their inclusion is significantly enhanced in particular cell types when in physical proximity to active promoters. CONCLUSION Histone modifications and other chromatin features that activate transcription can be co-opted to participate in the regulation of the splicing of exons that are in physical proximity to promoter regions.
Collapse
Affiliation(s)
- Joao Curado
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader, 88, 08003, Barcelona, Catalonia, Spain
- Graduate program in Areas of Basic and Applied Biology, Abel Salazar Biomedical Sciences Institute, University of Porto, 4099-003, Porto, Portugal
| | - Camilla Iannone
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader, 88, 08003, Barcelona, Catalonia, Spain
- Universitat Pompeu Fabra, Dr. Aiguader, 88, 08003, Barcelona, Catalonia, Spain
| | - Hagen Tilgner
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader, 88, 08003, Barcelona, Catalonia, Spain
- Department of Genetics, Stanford University, 300 Pasteur Dr., Stanford, CA, 94305-5120, USA
| | - Juan Valcárcel
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader, 88, 08003, Barcelona, Catalonia, Spain
- Universitat Pompeu Fabra, Dr. Aiguader, 88, 08003, Barcelona, Catalonia, Spain
- Institució Catalana de Recerca i Estudis Avançats, Pg Lluis Companys 23, 08010, Barcelona, Catalonia, Spain
| | - Roderic Guigó
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader, 88, 08003, Barcelona, Catalonia, Spain.
- Universitat Pompeu Fabra, Dr. Aiguader, 88, 08003, Barcelona, Catalonia, Spain.
| |
Collapse
|
242
|
Abstract
Splicing is a predominantly co-transcriptional process that has been shown to be tightly coupled to transcription. Chromatin structure is a key factor that mediates this functional coupling. In light of recent evidence that shows the importance of higher order chromatin organization in the coordination and regulation of gene expression, we discuss here the possible roles of long-range chromatin organization in splicing and alternative splicing regulation.
Collapse
Affiliation(s)
- Luciana I Gómez Acuña
- a Laboratorio de Fisiología y Biología Molecular, Departamento de Fisiología, Biología Molecular y Celular, IFIBYNE-CONICET, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires; Buenos Aires, Argentina
| | | |
Collapse
|
243
|
Yang L. Splicing noncoding RNAs from the inside out. WILEY INTERDISCIPLINARY REVIEWS-RNA 2015; 6:651-60. [PMID: 26424453 PMCID: PMC5054931 DOI: 10.1002/wrna.1307] [Citation(s) in RCA: 42] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/30/2015] [Revised: 08/12/2015] [Accepted: 08/21/2015] [Indexed: 12/21/2022]
Abstract
Eukaryotic precursor-messenger RNAs (pre-mRNAs) undergo splicing to remove intragenic regions (introns) and ligate expressed regions (exons) together. Unlike exons in the mature messenger RNAs (mRNAs) that are used for translation, introns that are spliced out of pre-mRNAs were generally believed to lack function and to be degraded. However, recent studies have revealed that a large group of spliced introns can escape complete degradation and are processed to generate noncoding RNAs (ncRNAs), including different types of small RNAs, long-noncoding RNAs, and circular RNAs. Strikingly, exonic sequences can be also back-spliced from pre-mRNAs to form stable circular RNAs. Together, the findings that ncRNAs can be spliced out of mRNA precursors not only expand the ever-growing repertoire of ncRNAs that originate from different genomic regions, but also reveal the unexpected transcriptomic complexity and functional capacity of eukaryotic genomes.
Collapse
Affiliation(s)
- Li Yang
- Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology; CAS Center for Excellence in Brain Science and Intelligence Technology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China.,School of Life Science and Technology, ShanghaiTech University, Shanghai, China
| |
Collapse
|
244
|
Abstract
Two opposing models have been proposed to describe the function of the MYC oncoprotein in shaping cellular transcriptomes: one posits that MYC amplifies transcription at all active loci; the other that MYC differentially controls discrete sets of genes, the products of which affect global transcript levels. Here, we argue that differential gene regulation by MYC is the sole unifying model that is consistent with all available data. Among other effects, MYC endows cells with physiological and metabolic changes that have the potential to feed back on global RNA production, processing and turnover. The field is progressing steadily towards a full characterization of the MYC-regulated genes and pathways that mediate these biological effects and - by the same token - endow MYC with its pervasive oncogenic potential.
Collapse
Affiliation(s)
- Theresia R Kress
- Center for Genomic Science of IIT@SEMM, Fondazione Istituto Italiano di Tecnologia (IIT) and Department of Experimental Oncology, European Institute of Oncology (IEO), Via Adamello 16, 20139 Milan, Italy
| | - Arianna Sabò
- Center for Genomic Science of IIT@SEMM, Fondazione Istituto Italiano di Tecnologia (IIT) and Department of Experimental Oncology, European Institute of Oncology (IEO), Via Adamello 16, 20139 Milan, Italy
| | - Bruno Amati
- Center for Genomic Science of IIT@SEMM, Fondazione Istituto Italiano di Tecnologia (IIT) and Department of Experimental Oncology, European Institute of Oncology (IEO), Via Adamello 16, 20139 Milan, Italy
- Department of Experimental Oncology, European Institute of Oncology (IEO), Via Adamello 16, 20139 Milan, Italy
| |
Collapse
|
245
|
Petrillo E, Godoy Herz MA, Barta A, Kalyna M, Kornblihtt AR. Let there be light: regulation of gene expression in plants. RNA Biol 2015; 11:1215-20. [PMID: 25590224 PMCID: PMC4615654 DOI: 10.4161/15476286.2014.972852] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023] Open
Abstract
Gene expression regulation relies on a variety of molecular mechanisms affecting different steps of a messenger RNA (mRNA) life: transcription, processing, splicing, alternative splicing, transport, translation, storage and decay. Light induces massive reprogramming of gene expression in plants. Differences in alternative splicing patterns in response to environmental stimuli suggest that alternative splicing plays an important role in plant adaptation to changing life conditions. In a recent publication, our laboratories showed that light regulates alternative splicing of a subset of Arabidopsis genes encoding proteins involved in RNA processing by chloroplast retrograde signals. The light effect on alternative splicing is also observed in roots when the communication with the photosynthetic tissues is not interrupted, suggesting that a signaling molecule travels through the plant. These results point at alternative splicing regulation by retrograde signals as an important mechanism for plant adaptation to their environment.
Collapse
Key Words
- DBMIB, 2,5-dibromo-3-methyl-6-isopropyl-benzoquinone
- DCMU, 3-(3,4-dichlorophenyl)-1,1-dimethylurea
- PQ, plastoquinone
- PS, photosystem
- Pol II, RNA polymerase II
- RNA
- ROS, reactive oxygen species
- alternative splicing
- chloroplast
- light
- mRNA, messenger RNA
- photoreceptors
- retrograde signaling
Collapse
Affiliation(s)
- Ezequiel Petrillo
- a Max F. Perutz Laboratories ; Medical University of Vienna ; Vienna , Austria
| | | | | | | | | |
Collapse
|
246
|
Hensman J, Papastamoulis P, Glaus P, Honkela A, Rattray M. Fast and accurate approximate inference of transcript expression from RNA-seq data. Bioinformatics 2015; 31:3881-9. [PMID: 26315907 PMCID: PMC4673974 DOI: 10.1093/bioinformatics/btv483] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2015] [Accepted: 08/07/2015] [Indexed: 11/25/2022] Open
Abstract
Motivation: Assigning RNA-seq reads to their transcript of origin is a fundamental task in transcript expression estimation. Where ambiguities in assignments exist due to transcripts sharing sequence, e.g. alternative isoforms or alleles, the problem can be solved through probabilistic inference. Bayesian methods have been shown to provide accurate transcript abundance estimates compared with competing methods. However, exact Bayesian inference is intractable and approximate methods such as Markov chain Monte Carlo and Variational Bayes (VB) are typically used. While providing a high degree of accuracy and modelling flexibility, standard implementations can be prohibitively slow for large datasets and complex transcriptome annotations. Results: We propose a novel approximate inference scheme based on VB and apply it to an existing model of transcript expression inference from RNA-seq data. Recent advances in VB algorithmics are used to improve the convergence of the algorithm beyond the standard Variational Bayes Expectation Maximization algorithm. We apply our algorithm to simulated and biological datasets, demonstrating a significant increase in speed with only very small loss in accuracy of expression level estimation. We carry out a comparative study against seven popular alternative methods and demonstrate that our new algorithm provides excellent accuracy and inter-replicate consistency while remaining competitive in computation time. Availability and implementation: The methods were implemented in R and C++, and are available as part of the BitSeq project at github.com/BitSeq. The method is also available through the BitSeq Bioconductor package. The source code to reproduce all simulation results can be accessed via github.com/BitSeq/BitSeqVB_benchmarking. Contact:james.hensman@sheffield.ac.uk or panagiotis.papastamoulis@manchester.ac.uk or Magnus.Rattray@manchester.ac.uk Supplementary information:Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- James Hensman
- Sheffield Institute for Translational Neuroscience (SITraN), Sheffield, UK
| | | | - Peter Glaus
- School of Computer Science, The University of Manchester, Manchester, UK and
| | - Antti Honkela
- Helsinki Institute for Information Technology (HIIT), Department of Computer Science, University of Helsinki, Helsinki, Finland
| | | |
Collapse
|
247
|
Engelhardt J, Stadler PF. Evolution of the unspliced transcriptome. BMC Evol Biol 2015; 15:166. [PMID: 26289325 PMCID: PMC4546029 DOI: 10.1186/s12862-015-0437-7] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2015] [Accepted: 07/29/2015] [Indexed: 12/17/2022] Open
Abstract
BACKGROUND Despite their abundance, unspliced EST data have received little attention as a source of information on non-coding RNAs. Very little is know, therefore, about the genomic distribution of unspliced non-coding transcripts and their relationship with the much better studied regularly spliced products. In particular, their evolution has remained virtually unstudied. RESULTS We systematically study the evidence on unspliced transcripts available in EST annotation tracks for human and mouse, comprising 104,980 and 66,109 unspliced EST clusters, respectively. Roughly one third of these are located totally inside introns of known genes (TINs) and another third overlaps exonic regions (PINs). Eleven percent are "intergenic", far away from any annotated gene. Direct evidence for the independent transcription of many PINs and TINs is obtained from CAGE tag and chromatin data. We predict more than 2000 3'UTR-associated RNA candidates for each human and mouse. Fifteen to twenty percent of the unspliced EST cluster are conserved between human and mouse. With the exception of TINs, the sequences of unspliced EST clusters evolve significantly slower than genomic background. Furthermore, like spliced lincRNAs, they show highly tissue-specific expression patterns. CONCLUSIONS Unspliced long non-coding RNAs are an important, rapidly evolving, component of mammalian transcriptomes. Their analysis is complicated by their preferential association with complex transcribed loci that usually also harbor a plethora of spliced transcripts. Unspliced EST data, although typically disregarded in transcriptome analysis, can be used to gain insights into this rarely investigated transcriptome component. The frequently postulated connection between lack of splicing and nuclear retention and the surprising overlap of chromatin-associated transcripts suggests that this class of transcripts might be involved in chromatin organization and possibly other mechanisms of epigenetic control.
Collapse
Affiliation(s)
- Jan Engelhardt
- Bioinformatics Group, Department of Computer Science, and Interdisciplinary Center for Bioinformatics, University of Leipzig, Haertelstraße 16-18, Leipzig, D-04107, Germany.
| | - Peter F Stadler
- Bioinformatics Group, Department of Computer Science, and Interdisciplinary Center for Bioinformatics, University of Leipzig, Haertelstraße 16-18, Leipzig, D-04107, Germany.
- Max Planck Institute for Mathematics in the Sciences, Inselstraße 22, Leipzig, D-04103, Germany.
- Fraunhofer Institut for Cell Therapy and Immunology, Perlickstraße 1, Leipzig, D-04103, Germany.
- Institute for Theoretical Chemistry, University of Vienna, Währingerstrasse 17, Vienna, A-1090, Austria.
- Center for non-coding RNA in Technology and Health, University of Copenhagen, Grønnegårdsvej 3, Frederiksberg, 1870, Denmark.
- Santa Fe Institute, 1399 Hyde Park Rd., Santa Fe, 87501, NM, USA.
| |
Collapse
|
248
|
Chromatin, DNA structure and alternative splicing. FEBS Lett 2015; 589:3370-8. [PMID: 26296319 DOI: 10.1016/j.febslet.2015.08.002] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2015] [Revised: 07/31/2015] [Accepted: 08/04/2015] [Indexed: 02/07/2023]
Abstract
Coupling of transcription and alternative splicing via regulation of the transcriptional elongation rate is a well-studied phenomenon. Template features that act as roadblocks for the progression of RNA polymerase II comprise histone modifications and variants, DNA-interacting proteins and chromatin compaction. These may affect alternative splicing decisions by inducing pauses or decreasing elongation rate that change the time-window for splicing regulatory sequences to be recognized. Herein we discuss the evidence supporting the influence of template structural modifications on transcription and splicing, and provide insights about possible roles of non-B DNA conformations on the regulation of alternative splicing.
Collapse
|
249
|
Warns JA, Davie JR, Dhasarathy A. Connecting the dots: chromatin and alternative splicing in EMT. Biochem Cell Biol 2015; 94:12-25. [PMID: 26291837 DOI: 10.1139/bcb-2015-0053] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
Nature has devised sophisticated cellular machinery to process mRNA transcripts produced by RNA Polymerase II, removing intronic regions and connecting exons together, to produce mature RNAs. This process, known as splicing, is very closely linked to transcription. Alternative splicing, or the ability to produce different combinations of exons that are spliced together from the same genomic template, is a fundamental means of regulating protein complexity. Similar to transcription, both constitutive and alternative splicing can be regulated by chromatin and its associated factors in response to various signal transduction pathways activated by external stimuli. This regulation can vary between different cell types, and interference with these pathways can lead to changes in splicing, often resulting in aberrant cellular states and disease. The epithelial to mesenchymal transition (EMT), which leads to cancer metastasis, is influenced by alternative splicing events of chromatin remodelers and epigenetic factors such as DNA methylation and non-coding RNAs. In this review, we will discuss the role of epigenetic factors including chromatin, chromatin remodelers, DNA methyltransferases, and microRNAs in the context of alternative splicing, and discuss their potential involvement in alternative splicing during the EMT process.
Collapse
Affiliation(s)
- Jessica A Warns
- a Department of Basic Sciences, University of North Dakota School of Medicine and Health Sciences, 501 N. Columbia Road Stop 9061, Grand Forks, ND 58202-9061, USA
| | - James R Davie
- b Children's Hospital Research Institute of Manitoba, John Buhler Research Centre, Winnipeg, Manitoba R3E 3P4, Canada
| | - Archana Dhasarathy
- a Department of Basic Sciences, University of North Dakota School of Medicine and Health Sciences, 501 N. Columbia Road Stop 9061, Grand Forks, ND 58202-9061, USA
| |
Collapse
|
250
|
Native elongating transcript sequencing reveals human transcriptional activity at nucleotide resolution. Cell 2015; 161:541-554. [PMID: 25910208 DOI: 10.1016/j.cell.2015.03.010] [Citation(s) in RCA: 270] [Impact Index Per Article: 27.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2014] [Revised: 11/26/2014] [Accepted: 02/18/2015] [Indexed: 01/12/2023]
Abstract
Major features of transcription by human RNA polymerase II (Pol II) remain poorly defined due to a lack of quantitative approaches for visualizing Pol II progress at nucleotide resolution. We developed a simple and powerful approach for performing native elongating transcript sequencing (NET-seq) in human cells that globally maps strand-specific Pol II density at nucleotide resolution. NET-seq exposes a mode of antisense transcription that originates downstream and converges on transcription from the canonical promoter. Convergent transcription is associated with a distinctive chromatin configuration and is characteristic of lower-expressed genes. Integration of NET-seq with genomic footprinting data reveals stereotypic Pol II pausing coincident with transcription factor occupancy. Finally, exons retained in mature transcripts display Pol II pausing signatures that differ markedly from skipped exons, indicating an intrinsic capacity for Pol II to recognize exons with different processing fates. Together, human NET-seq exposes the topography and regulatory complexity of human gene expression.
Collapse
|