451
|
Brosius J. The persistent contributions of RNA to eukaryotic gen(om)e architecture and cellular function. Cold Spring Harb Perspect Biol 2014; 6:a016089. [PMID: 25081515 DOI: 10.1101/cshperspect.a016089] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
Currently, the best scenario for earliest forms of life is based on RNA molecules as they have the proven ability to catalyze enzymatic reactions and harbor genetic information. Evolutionary principles valid today become apparent in such models already. Furthermore, many features of eukaryotic genome architecture might have their origins in an RNA or RNA/protein (RNP) world, including the onset of a further transition, when DNA replaced RNA as the genetic bookkeeper of the cell. Chromosome maintenance, splicing, and regulatory function via RNA may be deeply rooted in the RNA/RNP worlds. Mostly in eukaryotes, conversion from RNA to DNA is still ongoing, which greatly impacts the plasticity of extant genomes. Raw material for novel genes encoding protein or RNA, or parts of genes including regulatory elements that selection can act on, continues to enter the evolutionary lottery.
Collapse
Affiliation(s)
- Jürgen Brosius
- Institute of Experimental Pathology (ZMBE), University of Münster, D-48149 Münster, Germany
| |
Collapse
|
452
|
Retrotransposons in pluripotent cells: Impact and new roles in cellular plasticity. BIOCHIMICA ET BIOPHYSICA ACTA-GENE REGULATORY MECHANISMS 2014; 1849:417-26. [PMID: 25042909 DOI: 10.1016/j.bbagrm.2014.07.007] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/27/2014] [Revised: 07/08/2014] [Accepted: 07/10/2014] [Indexed: 12/27/2022]
Abstract
Transposable Elements are pieces of DNA able to mobilize from one location to another within genomes. Although they constitute more than 50% of the human genome, they have been classified as selfish DNA, with the only mission to spread within genomes and generate more copies of themselves that will ensure their presence over generations. Despite their remarkable prevalence, only a minor group of transposable elements remain active in the human genome and can sporadically be associated with the generation of a genetic disorder due to their ongoing mobility. Most of the transposable elements identified in the human genome corresponded to fixed insertions that no longer move in genomes. As selfish DNA, transposable element insertions accumulate in cell types where genetic information can be passed to the next generation. Indeed, work from different laboratories has demonstrated that the main heritable load of TE accumulation in humans occurs during early embryogenesis. Thus, active transposable elements have a clear impact on our pluripotent genome. However, recent findings suggest that the main proportion of fixed non-mobile transposable elements might also have emerging roles in cellular plasticity. In this concise review, we provide an overview of the impact of currently active transposable elements in our pluripotent genome and further discuss new roles of transposable elements (active or not) in regulating pluripotency. This article is part of a Special Issue entitled: Stress as a fundamental theme in cell plasticity.
Collapse
|
453
|
Lv J, Huang Z, Liu H, Liu H, Cui W, Li B, He H, Guo J, Liu Q, Zhang Y, Wu Q. Identification and characterization of long intergenic non-coding RNAs related to mouse liver development. Mol Genet Genomics 2014; 289:1225-35. [PMID: 25012394 DOI: 10.1007/s00438-014-0882-9] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2013] [Accepted: 06/20/2014] [Indexed: 01/23/2023]
Abstract
Long non-coding RNAs (lncRNAs) have been studied extensively over the last few years. Liver is an important organ that plays a crucial role in glucose metabolism and homeostasis; however, there are few reports of the identification and functional characterization of lncRNAs with important roles in liver development. Therefore, it is necessary to systematically identify lncRNAs that are involved in liver development. In this paper, we assembled the transcriptome using published RNA-seq data across three mouse liver developmental stages and identified 4,882 putative long intergenic non-coding RNAs (lincRNAs) expressed in at least one of the investigated stages. Combining these with Ensembl lincRNAs, we established a reference catalog of 6,602 transcribed lincRNAs in the mouse liver. We then analyzed all the lincRNAs in this reference catalog systematically and revealed that liver lincRNAs carry different genomic signatures from protein-coding genes, while the putative lincRNAs are generally comparable with known Ensembl lincRNAs. In addition, putative lincRNAs are functionally associated with essential biological processes, including RNA splicing, protein localization and fatty acid metabolic process, implying that they may play an important role in regulating liver development. The validation of selected lincRNAs that are specifically expressed in developing liver tissues further suggested the effectiveness of our approach. Our study shows that lincRNAs that are differentially expressed during three liver developmental stages could have important regulatory roles in liver development. The identified putative lincRNAs are a valuable resource for further functional studies.
Collapse
Affiliation(s)
- Jie Lv
- School of Life Science and Technology, State Key Laboratory of Urban Water Resource and Environment, Harbin Institute of Technology, Harbin, 150001, China
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
454
|
Zhi H, Ning S, Li X, Li Y, Wu W, Li X. A novel reannotation strategy for dissecting DNA methylation patterns of human long intergenic non-coding RNAs in cancers. Nucleic Acids Res 2014; 42:8258-70. [PMID: 25013169 PMCID: PMC4117791 DOI: 10.1093/nar/gku575] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023] Open
Abstract
Despite growing consensus that long intergenic non-coding ribonucleic acids (lincRNAs) are modulators of cancer, the knowledge about the deoxyribonucleic acid (DNA) methylation patterns of lincRNAs in cancers remains limited. In this study, we constructed DNA methylation profiles for 4629 tumors and 705 normal tissue samples from 20 different types of human cancer by reannotating data of DNA methylation arrays. We found that lincRNAs had different promoter methylation patterns in cancers. We classified 2461 lincRNAs into two categories and three subcategories, according to their promoter methylation patterns in tumors. LincRNAs with resistant methylation patterns in tumors had conserved transcriptional regulation regions and were ubiquitously expressed across normal tissues. By integrating cancer subtype data and patient clinical information, we identified lincRNAs with promoter methylation patterns that were associated with cancer status, subtype or prognosis for several cancers. Network analysis of aberrantly methylated lincRNAs in cancers showed that lincRNAs with aberrant methylation patterns might be involved in cancer development and progression. The methylated and demethylated lincRNAs identified in this study provide novel insights for developing cancer biomarkers and potential therapeutic targets.
Collapse
Affiliation(s)
- Hui Zhi
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150081, China
| | - Shangwei Ning
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150081, China
| | - Xiang Li
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150081, China
| | - Yuyun Li
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150081, China
| | - Wei Wu
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150081, China
| | - Xia Li
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150081, China
| |
Collapse
|
455
|
Ha H, Song J, Wang S, Kapusta A, Feschotte C, Chen KC, Xing J. A comprehensive analysis of piRNAs from adult human testis and their relationship with genes and mobile elements. BMC Genomics 2014; 15:545. [PMID: 24981367 PMCID: PMC4094622 DOI: 10.1186/1471-2164-15-545] [Citation(s) in RCA: 93] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2014] [Accepted: 06/20/2014] [Indexed: 11/11/2022] Open
Abstract
Background Piwi-interacting RNAs (piRNAs) are a recently discovered class of small non-coding RNAs whose best-understood function is to repress mobile element (ME) activity in animal germline. To date, nearly all piRNA studies have been conducted in model organisms and little is known about piRNA diversity, target specificity and biological function in human. Results Here we performed high-throughput sequencing of piRNAs from three human adult testis samples. We found that more than 81% of the ~17 million putative piRNAs mapped to ~6,000 piRNA-producing genomic clusters using a relaxed definition of clusters. A set of human protein-coding genes produces a relatively large amount of putative piRNAs from their 3’UTRs, and are significantly enriched for certain biological processes, suggestive of non-random sampling by the piRNA biogenesis machinery. Up to 16% of putative piRNAs mapped to a few hundred annotated long non-coding RNA (lncRNA) genes, suggesting that some lncRNA genes can act as piRNA precursors. Among major ME families, young families of LTR and endogenous retroviruses have a greater association with putative piRNAs than other MEs. In addition, piRNAs preferentially mapped to specific regions in the consensus sequences of several ME (sub)families and some piRNA mapping peaks showed patterns consistent with the “ping-pong” cycle of piRNA targeting and amplification. Conclusions Overall our data provide a comprehensive analysis and improved annotation of human piRNAs in adult human testes and shed new light into the relationship of piRNAs with protein-coding genes, lncRNAs, and mobile genetic elements in human. Electronic supplementary material The online version of this article (doi:10.1186/1471-2164-15-545) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
| | | | | | | | | | | | - Jinchuan Xing
- Department of Genetics, The State University of New Jersey, Piscataway, NJ 08854, USA.
| |
Collapse
|
456
|
Johnson R, Guigó R. The RIDL hypothesis: transposable elements as functional domains of long noncoding RNAs. RNA (NEW YORK, N.Y.) 2014; 20:959-76. [PMID: 24850885 PMCID: PMC4114693 DOI: 10.1261/rna.044560.114] [Citation(s) in RCA: 205] [Impact Index Per Article: 18.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/08/2023]
Abstract
Our genome contains tens of thousands of long noncoding RNAs (lncRNAs), many of which are likely to have genetic regulatory functions. It has been proposed that lncRNA are organized into combinations of discrete functional domains, but the nature of these and their identification remain elusive. One class of sequence elements that is enriched in lncRNA is represented by transposable elements (TEs), repetitive mobile genetic sequences that have contributed widely to genome evolution through a process termed exaptation. Here, we link these two concepts by proposing that exonic TEs act as RNA domains that are essential for lncRNA function. We term such elements Repeat Insertion Domains of LncRNAs (RIDLs). A growing number of RIDLs have been experimentally defined, where TE-derived fragments of lncRNA act as RNA-, DNA-, and protein-binding domains. We propose that these reflect a more general phenomenon of exaptation during lncRNA evolution, where inserted TE sequences are repurposed as recognition sites for both protein and nucleic acids. We discuss a series of genomic screens that may be used in the future to systematically discover RIDLs. The RIDL hypothesis has the potential to explain how functional evolution can keep pace with the rapid gene evolution observed in lncRNA. More practically, TE maps may in the future be used to predict lncRNA function.
Collapse
Affiliation(s)
- Rory Johnson
- Centre for Genomic Regulation (CRG), 08003 Barcelona, Spain
- Universitat Pompeu Fabra (UPF), 08003 Barcelona, Spain
- Institut Hospital del Mar d'Investigacions Mèdiques (IMIM), 08003 Barcelona, Spain
- Corresponding authorE-mail
| | - Roderic Guigó
- Centre for Genomic Regulation (CRG), 08003 Barcelona, Spain
- Universitat Pompeu Fabra (UPF), 08003 Barcelona, Spain
- Institut Hospital del Mar d'Investigacions Mèdiques (IMIM), 08003 Barcelona, Spain
| |
Collapse
|
457
|
Takehana Y, Matsuda M, Myosho T, Suster ML, Kawakami K, Shin-I T, Kohara Y, Kuroki Y, Toyoda A, Fujiyama A, Hamaguchi S, Sakaizumi M, Naruse K. Co-option of Sox3 as the male-determining factor on the Y chromosome in the fish Oryzias dancena. Nat Commun 2014; 5:4157. [PMID: 24948391 DOI: 10.1038/ncomms5157] [Citation(s) in RCA: 186] [Impact Index Per Article: 16.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2014] [Accepted: 05/19/2014] [Indexed: 12/21/2022] Open
Abstract
Sex chromosomes harbour a primary sex-determining signal that triggers sexual development of the organism. However, diverse sex chromosome systems have been evolved in vertebrates. Here we use positional cloning to identify the sex-determining locus of a medaka-related fish, Oryzias dancena, and find that the locus on the Y chromosome contains a cis-regulatory element that upregulates neighbouring Sox3 expression in developing gonad. Sex-reversed phenotypes in Sox3(Y) transgenic fish, and Sox3(Y) loss-of-function mutants all point to its critical role in sex determination. Furthermore, we demonstrate that Sox3 initiates testicular differentiation by upregulating expression of downstream Gsdf, which is highly conserved in fish sex differentiation pathways. Our results not only provide strong evidence for the independent recruitment of Sox3 to male determination in distantly related vertebrates, but also provide direct evidence that a novel sex determination pathway has evolved through co-option of a transcriptional regulator potentially interacted with a conserved downstream component.
Collapse
Affiliation(s)
- Yusuke Takehana
- 1] Laboratory of Bioresources, National Institute for Basic Biology, Okazaki 444-8585, Japan [2] Department of Basic Biology, the Graduate University for Advanced Studies (SOKENDAI), Okazaki 444-8585, Japan
| | - Masaru Matsuda
- Center for Bioscience Research and Education, Utsunomiya University, Utsunomiya, Tochigi 321-8505, Japan
| | - Taijun Myosho
- Institute of Science and Technology, Niigata University, Niigata 950-2181, Japan
| | - Maximiliano L Suster
- 1] Neural Circuits and Behaviour Group, Uni Research AS, Bergen 5008, Norway [2] Division of Molecular and Developmental Biology, National Institute of Genetics, Mishima 411-8540, Japan
| | - Koichi Kawakami
- 1] Division of Molecular and Developmental Biology, National Institute of Genetics, Mishima 411-8540, Japan [2] Department of Genetics, the Graduate University for Advanced Studies (SOKENDAI), Mishima 411-8540, Japan
| | - Tadasu Shin-I
- Center for Genetic Resource Information, National Institute of Genetics, Mishima 411-8540, Japan
| | - Yuji Kohara
- Center for Genetic Resource Information, National Institute of Genetics, Mishima 411-8540, Japan
| | - Yoko Kuroki
- Department of Integrative Genomics, Tohoku Medical Megabank Organization, Sendai 980-8573, Japan
| | - Atsushi Toyoda
- 1] Department of Genetics, the Graduate University for Advanced Studies (SOKENDAI), Mishima 411-8540, Japan [2] Comparative Genomics Laboratory, National Institute of Genetics, Mishima 411-8540, Japan
| | - Asao Fujiyama
- 1] Department of Genetics, the Graduate University for Advanced Studies (SOKENDAI), Mishima 411-8540, Japan [2] Comparative Genomics Laboratory, National Institute of Genetics, Mishima 411-8540, Japan [3] National Institute of Informatics, Tokyo 101-8430, Japan
| | - Satoshi Hamaguchi
- Institute of Science and Technology, Niigata University, Niigata 950-2181, Japan
| | - Mitsuru Sakaizumi
- Institute of Science and Technology, Niigata University, Niigata 950-2181, Japan
| | - Kiyoshi Naruse
- 1] Laboratory of Bioresources, National Institute for Basic Biology, Okazaki 444-8585, Japan [2] Department of Basic Biology, the Graduate University for Advanced Studies (SOKENDAI), Okazaki 444-8585, Japan
| |
Collapse
|
458
|
Zhang B, Gunawardane L, Niazi F, Jahanbani F, Chen X, Valadkhan S. A novel RNA motif mediates the strict nuclear localization of a long noncoding RNA. Mol Cell Biol 2014; 34:2318-29. [PMID: 24732794 PMCID: PMC4054287 DOI: 10.1128/mcb.01673-13] [Citation(s) in RCA: 109] [Impact Index Per Article: 9.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2013] [Revised: 01/06/2014] [Accepted: 04/03/2014] [Indexed: 12/14/2022] Open
Abstract
The ubiquitous presence of long noncoding RNAs (lncRNAs) in eukaryotes points to the importance of understanding how their sequences impact function. As many lncRNAs regulate nuclear events and thus must localize to nuclei, we analyzed the sequence requirements for nuclear localization in an intergenic lncRNA named BORG (BMP2-OP1-responsive gene), which is both spliced and polyadenylated but is strictly localized in nuclei. Subcellular localization of BORG was not dependent on the context or level of its expression or decay but rather depended on the sequence of the mature, spliced transcript. Mutational analyses indicated that nuclear localization of BORG was mediated through a novel RNA motif consisting of the pentamer sequence AGCCC with sequence restrictions at positions -8 (T or A) and -3 (G or C) relative to the first nucleotide of the pentamer. Mutation of the motif to a scrambled sequence resulted in complete loss of nuclear localization, while addition of even a single copy of the motif to a cytoplasmically localized RNA was sufficient to impart nuclear localization. Further, the presence of this motif in other cellular RNAs showed a direct correlation with nuclear localization, suggesting that the motif may act as a general nuclear localization signal for cellular RNAs.
Collapse
Affiliation(s)
- Bing Zhang
- Department of Biochemistry, Case Western Reserve University School of Medicine, Cleveland, Ohio, USA
| | - Lalith Gunawardane
- Department of Biochemistry, Case Western Reserve University School of Medicine, Cleveland, Ohio, USA
| | - Farshad Niazi
- Department of Biochemistry, Case Western Reserve University School of Medicine, Cleveland, Ohio, USA
| | - Fereshteh Jahanbani
- Department of Biochemistry, Case Western Reserve University School of Medicine, Cleveland, Ohio, USA
| | - Xin Chen
- Department of Biochemistry, Case Western Reserve University School of Medicine, Cleveland, Ohio, USA
| | - Saba Valadkhan
- Department of Biochemistry, Case Western Reserve University School of Medicine, Cleveland, Ohio, USA
| |
Collapse
|
459
|
Marques AC, Ponting CP. Intergenic lncRNAs and the evolution of gene expression. Curr Opin Genet Dev 2014; 27:48-53. [PMID: 24852186 DOI: 10.1016/j.gde.2014.03.009] [Citation(s) in RCA: 70] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2014] [Revised: 03/19/2014] [Accepted: 03/19/2014] [Indexed: 11/24/2022]
Abstract
Eukaryote genomes encode a surprisingly large number of noncoding transcripts. Around two-thirds of human transcribed loci do not encode protein, and many are intergenic and produce long (>200 nucleotides) noncoding RNAs (lncRNAs). Extensive analyses using comparative genomics and transcriptomics approaches have established that lncRNA sequence and transcription tend to turn over rapidly during evolution. Our appreciation of the biological roles of lncRNAs, based only on a handful of transcripts with well-characterized functions, is that lncRNAs have diverse roles in regulating gene expression. These proposed roles together with their rapid rates of evolution suggest that lncRNAs could contribute to the divergent expression patterns observed among species and potentially to the origin of new traits.
Collapse
Affiliation(s)
- Ana C Marques
- MRC Functional Genomics Unit, University of Oxford, South Parks Road, OX1 3QX, UK; University of Oxford, Department of Physiology, Anatomy and Genetics, South Parks Road, OX1 3QX, UK.
| | - Chris P Ponting
- MRC Functional Genomics Unit, University of Oxford, South Parks Road, OX1 3QX, UK; University of Oxford, Department of Physiology, Anatomy and Genetics, South Parks Road, OX1 3QX, UK
| |
Collapse
|
460
|
Upton KR, Faulkner GJ. Blood from 'junk': the LTR chimeric transcript Pu.2 promotes erythropoiesis. Mob DNA 2014; 5:15. [PMID: 24839466 PMCID: PMC4023170 DOI: 10.1186/1759-8753-5-15] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2014] [Accepted: 05/02/2014] [Indexed: 11/25/2022] Open
Abstract
Transposable elements (TEs) are a prominent feature of most eukaryotic genomes. Despite rapidly accumulating evidence for the role of TE-driven insertional mutagenesis and structural variation in genome evolution, few clear examples of individual TEs impacting biology via perturbed gene regulation are available. A recent report describes the discovery of an alternative promoter for the murine erythroid transcription factor Pu.1. This promoter is located in an ORR1A0 long terminal repeat (LTR) retrotransposon intronic to Pu.1 and is regulated by the Krüppel-like factors KLF1 and KLF3. Expression of the resultant chimeric transcript, called Pu.2, spontaneously induces erythroid differentiation in vitro. These experiments illustrate how transcription factor binding sites spread by retrotransposition have the potential to impact networks encoding key biological processes in the host genome.
Collapse
Affiliation(s)
- Kyle R Upton
- Mater Research Institute - University of Queensland, TRI Building, 4102 Brisbane, QLD, Australia
| | - Geoffrey J Faulkner
- Mater Research Institute - University of Queensland, TRI Building, 4102 Brisbane, QLD, Australia ; School of Biomedical Sciences, University of Queensland, 4072 Brisbane, QLD, Australia
| |
Collapse
|
461
|
Abstract
Discoveries over the past decade portend a paradigm shift in molecular biology. Evidence suggests that RNA is not only functional as a messenger between DNA and protein but also involved in the regulation of genome organization and gene expression, which is increasingly elaborate in complex organisms. Regulatory RNA seems to operate at many levels; in particular, it plays an important part in the epigenetic processes that control differentiation and development. These discoveries suggest a central role for RNA in human evolution and ontogeny. Here, we review the emergence of the previously unsuspected world of regulatory RNA from a historical perspective.
Collapse
Affiliation(s)
- Kevin V Morris
- School of Biotechnology and Biomedical Sciences, University of New South Wales, Sydney, NSW 2052, Australia; and Molecular and Experimental Medicine, The Scripps Research Institute, La Jolla, California 92037, USA
| | - John S Mattick
- Garvan Institute of Medical Research, 384 Victoria Street, Darlinghurst, NSW 2010, Australia; the School of Biotechnology and Biomedical Sciences, and St. Vincent's Clinical School, University of New South Wales, Sydney, NSW 2052, Australia
| |
Collapse
|
462
|
Shapiro JA. Epigenetic control of mobile DNA as an interface between experience and genome change. Front Genet 2014; 5:87. [PMID: 24795749 PMCID: PMC4007016 DOI: 10.3389/fgene.2014.00087] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2014] [Accepted: 04/01/2014] [Indexed: 12/29/2022] Open
Abstract
Mobile DNA in the genome is subject to RNA-targeted epigenetic control. This control regulates the activity of transposons, retrotransposons and genomic proviruses. Many different life history experiences alter the activities of mobile DNA and the expression of genetic loci regulated by nearby insertions. The same experiences induce alterations in epigenetic formatting and lead to trans-generational modifications of genome expression and stability. These observations lead to the hypothesis that epigenetic formatting directed by non-coding RNA provides a molecular interface between life history events and genome alteration.
Collapse
Affiliation(s)
- James A. Shapiro
- Department of Biochemistry and Molecular Biology, University of ChicagoChicago, IL, USA
| |
Collapse
|
463
|
|
464
|
Neguembor MV, Jothi M, Gabellini D. Long noncoding RNAs, emerging players in muscle differentiation and disease. Skelet Muscle 2014; 4:8. [PMID: 24685002 PMCID: PMC3973619 DOI: 10.1186/2044-5040-4-8] [Citation(s) in RCA: 87] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2014] [Accepted: 03/11/2014] [Indexed: 12/18/2022] Open
Abstract
The vast majority of the mammalian genome is transcribed giving rise to many different types of noncoding RNAs. Among them, long noncoding RNAs are the most numerous and functionally versatile class. Indeed, the lncRNA repertoire might be as rich as the proteome. LncRNAs have emerged as key regulators of gene expression at multiple levels. They play important roles in the regulation of development, differentiation and maintenance of cell identity and they also contribute to disease. In this review, we present recent advances in the biology of lncRNAs in muscle development and differentiation. We will also discuss the contribution of lncRNAs to muscle disease with a particular focus on Duchenne and facioscapulohumeral muscular dystrophies.
Collapse
Affiliation(s)
| | | | - Davide Gabellini
- Dulbecco Telethon Institute at San Raffaele Scientific Institute, Division of Regenerative Medicine, Stem cells, and Gene therapy, DIBIT2, 5A3, Via Olgettina 58, 20132 Milano, Italy.
| |
Collapse
|
465
|
Lu X, Sachs F, Ramsay L, Jacques PÉ, Göke J, Bourque G, Ng HH. The retrovirus HERVH is a long noncoding RNA required for human embryonic stem cell identity. Nat Struct Mol Biol 2014; 21:423-5. [PMID: 24681886 DOI: 10.1038/nsmb.2799] [Citation(s) in RCA: 300] [Impact Index Per Article: 27.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2013] [Accepted: 02/28/2014] [Indexed: 12/18/2022]
Abstract
Human endogenous retrovirus subfamily H (HERVH) is a class of transposable elements expressed preferentially in human embryonic stem cells (hESCs). Here, we report that the long terminal repeats of HERVH function as enhancers and that HERVH is a nuclear long noncoding RNA required to maintain hESC identity. Furthermore, HERVH is associated with OCT4, coactivators and Mediator subunits. Together, these results uncover a new role of species-specific transposable elements in hESCs.
Collapse
Affiliation(s)
- Xinyi Lu
- Gene Regulation Laboratory, Genome Institute of Singapore, Singapore
| | - Friedrich Sachs
- 1] Gene Regulation Laboratory, Genome Institute of Singapore, Singapore. [2] Department of Biochemistry, National University of Singapore, Singapore
| | - LeeAnn Ramsay
- Department of Human Genetics, McGill University, Montréal, Québec, Canada
| | - Pierre-Étienne Jacques
- 1] Gene Regulation Laboratory, Genome Institute of Singapore, Singapore. [2] Département de Biologie, Université de Sherbrooke, Sherbrooke, Québec, Canada
| | - Jonathan Göke
- Gene Regulation Laboratory, Genome Institute of Singapore, Singapore
| | - Guillaume Bourque
- 1] Department of Human Genetics, McGill University, Montréal, Québec, Canada. [2] Génome Québec Innovation Center, McGill University, Montréal, Québec, Canada
| | - Huck-Hui Ng
- 1] Gene Regulation Laboratory, Genome Institute of Singapore, Singapore. [2] Department of Biochemistry, National University of Singapore, Singapore. [3] Department of Biological Sciences, National University of Singapore, Singapore. [4] School of Biological Sciences, Nanyang Technological University, Singapore
| |
Collapse
|
466
|
Hrdlickova B, de Almeida RC, Borek Z, Withoff S. Genetic variation in the non-coding genome: Involvement of micro-RNAs and long non-coding RNAs in disease. Biochim Biophys Acta Mol Basis Dis 2014; 1842:1910-1922. [PMID: 24667321 DOI: 10.1016/j.bbadis.2014.03.011] [Citation(s) in RCA: 182] [Impact Index Per Article: 16.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2013] [Revised: 03/05/2014] [Accepted: 03/16/2014] [Indexed: 12/31/2022]
Abstract
It has been found that the majority of disease-associated genetic variants identified by genome-wide association studies are located outside of protein-coding regions, where they seem to affect regions that control transcription (promoters, enhancers) and non-coding RNAs that also can influence gene expression. In this review, we focus on two classes of non-coding RNAs that are currently a major focus of interest: micro-RNAs and long non-coding RNAs. We describe their biogenesis, suggested mechanism of action, and discuss how these non-coding RNAs might be affected by disease-associated genetic alterations. The discovery of these alterations has already contributed to a better understanding of the etiopathology of human diseases and yielded insight into the function of these non-coding RNAs. We also provide an overview of available databases, bioinformatics tools, and high-throughput techniques that can be used to study the mechanism of action of individual non-coding RNAs. This article is part of a Special Issue entitled: From Genome to Function.
Collapse
Affiliation(s)
- Barbara Hrdlickova
- Department of Genetics, University of Groningen, University Medical Center Groningen, The Netherlands
| | | | - Zuzanna Borek
- Department of Genetics, University of Groningen, University Medical Center Groningen, The Netherlands
| | - Sebo Withoff
- Department of Genetics, University of Groningen, University Medical Center Groningen, The Netherlands
| |
Collapse
|
467
|
Hall LL, Carone DM, Gomez AV, Kolpa HJ, Byron M, Mehta N, Fackelmayer FO, Lawrence JB. Stable C0T-1 repeat RNA is abundant and is associated with euchromatic interphase chromosomes. Cell 2014; 156:907-19. [PMID: 24581492 PMCID: PMC4023122 DOI: 10.1016/j.cell.2014.01.042] [Citation(s) in RCA: 149] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2013] [Revised: 10/15/2013] [Accepted: 01/23/2014] [Indexed: 10/25/2022]
Abstract
Recent studies recognize a vast diversity of noncoding RNAs with largely unknown functions, but few have examined interspersed repeat sequences, which constitute almost half our genome. RNA hybridization in situ using C0T-1 (highly repeated) DNA probes detects surprisingly abundant euchromatin-associated RNA comprised predominantly of repeat sequences (C0T-1 RNA), including LINE-1. C0T-1-hybridizing RNA strictly localizes to the interphase chromosome territory in cis and remains stably associated with the chromosome territory following prolonged transcriptional inhibition. The C0T-1 RNA territory resists mechanical disruption and fractionates with the nonchromatin scaffold but can be experimentally released. Loss of repeat-rich, stable nuclear RNAs from euchromatin corresponds to aberrant chromatin distribution and condensation. C0T-1 RNA has several properties similar to XIST chromosomal RNA but is excluded from chromatin condensed by XIST. These findings impact two "black boxes" of genome science: the poorly understood diversity of noncoding RNA and the unexplained abundance of repetitive elements.
Collapse
Affiliation(s)
- Lisa L Hall
- Department of Cell and Developmental Biology, University of Massachusetts Medical School, Worcester, MA 01655, USA
| | - Dawn M Carone
- Department of Cell and Developmental Biology, University of Massachusetts Medical School, Worcester, MA 01655, USA
| | - Alvin V Gomez
- Department of Cell and Developmental Biology, University of Massachusetts Medical School, Worcester, MA 01655, USA
| | - Heather J Kolpa
- Department of Cell and Developmental Biology, University of Massachusetts Medical School, Worcester, MA 01655, USA
| | - Meg Byron
- Department of Cell and Developmental Biology, University of Massachusetts Medical School, Worcester, MA 01655, USA
| | - Nitish Mehta
- Department of Cell and Developmental Biology, University of Massachusetts Medical School, Worcester, MA 01655, USA
| | - Frank O Fackelmayer
- Laboratory of Epigenetics and Chromosome Biology, Department of Biomedical Research, Institute for Molecular Biology and Biotechnology, Foundation for Research and Technology Hellas, 45110 Ioannina, Greece
| | - Jeanne B Lawrence
- Department of Cell and Developmental Biology, University of Massachusetts Medical School, Worcester, MA 01655, USA.
| |
Collapse
|
468
|
Li L, Eichten SR, Shimizu R, Petsch K, Yeh CT, Wu W, Chettoor AM, Givan SA, Cole RA, Fowler JE, Evans MMS, Scanlon MJ, Yu J, Schnable PS, Timmermans MCP, Springer NM, Muehlbauer GJ. Genome-wide discovery and characterization of maize long non-coding RNAs. Genome Biol 2014; 15:R40. [PMID: 24576388 PMCID: PMC4053991 DOI: 10.1186/gb-2014-15-2-r40] [Citation(s) in RCA: 319] [Impact Index Per Article: 29.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2013] [Accepted: 02/27/2014] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Long non-coding RNAs (lncRNAs) are transcripts that are 200 bp or longer, do not encode proteins, and potentially play important roles in eukaryotic gene regulation. However, the number, characteristics and expression inheritance pattern of lncRNAs in maize are still largely unknown. RESULTS By exploiting available public EST databases, maize whole genome sequence annotation and RNA-seq datasets from 30 different experiments, we identified 20,163 putative lncRNAs. Of these lncRNAs, more than 90% are predicted to be the precursors of small RNAs, while 1,704 are considered to be high-confidence lncRNAs. High confidence lncRNAs have an average transcript length of 463 bp and genes encoding them contain fewer exons than annotated genes. By analyzing the expression pattern of these lncRNAs in 13 distinct tissues and 105 maize recombinant inbred lines, we show that more than 50% of the high confidence lncRNAs are expressed in a tissue-specific manner, a result that is supported by epigenetic marks. Intriguingly, the inheritance of lncRNA expression patterns in 105 recombinant inbred lines reveals apparent transgressive segregation, and maize lncRNAs are less affected by cis- than by trans-genetic factors. CONCLUSIONS We integrate all available transcriptomic datasets to identify a comprehensive set of maize lncRNAs, provide a unique annotation resource of the maize genome and a genome-wide characterization of maize lncRNAs, and explore the genetic control of their expression using expression quantitative trait locus mapping.
Collapse
|
469
|
De novo assembly and transcriptome analysis of contrasting sugarcane varieties. PLoS One 2014; 9:e88462. [PMID: 24523899 PMCID: PMC3921171 DOI: 10.1371/journal.pone.0088462] [Citation(s) in RCA: 76] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2013] [Accepted: 01/07/2014] [Indexed: 02/01/2023] Open
Abstract
Sugarcane is an important crop and a major source of sugar and alcohol. In this study, we performed de novo assembly and transcriptome annotation for six sugarcane genotypes involved in bi-parental crosses. The de novo assembly of the sugarcane transcriptome was performed using short reads generated using the Illumina RNA-Seq platform. We produced more than 400 million reads, which were assembled into 72,269 unigenes. Based on a similarity search, the unigenes showed significant similarity to more than 28,788 sorghum proteins, including a set of 5,272 unigenes that are not present in the public sugarcane EST databases; many of these unigenes are likely putative undescribed sugarcane genes. From this collection of unigenes, a large number of molecular markers were identified, including 5,106 simple sequence repeats (SSRs) and 708,125 single-nucleotide polymorphisms (SNPs). This new dataset will be a useful resource for future genetic and genomic studies in this species.
Collapse
|
470
|
Schoorlemmer J, Pérez-Palacios R, Climent M, Guallar D, Muniesa P. Regulation of Mouse Retroelement MuERV-L/MERVL Expression by REX1 and Epigenetic Control of Stem Cell Potency. Front Oncol 2014; 4:14. [PMID: 24567914 PMCID: PMC3915180 DOI: 10.3389/fonc.2014.00014] [Citation(s) in RCA: 46] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2013] [Accepted: 01/21/2014] [Indexed: 11/17/2022] Open
Abstract
About half of the mammalian genome is occupied by DNA sequences that originate from transposable elements. Retrotransposons can modulate gene expression in different ways and, particularly retrotransposon-derived long terminal repeats, profoundly shape expression of both surrounding and distant genomic loci. This is especially important in pre-implantation development, during which extensive reprograming of the genome takes place and cells pass through totipotent and pluripotent states. At this stage, the main mechanism responsible for retrotransposon silencing, i.e., DNA methylation, is inoperative. A particular retrotransposon called muERV-L/MERVL is expressed during pre-implantation stages and contributes to the plasticity of mouse embryonic stem cells. This review will focus on the role of MERVL-derived sequences as controlling elements of gene expression specific for pre-implantation development, two-cell stage-specific gene expression, and stem cell pluripotency, the epigenetic mechanisms that control their expression, and the contributions of the pluripotency marker REX1 and the related Yin Yang 1 family of transcription factors to this regulation process.
Collapse
Affiliation(s)
- Jon Schoorlemmer
- Regenerative Medicine Program, Instituto Aragonés de Ciencias de la Salud , Zaragoza , Spain ; ARAID Foundation , Zaragoza , Spain
| | - Raquel Pérez-Palacios
- Regenerative Medicine Program, Instituto Aragonés de Ciencias de la Salud , Zaragoza , Spain
| | - María Climent
- Departamento de Anatomía, Embriología y Genética Animal, Facultad de Veterinaria, Universidad de Zaragoza , Zaragoza , Spain
| | - Diana Guallar
- Regenerative Medicine Program, Instituto Aragonés de Ciencias de la Salud , Zaragoza , Spain
| | - Pedro Muniesa
- Departamento de Anatomía, Embriología y Genética Animal, Facultad de Veterinaria, Universidad de Zaragoza , Zaragoza , Spain
| |
Collapse
|
471
|
Guennewig B, Cooper AA. The Central Role of Noncoding RNA in the Brain. INTERNATIONAL REVIEW OF NEUROBIOLOGY 2014; 116:153-94. [DOI: 10.1016/b978-0-12-801105-8.00007-2] [Citation(s) in RCA: 59] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
|
472
|
Bazak L, Haviv A, Barak M, Jacob-Hirsch J, Deng P, Zhang R, Isaacs FJ, Rechavi G, Li JB, Eisenberg E, Levanon EY. A-to-I RNA editing occurs at over a hundred million genomic sites, located in a majority of human genes. Genome Res 2013; 24:365-76. [PMID: 24347612 PMCID: PMC3941102 DOI: 10.1101/gr.164749.113] [Citation(s) in RCA: 464] [Impact Index Per Article: 38.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
RNA molecules transmit the information encoded in the genome and generally reflect its content. Adenosine-to-inosine (A-to-I) RNA editing by ADAR proteins converts a genomically encoded adenosine into inosine. It is known that most RNA editing in human takes place in the primate-specific Alu sequences, but the extent of this phenomenon and its effect on transcriptome diversity are not yet clear. Here, we analyzed large-scale RNA-seq data and detected ∼1.6 million editing sites. As detection sensitivity increases with sequencing coverage, we performed ultradeep sequencing of selected Alu sequences and showed that the scope of editing is much larger than anticipated. We found that virtually all adenosines within Alu repeats that form double-stranded RNA undergo A-to-I editing, although most sites exhibit editing at only low levels (<1%). Moreover, using high coverage sequencing, we observed editing of transcripts resulting from residual antisense expression, doubling the number of edited sites in the human genome. Based on bioinformatic analyses and deep targeted sequencing, we estimate that there are over 100 million human Alu RNA editing sites, located in the majority of human genes. These findings set the stage for exploring how this primate-specific massive diversification of the transcriptome is utilized.
Collapse
Affiliation(s)
- Lily Bazak
- Mina and Everard Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat Gan 52900, Israel
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
473
|
Patnala R, Lee SH, Dahlstrom JE, Ohms S, Chen L, Dheen ST, Rangasamy D. Inhibition of LINE-1 retrotransposon-encoded reverse transcriptase modulates the expression of cell differentiation genes in breast cancer cells. Breast Cancer Res Treat 2013; 143:239-53. [PMID: 24337508 PMCID: PMC3889873 DOI: 10.1007/s10549-013-2812-7] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2013] [Accepted: 12/04/2013] [Indexed: 12/31/2022]
Abstract
Long Interspersed Elements (L1 elements) are biologically active retrotransposons that are capable of autonomous replication using their own reverse transcriptase (RT) enzyme. Expression of the normally repressed RT has been implicated in cancer cell growth. However, at present, little is known about the expression of L1-encoded RT activity or the molecular changes that are associated with RT activity in the development of breast cancer. Here, we report that RT activity is widespread in breast cancer cells. The expression of RT protein decreased markedly in breast cancer cells after treatment with the antiretroviral drug, efavirenz. While the majority of cells showed a significant reduction in proliferation, inhibition of RT was also accompanied by cell-specific differences in morphology. MCF7 cells displayed elongated microtubule extensions that adhered tightly to their substrate, while a large fraction of the T47D cells that we studied formed long filopodia projections. These morphological changes were reversible upon cessation of RT inhibition, confirming their dependence on RT activity. We also carried out gene expression profiling with microarrays and determined the genes that were differentially expressed during the process of cellular differentiation. Genes involved in proliferation, cell migration, and invasive activity were repressed in RT-inhibited cells. Concomitantly, genes involved in cell projection, formation of vacuolar membranes, and cell-to-cell junctions were significantly upregulated in RT-inhibited cells. qRT-PCR examination of the mRNA expression of these genes in additional cell lines yielded close correlation between their differential expression and the degree of cellular differentiation. Our study demonstrates that the inhibition of L1-encoded RT can reduce the rate of proliferation and promote differentiation of breast cancer cells. Together, these results provide a direct functional link between the expression of L1 retrotransposons and the development of breast cancer.
Collapse
Affiliation(s)
- Radhika Patnala
- John Curtin School of Medical Research, The Australian National University, Canberra, ACT, 2601, Australia
| | | | | | | | | | | | | |
Collapse
|
474
|
Fatica A, Bozzoni I. Long non-coding RNAs: new players in cell differentiation and development. Nat Rev Genet 2013; 15:7-21. [PMID: 24296535 DOI: 10.1038/nrg3606] [Citation(s) in RCA: 2304] [Impact Index Per Article: 192.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
Abstract
Genomes of multicellular organisms are characterized by the pervasive expression of different types of non-coding RNAs (ncRNAs). Long ncRNAs (lncRNAs) belong to a novel heterogeneous class of ncRNAs that includes thousands of different species. lncRNAs have crucial roles in gene expression control during both developmental and differentiation processes, and the number of lncRNA species increases in genomes of developmentally complex organisms, which highlights the importance of RNA-based levels of control in the evolution of multicellular organisms. In this Review, we describe the function of lncRNAs in developmental processes, such as in dosage compensation, genomic imprinting, cell differentiation and organogenesis, with a particular emphasis on mammalian development.
Collapse
Affiliation(s)
- Alessandro Fatica
- Department of Biology and Biotechnology "Charles Darwin", Sapienza University of Rome, Piazzale Aldo Moro 5, 00185 Rome, Italy
| | - Irene Bozzoni
- 1] Department of Biology and Biotechnology "Charles Darwin", Sapienza University of Rome, Piazzale Aldo Moro 5, 00185 Rome, Italy. [2] Institute of Molecular Biology and Pathology of the National Research Council, Piazzale Aldo Moro 5, 00185 Rome, Italy. [3] Istituto Pasteur Fondazione Cenci Bolognetti, Piazzale Aldo Moro 5, 00185 Rome, Italy. [4] Center for Life Nano Science @Sapienza, Istituto Italiano di Tecnologia, Sapienza University of Rome, Viale Regina Elena 291, 00161 Rome, Italy
| |
Collapse
|
475
|
Singh DK, Prasanth KV. Functional insights into the role of nuclear-retained long noncoding RNAs in gene expression control in mammalian cells. Chromosome Res 2013; 21:695-711. [PMID: 24233053 PMCID: PMC3951185 DOI: 10.1007/s10577-013-9391-7] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
The mammalian genome harbors thousands of long noncoding RNA (lncRNA) genes. Recent studies have indicated the involvement of several of these lncRNAs in the regulation of gene expression. lncRNAs play crucial roles in various biological processes ranging from epigenetic gene regulation, transcriptional control,to post-transcriptional regulation. lncRNAs are localized in various subcellular compartments, and major proportion of these are retained in the cell nucleus and could be broadly classified as nuclear-retained lncRNAs (nrRNAs). Based on the identified functions,members of the nrRNAs execute diverse roles, including providing architectural support to the hierarchical subnuclear organization and influencing the recruitment of chromatin modifier factors to specific chromatin sites. In this review, we will summarize the recently described roles of mammalian nrRNAs in controlling gene expression by influencing chromatin organization, transcription,pre-mRNA processing, nuclear organization, and their involvement in disease.
Collapse
Affiliation(s)
- Deepak K. Singh
- Department of Cell and Developmental Biology, University of Illinois at Urbana-Champaign, Urbana, IL-61801, USA
| | - Kannanganattu V. Prasanth
- Department of Cell and Developmental Biology, University of Illinois at Urbana-Champaign, Urbana, IL-61801, USA
| |
Collapse
|
476
|
Marques AC, Hughes J, Graham B, Kowalczyk MS, Higgs DR, Ponting CP. Chromatin signatures at transcriptional start sites separate two equally populated yet distinct classes of intergenic long noncoding RNAs. Genome Biol 2013; 14:R131. [PMID: 24289259 PMCID: PMC4054604 DOI: 10.1186/gb-2013-14-11-r131] [Citation(s) in RCA: 131] [Impact Index Per Article: 10.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2013] [Accepted: 11/29/2013] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Mammalian transcriptomes contain thousands of long noncoding RNAs (lncRNAs). Some lncRNAs originate from intragenic enhancers which, when active, behave as alternative promoters producing transcripts that are processed using the canonical signals of their host gene. We have followed up this observation by analyzing intergenic lncRNAs to determine the extent to which they might also originate from intergenic enhancers. RESULTS We integrated high-resolution maps of transcriptional initiation and transcription to annotate a conservative set of intergenic lncRNAs expressed in mouse erythroblasts. We subclassified intergenic lncRNAs according to chromatin status at transcriptional initiation regions, defined by relative levels of histone H3K4 mono- and trimethylation. These transcripts are almost evenly divided between those arising from enhancer-associated (elncRNA) or promoter-associated (plncRNA) elements. These two classes of 5' capped and polyadenylated RNA transcripts are indistinguishable with regard to their length, number of exons or transcriptional orientation relative to their closest neighboring gene. Nevertheless, elncRNAs are more tissue-restricted, less highly expressed and less well conserved during evolution. Of considerable interest, we found that expression of elncRNAs, but not plncRNAs, is associated with enhanced expression of neighboring protein-coding genes during erythropoiesis. CONCLUSIONS We have determined globally the sites of initiation of intergenic lncRNAs in erythroid cells, allowing us to distinguish two similarly abundant classes of transcripts. Different correlations between the levels of elncRNAs, plncRNAs and expression of neighboring genes suggest that functional lncRNAs from the two classes may play contrasting roles in regulating the transcript abundance of local or distal loci.
Collapse
|
477
|
Young JM, Whiddon JL, Yao Z, Kasinathan B, Snider L, Geng LN, Balog J, Tawil R, van der Maarel SM, Tapscott SJ. DUX4 binding to retroelements creates promoters that are active in FSHD muscle and testis. PLoS Genet 2013; 9:e1003947. [PMID: 24278031 PMCID: PMC3836709 DOI: 10.1371/journal.pgen.1003947] [Citation(s) in RCA: 118] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2013] [Accepted: 09/25/2013] [Indexed: 01/19/2023] Open
Abstract
The human double-homeodomain retrogene DUX4 is expressed in the testis and epigenetically repressed in somatic tissues. Facioscapulohumeral muscular dystrophy (FSHD) is caused by mutations that decrease the epigenetic repression of DUX4 in somatic tissues and result in mis-expression of this transcription factor in skeletal muscle. DUX4 binds sites in the human genome that contain a double-homeobox sequence motif, including sites in unique regions of the genome as well as many sites in repetitive elements. Using ChIP-seq and RNA-seq on myoblasts transduced with DUX4 we show that DUX4 binds and activates transcription of mammalian apparent LTR-retrotransposons (MaLRs), endogenous retrovirus (ERVL and ERVK) elements, and pericentromeric satellite HSATII sequences. Some DUX4-activated MaLR and ERV elements create novel promoters for genes, long non-coding RNAs, and antisense transcripts. Many of these novel transcripts are expressed in FSHD muscle cells but not control cells, and thus might contribute to FSHD pathology. For example, HEY1, a repressor of myogenesis, is activated by DUX4 through a MaLR promoter. DUX4-bound motifs, including those in repetitive elements, show evolutionary conservation and some repeat-initiated transcripts are expressed in healthy testis, the normal expression site of DUX4, but more rarely in other somatic tissues. Testis expression patterns are known to have evolved rapidly in mammals, but the mechanisms behind this rapid change have not yet been identified: our results suggest that mobilization of MaLR and ERV elements during mammalian evolution altered germline gene expression patterns through transcriptional activation by DUX4. Our findings demonstrate a role for DUX4 and repetitive elements in mammalian germline evolution and in FSHD muscular dystrophy. Transposable elements (TEs) are found in most genomes, and many TEs create extra copies of themselves in new genomic locations by a process called retrotransposition. TEs are often thought of as genomic parasites that must be suppressed, because retrotransposition can cause great harm to their host organism. However, during evolution, the functions encoded by TEs have sometimes been co-opted to the advantage of the host genome as novel genes or as gene regulatory regions. We studied a human transcription factor called DUX4 that is normally expressed in testis and repressed in muscle. Sometimes muscle repression fails, causing the disease facioscapulohumeral muscular dystrophy (FSHD). We find that DUX4 binds many TE types and can activate their transcription. Some activated TEs have been co-opted as novel promoters for human genes. DUX4's activation of these genes via TEs might be important in the biology of normal testis and may contribute to the FSHD disease process. Our findings raise the possibility that DUX4 and TEs co-evolved, as TEs may have hijacked DUX4 to aid their retrotransposition while DUX4 may have utilized TEs to modify its transcriptional network in the evolving germline.
Collapse
Affiliation(s)
- Janet M. Young
- Division of Human Biology, Fred Hutchinson Cancer Research Center, Seattle, Washington, United States of America
- * E-mail: (JMY); (SJT)
| | - Jennifer L. Whiddon
- Division of Human Biology, Fred Hutchinson Cancer Research Center, Seattle, Washington, United States of America
| | - Zizhen Yao
- Division of Human Biology, Fred Hutchinson Cancer Research Center, Seattle, Washington, United States of America
| | - Bhavatharini Kasinathan
- Division of Human Biology, Fred Hutchinson Cancer Research Center, Seattle, Washington, United States of America
- School of Medicine, University of Washington, Seattle, Washington, United States of America
| | - Lauren Snider
- Division of Human Biology, Fred Hutchinson Cancer Research Center, Seattle, Washington, United States of America
| | - Linda N. Geng
- Division of Human Biology, Fred Hutchinson Cancer Research Center, Seattle, Washington, United States of America
- School of Medicine, University of Washington, Seattle, Washington, United States of America
| | - Judit Balog
- Department of Human Genetics, Leiden University Medical Center, Leiden, The Netherlands
| | - Rabi Tawil
- Department of Neurology, University of Rochester Medical Center, Rochester, New York, United States of America
| | | | - Stephen J. Tapscott
- Division of Human Biology, Fred Hutchinson Cancer Research Center, Seattle, Washington, United States of America
- * E-mail: (JMY); (SJT)
| |
Collapse
|
478
|
He S, Gu W, Li Y, Zhu H. ANRIL/CDKN2B-AS shows two-stage clade-specific evolution and becomes conserved after transposon insertions in simians. BMC Evol Biol 2013; 13:247. [PMID: 24225082 PMCID: PMC3831594 DOI: 10.1186/1471-2148-13-247] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2013] [Accepted: 11/08/2013] [Indexed: 02/07/2023] Open
Abstract
BACKGROUND Many long non-coding RNA (lncRNA) genes identified in mammals have multiple exons and functional domains, allowing them to bind to polycomb proteins, DNA methyltransferases, and specific DNA sequences to regulate genome methylation. Little is known about the origin and evolution of lncRNAs. ANRIL/CDKN2B-AS consists of 19 exons on human chromosome 9p21 and regulates the expression of three cyclin-dependent kinase inhibitors (CDKN2A/ARF/CDKN2B). RESULTS ANRIL/CDKN2B-AS originated in placental mammals, obtained additional exons during mammalian evolution but gradually lost them during rodent evolution, and reached 19 exons only in simians. ANRIL lacks splicing signals in mammals. In simians, multiple transposons were inserted and transformed into exons of the ANRIL gene, after which ANRIL became highly conserved. A further survey reveals that multiple transposons exist in many lncRNAs. CONCLUSIONS ANRIL shows a two-stage, clade-specific evolutionary process and is fully developed only in simians. The domestication of multiple transposons indicates an impressive pattern of "evolutionary tinkering" and is likely to be important for ANRIL's structure and function. The evolution of lncRNAs and that of transposons may be highly co-opted in primates. Many lncRNAs may be functional only in simians.
Collapse
Affiliation(s)
| | | | | | - Hao Zhu
- Bioinformatics Section, School of Basic Medical Sciences, Southern Medical University, Shatai Road, Guangzhou 510515, China.
| |
Collapse
|
479
|
Lv J, Liu H, Huang Z, Su J, He H, Xiu Y, Zhang Y, Wu Q. Long non-coding RNA identification over mouse brain development by integrative modeling of chromatin and genomic features. Nucleic Acids Res 2013; 41:10044-61. [PMID: 24038472 PMCID: PMC3905897 DOI: 10.1093/nar/gkt818] [Citation(s) in RCA: 58] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
In silico prediction of genomic long non-coding RNAs (lncRNAs) is prerequisite to the construction and elucidation of non-coding regulatory network. Chromatin modifications marked by chromatin regulators are important epigenetic features, which can be captured by prevailing high-throughput approaches such as ChIP sequencing. We demonstrate that the accuracy of lncRNA predictions can be greatly improved when incorporating high-throughput chromatin modifications over mouse embryonic stem differentiation toward adult Cerebellum by logistic regression with LASSO regularization. The discriminating features include H3K9me3, H3K27ac, H3K4me1, open reading frames and several repeat elements. Importantly, chromatin information is suggested to be complementary to genomic sequence information, highlighting the importance of an integrated model. Applying integrated model, we obtain a list of putative lncRNAs based on uncharacterized fragments from transcriptome assembly. We demonstrate that the putative lncRNAs have regulatory roles in vicinity of known gene loci by expression and Gene Ontology enrichment analysis. We also show that the lncRNA expression specificity can be efficiently modeled by the chromatin data with same developmental stage. The study not only supports the biological hypothesis that chromatin can regulate expression of tissue-specific or developmental stage-specific lncRNAs but also reveals the discriminating features between lncRNA and coding genes, which would guide further lncRNA identifications and characterizations.
Collapse
Affiliation(s)
- Jie Lv
- School of Life Science and Technology, State Key Laboratory of Urban Water Resource and Environment, Harbin Institute of Technology, Harbin 150001, China and College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150081, China
| | | | | | | | | | | | | | | |
Collapse
|
480
|
Abstract
Transposable elements (TEs) make up 50% of the human genome and are usually considered a mutational burden. A new study uses signatures of DNA hypomethylation to identify tissue-specific enhancers within TEs, providing fresh evidence that mobile DNA has a non-negligible role in genome regulation and evolution.
Collapse
|