1
|
Xie YY, Wen B, Bai MZ, Guo YY. De Novo Creation of Two Novel Spliceosomal Introns of RECG1 by Intronization of Formerly Exonic Sequences in Orchidaceae. J Mol Evol 2025; 93:267-277. [PMID: 40202594 DOI: 10.1007/s00239-025-10242-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2024] [Accepted: 03/21/2025] [Indexed: 04/10/2025]
Abstract
Spliceosomal introns are a key characteristic of eukaryotic genes. However, the origins and mechanisms of new spliceosomal introns remain elusive, and definitive case studies documenting intron creation are still limited. This study examined the RECG1 genes of 49 land plants, including 21 orchids and 28 non-orchid species. Sequence comparison revealed that the fourth intron of Gastrodia and Platanthera (Orchidaceae) is a newly gained spliceosomal intron, originating from the intronization of former exonic sequences. This intronization event was accompanied by the creation of novel recognizable GT/AG splice sites. In contrast, other orchid species lack the corresponding splice sites in the counterpart regions. Moreover, the secondary and tertiary protein structures implied that the intronization events do not affect the protein function. Given the diverse trophic modes of the two genera, we infer that relaxed selection may have contributed to the fluidity of gene structures. This study provides a typical example of de novo lineage-specific intron creation via intronization in orchids supported by multiple lines of evidence, and the two intronization events occurred independently in the same gene. This research enhances our understanding of gene evolution in orchids and provides valuable insights that may assist the annotation of structurally complex genes.
Collapse
Affiliation(s)
- Yuan-Yuan Xie
- College of Plant Protection, Henan Agricultural University, Zhengzhou, China
| | - Bin Wen
- College of Plant Protection, Henan Agricultural University, Zhengzhou, China
| | - Ming-Zhu Bai
- College of Plant Protection, Henan Agricultural University, Zhengzhou, China
| | - Yan-Yan Guo
- College of Plant Protection, Henan Agricultural University, Zhengzhou, China.
| |
Collapse
|
2
|
Senn KA, Hoskins AA. Mechanisms and regulation of spliceosome-mediated pre-mRNA splicing in Saccharomyces cerevisiae. WILEY INTERDISCIPLINARY REVIEWS. RNA 2024; 15:e1866. [PMID: 38972853 PMCID: PMC11585973 DOI: 10.1002/wrna.1866] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/05/2024] [Revised: 05/19/2024] [Accepted: 05/28/2024] [Indexed: 07/09/2024]
Abstract
Pre-mRNA splicing, the removal of introns and ligation of flanking exons, is a crucial step in eukaryotic gene expression. The spliceosome, a macromolecular complex made up of five small nuclear RNAs (snRNAs) and dozens of proteins, assembles on introns via a complex pathway before catalyzing the two transesterification reactions necessary for splicing. All of these steps have the potential to be highly regulated to ensure correct mRNA isoform production for proper cellular function. While Saccharomyces cerevisiae (yeast) has a limited set of intron-containing genes, many of these genes are highly expressed, resulting in a large number of transcripts in a cell being spliced. As a result, splicing regulation is of critical importance for yeast. Just as in humans, yeast splicing can be influenced by protein components of the splicing machinery, structures and properties of the pre-mRNA itself, or by the action of trans-acting factors. It is likely that further analysis of the mechanisms and pathways of splicing regulation in yeast can reveal general principles applicable to other eukaryotes. This article is categorized under: RNA Processing > Splicing Mechanisms RNA Processing > Splicing Regulation/Alternative Splicing.
Collapse
Affiliation(s)
- Katherine Anne Senn
- Department of Biochemistry, University of Wisconsin-Madison, Madison, Wisconsin, USA
| | - Aaron A. Hoskins
- Department of Biochemistry, University of Wisconsin-Madison, Madison, Wisconsin, USA
- Department of Chemistry, University of Wisconsin-Madison, Madison, Wisconsin, USA
| |
Collapse
|
3
|
Mizrahi R, Ostersetzer-Biran O. Mitochondrial RNA Helicases: Key Players in the Regulation of Plant Organellar RNA Splicing and Gene Expression. Int J Mol Sci 2024; 25:5502. [PMID: 38791540 PMCID: PMC11122041 DOI: 10.3390/ijms25105502] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2024] [Revised: 05/06/2024] [Accepted: 05/09/2024] [Indexed: 05/26/2024] Open
Abstract
Mitochondrial genomes of land plants are large and exhibit a complex mode of gene organization and expression, particularly at the post-transcriptional level. The primary organellar transcripts in plants undergo extensive maturation steps, including endo- and/or exo-nucleolytic cleavage, RNA-base modifications (mostly C-to-U deaminations) and both 'cis'- and 'trans'-splicing events. These essential processing steps rely on the activities of a large set of nuclear-encoded factors. RNA helicases serve as key players in RNA metabolism, participating in the regulation of transcription, mRNA processing and translation. They unwind RNA secondary structures and facilitate the formation of ribonucleoprotein complexes crucial for various stages of gene expression. Furthermore, RNA helicases are involved in RNA metabolism by modulating pre-mRNA maturation, transport and degradation processes. These enzymes are, therefore, pivotal in RNA quality-control mechanisms, ensuring the fidelity and efficiency of RNA processing and turnover in plant mitochondria. This review summarizes the significant roles played by helicases in regulating the highly dynamic processes of mitochondrial transcription, RNA processing and translation in plants. We further discuss recent advancements in understanding how dysregulation of mitochondrial RNA helicases affects the splicing of organellar genes, leading to respiratory dysfunctions, and consequently, altered growth, development and physiology of land plants.
Collapse
Affiliation(s)
| | - Oren Ostersetzer-Biran
- Department of Plant and Environmental Sciences, The Hebrew University of Jerusalem, Edmond J. Safra Campus—Givat Ram, Jerusalem 9190401, Israel
| |
Collapse
|
4
|
Hoh C, Salzberg SL. Discovering Intron Gain Events in Humans through Large-Scale Evolutionary Comparisons. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.05.02.592247. [PMID: 38746259 PMCID: PMC11092651 DOI: 10.1101/2024.05.02.592247] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 05/16/2024]
Abstract
The rapid growth in the number of sequenced genomes makes it possible to search for the appearance of entirely new introns in the human lineage. In this study, we compared the genomic sequences for 19,120 human protein-coding genes to a collection of 3493 vertebrate genomes, mapping the patterns of intron alignments onto a phylogenetic tree. This mapping allowed us to trace many intron gain events to precise locations in the tree, corresponding to distinct points in evolutionary history. We discovered 584 intron gain events, all of them relatively recent, in 514 distinct human genes. Among these events, we explored the hypothesis that intronization was the mechanism responsible for intron gain. Intronization events were identified by locating instances where human introns correspond to exonic sequences in homologous vertebrate genes. Although apparently rare, we found three compelling cases of intronization, and for each of those we compared the human protein sequence and structure to homologous genes that lack the introns.
Collapse
Affiliation(s)
- Celine Hoh
- Department of Computer Science, Johns Hopkins University, Baltimore, MD 21218, USA
- Center for Computational Biology, Johns Hopkins University, Baltimore, MD 21211, USA
| | - Steven L Salzberg
- Department of Computer Science, Johns Hopkins University, Baltimore, MD 21218, USA
- Center for Computational Biology, Johns Hopkins University, Baltimore, MD 21211, USA
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD 21211, USA
- Department of Biostatistics, Johns Hopkins University, Baltimore, MD 21205, USA
| |
Collapse
|
5
|
Zhang C, Yang Q, Zhang X, Zhang X, Yu T, Wu Y, Fang Y, Xue D. Genome-Wide Identification of the HMA Gene Family and Expression Analysis under Cd Stress in Barley. PLANTS (BASEL, SWITZERLAND) 2021; 10:1849. [PMID: 34579382 PMCID: PMC8468745 DOI: 10.3390/plants10091849] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/15/2021] [Revised: 08/31/2021] [Accepted: 09/03/2021] [Indexed: 02/07/2023]
Abstract
In recent years, cadmium (Cd) pollution in soil has increased with increasing industrial activities, which has restricted crop growth and agricultural development. The heavy metal ATPase (HMA) gene family contributes to heavy metal stress resistance in plants. In this study, 21 HMA genes (HvHMAs) were identified in barley (Hordeumvulgare L., Hv) using bioinformatics methods. Based on phylogenetic analysis and domain distribution, barley HMA genes were divided into five groups (A-E), and complete analyses were performed in terms of physicochemical properties, structural characteristics, conserved domains, and chromosome localization. The expression pattern analysis showed that most HvHMA genes were expressed in barley and exhibited tissue specificity. According to the fragments per kilobase of exon per million fragments values in shoots from seedlings at the 10 cm shoot stage (LEA) and phylogenetic analysis, five HvHMA genes were selected for expression analysis under Cd stress. Among the five HvHMA genes, three (HvHMA1, HvHMA3, and HvHMA4) were upregulated and two (HvHMA2 and HvHMA6) were downregulated following Cd treatments. This study serves as a foundation for clarifying the functions of HvHMA proteins in the heavy metal stress resistance of barley.
Collapse
Affiliation(s)
| | | | | | | | | | | | - Yunxia Fang
- College of Life and Environmental Sciences, Hangzhou Normal University, Hangzhou 311121, China; (C.Z.); (Q.Y.); (X.Z.); (X.Z.); (T.Y.); (Y.W.)
| | - Dawei Xue
- College of Life and Environmental Sciences, Hangzhou Normal University, Hangzhou 311121, China; (C.Z.); (Q.Y.); (X.Z.); (X.Z.); (T.Y.); (Y.W.)
| |
Collapse
|
6
|
Zhang X, Cvetkovska M, Morgan-Kiss R, Hüner NPA, Smith DR. Draft genome sequence of the Antarctic green alga Chlamydomonas sp. UWO241. iScience 2021; 24:102084. [PMID: 33644715 PMCID: PMC7887394 DOI: 10.1016/j.isci.2021.102084] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2020] [Revised: 12/08/2020] [Accepted: 01/14/2021] [Indexed: 11/22/2022] Open
Abstract
Antarctica is home to an assortment of psychrophilic algae, which have evolved various survival strategies for coping with their frigid environments. Here, we explore Antarctic psychrophily by examining the ∼212 Mb draft nuclear genome of the green alga Chlamydomonas sp. UWO241, which resides within the water column of a perennially ice-covered, hypersaline lake. Like certain other Antarctic algae, UWO241 encodes a large number (≥37) of ice-binding proteins, putatively originating from horizontal gene transfer. Even more striking, UWO241 harbors hundreds of highly similar duplicated genes involved in diverse cellular processes, some of which we argue are aiding its survival in the Antarctic via gene dosage. Gene and partial gene duplication appear to be an ongoing phenomenon within UWO241, one which might be mediated by retrotransposons. Ultimately, we consider how such a process could be associated with adaptation to extreme environments but explore potential non-adaptive hypotheses as well. Chlamydomonas sp. UWO241 is a green alga originating from Lake Bonney, Antarctica We present a draft nuclear genome sequence of UWO241 (∼212 Mb). The UWO genome contains hundreds of highly similar duplicated genes These duplicates, we argue, might be involved in cold adaptation
Collapse
Affiliation(s)
- Xi Zhang
- Department of Biology, University of Western Ontario, London, ON N6A 5B7, Canada
| | - Marina Cvetkovska
- Department of Biology, University of Ottawa, Ottawa, ON K1N 6N5, Canada
| | | | - Norman P A Hüner
- Department of Biology, University of Western Ontario, London, ON N6A 5B7, Canada
| | - David Roy Smith
- Department of Biology, University of Western Ontario, London, ON N6A 5B7, Canada
| |
Collapse
|
7
|
Hagen R, Vitali V, Catania F. Cross-Generational Effects and Non-random Developmental Response to Temperature Variation in Paramecium. Front Cell Dev Biol 2020; 8:584219. [PMID: 33195230 PMCID: PMC7606892 DOI: 10.3389/fcell.2020.584219] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2020] [Accepted: 09/30/2020] [Indexed: 12/16/2022] Open
Abstract
Unicellular organisms such as ciliates are largely neglected in research on adaptive developmental plasticity, although their nuclear dualism offers ideal circumstances to study development outside an embryonic context. Here, we gain first insights into the ability of the ciliate Paramecium to develop potentially adaptive phenotypic changes in response to early-life adversity. We show that, upon exposure to unconventional culture temperatures, germ line-to-soma differentiation gives rise to coordinated molecular changes that may help attune the number of functional gene copies to the new external conditions. The non-random somatic heterogeneity that developmental plasticity generates is largely epigenetically controlled, shaped by the parental experience, and may prompt a stress response. These findings establish Paramecium as a new model system to study the molecular basis and evolutionary significance of developmental plasticity. In echoing previous indications in mammals, they call for an incorporation of intergenerational effects in adaptation studies.
Collapse
Affiliation(s)
- Rebecca Hagen
- Department of Biology, Institute for Evolution and Biodiversity, University of Münster, Münster, Germany
| | - Valerio Vitali
- Department of Biology, Institute for Evolution and Biodiversity, University of Münster, Münster, Germany
| | - Francesco Catania
- Department of Biology, Institute for Evolution and Biodiversity, University of Münster, Münster, Germany
| |
Collapse
|
8
|
First Come, First Served: Sui Generis Features of the First Intron. PLANTS 2020; 9:plants9070911. [PMID: 32707681 PMCID: PMC7411622 DOI: 10.3390/plants9070911] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/31/2020] [Revised: 07/15/2020] [Accepted: 07/17/2020] [Indexed: 02/07/2023]
Abstract
Most of the transcribed genes in eukaryotic cells are interrupted by intervening sequences called introns that are co-transcriptionally removed from nascent messenger RNA through the process of splicing. In Arabidopsis, 79% of genes contain introns and more than 60% of intron-containing genes undergo alternative splicing (AS), which ostensibly is considered to increase protein diversity as one of the intrinsic mechanisms for fitness to the varying environment or the internal developmental program. In addition, recent findings have prevailed in terms of overlooked intron functions. Here, we review recent progress in the underlying mechanisms of intron function, in particular by focusing on unique features of the first intron that is located in close proximity to the transcription start site. The distinct deposition of epigenetic marks and nucleosome density on the first intronic DNA sequence, the impact of the first intron on determining the transcription start site and elongation of its own expression (called intron-mediated enhancement, IME), translation control in 5′-UTR, and the new mechanism of the trans-acting function of the first intron in regulating gene expression at the post-transcriptional level are summarized.
Collapse
|
9
|
Talkish J, Igel H, Perriman RJ, Shiue L, Katzman S, Munding EM, Shelansky R, Donohue JP, Ares M. Rapidly evolving protointrons in Saccharomyces genomes revealed by a hungry spliceosome. PLoS Genet 2019; 15:e1008249. [PMID: 31437148 PMCID: PMC6726248 DOI: 10.1371/journal.pgen.1008249] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2019] [Revised: 09/04/2019] [Accepted: 06/15/2019] [Indexed: 12/14/2022] Open
Abstract
Introns are a prevalent feature of eukaryotic genomes, yet their origins and contributions to genome function and evolution remain mysterious. In budding yeast, repression of the highly transcribed intron-containing ribosomal protein genes (RPGs) globally increases splicing of non-RPG transcripts through reduced competition for the spliceosome. We show that under these “hungry spliceosome” conditions, splicing occurs at more than 150 previously unannotated locations we call protointrons that do not overlap known introns. Protointrons use a less constrained set of splice sites and branchpoints than standard introns, including in one case AT-AC in place of GT-AG. Protointrons are not conserved in all closely related species, suggesting that most are not under positive selection and are fated to disappear. Some are found in non-coding RNAs (e. g. CUTs and SUTs), where they may contribute to the creation of new genes. Others are found across boundaries between noncoding and coding sequences, or within coding sequences, where they offer pathways to the creation of new protein variants, or new regulatory controls for existing genes. We define protointrons as (1) nonconserved intron-like sequences that are (2) infrequently spliced, and importantly (3) are not currently understood to contribute to gene expression or regulation in the way that standard introns function. A very few protointrons in S. cerevisiae challenge this classification by their increased splicing frequency and potential function, consistent with the proposed evolutionary process of “intronization”, whereby new standard introns are created. This snapshot of intron evolution highlights the important role of the spliceosome in the expansion of transcribed genomic sequence space, providing a pathway for the rare events that may lead to the birth of new eukaryotic genes and the refinement of existing gene function. The protein coding information in eukaryotic genes is broken by intervening sequences called introns that are removed from RNA during transcription by a large protein-RNA complex called the spliceosome. Where introns come from and how the spliceosome contributes to genome evolution are open questions. In this study, we find more than 150 new places in the yeast genome that are recognized by the spliceosome and spliced out as introns. Since they appear to have arisen very recently in evolution by sequence drift and do not appear to contribute to gene expression or its regulation, we call these protointrons. Protointrons are found in both protein-coding and non-coding RNAs and are not efficiently removed by the splicing machinery. Although most protointrons are not conserved and will likely disappear as evolution proceeds, a few are spliced more efficiently, and are located where they might begin to play functional roles in gene expression, as predicted by the proposed process of intronization. The challenge now is to understand how spontaneously appearing splicing events like protointrons might contribute to the creation of new genes, new genetic controls, and new protein isoforms as genomes evolve.
Collapse
Affiliation(s)
- Jason Talkish
- Center for Molecular Biology of RNA, Department of Molecular, Cell & Developmental Biology, University of California, Santa Cruz, Santa Cruz, California, United States of America
| | - Haller Igel
- Center for Molecular Biology of RNA, Department of Molecular, Cell & Developmental Biology, University of California, Santa Cruz, Santa Cruz, California, United States of America
| | - Rhonda J. Perriman
- Center for Molecular Biology of RNA, Department of Molecular, Cell & Developmental Biology, University of California, Santa Cruz, Santa Cruz, California, United States of America
| | - Lily Shiue
- Center for Molecular Biology of RNA, Department of Molecular, Cell & Developmental Biology, University of California, Santa Cruz, Santa Cruz, California, United States of America
| | - Sol Katzman
- Department of Biomolecular Engineering, University of California, Santa Cruz, Santa Cruz, California, United States of America
| | - Elizabeth M. Munding
- Center for Molecular Biology of RNA, Department of Molecular, Cell & Developmental Biology, University of California, Santa Cruz, Santa Cruz, California, United States of America
| | - Robert Shelansky
- Department of Biomolecular Engineering, University of California, Santa Cruz, Santa Cruz, California, United States of America
| | - John Paul Donohue
- Center for Molecular Biology of RNA, Department of Molecular, Cell & Developmental Biology, University of California, Santa Cruz, Santa Cruz, California, United States of America
| | - Manuel Ares
- Center for Molecular Biology of RNA, Department of Molecular, Cell & Developmental Biology, University of California, Santa Cruz, Santa Cruz, California, United States of America
- * E-mail:
| |
Collapse
|
10
|
Talkish J, Igel H, Perriman RJ, Shiue L, Katzman S, Munding EM, Shelansky R, Donohue JP, Ares M. Rapidly evolving protointrons in Saccharomyces genomes revealed by a hungry spliceosome. PLoS Genet 2019; 15:e1008249. [PMID: 31437148 DOI: 10.1101/515197] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2019] [Revised: 09/04/2019] [Accepted: 06/15/2019] [Indexed: 05/28/2023] Open
Abstract
Introns are a prevalent feature of eukaryotic genomes, yet their origins and contributions to genome function and evolution remain mysterious. In budding yeast, repression of the highly transcribed intron-containing ribosomal protein genes (RPGs) globally increases splicing of non-RPG transcripts through reduced competition for the spliceosome. We show that under these "hungry spliceosome" conditions, splicing occurs at more than 150 previously unannotated locations we call protointrons that do not overlap known introns. Protointrons use a less constrained set of splice sites and branchpoints than standard introns, including in one case AT-AC in place of GT-AG. Protointrons are not conserved in all closely related species, suggesting that most are not under positive selection and are fated to disappear. Some are found in non-coding RNAs (e. g. CUTs and SUTs), where they may contribute to the creation of new genes. Others are found across boundaries between noncoding and coding sequences, or within coding sequences, where they offer pathways to the creation of new protein variants, or new regulatory controls for existing genes. We define protointrons as (1) nonconserved intron-like sequences that are (2) infrequently spliced, and importantly (3) are not currently understood to contribute to gene expression or regulation in the way that standard introns function. A very few protointrons in S. cerevisiae challenge this classification by their increased splicing frequency and potential function, consistent with the proposed evolutionary process of "intronization", whereby new standard introns are created. This snapshot of intron evolution highlights the important role of the spliceosome in the expansion of transcribed genomic sequence space, providing a pathway for the rare events that may lead to the birth of new eukaryotic genes and the refinement of existing gene function.
Collapse
Affiliation(s)
- Jason Talkish
- Center for Molecular Biology of RNA, Department of Molecular, Cell & Developmental Biology, University of California, Santa Cruz, Santa Cruz, California, United States of America
| | - Haller Igel
- Center for Molecular Biology of RNA, Department of Molecular, Cell & Developmental Biology, University of California, Santa Cruz, Santa Cruz, California, United States of America
| | - Rhonda J Perriman
- Center for Molecular Biology of RNA, Department of Molecular, Cell & Developmental Biology, University of California, Santa Cruz, Santa Cruz, California, United States of America
| | - Lily Shiue
- Center for Molecular Biology of RNA, Department of Molecular, Cell & Developmental Biology, University of California, Santa Cruz, Santa Cruz, California, United States of America
| | - Sol Katzman
- Department of Biomolecular Engineering, University of California, Santa Cruz, Santa Cruz, California, United States of America
| | - Elizabeth M Munding
- Center for Molecular Biology of RNA, Department of Molecular, Cell & Developmental Biology, University of California, Santa Cruz, Santa Cruz, California, United States of America
| | - Robert Shelansky
- Department of Biomolecular Engineering, University of California, Santa Cruz, Santa Cruz, California, United States of America
| | - John Paul Donohue
- Center for Molecular Biology of RNA, Department of Molecular, Cell & Developmental Biology, University of California, Santa Cruz, Santa Cruz, California, United States of America
| | - Manuel Ares
- Center for Molecular Biology of RNA, Department of Molecular, Cell & Developmental Biology, University of California, Santa Cruz, Santa Cruz, California, United States of America
| |
Collapse
|
11
|
Venters CC, Oh JM, Di C, So BR, Dreyfuss G. U1 snRNP Telescripting: Suppression of Premature Transcription Termination in Introns as a New Layer of Gene Regulation. Cold Spring Harb Perspect Biol 2019; 11:11/2/a032235. [PMID: 30709878 DOI: 10.1101/cshperspect.a032235] [Citation(s) in RCA: 53] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
Recent observations showed that nascent RNA polymerase II transcripts, pre-mRNAs, and noncoding RNAs are highly susceptible to premature 3'-end cleavage and polyadenylation (PCPA) from numerous intronic cryptic polyadenylation signals (PASs). The importance of this in gene regulation was not previously appreciated as PASs, despite their prevalence, were thought to be active in terminal exons at gene ends. Unexpectedly, antisense oligonucleotide interference with U1 snRNA base-pairing to 5' splice sites, which is necessary for U1 snRNP's (U1) function in splicing, caused widespread PCPA in metazoans. This uncovered U1's PCPA suppression activity, termed telescripting, as crucial for full-length transcription in thousands of vertebrate genes, providing a general role in transcription elongation control. Progressive intron-size expansion in metazoan evolution greatly increased PCPA vulnerability and dependence on U1 telescripting. We describe how these observations unfolded and discuss U1 telescripting's role in shaping the transcriptome.
Collapse
Affiliation(s)
- Christopher C Venters
- Howard Hughes Medical Institute, Department of Biochemistry and Biophysics, University of Pennsylvania School of Medicine, Philadelphia, Pennsylvania 19104
| | - Jung-Min Oh
- Howard Hughes Medical Institute, Department of Biochemistry and Biophysics, University of Pennsylvania School of Medicine, Philadelphia, Pennsylvania 19104
| | - Chao Di
- Howard Hughes Medical Institute, Department of Biochemistry and Biophysics, University of Pennsylvania School of Medicine, Philadelphia, Pennsylvania 19104
| | - Byung Ran So
- Howard Hughes Medical Institute, Department of Biochemistry and Biophysics, University of Pennsylvania School of Medicine, Philadelphia, Pennsylvania 19104
| | - Gideon Dreyfuss
- Howard Hughes Medical Institute, Department of Biochemistry and Biophysics, University of Pennsylvania School of Medicine, Philadelphia, Pennsylvania 19104
| |
Collapse
|
12
|
Kini RM. Accelerated evolution of toxin genes: Exonization and intronization in snake venom disintegrin/metalloprotease genes. Toxicon 2018; 148:16-25. [PMID: 29634956 DOI: 10.1016/j.toxicon.2018.04.005] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2018] [Revised: 03/21/2018] [Accepted: 04/01/2018] [Indexed: 12/20/2022]
Abstract
Toxin genes in animals undergo accelerated evolution compared to non-toxin genes to be effective and competitive in prey capture, as well as to enhance their predator defense. Several mechanisms have been proposed to explain this unusual phenomenon. These include (a) frequent mutations in exons compared to introns and nonsynonymous substitutions in exons; (b) high frequency of point mutations are due to the presence of more unstable triplets in exons compared to introns; (c) Accelerated Segment Switch in Exons to alter Targeting (ASSET); (d) Rapid Accumulation of Variations in Exposed Residues (RAVERs); (e) alteration in intron-exon boundary; (f) deletion of exon; and (g) loss/gain of domains through recombination. By systematic analyses of snake venom disintegrin/metalloprotease genes, I describe a new mechanism in the evolution of these genes through exonization and intronization. In the evolution of RTS/KTS disintegrins, a new exon (10a) is formed in intron 10 of the disintegrin/metalloprotease gene. Unlike more than 90% new exons that are from repetitive elements in introns, exon 10a originated from a non-repetitive element. To incorporate exon 10a, part of the exon 11 is intronized to retain the open reading frame. This is the first case of simultaneous exonization and intronization within a single gene. This new mechanism alters the function of toxins through drastic changes to the molecular surface via insertion of new exons and deletion of exons.
Collapse
Affiliation(s)
- R Manjunatha Kini
- Protein Science Laboratory, Department of Biological Sciences, Faculty of Science, National University of Singapore, Singapore, 117543, Singapore.
| |
Collapse
|
13
|
U1 snRNP telescripting regulates a size-function-stratified human genome. Nat Struct Mol Biol 2017; 24:993-999. [PMID: 28967884 DOI: 10.1038/nsmb.3473] [Citation(s) in RCA: 81] [Impact Index Per Article: 10.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2016] [Accepted: 08/28/2017] [Indexed: 12/16/2022]
Abstract
U1 snRNP (U1) functions in splicing introns and telescripting, which suppresses premature cleavage and polyadenylation (PCPA). Using U1 inhibition in human cells, we show that U1 telescripting is selectively required for sustaining long-distance transcription elongation in introns of large genes (median 39 kb). Evidence of widespread PCPA in the same locations in normal tissues reveals that large genes incur natural transcription attrition. Underscoring the importance of U1 telescripting as a gene-size-based mRNA-regulation mechanism, small genes were not sensitive to PCPA, and the spliced-mRNA productivity of ∼1,000 small genes (median 6.8 kb) increased upon U1 inhibition. Notably, these small, upregulated genes were enriched in functions related to acute stimuli and cell-survival response, whereas genes subject to PCPA were enriched in cell-cycle progression and developmental functions. This gene size-function polarization increased in metazoan evolution by enormous intron expansion. We propose that telescripting adds an overarching layer of regulation to size-function-stratified genomes, leveraged by selective intron expansion to rapidly shift gene expression priorities.
Collapse
|
14
|
Catania F. From intronization to intron loss: How the interplay between mRNA-associated processes can shape the architecture and the expression of eukaryotic genes. Int J Biochem Cell Biol 2017; 91:136-144. [PMID: 28673893 DOI: 10.1016/j.biocel.2017.06.017] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2017] [Revised: 06/25/2017] [Accepted: 06/30/2017] [Indexed: 12/29/2022]
Abstract
Transcription-coupled processes such as capping, splicing, and cleavage/polyadenylation participate in the journey from genes to proteins. Although they are traditionally thought to serve only as steps in the generation of mature mRNAs, a synthesis of available data indicates that these processes could also act as a driving force for the evolution of eukaryotic genes. A theoretical framework for how mRNA-associated processes may shape gene structure and expression has recently been proposed. Factors that promote splicing and cleavage/polyadenylation in this framework compete for access to overlapping or neighboring signals throughout the transcription cycle. These antagonistic interactions allow mechanisms for intron gain and splice site recognition as well as common trends in eukaryotic gene structure and expression to be coherently integrated. Here, I extend this framework further. Observations that largely (but not exclusively) revolve around the formation of DNA-RNA hybrid structures, called R loops, and promoter directionality are integrated. Additionally, the interplay between splicing factors and cleavage/polyadenylation factors is theorized to also affect the formation of intragenic DNA double-stranded breaks thereby contributing to intron loss. The most notable prediction in this proposition is that RNA molecules can mediate intron loss by serving as a template to repair DNA double-stranded breaks. The framework presented here leverages a vast body of empirical observations, logically extending previous suggestions, and generating verifiable predictions to further substantiate the view that the intracellular environment plays an active role in shaping the structure and the expression of eukaryotic genes.
Collapse
Affiliation(s)
- Francesco Catania
- Institute for Evolution and Biodiversity, University of Münster, Hüfferstraße 1, 48149 Münster, Germany.
| |
Collapse
|
15
|
Exploring the Impact of Cleavage and Polyadenylation Factors on Pre-mRNA Splicing Across Eukaryotes. G3-GENES GENOMES GENETICS 2017; 7:2107-2114. [PMID: 28500052 PMCID: PMC5499120 DOI: 10.1534/g3.117.041483] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
In human, mouse, and Drosophila, the spliceosomal complex U1 snRNP (U1) protects transcripts from premature cleavage and polyadenylation at proximal intronic polyadenylation signals (PAS). These U1-mediated effects preserve transcription integrity, and are known as telescripting. The watchtower role of U1 throughout transcription is clear. What is less clear is whether cleavage and polyadenylation factors (CPFs) are simply patrolled or if they might actively antagonize U1 recruitment. In addressing this question, we found that, in the introns of human, mouse, and Drosophila, and of 14 other eukaryotes, including multi- and single-celled species, the conserved AATAAA PAS—a major target for CPFs—is selected against. This selective pressure, approximated using DNA strand asymmetry, is detected for peripheral and internal introns alike. Surprisingly, it is more pronounced within—rather than outside—the action range of telescripting, and particularly intense in the vicinity of weak 5′ splice sites. Our study uncovers a novel feature of eukaryotic genes: that the AATAAA PAS is universally counter-selected in spliceosomal introns. This pattern implies that CPFs may attempt to access introns at any time during transcription. However, natural selection operates to minimize this access. By corroborating and extending previous work, our study further indicates that CPF access to intronic PASs might perturb the recruitment of U1 to the adjacent 5′ splice sites. These results open the possibility that CPFs may impact the splicing process across eukaryotes.
Collapse
|
16
|
Casola C, Betrán E. The Genomic Impact of Gene Retrocopies: What Have We Learned from Comparative Genomics, Population Genomics, and Transcriptomic Analyses? Genome Biol Evol 2017; 9:1351-1373. [PMID: 28605529 PMCID: PMC5470649 DOI: 10.1093/gbe/evx081] [Citation(s) in RCA: 56] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/18/2017] [Indexed: 02/07/2023] Open
Abstract
Gene duplication is a major driver of organismal evolution. Gene retroposition is a mechanism of gene duplication whereby a gene's transcript is used as a template to generate retroposed gene copies, or retrocopies. Intriguingly, the formation of retrocopies depends upon the enzymatic machinery encoded by retrotransposable elements, genomic parasites occurring in the majority of eukaryotes. Most retrocopies are depleted of the regulatory regions found upstream of their parental genes; therefore, they were initially considered transcriptionally incompetent gene copies, or retropseudogenes. However, examples of functional retrocopies, or retrogenes, have accumulated since the 1980s. Here, we review what we have learned about retrocopies in animals, plants and other eukaryotic organisms, with a particular emphasis on comparative and population genomic analyses complemented with transcriptomic datasets. In addition, these data have provided information about the dynamics of the different "life cycle" stages of retrocopies (i.e., polymorphic retrocopy number variants, fixed retropseudogenes and retrogenes) and have provided key insights into the retroduplication mechanisms, the patterns and evolutionary forces at work during the fixation process and the biological function of retrogenes. Functional genomic and transcriptomic data have also revealed that many retropseudogenes are transcriptionally active and a biological role has been experimentally determined for many. Finally, we have learned that not only non-long terminal repeat retroelements but also long terminal repeat retroelements play a role in the emergence of retrocopies across eukaryotes. This body of work has shown that mRNA-mediated duplication represents a widespread phenomenon that produces an array of new genes that contribute to organismal diversity and adaptation.
Collapse
Affiliation(s)
- Claudio Casola
- Department of Ecosystem Science and Management, Texas A&M University, TX
| | - Esther Betrán
- Department of Biology, University of Texas at Arlington, Arlington, TX
| |
Collapse
|
17
|
Wan PJ, Yuan SY, Wang WX, Chen X, Lai FX, Fu Q. A Genome-Wide Identification and Analysis of the Basic Helix-Loop-Helix Transcription Factors in Brown Planthopper, Nilaparvata lugens. Genes (Basel) 2016; 7:genes7110100. [PMID: 27869716 PMCID: PMC5126786 DOI: 10.3390/genes7110100] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2016] [Revised: 10/11/2016] [Accepted: 10/19/2016] [Indexed: 11/17/2022] Open
Abstract
The basic helix-loop-helix (bHLH) transcription factors in insects play essential roles in multiple developmental processes including neurogenesis, sterol metabolism, circadian rhythms, organogenesis and formation of olfactory sensory neurons. The identification and function analysis of bHLH family members of the most destructive insect pest of rice, Nilaparvata lugens, may provide novel tools for pest management. Here, a genome-wide survey for bHLH sequences identified 60 bHLH sequences (NlbHLHs) encoded in the draft genome of N. lugens. Phylogenetic analysis of the bHLH domains successfully classified these genes into 40 bHLH families in group A (25), B (14), C (10), D (1), E (8) and F (2). The number of NlbHLHs with introns is higher than many other insect species, and the average intron length is shorter than those of Acyrthosiphon pisum. High number of ortholog families of NlbHLHs was found suggesting functional conversation for these proteins. Compared to other insect species studied, N. lugens has the highest number of bHLH members. Furthermore, gene duplication events of SREBP, Kn(col), Tap, Delilah, Sim, Ato and Crp were found in N. lugens. In addition, a putative full set of NlbHLH genes is defined and compared with another insect species. Thus, our classification of these NlbHLH members provides a platform for further investigations of bHLH protein functions in the regulation of N. lugens, and of insects in general.
Collapse
Affiliation(s)
- Pin-Jun Wan
- State Key Laboratory of Rice Biology, China National Rice Research Institute, Hangzhou 310006, China.
| | - San-Yue Yuan
- State Key Laboratory of Rice Biology, China National Rice Research Institute, Hangzhou 310006, China.
| | - Wei-Xia Wang
- State Key Laboratory of Rice Biology, China National Rice Research Institute, Hangzhou 310006, China.
| | - Xu Chen
- State Key Laboratory of Rice Biology, China National Rice Research Institute, Hangzhou 310006, China.
| | - Feng-Xiang Lai
- State Key Laboratory of Rice Biology, China National Rice Research Institute, Hangzhou 310006, China.
| | - Qiang Fu
- State Key Laboratory of Rice Biology, China National Rice Research Institute, Hangzhou 310006, China.
| |
Collapse
|
18
|
Li S, Yamada M, Han X, Ohler U, Benfey PN. High-Resolution Expression Map of the Arabidopsis Root Reveals Alternative Splicing and lincRNA Regulation. Dev Cell 2016; 39:508-522. [PMID: 27840108 DOI: 10.1016/j.devcel.2016.10.012] [Citation(s) in RCA: 190] [Impact Index Per Article: 21.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2016] [Revised: 08/15/2016] [Accepted: 10/17/2016] [Indexed: 12/31/2022]
Abstract
The extent to which alternative splicing and long intergenic noncoding RNAs (lincRNAs) contribute to the specialized functions of cells within an organ is poorly understood. We generated a comprehensive dataset of gene expression from individual cell types of the Arabidopsis root. Comparisons across cell types revealed that alternative splicing tends to remove parts of coding regions from a longer, major isoform, providing evidence for a progressive mechanism of splicing. Cell-type-specific intron retention suggested a possible origin for this common form of alternative splicing. Coordinated alternative splicing across developmental stages pointed to a role in regulating differentiation. Consistent with this hypothesis, distinct isoforms of a transcription factor were shown to control developmental transitions. lincRNAs were generally lowly expressed at the level of individual cell types, but co-expression clusters provided clues as to their function. Our results highlight insights gained from analysis of expression at the level of individual cell types.
Collapse
Affiliation(s)
- Song Li
- Department of Biology and HHMI, Duke University, Durham, NC 27708, USA
| | - Masashi Yamada
- Department of Biology and HHMI, Duke University, Durham, NC 27708, USA
| | - Xinwei Han
- Department of Biology and HHMI, Duke University, Durham, NC 27708, USA
| | - Uwe Ohler
- Department of Biostatistics & Bioinformatics, Duke University, Durham, NC 27710, USA; Berlin Institute for Medical Systems Biology, Max Delbrück Center for Molecular Medicine, 13125 Berlin, Germany.
| | - Philip N Benfey
- Department of Biology and HHMI, Duke University, Durham, NC 27708, USA.
| |
Collapse
|
19
|
Abstract
We examine exon junctions near apparent amino acid insertions and deletions in alignments of orthologous protein-coding genes. In 1,917 ortholog families across nine oomycete genomes, 10–20% of introns are near an alignment gap, indicating at first sight that splice-site displacements are frequent. We designed a robust algorithmic procedure for the delineation of intron-containing homologous regions, and combined it with a parsimony-based reconstruction of intron loss, gain, and splice-site shift events on a phylogeny. The reconstruction implies that 12% of introns underwent an acceptor-site shift, and 10% underwent a donor-site shift. In order to offset gene annotation problems, we amended the procedure with the reannotation of intron boundaries using alignment evidence. The corresponding reconstruction involves much fewer intron gain and splice-site shift events. The frequency of acceptor- and donor-side shifts drops to 4% and 3%, respectively, which are not much different from what one would expect by random codon insertions and deletions. In other words, gaps near exon junctions are mostly artifacts of gene annotation rather than evidence of sliding intron boundaries. Our study underscores the importance of using well-supported gene structure annotations in comparative studies. When transcription evidence is not available, we propose a robust ancestral reconstruction procedure that corrects misannotated intron boundaries using sequence alignments. The results corroborate the view that boundary shifts and complete intron sliding are only accidental in eukaryotic genome evolution and have a negligible impact on protein diversity.
Collapse
Affiliation(s)
- Steven Sêton Bocco
- Department of Biochemistry and Molecular Medicine, University of Montréal, Montréal, Canada
| | - Miklós Csűrös
- Department of Computer Science and Operations Research, University of Montréal, Montréal, Canada Institute of Genetics, Biological Research Centre, Hungarian Academy of Sciences, Szeged, Hungary
| |
Collapse
|
20
|
mRNA-Associated Processes and Their Influence on Exon-Intron Structure in Drosophila melanogaster. G3-GENES GENOMES GENETICS 2016; 6:1617-26. [PMID: 27172210 PMCID: PMC4889658 DOI: 10.1534/g3.116.029231] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]
Abstract
mRNA-associated processes and gene structure in eukaryotes are typically treated as separate research subjects. Here, we bridge this separation and leverage the extensive multidisciplinary work on Drosophila melanogaster to examine the roles that capping, splicing, cleavage/polyadenylation, and telescripting (i.e., the protection of nascent transcripts from premature cleavage/polyadenylation by the splicing factor U1) might play in shaping exon-intron architecture in protein-coding genes. Our findings suggest that the distance between subsequent internal 5′ splice sites (5′ss) in Drosophila genes is constrained such that telescripting effects are maximized, in theory, and thus nascent transcripts are less vulnerable to premature termination. Exceptionally weak 5′ss and constraints on intron-exon size at the gene 5′ end also indicate that capping might enhance the recruitment of U1 and, in turn, promote telescripting at this location. Finally, a positive correlation between last exon length and last 5′ss strength suggests that optimal donor splice sites in the proximity of the pre-mRNA tail may inhibit the processing of downstream polyadenylation signals more than weak donor splice sites do. These findings corroborate and build upon previous experimental and computational studies on Drosophila genes. They support the possibility, hitherto scantly explored, that mRNA-associated processes impose significant constraints on the evolution of eukaryotic gene structure.
Collapse
|
21
|
Ferro D, Lepennetier G, Catania F. Cis-acting signals modulate the efficiency of programmed DNA elimination in Paramecium tetraurelia. Nucleic Acids Res 2015; 43:8157-68. [PMID: 26304543 PMCID: PMC4787833 DOI: 10.1093/nar/gkv843] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2015] [Accepted: 08/01/2015] [Indexed: 12/12/2022] Open
Abstract
In Paramecium, the regeneration of a functional somatic genome at each sexual event relies on the elimination of thousands of germline DNA sequences, known as Internal Eliminated Sequences (IESs), from the zygotic nuclear DNA. Here, we provide evidence that IESs’ length and sub-terminal bases jointly modulate IES excision by affecting DNA conformation in P. tetraurelia. Our study reveals an excess of complementary base pairing between IESs’ sub-terminal and contiguous sites, suggesting that IESs may form DNA loops prior to cleavage. The degree of complementary base pairing between IESs’ sub-terminal sites (termed Cin-score) is positively associated with IES length and is shaped by natural selection. Moreover, it escalates abruptly when IES length exceeds 45 nucleotides (nt), indicating that only sufficiently large IESs may form loops. Finally, we find that IESs smaller than 46 nt are favored targets of the cellular surveillance systems, presumably because of their relatively inefficient excision. Our findings extend the repertoire of cis-acting determinants for IES recognition/excision and provide unprecedented insights into the distinct selective pressures that operate on IESs and somatic DNA regions. This information potentially moves current models of IES evolution and of mechanisms of IES recognition/excision forward.
Collapse
Affiliation(s)
- Diana Ferro
- Institute for Evolution and Biodiversity, University of Münster, Hüfferstrasse 1, 48149 Münster, Germany
| | - Gildas Lepennetier
- Institute for Evolution and Biodiversity, University of Münster, Hüfferstrasse 1, 48149 Münster, Germany
| | - Francesco Catania
- Institute for Evolution and Biodiversity, University of Münster, Hüfferstrasse 1, 48149 Münster, Germany
| |
Collapse
|
22
|
Catania F, Schmitz J. On the path to genetic novelties: insights from programmed DNA elimination and RNA splicing. WILEY INTERDISCIPLINARY REVIEWS-RNA 2015; 6:547-61. [PMID: 26140477 DOI: 10.1002/wrna.1293] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/22/2015] [Revised: 04/29/2015] [Accepted: 06/06/2015] [Indexed: 12/17/2022]
Abstract
Understanding how genetic novelties arise is a central goal of evolutionary biology. To this end, programmed DNA elimination and RNA splicing deserve special consideration. While programmed DNA elimination reshapes genomes by eliminating chromatin during organismal development, RNA splicing rearranges genetic messages by removing intronic regions during transcription. Small RNAs help to mediate this class of sequence reorganization, which is not error-free. It is this imperfection that makes programmed DNA elimination and RNA splicing excellent candidates for generating evolutionary novelties. Leveraging a number of these two processes' mechanistic and evolutionary properties, which have been uncovered over the past years, we present recently proposed models and empirical evidence for how splicing can shape the structure of protein-coding genes in eukaryotes. We also chronicle a number of intriguing similarities between the processes of programmed DNA elimination and RNA splicing, and highlight the role that the variation in the population-genetic environment may play in shaping their target sequences.
Collapse
Affiliation(s)
- Francesco Catania
- Institute for Evolution and Biodiversity, University of Münster, Münster, Germany
| | - Jürgen Schmitz
- Institute of Experimental Pathology (ZMBE), University of Münster, Münster, Germany
| |
Collapse
|
23
|
Unmasking alternative splicing inside protein-coding exons defines exitrons and their role in proteome plasticity. Genome Res 2015; 25:995-1007. [PMID: 25934563 PMCID: PMC4484396 DOI: 10.1101/gr.186585.114] [Citation(s) in RCA: 120] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2014] [Accepted: 04/30/2015] [Indexed: 11/25/2022]
Abstract
Alternative splicing (AS) diversifies transcriptomes and proteomes and is widely recognized as a key mechanism for regulating gene expression. Previously, in an analysis of intron retention events in Arabidopsis, we found unusual AS events inside annotated protein-coding exons. Here, we also identify such AS events in human and use these two sets to analyse their features, regulation, functional impact, and evolutionary origin. As these events involve introns with features of both introns and protein-coding exons, we name them exitrons (exonic introns). Though exitrons were detected as a subset of retained introns, they are clearly distinguishable, and their splicing results in transcripts with different fates. About half of the 1002 Arabidopsis and 923 human exitrons have sizes of multiples of 3 nucleotides (nt). Splicing of these exitrons results in internally deleted proteins and affects protein domains, disordered regions, and various post-translational modification sites, thus broadly impacting protein function. Exitron splicing is regulated across tissues, in response to stress and in carcinogenesis. Intriguingly, annotated intronless genes can be also alternatively spliced via exitron usage. We demonstrate that at least some exitrons originate from ancestral coding exons. Based on our findings, we propose a “splicing memory” hypothesis whereby upon intron loss imprints of former exon borders defined by vestigial splicing regulatory elements could drive the evolution of exitron splicing. Altogether, our studies show that exitron splicing is a conserved strategy for increasing proteome plasticity in plants and animals, complementing the repertoire of AS events.
Collapse
|
24
|
Niklas KJ, Bondos SE, Dunker AK, Newman SA. Rethinking gene regulatory networks in light of alternative splicing, intrinsically disordered protein domains, and post-translational modifications. Front Cell Dev Biol 2015; 3:8. [PMID: 25767796 PMCID: PMC4341551 DOI: 10.3389/fcell.2015.00008] [Citation(s) in RCA: 74] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2014] [Accepted: 01/26/2015] [Indexed: 11/16/2022] Open
Abstract
Models for genetic regulation and cell fate specification characteristically assume that gene regulatory networks (GRNs) are essentially deterministic and exhibit multiple stable states specifying alternative, but pre-figured cell fates. Mounting evidence shows, however, that most eukaryotic precursor RNAs undergo alternative splicing (AS) and that the majority of transcription factors contain intrinsically disordered protein (IDP) domains whose functionalities are context dependent as well as subject to post-translational modification (PTM). Consequently, many transcription factors do not have fixed cis-acting regulatory targets, and developmental determination by GRNs alone is untenable. Modeling these phenomena requires a multi-scale approach to explain how GRNs operationally interact with the intra- and intercellular environments. Evidence shows that AS, IDP, and PTM complicate gene expression and act synergistically to facilitate and promote time- and cell-specific protein modifications involved in cell signaling and cell fate specification and thereby disrupt a strict deterministic GRN-phenotype mapping. The combined effects of AS, IDP, and PTM give proteomes physiological plasticity, adaptive responsiveness, and developmental versatility without inefficiently expanding genome size. They also help us understand how protein functionalities can undergo major evolutionary changes by buffering mutational consequences.
Collapse
Affiliation(s)
- Karl J Niklas
- Plant Biology Section, School of Integrative Plant Science, Cornell University Ithaca, NY, USA
| | - Sarah E Bondos
- Department of Molecular and Cellular Medicine, Texas A&M Health Science Center College Station, TX, USA
| | - A Keith Dunker
- Center for Computational Biology and Bioinformatics, School of Medicine, Indiana University Indianapolis, IN, USA
| | - Stuart A Newman
- Department of Cell Biology and Anatomy, New York Medical College Valhalla, NY, USA
| |
Collapse
|
25
|
Zhan L, Meng Q, Chen R, Yue Y, Jin Y. Origin and evolution of a new retained intron on the vulcan gene in Drosophila melanogaster subgroup species. Genome 2015; 57:567-72. [PMID: 25723758 DOI: 10.1139/gen-2014-0132] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Although numerous intron gains have been discovered, the mechanisms of intron creation have proven to be elusive. Previous study revealed that the vulcan gene of Drosophila melanogaster contained four exons in its coding region. In the current study, a newly created intron (Intron L) was identified on exon 2 of vulcan in D. melanogaster by comparing expression sequence tags. The RT-PCR experiment revealed that Intron L was associated with intron retention, in which two alternative transcripts of the gene differ by the inclusion or removal of an intron. It was found that Intron L was created by intronization of exonic sequence, and its donor and acceptor splice sites were created by synonymous mutation, leading to the origin of a new vulcan protein that is 22 amino acids shorter than the previously reported vulcan protein. Moreover, to track the origin of Intron L, 36 orthologous genes of species of Drosophila were cloned or annotated, and phylogenetic analysis was carried out. It indicated that the common ancestor of D. melangaster subgroup species created Intron L about 15 million years ago.
Collapse
Affiliation(s)
- Leilei Zhan
- Institute of Biochemistry, College of Life Sciences, Zhejiang University (Zijingang Campus), Hangzhou, Zhejiang, ZJ310058, P.R. of China
| | | | | | | | | |
Collapse
|
26
|
Wang H, Devos KM, Bennetzen JL. Recurrent loss of specific introns during angiosperm evolution. PLoS Genet 2014; 10:e1004843. [PMID: 25474210 PMCID: PMC4256211 DOI: 10.1371/journal.pgen.1004843] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2013] [Accepted: 10/22/2014] [Indexed: 11/18/2022] Open
Abstract
Numerous instances of presence/absence variations for introns have been documented in eukaryotes, and some cases of recurrent loss of the same intron have been suggested. However, there has been no comprehensive or phylogenetically deep analysis of recurrent intron loss. Of 883 cases of intron presence/absence variation that we detected in five sequenced grass genomes, 93 were confirmed as recurrent losses and the rest could be explained by single losses (652) or single gains (118). No case of recurrent intron gain was observed. Deep phylogenetic analysis often indicated that apparent intron gains were actually numerous independent losses of the same intron. Recurrent loss exhibited extreme non-randomness, in that some introns were removed independently in many lineages. The two larger genomes, maize and sorghum, were found to have a higher rate of both recurrent loss and overall loss and/or gain than foxtail millet, rice or Brachypodium. Adjacent introns and small introns were found to be preferentially lost. Intron loss genes exhibited a high frequency of germ line or early embryogenesis expression. In addition, flanking exon A+T-richness and intron TG/CG ratios were higher in retained introns. This last result suggests that epigenetic status, as evidenced by a loss of methylated CG dinucleotides, may play a role in the process of intron loss. This study provides the first comprehensive analysis of recurrent intron loss, makes a series of novel findings on the patterns of recurrent intron loss during the evolution of the grass family, and provides insight into the molecular mechanism(s) underlying intron loss. The spliceosomal introns are nucleotide sequences that interrupt coding regions of eukaryotic genes and are removed by RNA splicing after transcription. Recent studies have reported several examples of possible recurrent intron loss or gain, i.e., introns that are independently removed from or inserted into the identical sites more than once in an investigated phylogeny. However, the frequency, evolutionary patterns or other characteristics of recurrent intron turnover remain unknown. We provide results for the first comprehensive analysis of recurrent intron turnover within a plant family and show that recurrent intron loss represents a considerable portion of all intron losses identified and intron loss events far outnumber intron gain events. We also demonstrate that recurrent intron loss is non-random, affecting only a small number of introns that are repeatedly lost, and that different lineages show significantly different rates of intron loss. Our results suggest a possible role of DNA methylation in the process of intron loss. Moreover, this study provides strong support for the model of intron loss by reverse transcriptase mediated conversion of genes by their processed mRNA transcripts.
Collapse
Affiliation(s)
- Hao Wang
- Department of Genetics, University of Georgia, Athens, Georgia, United States of America
| | - Katrien M. Devos
- Department of Crop and Soil Sciences, and Department of Plant Biology, University of Georgia, Athens, Georgia, United States of America
| | - Jeffrey L. Bennetzen
- Department of Genetics, University of Georgia, Athens, Georgia, United States of America
- Germplasm Bank of Wild Species in Southwestern China, Kunming Institute of Botany, Kunming, Yunnan, P.R. China
- * E-mail:
| |
Collapse
|
27
|
Tohidi R, Idris IB, Panandam JM, Bejo MH. The effects of polymorphisms in IL-2, IFN-γ, TGF-β2, IgL, TLR-4, MD-2, and iNOS genes on resistance to Salmonella enteritidis in indigenous chickens. Avian Pathol 2014; 41:605-12. [PMID: 23237374 DOI: 10.1080/03079457.2012.739680] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
Salmonella Enteritidis is a major cause of food poisoning worldwide, and poultry products are the main source of S. Enteritidis contamination for humans. Among the numerous strategies for disease control, improving genetic resistance to S. Enteritidis has been the most effective approach. We investigated the association between S. Enteritidis burden in the caecum, spleen, and liver of young indigenous chickens and seven candidate genes, selected on the basis of their critical roles in immunological functions. The genes included those encoding interleukin 2 (IL-2), interferon-γ (IFN-γ), transforming growth factor β2 (TGF-β2), immunoglobulin light chain (IgL), toll-like receptor 4 (TLR-4), myeloid differentiation protein 2 (MD-2), and inducible nitric oxide synthase (iNOS). Two Malaysian indigenous chicken breeds were used as sustainable genetic sources of alleles that are resistant to salmonellosis. The polymerase chain reaction restriction fragment-length polymorphism technique was used to genotype the candidate genes. Three different genotypes were observed in all of the candidate genes, except for MD-2. All of the candidate genes showed the Hardy-Weinberg equilibrium for the two populations. The IL-2-MnlI polymorphism was associated with S. Enteritidis burden in the caecum and spleen. The TGF-β2-RsaI, TLR-4-Sau 96I, and iNOS-AluI polymorphisms were associated with the caecum S. Enteritidis load. The other candidate genes were not associated with S. Enteritidis load in any organ. The results indicate that the IL-2, TGF-β2, TLR-4, and iNOS genes are potential candidates for use in selection programmes for increasing genetic resistance against S. Enteritidis in Malaysian indigenous chickens.
Collapse
Affiliation(s)
- Reza Tohidi
- Department of Animal Science, Faculty of Agriculture, Universiti Putra Malaysia, 43400, Serdang, Selangor, Malaysia
| | | | | | | |
Collapse
|
28
|
Yan H, Jiang C, Li X, Sheng L, Dong Q, Peng X, Li Q, Zhao Y, Jiang H, Cheng B. PIGD: a database for intronless genes in the Poaceae. BMC Genomics 2014; 15:832. [PMID: 25270086 PMCID: PMC4195894 DOI: 10.1186/1471-2164-15-832] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2014] [Accepted: 09/24/2014] [Indexed: 01/31/2023] Open
Abstract
Background Intronless genes are a feature of prokaryotes; however, they are widespread and unequally distributed among eukaryotes and represent an important resource to study the evolution of gene architecture. Although many databases on exons and introns exist, there is currently no cohesive database that collects intronless genes in plants into a single database. Description In this study, we present the Poaceae Intronless Genes Database (PIGD), a user-friendly web interface to explore information on intronless genes from different plants. Five Poaceae species, Sorghum bicolor, Zea mays, Setaria italica, Panicum virgatum and Brachypodium distachyon, are included in the current release of PIGD. Gene annotations and sequence data were collected and integrated from different databases. The primary focus of this study was to provide gene descriptions and gene product records. In addition, functional annotations, subcellular localization prediction and taxonomic distribution are reported. PIGD allows users to readily browse, search and download data. BLAST and comparative analyses are also provided through this online database, which is available at http://pigd.ahau.edu.cn/. Conclusion PIGD provides a solid platform for the collection, integration and analysis of intronless genes in the Poaceae. As such, this database will be useful for subsequent bio-computational analysis in comparative genomics and evolutionary studies.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | - Beijiu Cheng
- Key Laboratory of Crop Biology of Anhui Province, Anhui Agricultural University, Hefei 230036, China.
| |
Collapse
|
29
|
Buchmann JP, Löytynoja A, Wicker T, Schulman AH. Analysis of CACTA transposases reveals intron loss as major factor influencing their exon/intron structure in monocotyledonous and eudicotyledonous hosts. Mob DNA 2014; 5:24. [PMID: 25206928 PMCID: PMC4158355 DOI: 10.1186/1759-8753-5-24] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2014] [Accepted: 08/18/2014] [Indexed: 01/20/2023] Open
Abstract
Background CACTA elements are DNA transposons and are found in numerous organisms. Despite their low activity, several thousand copies can be identified in many genomes. CACTA elements transpose using a ‘cut-and-paste’ mechanism, which is facilitated by a DDE transposase. DDE transposases from CACTA elements contain, despite their conserved function, different exon numbers among various CACTA families. While earlier studies analyzed the ancestral history of the DDE transposases, no studies have examined exon loss and gain with a view of mechanisms that could drive the changes. Results We analyzed 64 transposases from different CACTA families among monocotyledonous and eudicotyledonous host species. The annotation of the exon/intron boundaries showed a range from one to six exons. A robust multiple sequence alignment of the 64 transposases based on their protein sequences was created and used for phylogenetic analysis, which revealed eight different clades. We observed that the exon numbers in CACTA transposases are not specific for a host genome. We found that ancient CACTA lineages diverged before the divergence of monocotyledons and eudicotyledons. Most exon/intron boundaries were found in three distinct regions among all the transposases, grouping 63 conserved intron/exon boundaries. Conclusions We propose a model for the ancestral CACTA transposase gene, which consists of four exons, that predates the divergence of the monocotyledons and eudicotyledons. Based on this model, we propose pathways of intron loss or gain to explain the observed variation in exon numbers. While intron loss appears to have prevailed, a putative case of intron gain was nevertheless observed.
Collapse
Affiliation(s)
- Jan P Buchmann
- Institute of Biotechnology, Viikki Biocenter, University of Helsinki, PO Box 65, FIN-00014 Helsinki, Finland ; Present address: Marie Bashir Institute for Infectious Diseases and Biosecurity, Charles Perkins Center, University of Sydney, Sydney NSW 2006, Australia
| | - Ari Löytynoja
- Institute of Biotechnology, Viikki Biocenter, University of Helsinki, PO Box 65, FIN-00014 Helsinki, Finland
| | - Thomas Wicker
- Institute of Plant Biology, University of Zurich, Zollikerstrasse 107, Zurich, Switzerland
| | - Alan H Schulman
- Institute of Biotechnology, Viikki Biocenter, University of Helsinki, PO Box 65, FIN-00014 Helsinki, Finland ; Biotechnology and Food Research, MTT Agrifood Research Finland, Myllytie 1, FIN-31600 Jokioinen, Finland
| |
Collapse
|
30
|
Catania F, McGrath CL, Doak TG, Lynch M. Spliced DNA sequences in the Paramecium germline: their properties and evolutionary potential. Genome Biol Evol 2013; 5:1200-11. [PMID: 23737328 PMCID: PMC3698930 DOI: 10.1093/gbe/evt087] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023] Open
Abstract
Despite playing a crucial role in germline-soma differentiation, the evolutionary significance of developmentally regulated genome rearrangements (DRGRs) has received scant attention. An example of DRGR is DNA splicing, a process that removes segments of DNA interrupting genic and/or intergenic sequences. Perhaps, best known for shaping immune-system genes in vertebrates, DNA splicing plays a central role in the life of ciliated protozoa, where thousands of germline DNA segments are eliminated after sexual reproduction to regenerate a functional somatic genome. Here, we identify and chronicle the properties of 5,286 sequences that putatively undergo DNA splicing (i.e., internal eliminated sequences [IESs]) across the genomes of three closely related species of the ciliate Paramecium (P. tetraurelia, P. biaurelia, and P. sexaurelia). The study reveals that these putative IESs share several physical characteristics. Although our results are consistent with excision events being largely conserved between species, episodes of differential IES retention/excision occur, may have a recent origin, and frequently involve coding regions. Our findings indicate interconversion between somatic--often coding--DNA sequences and noncoding IESs, and provide insights into the role of DNA splicing in creating potentially functional genetic innovation.
Collapse
Affiliation(s)
- Francesco Catania
- Institute for Evolution and Biodiversity, University of Münster, Germany
| | | | | | | |
Collapse
|
31
|
Zhu T, Niu DK. Mechanisms of intron loss and gain in the fission yeast Schizosaccharomyces. PLoS One 2013; 8:e61683. [PMID: 23613904 PMCID: PMC3629103 DOI: 10.1371/journal.pone.0061683] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2013] [Accepted: 03/13/2013] [Indexed: 11/24/2022] Open
Abstract
The fission yeast, Schizosaccharomyces pombe, is an important model species with a low intron density. Previous studies showed extensive intron losses during its evolution. To test the models of intron loss and gain in fission yeasts, we conducted a comparative genomic analysis in four Schizosaccharomyces species. Both intronization and de-intronization were observed, although both were at a low frequency. A de-intronization event was caused by a degenerative mutation in the branch site. Four cases of imprecise intron losses were identified, indicating that genomic deletion is not a negligible mechanism of intron loss. Most intron losses were precise deletions of introns, and were significantly biased to the 3′ sides of genes. Adjacent introns tended to be lost simultaneously. These observations indicated that the main force shaping the exon-intron structures of fission yeasts was precise intron losses mediated by reverse transcriptase. We found two cases of intron gains caused by tandem genomic duplication, but failed to identify the mechanisms for the majority of the intron gain events observed. In addition, we found that intron-lost and intron-gained genes had certain similar features, such as similar Gene Ontology categories and expression levels.
Collapse
Affiliation(s)
- Tao Zhu
- MOE Key Laboratory for Biodiversity Science and Ecological Engineering and Beijing Key Laboratory of Gene Resource and Molecular Development, College of Life Sciences, Beijing Normal University, Beijing, China
| | - Deng-Ke Niu
- MOE Key Laboratory for Biodiversity Science and Ecological Engineering and Beijing Key Laboratory of Gene Resource and Molecular Development, College of Life Sciences, Beijing Normal University, Beijing, China
- * E-mail:
| |
Collapse
|
32
|
Catania F, Lynch M. A simple model to explain evolutionary trends of eukaryotic gene architecture and expression: how competition between splicing and cleavage/polyadenylation factors may affect gene expression and splice-site recognition in eukaryotes. Bioessays 2013; 35:561-70. [PMID: 23568225 DOI: 10.1002/bies.201200127] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
Enormous phylogenetic variation exists in the number and sizes of introns in protein-coding genes. Although some consideration has been given to the underlying role of the population-genetic environment in defining such patterns, the influence of the intracellular environment remains virtually unexplored. Drawing from observations on interactions between co-transcriptional processes involved in splicing and mRNA 3'-end formation, a mechanistic model is proposed for splice-site recognition that challenges the commonly accepted intron- and exon-definition models. Under the suggested model, splicing factors that outcompete 3'-end processing factors for access to intronic binding sites concurrently favor the recruitment of 3'-end processing factors at the pre-mRNA tail. This hypothesis sheds new light on observations such as the intron-mediated enhancement of gene expression and the negative correlation between intron length and levels of gene expression.
Collapse
Affiliation(s)
- Francesco Catania
- Institute for Evolution and Biodiversity, University of Münster, Münster, Germany.
| | | |
Collapse
|
33
|
Koonin EV, Csuros M, Rogozin IB. Whence genes in pieces: reconstruction of the exon-intron gene structures of the last eukaryotic common ancestor and other ancestral eukaryotes. WILEY INTERDISCIPLINARY REVIEWS-RNA 2012; 4:93-105. [PMID: 23139082 DOI: 10.1002/wrna.1143] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
Abstract
In eukaryotes, protein-coding sequences are interrupted by non-coding sequences known as introns. During mRNA maturation, introns are excised by the spliceosome and the coding regions, exons, are spliced to form the mature coding region. The intron densities widely differ between eukaryotic lineages, from 6 to 7 introns per kb of coding sequence in vertebrates, some invertebrates and green plants, to only a few introns across the entire genome in many unicellular eukaryotes. Evolutionary reconstructions using maximum likelihood methods suggest intron-rich ancestors for each major group of eukaryotes. For the last common ancestor of animals, the highest intron density of all extant and extinct eukaryotes was inferred, at 120-130% of the human intron density. Furthermore, an intron density within 53-74% of the human values was inferred for the last eukaryotic common ancestor. Accordingly, evolution of eukaryotic genes in all lines of descent involved primarily intron loss, with substantial gain only at the bases of several branches including plants and animals. These conclusions have substantial biological implications indicating that the common ancestor of all modern eukaryotes was a complex organism with a gene architecture resembling those in multicellular organisms. Alternative splicing most likely initially appeared as an inevitable result of splicing errors and only later was employed to generate structural and functional diversification of proteins.
Collapse
Affiliation(s)
- Eugene V Koonin
- National Center for Biotechnology Information NLM/NIH, Bethesda, MD, USA.
| | | | | |
Collapse
|
34
|
Yenerall P, Zhou L. Identifying the mechanisms of intron gain: progress and trends. Biol Direct 2012; 7:29. [PMID: 22963364 PMCID: PMC3443670 DOI: 10.1186/1745-6150-7-29] [Citation(s) in RCA: 55] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2012] [Accepted: 08/22/2012] [Indexed: 12/22/2022] Open
Abstract
Abstract Continued improvements in Next-Generation DNA/RNA sequencing coupled with advances in gene annotation have provided researchers access to a plethora of annotated genomes. Subsequent analyses of orthologous gene structures have identified numerous intron gain and loss events that have occurred both recently and in the very distant past. This research has afforded exceptional insight into the temporal and lineage-specific rates of intron gain and loss among various species throughout evolution. Numerous studies have also attempted to identify the molecular mechanisms of intron gain and loss. However, even after considerable effort, very little is known about these processes. In particular, the mechanism(s) of intron gain have proven exceptionally enigmatic and remain topics of considerable debate. Currently, there exists no definitive consensus as to what mechanism(s) may generate introns. Because many introns are known to affect gene expression, it is necessary to understand the molecular process(es) by which introns may be gained. Here we review the seven most commonly purported mechanisms of intron gain and, when possible, summarize molecular evidence for or against the occurrence of each of these mechanisms. Furthermore, we catalogue indirect evidence that supports the occurrence of each mechanism. Finally, because these proposed mechanisms fail to explain the mechanistic origin of many recently gained introns, we also look at trends that may aid researchers in identifying other potential mechanism(s) of intron gain. Reviewers This article was reviewed by Eugene Koonin, Scott Roy (nominated by W. Ford Doolittle), and John Logsdon.
Collapse
Affiliation(s)
- Paul Yenerall
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, PA 15260, USA
| | | |
Collapse
|
35
|
Liu A, Wang Y, Dang C, Zhang D, Song H, Yao Q, Chen K. A genome-wide identification and analysis of the basic helix-loop-helix transcription factors in the ponerine ant, Harpegnathos saltator. BMC Evol Biol 2012; 12:165. [PMID: 22938134 PMCID: PMC3527142 DOI: 10.1186/1471-2148-12-165] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2012] [Accepted: 08/20/2012] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The basic helix-loop-helix (bHLH) transcription factors and their homologs form a superfamily that plays essential roles in transcriptional networks of multiple developmental processes. bHLH family members have been identified in over 20 organisms, including fruit fly, zebrafish, human and mouse. RESULT In this study, we conducted a genome-wide survey for bHLH sequences, and identified 57 bHLH sequences encoded in complete genome sequence of the ponerine ant, Harpegnathos saltator. Phylogenetic analysis of the bHLH domain sequences classified these genes into 38 bHLH families with 23, 14, 10, 1, 8 and 1 members in group A, B, C, D, E and F, respectively. The number of PabHLHs (ponerine ant bHLHs) with introns is higher than many other insect species, and they are found to have introns with average lengths only inferior to those of pea aphid. In addition, two H. saltator bHLHs named PaCrp1 and PaSide locate on two separate contigs in the genome. CONCLUSIONS A putative full set of PabHLH genes is comparable with other insect species and genes encoding Oligo, MyoRb and Figα were not found in genomes of all insect species of which bHLH family members have been identified. Moreover, in-family phylogenetic analyses indicate that the PabHLH genes are more closely related with Apis mellifera than others. The present study will serve as a solid foundation for further investigations into the structure and function of bHLH proteins in the regulation of H. saltator development.
Collapse
Affiliation(s)
- Ake Liu
- Institute of Life Sciences, Jiangsu University, 301 Xuefu Road, Zhenjiang, 212013, P. R., China
| | - Yong Wang
- School of Food and Biological Engineering, Jiangsu University, 301 Xuefu Road, Zhenjiang 212013, P. R., China
| | - Chunwang Dang
- Institute of Life Sciences, Jiangsu University, 301 Xuefu Road, Zhenjiang, 212013, P. R., China
| | - Debao Zhang
- Institute of Life Sciences, Jiangsu University, 301 Xuefu Road, Zhenjiang, 212013, P. R., China
| | - Huifang Song
- Institute of Life Sciences, Jiangsu University, 301 Xuefu Road, Zhenjiang, 212013, P. R., China
| | - Qin Yao
- Institute of Life Sciences, Jiangsu University, 301 Xuefu Road, Zhenjiang, 212013, P. R., China
| | - Keping Chen
- Institute of Life Sciences, Jiangsu University, 301 Xuefu Road, Zhenjiang, 212013, P. R., China
| |
Collapse
|
36
|
Rogozin IB, Carmel L, Csuros M, Koonin EV. Origin and evolution of spliceosomal introns. Biol Direct 2012; 7:11. [PMID: 22507701 PMCID: PMC3488318 DOI: 10.1186/1745-6150-7-11] [Citation(s) in RCA: 245] [Impact Index Per Article: 18.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2011] [Accepted: 03/15/2012] [Indexed: 12/31/2022] Open
Abstract
Evolution of exon-intron structure of eukaryotic genes has been a matter of long-standing, intensive debate. The introns-early concept, later rebranded ‘introns first’ held that protein-coding genes were interrupted by numerous introns even at the earliest stages of life's evolution and that introns played a major role in the origin of proteins by facilitating recombination of sequences coding for small protein/peptide modules. The introns-late concept held that introns emerged only in eukaryotes and new introns have been accumulating continuously throughout eukaryotic evolution. Analysis of orthologous genes from completely sequenced eukaryotic genomes revealed numerous shared intron positions in orthologous genes from animals and plants and even between animals, plants and protists, suggesting that many ancestral introns have persisted since the last eukaryotic common ancestor (LECA). Reconstructions of intron gain and loss using the growing collection of genomes of diverse eukaryotes and increasingly advanced probabilistic models convincingly show that the LECA and the ancestors of each eukaryotic supergroup had intron-rich genes, with intron densities comparable to those in the most intron-rich modern genomes such as those of vertebrates. The subsequent evolution in most lineages of eukaryotes involved primarily loss of introns, with only a few episodes of substantial intron gain that might have accompanied major evolutionary innovations such as the origin of metazoa. The original invasion of self-splicing Group II introns, presumably originating from the mitochondrial endosymbiont, into the genome of the emerging eukaryote might have been a key factor of eukaryogenesis that in particular triggered the origin of endomembranes and the nucleus. Conversely, splicing errors gave rise to alternative splicing, a major contribution to the biological complexity of multicellular eukaryotes. There is no indication that any prokaryote has ever possessed a spliceosome or introns in protein-coding genes, other than relatively rare mobile self-splicing introns. Thus, the introns-first scenario is not supported by any evidence but exon-intron structure of protein-coding genes appears to have evolved concomitantly with the eukaryotic cell, and introns were a major factor of evolution throughout the history of eukaryotes. This article was reviewed by I. King Jordan, Manuel Irimia (nominated by Anthony Poole), Tobias Mourier (nominated by Anthony Poole), and Fyodor Kondrashov. For the complete reports, see the Reviewers’ Reports section.
Collapse
Affiliation(s)
- Igor B Rogozin
- National Center for Biotechnology Information NLM/NIH, 8600 Rockville Pike, Bldg, 38A, Bethesda, MD 20894, USA
| | | | | | | |
Collapse
|
37
|
Andrulis ED. Theory of the origin, evolution, and nature of life. Life (Basel) 2011; 2:1-105. [PMID: 25382118 PMCID: PMC4187144 DOI: 10.3390/life2010001] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2011] [Revised: 12/10/2011] [Accepted: 12/13/2011] [Indexed: 12/22/2022] Open
Abstract
Life is an inordinately complex unsolved puzzle. Despite significant theoretical progress, experimental anomalies, paradoxes, and enigmas have revealed paradigmatic limitations. Thus, the advancement of scientific understanding requires new models that resolve fundamental problems. Here, I present a theoretical framework that economically fits evidence accumulated from examinations of life. This theory is based upon a straightforward and non-mathematical core model and proposes unique yet empirically consistent explanations for major phenomena including, but not limited to, quantum gravity, phase transitions of water, why living systems are predominantly CHNOPS (carbon, hydrogen, nitrogen, oxygen, phosphorus, and sulfur), homochirality of sugars and amino acids, homeoviscous adaptation, triplet code, and DNA mutations. The theoretical framework unifies the macrocosmic and microcosmic realms, validates predicted laws of nature, and solves the puzzle of the origin and evolution of cellular life in the universe.
Collapse
Affiliation(s)
- Erik D Andrulis
- Department of Molecular Biology and Microbiology, Case Western Reserve University School of Medicine, Wood Building, W212, Cleveland, OH 44106, USA.
| |
Collapse
|
38
|
Yenerall P, Krupa B, Zhou L. Mechanisms of intron gain and loss in Drosophila. BMC Evol Biol 2011; 11:364. [PMID: 22182367 PMCID: PMC3296678 DOI: 10.1186/1471-2148-11-364] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2011] [Accepted: 12/19/2011] [Indexed: 12/02/2022] Open
Abstract
Background It is widely accepted that orthologous genes have lost or gained introns throughout evolution. However, the specific mechanisms that generate these changes have proved elusive. Introns are known to affect nearly every level of gene expression. Therefore, understanding their mechanism of evolution after their initial fixation in eukaryotes is pertinent to understanding the means by which organisms develop greater regulation and complexity. Results To investigate possible mechanisms of intron gain and loss, we identified 189 intron gain and 297 intron loss events among 11 Drosophila species. We then investigated these events for signatures of previously proposed mechanisms of intron gain and loss. This work constitutes the first comprehensive study into the specific mechanisms that may generate intron gains and losses in Drosophila. We report evidence of intron gain via transposon insertion; the first intron loss that may have occurred via non-homologous end joining; intron gains via the repair of a double strand break; evidence of intron sliding; and evidence that internal or 5' introns may not frequently be deleted via the self-priming of reverse transcription during mRNA-mediated intron loss. Our data also suggest that the transcription process may promote or result in intron gain. Conclusion Our findings support the occurrence of intron gain via transposon insertion, repair of double strand breaks, as well as intron loss via non-homologous end joining. Furthermore, our data suggest that intron gain may be enabled by or due to transcription, and we shed further light on the exact mechanism of mRNA-mediated intron loss.
Collapse
Affiliation(s)
- Paul Yenerall
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, PA 15260, USA
| | | | | |
Collapse
|
39
|
Extensive intron gain in the ancestor of placental mammals. Biol Direct 2011; 6:59. [PMID: 22112745 PMCID: PMC3257199 DOI: 10.1186/1745-6150-6-59] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2011] [Accepted: 11/23/2011] [Indexed: 01/29/2023] Open
Abstract
Background Genome-wide studies of intron dynamics in mammalian orthologous genes have found convincing evidence for loss of introns but very little for intron turnover. Similarly, large-scale analysis of intron dynamics in a few vertebrate genomes has identified only intron losses and no gains, indicating that intron gain is an extremely rare event in vertebrate evolution. These studies suggest that the intron-rich genomes of vertebrates do not allow intron gain. The aim of this study was to search for evidence of de novo intron gain in domesticated genes from an analysis of their exon/intron structures. Results A phylogenomic approach has been used to analyse all domesticated genes in mammals and chordates that originated from the coding parts of transposable elements. Gain of introns in domesticated genes has been reconstructed on well established mammalian, vertebrate and chordate phylogenies, and examined as to where and when the gain events occurred. The locations, sizes and amounts of de novo introns gained in the domesticated genes during the evolution of mammals and chordates has been analyzed. A significant amount of intron gain was found only in domesticated genes of placental mammals, where more than 70 cases were identified. De novo gained introns show clear positional bias, since they are distributed mainly in 5' UTR and coding regions, while 3' UTR introns are very rare. In the coding regions of some domesticated genes up to 8 de novo gained introns have been found. Intron densities in Eutheria-specific domesticated genes and in older domesticated genes that originated early in vertebrates are lower than those for normal mammalian and vertebrate genes. Surprisingly, the majority of intron gains have occurred in the ancestor of placentals. Conclusions This study provides the first evidence for numerous intron gains in the ancestor of placental mammals and demonstrates that adequate taxon sampling is crucial for reconstructing intron evolution. The findings of this comprehensive study slightly challenge the current view on the evolutionary stasis in intron dynamics during the last 100 - 200 My. Domesticated genes could constitute an excellent system on which to analyse the mechanisms of intron gain in placental mammals. Reviewers: this article was reviewed by Dan Graur, Eugene V. Koonin and Jürgen Brosius.
Collapse
|
40
|
Shawar SM, Al-Drees MA, Ramadan AR, Ali NH, Alfadhli SM. The Arabic allele: a single base pair substitution activates a 10-base downstream cryptic splice acceptor site in exon 12 of LDLR and severely decreases LDLR expression in two unrelated Arab families with familial hypercholesterolemia. Atherosclerosis 2011; 220:429-36. [PMID: 22129472 DOI: 10.1016/j.atherosclerosis.2011.10.045] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/23/2011] [Revised: 10/14/2011] [Accepted: 10/30/2011] [Indexed: 10/15/2022]
Abstract
Familial hypercholesterolemia (FH) is a monogenic autosomal dominant disorder caused by defects in LDLR. Few reports describe FH mutations among Arabs. We describe a mutation in LDLR of two unrelated Arab families. We investigated 19 patients using DNA sequencing, RFLP, and real-time (RT) PCR. DNA sequencing showed a base pair substitution (c.1706-2 A>T) in the splice acceptor site of LDLR intron 11. Our results were confirmed by RFLP on 2% agarose gel. In silico analysis predicted a new cryptic splice site downstream of the original position generating a 10-base deletion from the beginning of exon 12; (c.1706-1715del.ATCTCCTCAG). cDNA sequencing of exon 12 confirmed the computational analysis. The deletion was visualized on 4% agarose gel. The deletion generates a frameshift and a premature termination codon (c.1991-1993; p.(Asp569Valfs*93). RT-PCR revealed that LDLR mRNA is 9.3%±6.5 and 17.9%±8.0 for FH homozygote and heterozygote individuals respectively, compared to a healthy family control. We predict a class II LDLR mutation that leads to a truncated receptor missing exons 14-18. We called this mutation "the Arabic allele". We expect a significant contribution of this mutation to the prevalence of FH among Arabs. Also, we propose that the severe down regulation of LDLR mRNA expression is due to nonsense-mediated-decay.
Collapse
Affiliation(s)
- Said M Shawar
- Biotechnology Program, Arabian Gulf University, Manama, Bahrain.
| | | | | | | | | |
Collapse
|
41
|
Cohen NE, Shen R, Carmel L. The role of reverse transcriptase in intron gain and loss mechanisms. Mol Biol Evol 2011; 29:179-86. [PMID: 21804076 DOI: 10.1093/molbev/msr192] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023] Open
Abstract
Intron density is highly variable across eukaryotic species. It seems that different lineages have experienced considerably different levels of intron gain and loss events, but the reasons for this are not well known. A large number of mechanisms for intron loss and gain have been suggested, and most of them have at least some level of indirect support. We therefore figured out that the variability in intron density can be a reflection of the fact that different mechanisms are active in different lineages. Quite a number of these putative mechanisms, both for intron loss and for intron gain, postulate that the enzyme reverse transcriptase (RT) has a key role in the process. In this paper, we lay out three predictions whose approval or falsification gives indication for the involvement of RT in intron gain and loss processes. Testing these predictions requires data on the intron gain and loss rates of individual genes along different branches of the eukaryotic phylogenetic tree. So far, such rates could not be computed, and hence, these predictions could not be rigorously evaluated. Here, we use a maximum likelihood algorithm that we have devised in the past, Evolutionary Reconstruction by Expectation Maximization, which allows the estimation of such rates. Using this algorithm, we computed the intron loss and gain rates of more than 300 genes in each branch of the phylogenetic tree of 19 eukaryotic species. Based on that we found only little support for RT activity in intron gain. In contrast, we suggest that RT-mediated intron loss is a mechanism that is very efficient in removing introns, and thus, its levels of activity may be a major determinant of intron number. Moreover, we found that intron gain and loss rates are negatively correlated in intron-poor species but are positively correlated for intron-rich species. One explanation to this is that intron gain and loss mechanisms in intron-rich species (like metazoans) share a common mechanistic component, albeit not a RT.
Collapse
Affiliation(s)
- Noa E Cohen
- Department of Genetics, The Alexander Silberman Institute of Life Sciences, Faculty of Science, The Hebrew University of Jerusalem, Jerusalem, Israel
| | | | | |
Collapse
|
42
|
Ragg H. Intron creation and DNA repair. Cell Mol Life Sci 2011; 68:235-42. [PMID: 20853128 PMCID: PMC11115024 DOI: 10.1007/s00018-010-0532-2] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2010] [Revised: 09/07/2010] [Accepted: 09/07/2010] [Indexed: 10/19/2022]
Abstract
The genesis of the exon-intron patterns of eukaryotic genes persists as one of the most enigmatic questions in molecular genetics. In particular, the origin and mechanisms responsible for creation of spliceosomal introns have remained controversial. Now the issue appears to have taken a turn. The formation of novel introns in eukaryotes, including some vertebrate lineages, is not as rare as commonly assumed. Moreover, introns appear to have been gained in parallel at closely spaced sites and even repeatedly at the same position. Based on these discoveries, novel hypotheses of intron creation have been developed. The new concepts posit that DNA repair processes are a major source of intron formation. Here, after summarizing the current views of intron gain mechanisms, I review findings in support of the DNA repair hypothesis that provides a global mechanistic scenario for intron creation. Some implications on our perception of the mosaic structure of eukaryotic genes are also discussed.
Collapse
Affiliation(s)
- Hermann Ragg
- Department of Biotechnology, University of Bielefeld, Germany.
| |
Collapse
|
43
|
Zhang LY, Yang YF, Niu DK. Evaluation of models of the mechanisms underlying intron loss and gain in Aspergillus fungi. J Mol Evol 2010; 71:364-73. [PMID: 20862581 DOI: 10.1007/s00239-010-9391-6] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2010] [Accepted: 09/08/2010] [Indexed: 11/26/2022]
Abstract
Although intron loss and gain have been widely observed, their mechanisms are still to be determined. In four Aspergillus genomes, we found 204 cases of intron loss and 84 cases of intron gain. Using this data, we tested common hypotheses of intron loss or gain. Statistical analysis showed that adjacent introns tend to be lost simultaneously and small introns were preferentially lost, supporting the model of mRNA-mediated intron loss. The lost introns reside in internal regions of genes, which is inconsistent with the traditional version of the model (partial length cDNAs are reverse transcribed from 3' ends of mRNAs), but consistent with an alternate version (partial length cDNAs are produced by self-primed reverse transcription). The latter version was not supported by examination of the abundance of T-rich segments in mRNAs. Preferential loss of internal introns might be explained by highly efficient recombination at internal regions of genes. Among the 84 cases of intron gain, we found a significantly higher frequency of short direct repeats near exon-intron boundary than in conserved introns, supporting the double-strand break repair model. We also found possible source sequences for two cases of intron gain, one by gene conversion and one by insertion of a mitochondrial sequence during double-strand break repair. Source sequences for most gained introns could not be identified and the possible reasons were discussed. In the four Aspergillus genomes studied, we did not find evidence of frequent parallel intron gains.
Collapse
Affiliation(s)
- Lei-Ying Zhang
- MOE Key Laboratory for Biodiversity Sciences and Ecological Engineering, College of Life Sciences, Beijing Normal University, Beijing, 100875, China
| | | | | |
Collapse
|
44
|
Liang X, Fomenko DE, Hua D, Kaya A, Gladyshev VN. Diversity of protein and mRNA forms of mammalian methionine sulfoxide reductase B1 due to intronization and protein processing. PLoS One 2010; 5:e11497. [PMID: 20634897 PMCID: PMC2901347 DOI: 10.1371/journal.pone.0011497] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2010] [Accepted: 06/02/2010] [Indexed: 01/01/2023] Open
Abstract
Background Methionine sulfoxide reductases (Msrs) are repair enzymes that protect proteins from oxidative stress by catalyzing stereospecific reduction of oxidized methionine residues. MsrB1 is a selenocysteine-containing cytosolic/nuclear Msr with high expression in liver and kidney. Principal Findings Here, we identified differences in MsrB1 gene structure among mammals. Human MsrB1 gene consists of four, whereas the corresponding mouse gene of five exons, due to occurrence of an additional intron that flanks the stop signal and covers a large part of the 3′-UTR. This intron evolved in a subset of rodents through intronization of exonic sequences, whereas the human gene structure represents the ancestral form. In mice, both splice forms were detected in liver, kidney, brain and heart with the five-exon form being the major form. We found that both mRNA forms were translated and supported efficient selenocysteine insertion into MsrB1. In addition, MsrB1 occurs in two protein forms that migrate as 14 and 5 kDa proteins. We found that each mRNA splice form generated both protein forms. The abundance of the 5 kDa form was not influenced by protease inhibitors, replacement of selenocysteine in the active site or mutation of amino acids in the cleavage site. However, mutation of cysteines that coordinate a structural zinc decreased the levels of 5 and 14 kDa forms, suggesting importance of protein structure for biosynthesis and/stability of these forms. Conclusions This study characterized unexpected diversity of protein and mRNA forms of mammalian selenoprotein MsrB1.
Collapse
Affiliation(s)
- Xinwen Liang
- Department of Biochemistry and Redox Biology Center, University of Nebraska-Lincoln, Lincoln, Nebraska, United States of America
| | - Dmitri E. Fomenko
- Department of Biochemistry and Redox Biology Center, University of Nebraska-Lincoln, Lincoln, Nebraska, United States of America
| | - Deame Hua
- Department of Biochemistry and Redox Biology Center, University of Nebraska-Lincoln, Lincoln, Nebraska, United States of America
| | - Alaattin Kaya
- Department of Biochemistry and Redox Biology Center, University of Nebraska-Lincoln, Lincoln, Nebraska, United States of America
| | - Vadim N. Gladyshev
- Department of Biochemistry and Redox Biology Center, University of Nebraska-Lincoln, Lincoln, Nebraska, United States of America
- Division of Genetics, Brigham and Women's Hospital and Harvard Medical School, Boston, Massachusetts, United States of America
- * E-mail:
| |
Collapse
|
45
|
The role of transposable elements in the evolution of non-mammalian vertebrates and invertebrates. Genome Biol 2010; 11:R59. [PMID: 20525173 PMCID: PMC2911107 DOI: 10.1186/gb-2010-11-6-r59] [Citation(s) in RCA: 79] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2010] [Revised: 04/27/2010] [Accepted: 06/02/2010] [Indexed: 01/29/2023] Open
Abstract
Background Transposable elements (TEs) have played an important role in the diversification and enrichment of mammalian transcriptomes through various mechanisms such as exonization and intronization (the birth of new exons/introns from previously intronic/exonic sequences, respectively), and insertion into first and last exons. However, no extensive analysis has compared the effects of TEs on the transcriptomes of mammals, non-mammalian vertebrates and invertebrates. Results We analyzed the influence of TEs on the transcriptomes of five species, three invertebrates and two non-mammalian vertebrates. Compared to previously analyzed mammals, there were lower levels of TE introduction into introns, significantly lower numbers of exonizations originating from TEs and a lower percentage of TE insertion within the first and last exons. Although the transcriptomes of vertebrates exhibit significant levels of exonization of TEs, only anecdotal cases were found in invertebrates. In vertebrates, as in mammals, the exonized TEs are mostly alternatively spliced, indicating that selective pressure maintains the original mRNA product generated from such genes. Conclusions Exonization of TEs is widespread in mammals, less so in non-mammalian vertebrates, and very low in invertebrates. We assume that the exonization process depends on the length of introns. Vertebrates, unlike invertebrates, are characterized by long introns and short internal exons. Our results suggest that there is a direct link between the length of introns and exonization of TEs and that this process became more prevalent following the appearance of mammals.
Collapse
|
46
|
Nonsense-mediated decay enables intron gain in Drosophila. PLoS Genet 2010; 6:e1000819. [PMID: 20107520 PMCID: PMC2809761 DOI: 10.1371/journal.pgen.1000819] [Citation(s) in RCA: 52] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2009] [Accepted: 12/18/2009] [Indexed: 12/03/2022] Open
Abstract
Intron number varies considerably among genomes, but despite their fundamental importance, the mutational mechanisms and evolutionary processes underlying the expansion of intron number remain unknown. Here we show that Drosophila, in contrast to most eukaryotic lineages, is still undergoing a dramatic rate of intron gain. These novel introns carry significantly weaker splice sites that may impede their identification by the spliceosome. Novel introns are more likely to encode a premature termination codon (PTC), indicating that nonsense-mediated decay (NMD) functions as a backup for weak splicing of new introns. Our data suggest that new introns originate when genomic insertions with weak splice sites are hidden from selection by NMD. This mechanism reduces the sequence requirement imposed on novel introns and implies that the capacity of the spliceosome to recognize weak splice sites was a prerequisite for intron gain during eukaryotic evolution. The surprising observation 30 years ago that genes are interrupted by non-coding introns changed our view of gene architecture. Intron number varies dramatically among species; ranging from nine introns/gene in humans to less than one in some simple eukyarotes. Here we ask where new introns come from and how they are maintained in a population. We find that novel introns do not arise from pre-existing introns, although the mechanisms that generate novel introns remain unclear. We also show that novel introns carry only weak signals for their identification and removal, and therefore depend on nonsense-mediated decay (NMD). NMD maintains RNA quality control by degrading transcripts that have not been spliced properly. We propose that NMD shelters novel introns from natural selection. This increases the likelihood that a novel intron will rise in frequency and be maintained within a population, thus increasing the rate of intron gain.
Collapse
|
47
|
Tandem repeats modify the structure of human genes hosted in segmental duplications. Genome Biol 2009; 10:R137. [PMID: 19954527 PMCID: PMC2812944 DOI: 10.1186/gb-2009-10-12-r137] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2009] [Revised: 10/08/2009] [Accepted: 12/02/2009] [Indexed: 12/14/2022] Open
Abstract
BACKGROUND Recently duplicated genes are often subject to genomic rearrangements that can lead to the development of novel gene structures. Here we specifically investigated the effect of variations in internal tandem repeats (ITRs) on the gene structure of human paralogs located in segmental duplications. RESULTS We found that around 7% of the primate-specific genes located within duplicated regions of the genome contain variable tandem repeats. These genes are members of large groups of recently duplicated paralogs that are often polymorphic in the human population. Half of the identified ITRs occur within coding exons and may be either kept or spliced out from the mature transcript. When ITRs reside within exons, they encode variable amino acid repeats. When located at exon-intron boundaries, ITRs can generate alternative splicing patterns through the formation of novel introns. CONCLUSIONS Our study shows that variation in the number of ITRs impacts on recently duplicated genes by modifying their coding sequence, splicing pattern, and tissue expression. The resulting effect is the production of a variety of primate-specific proteins, which mostly differ in number and sequence of amino acid repeats.
Collapse
|
48
|
Zhu Z, Zhang Y, Long M. Extensive structural renovation of retrogenes in the evolution of the Populus genome. PLANT PHYSIOLOGY 2009; 151:1943-51. [PMID: 19789289 PMCID: PMC2785971 DOI: 10.1104/pp.109.142984] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]
Abstract
Retroposition, as an important copy mechanism for generating new genes, was believed to play a negligible role in plants. As a representative dicot, the genomic sequences of Populus (poplar; Populus trichocarpa) provide an opportunity to investigate this issue. We identified 106 retrogenes and found the majority (89%) of them are associated with functional signatures in sequence evolution, transcription, and (or) translation. Remarkably, examination of gene structures revealed extensive structural renovation of these retrogenes: we identified 18 (17%) of them undergoing either chimerization to form new chimerical genes and (or) intronization (transformation into intron sequences of previously exonic sequences) to generate new intron-containing genes. Such a change might occur at a high speed, considering eight out of 18 such cases occurred recently after divergence between Arabidopsis (Arabidopsis thaliana) and Populus. This pattern also exists in Arabidopsis, with 15 intronized retrogenes occurring after the divergence between Arabidopsis and papaya (Carica papaya). Thus, the frequency of intronization in dicots revealed its importance as a mechanism in the evolution of exon-intron structure. In addition, we also examined the potential impact of the Populus nascent sex determination system on the chromosomal distribution of retrogenes and did not observe any significant effects of the extremely young sex chromosomes.
Collapse
|
49
|
Ubiquitous internal gene duplication and intron creation in eukaryotes. Proc Natl Acad Sci U S A 2009; 106:20818-23. [PMID: 19926850 DOI: 10.1073/pnas.0911093106] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Duplication of genomic segments provides a primary resource for the origin of evolutionary novelties. However, most previous studies have focused on duplications of complete protein-coding genes, whereas little is known about the significance of duplication segments that are entirely internal to genes. Our examination of six fully sequenced genomes reveals that internal duplications of gene segments occur at a high frequency (0.001-0.013 duplications/gene per million years), similar to that of complete gene duplications, such that 8-17% of the genes in a genome carry duplicated intronic and/or exonic regions. At least 7-30% of such genes have acquired novel introns, either because a prior intron in the same gene has been duplicated, or more commonly, because a spatial change has activated a latent splice site. These results strongly suggest a major evolutionary role for internal gene duplications in the origin of genomic novelties, particularly as a mechanism for intron gain.
Collapse
|
50
|
An Overview of the Introns-First Theory. J Mol Evol 2009; 69:527-40. [DOI: 10.1007/s00239-009-9279-5] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2009] [Accepted: 09/08/2009] [Indexed: 10/20/2022]
|