1
|
Dynamic Transcriptional Landscape of Grass Carp (Ctenopharyngodon idella) Reveals Key Transcriptional Features Involved in Fish Development. Int J Mol Sci 2022; 23:ijms231911547. [PMID: 36232849 PMCID: PMC9569805 DOI: 10.3390/ijms231911547] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2022] [Revised: 09/23/2022] [Accepted: 09/23/2022] [Indexed: 11/17/2022] Open
Abstract
A high-quality baseline transcriptome is a valuable resource for developmental research as well as a useful reference for other studies. We gathered 41 samples representing 11 tissues/organs from 22 important developmental time points within 197 days of fertilization of grass carp eggs in order to systematically examine the role of lncRNAs and alternative splicing in fish development. We created a high-quality grass carp baseline transcriptome with a completeness of up to 93.98 percent by combining strand-specific RNA sequencing and single-molecule real-time RNA sequencing technologies, and we obtained temporal expression profiles of 33,055 genes and 77,582 transcripts during development and tissue differentiation. A family of short interspersed elements was preferentially expressed at the early stage of zygotic activation in grass carp, and its possible regulatory components were discovered through analysis. Additionally, after thoroughly analyzing alternative splicing events, we discovered that retained intron (RI) alternative splicing events change significantly in both zygotic activation and tissue differentiation. During zygotic activation, we also revealed the precise regulatory characteristics of the underlying functional RI events.
Collapse
|
2
|
Vassetzky NS, Kosushkin SA, Korchagin VI, Ryskov AP. New Ther1-derived SINE Squam3 in scaled reptiles. Mob DNA 2021; 12:10. [PMID: 33752750 PMCID: PMC7983390 DOI: 10.1186/s13100-021-00238-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2020] [Accepted: 02/25/2021] [Indexed: 11/14/2022] Open
Abstract
BACKGROUND SINEs comprise a significant part of animal genomes and are used to study the evolution of diverse taxa. Despite significant advances in SINE studies in vertebrates and higher eukaryotes in general, their own evolution is poorly understood. RESULTS We have discovered and described in detail a new Squam3 SINE specific for scaled reptiles (Squamata). The subfamilies of this SINE demonstrate different distribution in the genomes of squamates, which together with the data on similar SINEs in the tuatara allowed us to propose a scenario of their evolution in the context of reptilian evolution. CONCLUSIONS Ancestral SINEs preserved in small numbers in most genomes can give rise to taxa-specific SINE families. Analysis of this aspect of SINEs can shed light on the history and mechanisms of SINE variation in reptilian genomes.
Collapse
Affiliation(s)
- Nikita S Vassetzky
- Institute of Gene Biology, Russian Academy of Sciences, Moscow, 119334, Russia.
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow, 119991, Russia.
| | - Sergei A Kosushkin
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow, 119991, Russia
| | - Vitaly I Korchagin
- Institute of Gene Biology, Russian Academy of Sciences, Moscow, 119334, Russia
| | - Alexey P Ryskov
- Institute of Gene Biology, Russian Academy of Sciences, Moscow, 119334, Russia
| |
Collapse
|
3
|
Nishihara H. Retrotransposons spread potential cis-regulatory elements during mammary gland evolution. Nucleic Acids Res 2020; 47:11551-11562. [PMID: 31642473 PMCID: PMC7145552 DOI: 10.1093/nar/gkz1003] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2018] [Revised: 10/14/2019] [Accepted: 10/17/2019] [Indexed: 12/18/2022] Open
Abstract
Acquisition of cis-elements is a major driving force for rewiring a gene regulatory network. Several kinds of transposable elements (TEs), mostly retrotransposons that propagate via a copy-and-paste mechanism, are known to possess transcription factor binding motifs and have provided source sequences for enhancers/promoters. However, it remains largely unknown whether retrotransposons have spread the binding sites of master regulators of morphogenesis and accelerated cis-regulatory expansion involved in common mammalian morphological features during evolution. Here, I demonstrate that thousands of binding sites for estrogen receptor α (ERα) and three related pioneer factors (FoxA1, GATA3 and AP2γ) that are essential regulators of mammary gland development arose from a spreading of the binding motifs by retrotransposons. The TE-derived functional elements serve primarily as distal enhancers and are enriched around genes associated with mammary gland morphogenesis. The source TEs occurred via a two-phased expansion consisting of mainly L2/MIR in a eutherian ancestor and endogenous retrovirus 1 (ERV1) in simian primates and murines. Thus the build-up of potential sources for cis-elements by retrotransposons followed by their frequent utilization by the host (co-option/exaptation) may have a general accelerating effect on both establishing and diversifying a gene regulatory network, leading to morphological innovation.
Collapse
Affiliation(s)
- Hidenori Nishihara
- Department of Life Science and Technology, Tokyo Institute of Technology, 4259-S2-17, Nagatsuta-cho, Midori-ku, Yokohama, Kanagawa 226-8501, Japan
| |
Collapse
|
4
|
Seibt KM, Schmidt T, Heitkam T. The conserved 3' Angio-domain defines a superfamily of short interspersed nuclear elements (SINEs) in higher plants. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2020; 101:681-699. [PMID: 31610059 DOI: 10.1111/tpj.14567] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/16/2019] [Revised: 09/13/2019] [Accepted: 09/17/2019] [Indexed: 06/10/2023]
Abstract
Repetitive sequences are ubiquitous components of eukaryotic genomes affecting genome size and evolution as well as gene regulation. Among them, short interspersed nuclear elements (SINEs) are non-coding retrotransposons usually shorter than 1000 bp. They contain only few short conserved structural motifs, in particular an internal promoter derived from cellular RNAs and a mostly AT-rich 3' tail, whereas the remaining regions are highly variable. SINEs emerge and vanish during evolution, and often diversify into numerous families and subfamilies that are usually specific for only a limited number of species. In contrast, at the 3' end of multiple plant SINEs we detected the highly conserved 'Angio-domain'. This 37 bp segment defines the Angio-SINE superfamily, which encompasses 24 plant SINE families widely distributed across 13 orders within the plant kingdom. We retrieved 28 433 full-length Angio-SINE copies from genome assemblies of 46 plant species, frequently located in genes. Compensatory mutations in and adjacent to the Angio-domain imply selective restraints maintaining its RNA structure. Angio-SINE families share segmental sequence similarities, indicating a modular evolution with strong Angio-domain preservation. We suggest that the conserved domain contributes to the evolutionary success of Angio-SINEs through either structural interactions between SINE RNA and proteins increasing their transpositional efficiency, or by enhancing their accumulation in genes.
Collapse
Affiliation(s)
- Kathrin M Seibt
- Faculty of Biology, Technische Universität Dresden, Zellescher Weg 20b, Dresden, 01217, Germany
| | - Thomas Schmidt
- Faculty of Biology, Technische Universität Dresden, Zellescher Weg 20b, Dresden, 01217, Germany
| | - Tony Heitkam
- Faculty of Biology, Technische Universität Dresden, Zellescher Weg 20b, Dresden, 01217, Germany
| |
Collapse
|
5
|
Luchetti A, Lomiento M, Mantovani B. Riding the Wave: The SINE-Specific V Highly-Conserved Domain Spread into Mammalian Genomes Exploiting the Replication Burst of the MER6 DNA Transposon. Int J Mol Sci 2019; 20:ijms20225607. [PMID: 31717545 PMCID: PMC6887750 DOI: 10.3390/ijms20225607] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2019] [Revised: 11/05/2019] [Accepted: 11/06/2019] [Indexed: 02/06/2023] Open
Abstract
Transposable elements are widely distributed within genomes where they may significantly impact their evolution and cell functions. Short interspersed elements (SINEs) are non-autonomous, fast-evolving elements, but some of them carry a highly conserved domain (HCD), whose sequence remained substantially unchanged throughout the metazoan evolution. SINEs carrying the HCD called V are absent in amniote genomes, but V-like sequences were found within the miniature inverted-repeat transposable element (MITE) MER6 in Homo sapiens. In the present work, the genomic distribution and evolution of MER6 are investigated, in order to reconstruct the origin of human V domain and to envisage its possible functional role. The analysis of 85 tetrapod genomes revealed that MER6 and its variant MER6A are found in primates, while only the MER6A variant was found in bats and eulipotyphlans. These MITEs appeared no longer active, in line with literature data on mammalian DNA transposons. Moreover, they appeared to have originated from a Mariner element found in turtles and from a V-SINE from bony fishes. MER6 insertions were found within genes and conserved in mRNAs: in line with previous hypothesis on functional role of HCDs, the MER6 V domain may be important for cell function also in mammals.
Collapse
Affiliation(s)
- Andrea Luchetti
- Department of Biological, Geological and Environmental Sciences, University of Bologna, 40126 Bologna, Italy;
- Correspondence: ; Tel.: +39-051-209-4165
| | - Mariana Lomiento
- Sant’Orsola Malpighi Hospital, University of Bologna, 40138 Bologna Italy;
| | - Barbara Mantovani
- Department of Biological, Geological and Environmental Sciences, University of Bologna, 40126 Bologna, Italy;
| |
Collapse
|
6
|
Kojima KK. LINEs Contribute to the Origins of Middle Bodies of SINEs besides 3' Tails. Genome Biol Evol 2018; 10:370-379. [PMID: 29325122 PMCID: PMC5786205 DOI: 10.1093/gbe/evy008] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/08/2018] [Indexed: 01/06/2023] Open
Abstract
Short interspersed elements (SINEs), which are nonautonomous transposable elements, require the transposition machinery of long interspersed elements (LINEs) to mobilize. SINEs are composed of two or more independently originating parts. The 5′ region is called the “head” and is derived mainly from small RNAs, and the 3′ region (“tail”) originates from the 3′ region of LINEs and is responsible for being recognized by counterpart LINE proteins. The origin of the middle “body” of SINEs is enigmatic, although significant sequence similarities among SINEs from very diverse species have been observed. Here, a systematic analysis of the similarities among SINEs and LINEs deposited on Repbase, a comprehensive database of eukaryotic repeat sequences was performed. Three primary findings are described: 1) The 5′ regions of only two clades of LINEs, RTE and Vingi, were revealed to have contributed to the middle parts of SINEs; 2) The linkage of the 5′ and 3′ parts of LINEs can be lost due to occasional tail exchange of SINEs; and 3) The previously proposed Ceph-domain was revealed to be a fusion of a CORE-domain and a 5′ part of RTE clade of LINE. Based on these findings, a hypothesis that the 5′ parts of bipartite nonautonomous LINEs, which possess only the 5′ and 3′ regions of the original LINEs, can contribute to the undefined middle part of SINEs is proposed.
Collapse
Affiliation(s)
- Kenji K Kojima
- Department of Life Sciences, National Cheng Kung University, Tainan, Taiwan.,Genetic Information Research Institute, Mountain View, California
| |
Collapse
|
7
|
Sun Y, Zhang H, Kazemian M, Troy JM, Seward C, Lu X, Stubbs L. ZSCAN5B and primate-specific paralogs bind RNA polymerase III genes and extra-TFIIIC (ETC) sites to modulate mitotic progression. Oncotarget 2018; 7:72571-72592. [PMID: 27732952 PMCID: PMC5340127 DOI: 10.18632/oncotarget.12508] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2016] [Accepted: 09/20/2016] [Indexed: 11/25/2022] Open
Abstract
Mammalian genomes contain hundreds of genes transcribed by RNA Polymerase III (Pol III), encoding noncoding RNAs and especially the tRNAs specialized to carry specific amino acids to the ribosome for protein synthesis. In addition to this well-known function, tRNAs and their genes (tDNAs) serve a variety of other critical cellular functions. For example, tRNAs and other Pol III transcripts can be cleaved to yield small RNAs with potent regulatory activities. Furthermore, from yeast to mammals, active tDNAs and related “extra-TFIIIC” (ETC) loci provide the DNA scaffolds for the most ancient known mechanism of three-dimensional chromatin architecture. Here we identify the ZSCAN5 TF family - including mammalian ZSCAN5B and its primate-specific paralogs - as proteins that occupy mammalian Pol III promoters and ETC sites. We show that ZSCAN5B binds with high specificity to a conserved subset of Pol III genes in human and mouse. Furthermore, primate-specific ZSCAN5A and ZSCAN5D also bind Pol III genes, although ZSCAN5D preferentially localizes to MIR SINE- and LINE2-associated ETC sites. ZSCAN5 genes are expressed in proliferating cell populations and are cell-cycle regulated, and siRNA knockdown experiments suggested a cooperative role in regulation of mitotic progression. Consistent with this prediction, ZSCAN5A knockdown led to increasing numbers of cells in mitosis and the appearance of cells. Together, these data implicate the role of ZSCAN5 genes in regulation of Pol III genes and nearby Pol II loci, ultimately influencing cell cycle progression and differentiation in a variety of tissues.
Collapse
Affiliation(s)
- Younguk Sun
- Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL, USA.,Department of Cell and Developmental Biology, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Huimin Zhang
- Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL, USA.,Department of Cell and Developmental Biology, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Majid Kazemian
- Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL, USA.,Laboratory of Molecular Immunology and the Immunology Center, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD, USA
| | - Joseph M Troy
- Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL, USA.,Illinois Informatics Program, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Christopher Seward
- Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL, USA.,Department of Cell and Developmental Biology, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Xiaochen Lu
- Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL, USA.,Department of Cell and Developmental Biology, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Lisa Stubbs
- Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL, USA.,Department of Cell and Developmental Biology, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| |
Collapse
|
8
|
Suh A, Bachg S, Donnellan S, Joseph L, Brosius J, Kriegs JO, Schmitz J. De-novo emergence of SINE retroposons during the early evolution of passerine birds. Mob DNA 2017; 8:21. [PMID: 29255493 PMCID: PMC5729268 DOI: 10.1186/s13100-017-0104-1] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2017] [Accepted: 11/29/2017] [Indexed: 12/17/2022] Open
Abstract
BACKGROUND Passeriformes ("perching birds" or passerines) make up more than half of all extant bird species. The genome of the zebra finch, a passerine model organism for vocal learning, was noted previously to contain thousands of short interspersed elements (SINEs), a group of retroposons that is abundant in mammalian genomes but considered largely inactive in avian genomes. RESULTS Here we resolve the deep phylogenetic relationships of passerines using presence/absence patterns of SINEs. The resultant retroposon-based phylogeny provides a powerful and independent corroboration of previous sequence-based analyses. Notably, SINE activity began in the common ancestor of Eupasseres (passerines excluding the New Zealand wrens Acanthisittidae) and ceased before the rapid diversification of oscine passerines (suborder Passeri - songbirds). Furthermore, we find evidence for very recent SINE activity within suboscine passerines (suborder Tyranni), following the emergence of a SINE via acquisition of a different tRNA head as we suggest through template switching. CONCLUSIONS We propose that the early evolution of passerines was unusual among birds in that it was accompanied by de-novo emergence and activity of SINEs. Their genomic and transcriptomic impact warrants further study in the light of the massive diversification of passerines.
Collapse
Affiliation(s)
- Alexander Suh
- Institute of Experimental Pathology (ZMBE), University of Münster, D-48149 Münster, Germany
- Department of Evolutionary Biology (EBC), Uppsala University, SE-75236 Uppsala, Sweden
| | - Sandra Bachg
- Institute of Experimental Pathology (ZMBE), University of Münster, D-48149 Münster, Germany
| | - Stephen Donnellan
- South Australian Museum, Adelaide, SA 5000 Australia
- School of Biological Sciences, The University of Adelaide, Adelaide, 5005 Australia
| | - Leo Joseph
- Australian National Wildlife Collection, CSIRO National Research Collections Australia, Canberra, ACT 2601 Australia
| | - Jürgen Brosius
- Institute of Experimental Pathology (ZMBE), University of Münster, D-48149 Münster, Germany
- Brandenburg Medical School (MHB), D-16816 Neuruppin, Germany
| | - Jan Ole Kriegs
- Institute of Experimental Pathology (ZMBE), University of Münster, D-48149 Münster, Germany
- LWL-Museum für Naturkunde, Westfälisches Landesmuseum mit Planetarium, D-48161 Münster, Germany
| | - Jürgen Schmitz
- Institute of Experimental Pathology (ZMBE), University of Münster, D-48149 Münster, Germany
| |
Collapse
|
9
|
Luchetti A, Plazzi F, Mantovani B. Evolution of Two Short Interspersed Elements in Callorhinchus milii (Chondrichthyes, Holocephali) and Related Elements in Sharks and the Coelacanth. Genome Biol Evol 2017; 9:3824762. [PMID: 28505260 PMCID: PMC5499810 DOI: 10.1093/gbe/evx094] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/11/2017] [Indexed: 12/11/2022] Open
Abstract
Short interspersed elements (SINEs) are non-autonomous retrotransposons. Although they usually show fast evolutionary rates, in some instances highly conserved domains (HCDs) have been observed in elements with otherwise divergent sequences and from distantly related species. Here, we document the life history of two HCD-SINE families in the elephant shark Callorhinchus milii, one specific to the holocephalan lineage (CmiSINEs) and another one (SacSINE1-CM) with homologous elements in sharks and the coelacanth (SacSINE1s, LmeSINE1s). The analyses of their relationships indicated that these elements share the same 3′-tail, which would have allowed both elements to rise to high copy number by exploiting the C. milii L2-2_CM long interspersed element (LINE) enzymes. Molecular clock analysis on SINE activity in C. milii genome evidenced two replication bursts occurring right after two major events in the holocephalan evolution: the end-Permian mass extinction and the radiation of modern Holocephali. Accordingly, the same analysis on the coelacanth homologous elements, LmeSINE1, identified a replication wave close to the split age of the two extant Latimeria species. The genomic distribution of the studied SINEs pointed out contrasting results: some elements were preferentially sorted out from gene regions, but accumulated in flanking regions, while others appear more conserved within genes. Moreover, data from the C. milii transcriptome suggest that these SINEs could be involved in miRNA biogenesis and may be targets for miRNA-based regulation.
Collapse
Affiliation(s)
- Andrea Luchetti
- Dipartimento di Scienze Biologiche, Geologiche e Ambientali - Università di Bologna, Italy
| | - Federico Plazzi
- Dipartimento di Scienze Biologiche, Geologiche e Ambientali - Università di Bologna, Italy
| | - Barbara Mantovani
- Dipartimento di Scienze Biologiche, Geologiche e Ambientali - Università di Bologna, Italy
| |
Collapse
|
10
|
Luchetti A, Mantovani B. Rare horizontal transmission does not hide long-term inheritance of SINE highly conserved domains in the metazoan evolution. Curr Zool 2016; 62:667-674. [PMID: 29491954 PMCID: PMC5804259 DOI: 10.1093/cz/zow095] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2016] [Accepted: 08/05/2016] [Indexed: 12/27/2022] Open
Abstract
Transposable elements (TEs) are self-replicating, mobile DNA sequences which constitute a significant fraction of eukaryotic genomes. They are generally considered selfish DNA, as their replication and random insertion may have deleterious effects on genome functionalities, although some beneficial effects and evolutionary potential have been recognized. Short interspersed elements (SINEs) are non-autonomous TEs with a modular structure: a small RNA-related head, a body, and a long interspersed element-related tail. Despite their high turnover rate and de novo emergence, the body may retain highly conserved domains (HCDs) shared among divergent SINE families: in metazoans, at least nine HCD-SINEs have been recognized. Data mining on public molecular databases allowed the retrieval of 16 new HCD-SINE families from cnidarian, molluscs, arthropods, and vertebrates. Tracking the ancestry of HCDs on the metazoan phylogeny revealed that some of them date back to the Radiata–Bilateria split. Moreover, phylogenetic and age versus divergence analyses of the most ancient HCDs suggested that long-term vertical inheritance is the rule, with few horizontal transfer events. We suggest that the evolutionary conservation of HCDs may be linked to their potential to serve as recombination hotspots. This indirectly affects host genomes by maintaining active and diverse SINE lineages, whose insertions may impact (either positively or negatively) on the evolution of the genome.
Collapse
Affiliation(s)
- Andrea Luchetti
- Dipartimento di Scienze Biologiche, Geologiche e Ambientali-Università di Bologna, Via Selmi 3, Bologna 40126, Italy
| | - Barbara Mantovani
- Dipartimento di Scienze Biologiche, Geologiche e Ambientali-Università di Bologna, Via Selmi 3, Bologna 40126, Italy
| |
Collapse
|
11
|
Luchetti A, Šatović E, Mantovani B, Plohl M. RUDI, a short interspersed element of the V-SINE superfamily widespread in molluscan genomes. Mol Genet Genomics 2016; 291:1419-29. [PMID: 26987730 DOI: 10.1007/s00438-016-1194-z] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2015] [Accepted: 02/29/2016] [Indexed: 01/28/2023]
Abstract
Short interspersed elements (SINEs) are non-autonomous retrotransposons that are widespread in eukaryotic genomes. They exhibit a chimeric sequence structure consisting of a small RNA-related head, an anonymous body and an AT-rich tail. Although their turnover and de novo emergence is rapid, some SINE elements found in distantly related species retain similarity in certain core segments (or highly conserved domains, HCD). We have characterized a new SINE element named RUDI in the bivalve molluscs Ruditapes decussatus and R. philippinarum and found this element to be widely distributed in the genomes of a number of mollusc species. An unexpected structural feature of RUDI is the HCD domain type V, which was first found in non-amniote vertebrate SINEs and in the SINE from one cnidarian species. In addition to the V domain, the overall sequence conservation pattern of RUDI elements resembles that found in ancient AmnSINE (~310 Myr old) and Au SINE (~320 Myr old) families, suggesting that RUDI might be among the most ancient SINE families. Sequence conservation suggests a monophyletic origin of RUDI. Nucleotide variability and phylogenetic analyses suggest long-term vertical inheritance combined with at least one horizontal transfer event as the most parsimonious explanation for the observed taxonomic distribution.
Collapse
Affiliation(s)
- Andrea Luchetti
- Dipartimento di Scienze Biologiche, Geologiche e Ambientali, Università di Bologna, Bologna, Italy.
| | - Eva Šatović
- Division of Molecular Biology, Ruđer Bošković Institute, Zagreb, Croatia
| | - Barbara Mantovani
- Dipartimento di Scienze Biologiche, Geologiche e Ambientali, Università di Bologna, Bologna, Italy
| | - Miroslav Plohl
- Division of Molecular Biology, Ruđer Bošković Institute, Zagreb, Croatia
| |
Collapse
|
12
|
Nishihara H, Plazzi F, Passamonti M, Okada N. MetaSINEs: Broad Distribution of a Novel SINE Superfamily in Animals. Genome Biol Evol 2016; 8:528-39. [PMID: 26872770 PMCID: PMC4824008 DOI: 10.1093/gbe/evw029] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
SINEs (short interspersed elements) are transposable elements that typically originate independently in each taxonomic clade (order/family). However, some SINE families share a highly similar central sequence and are thus categorized as a SINE superfamily. Although only four SINE superfamilies (CORE-SINEs, V-SINEs, DeuSINEs, and Ceph-SINEs) have been reported so far, it is expected that new SINE superfamilies would be discovered by deep exploration of new SINEs in metazoan genomes. Here we describe 15 SINEs, among which 13 are novel, that have a similar 66-bp central region and therefore constitute a new SINE superfamily, MetaSINEs. MetaSINEs are distributed from fish to cnidarians, suggesting their common evolutionary origin at least 640 Ma. Because the 3′ tails of MetaSINEs are variable, these SINEs most likely survived by changing their partner long interspersed elements for retrotransposition during evolution. Furthermore, we examined the presence of members of other SINE superfamilies in bivalve genomes and characterized eight new SINEs belonging to the CORE-SINEs, V-SINEs, and DeuSINEs, in addition to the MetaSINEs. The broad distribution of bivalve SINEs suggests that at least three SINEs originated in the common ancestor of Bivalvia. Our comparative analysis of the central domains of the SINEs revealed that, in each superfamily, only a restricted region is shared among all of its members. Because the functions of the central domains of the SINE superfamilies remain unknown, such structural information of SINE superfamilies will be useful for future experimental and comparative analyses to reveal why they have been retained in metazoan genomes during evolution.
Collapse
Affiliation(s)
- Hidenori Nishihara
- Graduate School of Bioscience and Biotechnology, Tokyo Institute of Technology, Midori-Ku, Yokohama, Kanagawa, Japan
| | - Federico Plazzi
- Department of Biological, Geological and Environmental Sciences, University of Bologna, Bologna, Italy
| | - Marco Passamonti
- Department of Biological, Geological and Environmental Sciences, University of Bologna, Bologna, Italy
| | - Norihiro Okada
- Department of Life Sciences, National Cheng Kung University, Tainan, Taiwan Foundation for Advancement of International Science, Tsukuba, Japan
| |
Collapse
|
13
|
Matetovici I, Sajgo S, Ianc B, Ochis C, Bulzu P, Popescu O, Damert A. Mobile Element Evolution Playing Jigsaw - SINEs in Gastropod and Bivalve Mollusks. Genome Biol Evol 2016; 8:253-70. [PMID: 26739168 PMCID: PMC4758252 DOI: 10.1093/gbe/evv257] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
SINEs (Short INterspersed Elements) are widely distributed among eukaryotes. Some SINE families are organized in superfamilies characterized by a shared central domain. These central domains are conserved across species, classes, and even phyla. Here we report the identification of two novel such superfamilies in the genomes of gastropod and bivalve mollusks. The central conserved domain of the first superfamily is present in SINEs in Caenogastropoda and Vetigastropoda as well as in all four subclasses of Bivalvia. We designated the domain MESC (Romanian for MElc-snail and SCoica-mussel) because it appears to be restricted to snails and mussels. The second superfamily is restricted to Caenogastropoda. Its central conserved domain-Snail-is related to the Nin-DC domain. Furthermore, we provide evidence that a 40-bp subdomain of the SINE V-domain is conserved in SINEs in mollusks and arthropods. It is predicted to form a stable stem-loop structure that is preserved in the context of the overall SINE RNA secondary structure in invertebrates. Our analysis also recovered short retrotransposons with a Long INterspersed Element (LINE)-derived 5' end. These share the body and/or the tail with transfer RNA (tRNA)-derived SINEs within and across species. Finally, we identified CORE SINEs in gastropods and bivalves-extending the distribution range of this superfamily.
Collapse
Affiliation(s)
- Irina Matetovici
- Institute for Interdisciplinary Research in Bio-Nano-Sciences, Molecular Biology Center, Babes-Bolyai-University, Cluj-Napoca, Romania Present address: Institute of Tropical Medicine, Unit of Veterinary Protozoology, Antwerpen, Belgium
| | - Szilard Sajgo
- Institute for Interdisciplinary Research in Bio-Nano-Sciences, Molecular Biology Center, Babes-Bolyai-University, Cluj-Napoca, Romania Present address: Danish Research Institute of Translational Neuroscience, Nordic EMBL Partnership for Molecular Medicine, DANDRITE, Aarhus University, Aarhus, Denmark
| | - Bianca Ianc
- Institute for Interdisciplinary Research in Bio-Nano-Sciences, Molecular Biology Center, Babes-Bolyai-University, Cluj-Napoca, Romania
| | - Cornelia Ochis
- Institute for Interdisciplinary Research in Bio-Nano-Sciences, Molecular Biology Center, Babes-Bolyai-University, Cluj-Napoca, Romania
| | - Paul Bulzu
- Institute for Interdisciplinary Research in Bio-Nano-Sciences, Molecular Biology Center, Babes-Bolyai-University, Cluj-Napoca, Romania
| | | | - Annette Damert
- Institute for Interdisciplinary Research in Bio-Nano-Sciences, Molecular Biology Center, Babes-Bolyai-University, Cluj-Napoca, Romania
| |
Collapse
|
14
|
Polyadenylation of RNA transcribed from mammalian SINEs by RNA polymerase III: Complex requirements for nucleotide sequences. BIOCHIMICA ET BIOPHYSICA ACTA-GENE REGULATORY MECHANISMS 2015; 1859:355-65. [PMID: 26700565 DOI: 10.1016/j.bbagrm.2015.12.003] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Received: 09/08/2015] [Revised: 12/09/2015] [Accepted: 12/11/2015] [Indexed: 01/08/2023]
Abstract
It is generally accepted that only transcripts synthesized by RNA polymerase II (e.g., mRNA) were subject to AAUAAA-dependent polyadenylation. However, we previously showed that RNA transcribed by RNA polymerase III (pol III) from mouse B2 SINE could be polyadenylated in an AAUAAA-dependent manner. Many species of mammalian SINEs end with the pol III transcriptional terminator (TTTTT) and contain hexamers AATAAA in their A-rich tail. Such SINEs were united into Class T(+), whereas SINEs lacking the terminator and AATAAA sequences were classified as T(-). Here we studied the structural features of SINE pol III transcripts that are necessary for their polyadenylation. Eight and six SINE families from classes T(+) and T(-), respectively, were analyzed. The replacement of AATAAA with AACAAA in T(+) SINEs abolished the RNA polyadenylation. Interestingly, insertion of the polyadenylation signal (AATAAA) and pol III transcription terminator in T(-) SINEs did not result in polyadenylation. The detailed analysis of three T(+) SINEs (B2, DIP, and VES) revealed areas important for the polyadenylation of their pol III transcripts: the polyadenylation signal and terminator in A-rich tail, β region positioned immediately downstream of the box B of pol III promoter, and τ region located upstream of the tail. In DIP and VES (but not in B2), the τ region is a polypyrimidine motif which is also characteristic of many other T(+) SINEs. Most likely, SINEs of different mammals acquired these structural features independently as a result of parallel evolution.
Collapse
|
15
|
Domené S, Bumaschny VF, de Souza FSJ, Franchini LF, Nasif S, Low MJ, Rubinstein M. Enhancer turnover and conserved regulatory function in vertebrate evolution. Philos Trans R Soc Lond B Biol Sci 2013; 368:20130027. [PMID: 24218639 DOI: 10.1098/rstb.2013.0027] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Mutations in regulatory regions including enhancers are an important source of variation and innovation during evolution. Enhancers can evolve by changes in the sequence, arrangement and repertoire of transcription factor binding sites, but whole enhancers can also be lost or gained in certain lineages in a process of turnover. The proopiomelanocortin gene (Pomc), which encodes a prohormone, is expressed in the pituitary and hypothalamus of all jawed vertebrates. We have previously described that hypothalamic Pomc expression in mammals is controlled by two enhancers-nPE1 and nPE2-that are derived from transposable elements and that presumably replaced the ancestral neuronal Pomc regulatory regions. Here, we show that nPE1 and nPE2, even though they are mammalian novelties with no homologous counterpart in other vertebrates, nevertheless can drive gene expression specifically to POMC neurons in the hypothalamus of larval and adult transgenic zebrafish. This indicates that when neuronal Pomc enhancers originated de novo during early mammalian evolution, the newly created cis- and trans-codes were similar to the ancestral ones. We also identify the neuronal regulatory region of zebrafish pomca and confirm that it is not homologous to the mammalian enhancers. Our work sheds light on the process of gene regulatory evolution by showing how a locus can undergo enhancer turnover and nevertheless maintain the ancestral transcriptional output.
Collapse
Affiliation(s)
- Sabina Domené
- Instituto de Investigaciones en Ingeniería Genética y Biología Molecular, Consejo Nacional de Investigaciones Científicas y Técnicas, , C1428ADN Buenos Aires, Argentina
| | | | | | | | | | | | | |
Collapse
|
16
|
Luchetti A, Mantovani B. Conserved domains and SINE diversity during animal evolution. Genomics 2013; 102:296-300. [PMID: 23981965 DOI: 10.1016/j.ygeno.2013.08.005] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2013] [Revised: 06/25/2013] [Accepted: 08/14/2013] [Indexed: 11/28/2022]
Abstract
Eukaryotic genomes harbour a number of mobile genetic elements (MGEs); moving from one genomic location to another, they are known to impact on the host genome. Short interspersed elements (SINEs) are well-represented, non-autonomous retroelements and they are likely the most diversified MGEs. In some instances, sequence domains conserved across unrelated SINEs have been identified; remarkably, one of these, called Nin, has been conserved since the Radiata-Bilateria splitting. Here we report on two new domains: Inv, derived from Nin, identified in insects and in deuterostomes, and Pln, restricted to polyneopteran insects. The identification of Inv and Pln sequences allowed us to retrieve new SINEs, two in insects and one in a hemichordate. The diverse structural combination of the different domains in different SINE families, during metazoan evolution, offers a clearer view of SINE diversity and their frequent de novo emergence through module exchange, possibly underlying the high evolutionary success of SINEs.
Collapse
Affiliation(s)
- Andrea Luchetti
- Dip. Scienze Biologiche, Geologiche e Ambientali (BiGeA) - Università di Bologna, via Selmi 3, 40126 Bologna, Italy.
| | | |
Collapse
|
17
|
RNA-Mediated Gene Duplication and Retroposons: Retrogenes, LINEs, SINEs, and Sequence Specificity. INTERNATIONAL JOURNAL OF EVOLUTIONARY BIOLOGY 2013; 2013:424726. [PMID: 23984183 PMCID: PMC3747384 DOI: 10.1155/2013/424726] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/14/2013] [Accepted: 07/01/2013] [Indexed: 11/18/2022]
Abstract
A substantial number of “retrogenes” that are derived from the mRNA of various intron-containing genes have been reported. A class of mammalian retroposons, long interspersed element-1 (LINE1, L1), has been shown to be involved in the reverse transcription of retrogenes (or processed pseudogenes) and non-autonomous short interspersed elements (SINEs). The 3′-end sequences of various SINEs originated from a corresponding LINE. As the 3′-untranslated regions of several LINEs are essential for retroposition, these LINEs presumably require “stringent” recognition of the 3′-end sequence of the RNA template. However, the 3′-ends of mammalian L1s do not exhibit any similarity to SINEs, except for the presence of 3′-poly(A) repeats. Since the 3′-poly(A) repeats of L1 and Alu SINE are critical for their retroposition, L1 probably recognizes the poly(A) repeats, thereby mobilizing not only Alu SINE but also cytosolic mRNA. Many flowering plants only harbor L1-clade LINEs and a significant number of SINEs with poly(A) repeats, but no homology to the LINEs. Moreover, processed pseudogenes have also been found in flowering plants. I propose that the ancestral L1-clade LINE in the common ancestor of green plants may have recognized a specific RNA template, with stringent recognition then becoming relaxed during the course of plant evolution.
Collapse
|
18
|
de Souza FS, Franchini LF, Rubinstein M. Exaptation of transposable elements into novel cis-regulatory elements: is the evidence always strong? Mol Biol Evol 2013; 30:1239-51. [PMID: 23486611 PMCID: PMC3649676 DOI: 10.1093/molbev/mst045] [Citation(s) in RCA: 117] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
Transposable elements (TEs) are mobile genetic sequences that can jump around the genome from one location to another, behaving as genomic parasites. TEs have been particularly effective in colonizing mammalian genomes, and such heavy TE load is expected to have conditioned genome evolution. Indeed, studies conducted both at the gene and genome levels have uncovered TE insertions that seem to have been co-opted--or exapted--by providing transcription factor binding sites (TFBSs) that serve as promoters and enhancers, leading to the hypothesis that TE exaptation is a major factor in the evolution of gene regulation. Here, we critically review the evidence for exaptation of TE-derived sequences as TFBSs, promoters, enhancers, and silencers/insulators both at the gene and genome levels. We classify the functional impact attributed to TE insertions into four categories of increasing complexity and argue that so far very few studies have conclusively demonstrated exaptation of TEs as transcriptional regulatory regions. We also contend that many genome-wide studies dealing with TE exaptation in recent lineages of mammals are still inconclusive and that the hypothesis of rapid transcriptional regulatory rewiring mediated by TE mobilization must be taken with caution. Finally, we suggest experimental approaches that may help attributing higher-order functions to candidate exapted TEs.
Collapse
Affiliation(s)
- Flávio S.J. de Souza
- Instituto de Investigaciones en Ingeniería Genética y Biología Molecular, Consejo Nacional de Investigaciones Científicas y Técnicas, Buenos Aires, Argentina
- Departamento de Fisiología, Biología Molecular y Celular, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Buenos Aires, Argentina
| | - Lucía F. Franchini
- Instituto de Investigaciones en Ingeniería Genética y Biología Molecular, Consejo Nacional de Investigaciones Científicas y Técnicas, Buenos Aires, Argentina
| | - Marcelo Rubinstein
- Instituto de Investigaciones en Ingeniería Genética y Biología Molecular, Consejo Nacional de Investigaciones Científicas y Técnicas, Buenos Aires, Argentina
- Departamento de Fisiología, Biología Molecular y Celular, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Buenos Aires, Argentina
| |
Collapse
|
19
|
Abstract
SINEBase (http://sines.eimb.ru) integrates the revisited body of knowledge about short interspersed elements (SINEs). A set of formal definitions concerning SINEs was introduced. All available sequence data were screened through these definitions and the genetic elements misidentified as SINEs were discarded. As a result, 175 SINE families have been recognized in animals, flowering plants and green algae. These families were classified by the modular structure of their nucleotide sequences and the frequencies of different patterns were evaluated. These data formed the basis for the database of SINEs. The SINEBase website can be used in two ways: first, to explore the database of SINE families, and second, to analyse candidate SINE sequences using specifically developed tools. This article presents an overview of the database and the process of SINE identification and analysis.
Collapse
Affiliation(s)
- Nikita S Vassetzky
- Laboratory of Eukaryotic Genome Evolution, Engelhardt Institute of Molecular Biology, Moscow 119991, Russia
| | | |
Collapse
|
20
|
Nilsson MA, Janke A, Murchison EP, Ning Z, Hallström BM. Expansion of CORE-SINEs in the genome of the Tasmanian devil. BMC Genomics 2012; 13:172. [PMID: 22559330 PMCID: PMC3403934 DOI: 10.1186/1471-2164-13-172] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2011] [Accepted: 05/06/2012] [Indexed: 11/22/2022] Open
Abstract
Background The genome of the carnivorous marsupial, the Tasmanian devil (Sarcophilus harrisii, Order: Dasyuromorphia), was sequenced in the hopes of finding a cure for or gaining a better understanding of the contagious devil facial tumor disease that is threatening the species’ survival. To better understand the Tasmanian devil genome, we screened it for transposable elements and investigated the dynamics of short interspersed element (SINE) retroposons. Results The temporal history of Tasmanian devil SINEs, elucidated using a transposition in transposition analysis, indicates that WSINE1, a CORE-SINE present in around 200,000 copies, is the most recently active element. Moreover, we discovered a new subtype of WSINE1 (WSINE1b) that comprises at least 90% of all Tasmanian devil WSINE1s. The frequencies of WSINE1 subtypes differ in the genomes of two of the other Australian marsupial orders. A co-segregation analysis indicated that at least 66 subfamilies of WSINE1 evolved during the evolution of Dasyuromorphia. Using a substitution rate derived from WSINE1 insertions, the ages of the subfamilies were estimated and correlated with a newly established phylogeny of Dasyuromorphia. Phylogenetic analyses and divergence time estimates of mitochondrial genome data indicate a rapid radiation of the Tasmanian devil and the closest relative the quolls (Dasyurus) around 14 million years ago. Conclusions The radiation and abundance of CORE-SINEs in marsupial genomes indicates that they may be a major player in the evolution of marsupials. It is evident that the early phases of evolution of the carnivorous marsupial order Dasyuromorphia was characterized by a burst of SINE activity. A correlation between a speciation event and a major burst of retroposon activity is for the first time shown in a marsupial genome.
Collapse
Affiliation(s)
- Maria A Nilsson
- LOEWE-Biodiversity and Climate Research Center, BiK-F, Senckenberganlage 25, Frankfurt am Main D-60325, Germany.
| | | | | | | | | |
Collapse
|
21
|
Dufresne F, Jeffery N. A guided tour of large genome size in animals: what we know and where we are heading. Chromosome Res 2011; 19:925-38. [DOI: 10.1007/s10577-011-9248-x] [Citation(s) in RCA: 62] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
22
|
Ryabinina NL, Bannikova AA, Sheremet’eva VA, Chikobava MG, Lapin BA, Kramerov DA. Analysis of DNA of higher primates using inter-SINE PCR. RUSS J GENET+ 2011. [DOI: 10.1134/s1022795408030046] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
23
|
Molecular characterization, genomic distribution and evolutionary dynamics of Short INterspersed Elements in the termite genome. Mol Genet Genomics 2010; 285:175-84. [PMID: 21184097 DOI: 10.1007/s00438-010-0595-7] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2010] [Accepted: 12/06/2010] [Indexed: 10/18/2022]
Abstract
Short INterspersed Elements (SINEs) in invertebrates, and especially in animal inbred genomes such that of termites, are poorly known; in this paper we characterize three new SINE families (Talub, Taluc and Talud) through the analyses of 341 sequences, either isolated from the Reticulitermes lucifugus genome or drawn from EST Genbank collection. We further add new data to the only isopteran element known so far, Talua. These SINEs are tRNA-derived elements, with an average length ranging from 258 to 372 bp. The tails are made up by poly(A) or microsatellite motifs. Their copy number varies from 7.9 × 10(3) to 10(5) copies, well within the range observed for other metazoan genomes. Species distribution, age and target site duplication analysis indicate Talud as the oldest, possibly inactive SINE originated before the onset of Isoptera (~150 Myr ago). Taluc underwent to substantial sequence changes throughout the evolution of termites and data suggest it was silenced and then re-activated in the R. lucifugus lineage. Moreover, Taluc shares a conserved sequence block with other unrelated SINEs, as observed for some vertebrate and cephalopod elements. The study of genomic environment showed that insertions are mainly surrounded by microsatellites and other SINEs, indicating a biased accumulation within non-coding regions. The evolutionary dynamics of Talu~ elements is explained through selective mechanisms acting in an inbred genome; in this respect, the study of termites' SINEs activity may provide an interesting framework to address the (co)evolution of mobile elements and the host genome.
Collapse
|
24
|
Morescalchi MA, Barucca M, Stingo V, Capriglione T. Polypteridae (Actinopterygii: Cladistia) and DANA-SINEs insertions. Mar Genomics 2010; 3:79-84. [PMID: 21798200 DOI: 10.1016/j.margen.2010.06.001] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2009] [Revised: 06/07/2010] [Accepted: 06/15/2010] [Indexed: 01/09/2023]
Abstract
SINE sequences are interspersed throughout virtually all eukaryotic genomes and greatly outnumber the other repetitive elements. These sequences are of increasing interest for phylogenetic studies because of their diagnostic power for establishing common ancestry among taxa, once properly characterized. We identified and characterized a peculiar family of composite tRNA-derived short interspersed SINEs, DANA-SINEs, associated with mutational activities in Danio rerio, in a group of species belonging to one of the most basal bony fish families, the Polypteridae, in order to investigate their own inner specific phylogenetic relationships. DANA sequences were identified, sequenced and then localized, by means of fluorescent in situ hybridization (FISH), in six Polypteridae species (Polypterus delhezi, P. ornatipinnis, P. palmas, P. buettikoferi P. senegalus and Erpetoichthys calabaricus) After cloning, the sequences obtained were aligned for phylogenetic analysis, comparing them with three Dipnoan lungfish species (Protopterus annectens, P. aethiopicus, Lepidosiren paradoxa), and Lethenteron reissneri (Petromyzontidae)was used as outgroup. The obtained overlapping MP, ML and NJ tree clustered together the species belonging to the two taxonomically different Osteichthyans groups: the Polypteridae, by one side, and the Protopteridae by the other, with the monotypic genus Erpetoichthys more distantly related to the Polypterus genus comprising three distinct groups: P. palmas and P. buettikoferi, P. delhezi and P. ornatipinnis and P. senegalus. In situ hybridization with DANA probes marked along the whole chromosome arms in the metaphases of all the Polypteridae species examined.
Collapse
Affiliation(s)
- Maria Alessandra Morescalchi
- Dipartimento di Scienze della Vita, Seconda Università degli Studi di Napoli, via Vivaldi 43, 81100, Caserta, Italy.
| | | | | | | |
Collapse
|
25
|
Akasaki T, Nikaido M, Nishihara H, Tsuchiya K, Segawa S, Okada N. Characterization of a novel SINE superfamily from invertebrates: "Ceph-SINEs" from the genomes of squids and cuttlefish. Gene 2009; 454:8-19. [PMID: 19914361 DOI: 10.1016/j.gene.2009.11.005] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2009] [Revised: 10/30/2009] [Accepted: 11/06/2009] [Indexed: 11/27/2022]
Abstract
Five tRNA-derived short interspersed repetitive elements (SINEs), named SepiaSINE, Sepioth-SINE1, Sepioth-SINE2A, Sepioth-SINE2B and OegopSINE, were isolated from the genomes of three decabrachian species [Sepia officinalis (order Sepiida), Sepiotheuthis lessoniana (suborder Myopsida), and Mastigoteuthis cordiformes (suborder Oegopsida)], by random sequencing and genome screening. In addition, two tRNA-derived SINEs, named IdioSINE1 and IdioSINE2, were further detected from EST (expressed sequence tag) data of Idiosepius paradoxus (order Idiosepiida), using a GenBank FASTA search with a conserved sequence of the SepiaSINE as the query. All the isolated SINEs had a common and unique highly conserved 149-bp sequence in their central structures (Sepioth-SINE2B and IdioSINEs, however, had a continuous 73-bp deletion in the conserved region.), and are therefore grouped as the fourth SINE superfamily "Ceph-SINEs", following the CORE-SINE, V-SINE, and DeuSINE superfamilies. Our analysis suggested that the central conserved region called the "Ceph-domain" might have originated before the diversification of cephalopods (505 myr ago). A sequence alignment of Sepioth-SINE1, Sepioth-SINE2A, and Sepioth-SINE2B demonstrated that Sepioth-SINE2A has a chimeric structure shared with two other SINEs. The above relationship suggests possible template switching in the central conserved domain during reverse transcription for the birth of Sepioth-SINE2A, providing the possibility that the presence of the conserved domain contributed to yield a variety of SINEs during evolution. Furthermore, the distributions of the isolated SINEs showed that order Sepiida, suborders Oegopsida and Myopsida, and order Idiosepiida have their own independent SINE(s), and suggest that order Sepiida can be largely separated into two groups, with clarification of the phylogenetic relatedness between subfamily Sepioteuthinae and the other loliginid squids.
Collapse
Affiliation(s)
- Tetsuya Akasaki
- Department of Biological Science, Graduate School of Bioscience and Biotechnology, Tokyo Institute of Technology, 4259, Nagatsuta-cho, Midori-ku, Yokohama 226-8501, Japan
| | | | | | | | | | | |
Collapse
|
26
|
Hasnaoui M, Doucet AJ, Meziane O, Gilbert N. Ancient repeat sequence derived from U6 snRNA in primate genomes. Gene 2009; 448:139-44. [PMID: 19647053 DOI: 10.1016/j.gene.2009.07.015] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2009] [Revised: 07/15/2009] [Accepted: 07/15/2009] [Indexed: 02/06/2023]
Abstract
LINE-1 (L1) is the most represented sequence of the human genome (17% of the total genomic mass). Moreover, it has been proposed for many years and demonstrated more recently that L1 has contributed to the mobilization of pseudogenes, small non-coding RNAs, such as tRNAs or snRNAs, and SINEs. In fact, it is estimated that L1 is responsible for at least 30% of our genome. The mobilization of non-L1 RNAs can occur in different ways and at different steps of the retrotransposition cycle. Here, by looking at U6 snRNA sequences mobilized by L1, we have observed an ancient repeat sequence derived from U6, present in all primate genomes. We were able to trace its origin in Euarchota genomes, most likely during the divergence of the four orders; Scandentia, Dermoptera, Plesiadapiform (extinct) and Primates.
Collapse
Affiliation(s)
- Manel Hasnaoui
- Institut de Génétique Humaine, Centre National de la Recherche Scientifique, 141 Rue de la Cardonille, 34396 Montpellier Cedex 5, France
| | | | | | | |
Collapse
|
27
|
Novick PA, Basta H, Floumanhaft M, McClure MA, Boissinot S. The Evolutionary Dynamics of Autonomous Non-LTR Retrotransposons in the Lizard Anolis Carolinensis Shows More Similarity to Fish Than Mammals. Mol Biol Evol 2009; 26:1811-22. [DOI: 10.1093/molbev/msp090] [Citation(s) in RCA: 50] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023] Open
|
28
|
Gogolevsky KP, Vassetzky NS, Kramerov DA. 5S rRNA-derived and tRNA-derived SINEs in fruit bats. Genomics 2009; 93:494-500. [PMID: 19442632 DOI: 10.1016/j.ygeno.2009.02.001] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2008] [Revised: 02/04/2009] [Accepted: 02/04/2009] [Indexed: 11/24/2022]
Abstract
Most short retroposons (SINEs) descend from cellular tRNA of 7SL RNA. Here, four new SINEs were found in megabats (Megachiroptera) but neither in microbats nor in other mammals. Two of them, MEG-RS and MEG-RL, descend from another cellular RNA, 5S rRNA; one (MEG-T2) is a tRNA-derived SINE; and MEG-TR is a hybrid tRNA/5S rRNA SINE. Insertion locus analysis suggests that these SINEs were active in the recent fruit bat evolution. Analysis of MEG-RS and MEG-RL in comparison with other few 5S rRNA-derived SINEs demonstrates that the internal RNA polymerase III promoter is their most invariant region, while the secondary structure is more variable. The mechanisms underlying the modular structure of these and other SINEs as well as their variation are discussed. The scenario of evolution of MEG SINEs is proposed.
Collapse
Affiliation(s)
- Konstantin P Gogolevsky
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, 32 Vavilov St., Moscow 119991, Russia
| | | | | |
Collapse
|
29
|
Matveev V, Okada N. Retroposons of salmonoid fishes (Actinopterygii: Salmonoidei) and their evolution. Gene 2008; 434:16-28. [PMID: 18590946 DOI: 10.1016/j.gene.2008.04.022] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2008] [Revised: 04/28/2008] [Accepted: 04/29/2008] [Indexed: 11/27/2022]
Abstract
Short and long retroposons, or non-LTR retrotransposons (SINEs and LINEs, respectively) are two groups of interspersed repetitive elements amplifying in the genome via RNA and cDNA-mediated reverse transcription. In this process, SINEs entirely depend on the enzymatic machinery of autonomous LINEs. The impact of retroposons on the host genome is difficult to overestimate: their sequences account for significant portion of the eukaryotic genome, while propagation of their active copies gradually reshapes it. In this way, the retropositional activity plays a role of important evolutionary factor. More than 100 LINE and nearly 100 SINE families have been described to date from the genomes of various eukaryotes, and it is salmonoid fishes (Actinopterygii: Salmonoidei) that are particularly noticeable for the diversity of transposons they host in their genomes, including two LINE and seven SINE families. Moreover, this group of ray-finned fish represents an excellent opportunity to study such a rare evolutionary phenomenon as lateral gene transfer, due to a great variety of transposons and other sequences salmons share with a blood fluke, Schistosoma japonicum (Trematoda: Strigeiformes)--a parasitic helminth infecting various vertebrates. The aim of the present review is to structure all knowledge accumulated about salmonoid retroposons by now, as well as to complement it with the new data pertaining to the distribution of some SINE families.
Collapse
Affiliation(s)
- Vitaliy Matveev
- Faculty of Bioscience and Biotechnology, Tokyo Institute of Technology, Yokohama, Japan
| | | |
Collapse
|
30
|
Core-SINE blocks comprise a large fraction of monotreme genomes; implications for vertebrate chromosome evolution. Chromosome Res 2008; 15:975-84. [DOI: 10.1007/s10577-007-1187-1] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2007] [Revised: 10/21/2007] [Accepted: 10/21/2007] [Indexed: 10/22/2022]
|
31
|
Santangelo AM, de Souza FSJ, Franchini LF, Bumaschny VF, Low MJ, Rubinstein M. Ancient exaptation of a CORE-SINE retroposon into a highly conserved mammalian neuronal enhancer of the proopiomelanocortin gene. PLoS Genet 2007; 3:1813-26. [PMID: 17922573 PMCID: PMC2000970 DOI: 10.1371/journal.pgen.0030166] [Citation(s) in RCA: 103] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2007] [Accepted: 08/15/2007] [Indexed: 02/01/2023] Open
Abstract
The proopiomelanocortin gene (POMC) is expressed in the pituitary gland and the ventral hypothalamus of all jawed vertebrates, producing several bioactive peptides that function as peripheral hormones or central neuropeptides, respectively. We have recently determined that mouse and human POMC expression in the hypothalamus is conferred by the action of two 5′ distal and unrelated enhancers, nPE1 and nPE2. To investigate the evolutionary origin of the neuronal enhancer nPE2, we searched available vertebrate genome databases and determined that nPE2 is a highly conserved element in placentals, marsupials, and monotremes, whereas it is absent in nonmammalian vertebrates. Following an in silico paleogenomic strategy based on genome-wide searches for paralog sequences, we discovered that opossum and wallaby nPE2 sequences are highly similar to members of the superfamily of CORE-short interspersed nucleotide element (SINE) retroposons, in particular to MAR1 retroposons that are widely present in marsupial genomes. Thus, the neuronal enhancer nPE2 originated from the exaptation of a CORE-SINE retroposon in the lineage leading to mammals and remained under purifying selection in all mammalian orders for the last 170 million years. Expression studies performed in transgenic mice showed that two nonadjacent nPE2 subregions are essential to drive reporter gene expression into POMC hypothalamic neurons, providing the first functional example of an exapted enhancer derived from an ancient CORE-SINE retroposon. In addition, we found that this CORE-SINE family of retroposons is likely to still be active in American and Australian marsupial genomes and that several highly conserved exonic, intronic and intergenic sequences in the human genome originated from the exaptation of CORE-SINE retroposons. Together, our results provide clear evidence of the functional novelties that transposed elements contributed to their host genomes throughout evolution. One of the most striking observations derived from the genomic era is the overwhelming contribution of transposed elements to mammalian genomes. For example, 45% of the human genome is derived from mobile element fragments. Although historically viewed as “junk DNA,” transposed elements could also contribute to novel advantageous functional elements in their host genomes, a process called exaptation. Functionally proven examples of exaptation derived from ancient retroposition events are rare. Using an in silico paleogenomic strategy, we unraveled the evolutionary origin of nPE2, a neuronal enhancer of the proopiomelancortin gene that participates in the production of hypothalamic peptides involved in feeding behavior and stress-induced analgesia. We demonstrate that nPE2 originated from the exaptation of a SINE retroposon in the lineage leading to mammals and remained under purifying selection for the last 170 million years. The difficulty in detecting nPE2 origin as an exapted retroposon illustrates the underestimation of this phenomenon and encourages the finding of the many thousands of retroposon-derived functional elements still hidden within the genomes. Their discovery will contribute to a better understanding of the dynamics of gene evolution and, at a larger scale, the origin of macroevolutionary novelties that lead to the appearance of new species, orders, or classes.
Collapse
Affiliation(s)
- Andrea M Santangelo
- Instituto de Investigaciones en Ingeniería Genética y Biología Molecular, Consejo Nacional de Investigaciones Científicas y Técnicas, Buenos Aires, Argentina
| | - Flávio S. J de Souza
- Instituto de Investigaciones en Ingeniería Genética y Biología Molecular, Consejo Nacional de Investigaciones Científicas y Técnicas, Buenos Aires, Argentina
| | - Lucía F Franchini
- Instituto de Investigaciones en Ingeniería Genética y Biología Molecular, Consejo Nacional de Investigaciones Científicas y Técnicas, Buenos Aires, Argentina
| | - Viviana F Bumaschny
- Instituto de Investigaciones en Ingeniería Genética y Biología Molecular, Consejo Nacional de Investigaciones Científicas y Técnicas, Buenos Aires, Argentina
| | - Malcolm J Low
- Center for the Study of Weight Regulation and Associated Disorders, Portland, Oregon, United States of America
- Department of Behavioral Neuroscience, Oregon Health and Science University, Portland, Oregon, United States of America
| | - Marcelo Rubinstein
- Instituto de Investigaciones en Ingeniería Genética y Biología Molecular, Consejo Nacional de Investigaciones Científicas y Técnicas, Buenos Aires, Argentina
- Center for the Study of Weight Regulation and Associated Disorders, Portland, Oregon, United States of America
- Departmento de Fisiología, Biología Molecular y Celular, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Buenos Aires, Argentina
- Centro de Estudios Científicos, Valdivia, Chile
- * To whom correspondence should be addressed. E-mail:
| |
Collapse
|
32
|
Munemasa M, Nikaido M, Nishihara H, Donnellan S, Austin CC, Okada N. Newly discovered young CORE-SINEs in marsupial genomes. Gene 2007; 407:176-85. [PMID: 17988807 DOI: 10.1016/j.gene.2007.10.008] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2007] [Revised: 10/02/2007] [Accepted: 10/04/2007] [Indexed: 02/04/2023]
Abstract
Although recent mammalian genome projects have uncovered a large part of genomic component of various groups, several repetitive sequences still remain to be characterized and classified for particular groups. The short interspersed repetitive elements (SINEs) distributed among marsupial genomes are one example. We have identified and characterized two new SINEs from marsupial genomes that belong to the CORE-SINE family, characterized by a highly conserved "CORE" domain. PCR and genomic dot blot analyses revealed that the distribution of each SINE shows distinct patterns among the marsupial genomes, implying different timing of their retroposition during the evolution of marsupials. The members of Mar3 (Marsupialia 3) SINE are distributed throughout the genomes of all marsupials, whereas the Mac1 (Macropodoidea 1) SINE is distributed specifically in the genomes of kangaroos. Sequence alignment of the Mar3 SINEs revealed that they can be further divided into four subgroups, each of which has diagnostic nucleotides. The insertion patterns of each SINE at particular genomic loci, together with the distribution patterns of each SINE, suggest that the Mar3 SINEs have intensively amplified after the radiation of diprotodontians, whereas the Mac1 SINE has amplified only slightly after the divergence of hypsiprimnodons from other macropods. By compiling the information of CORE-SINEs characterized to date, we propose a comprehensive picture of how SINE evolution occurred in the genomes of marsupials.
Collapse
Affiliation(s)
- Maruo Munemasa
- Graduate School of Bioscience and Biotechnology, Tokyo Institute of Technology, 4259-B21 Nagatsuta-cho, Midori-ku, Yokohama, Japan
| | | | | | | | | | | |
Collapse
|
33
|
Gogolevsky KP, Vassetzky NS, Kramerov DA. Bov-B-mobilized SINEs in vertebrate genomes. Gene 2007; 407:75-85. [PMID: 17976929 DOI: 10.1016/j.gene.2007.09.021] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2007] [Revised: 09/27/2007] [Accepted: 09/27/2007] [Indexed: 11/26/2022]
Abstract
Two new short retroposon families (SINEs) have been found in the genome of springhare Pedetes capensis (Rodentia). One of them, Ped-1, originated from 5S rRNA, while the other one, Ped-2, originated from tRNA-derived SINE ID. In contrast to most currently active mammalian SINEs mobilized by L1 long retrotransposon (LINE), Ped-1 and Ped-2 are mobilized by Bov-B, a LINE family of the widely distributed RTE clade. The 3' part of these SINEs originates from two sequences in the 5' and 3' regions of Bov-B. Such bipartite structure of the LINE-derived part has been revealed in all Bov-B-mobilized SINEs known to date (AfroSINE, Bov-tA, Mar-1, and Ped-1/2), which distinguishes them from other SINEs with only a 3' LINE-derived part. Structural analysis and the distribution of Bov-B LINEs and partner SINEs supports the horizontal transfer of Bov-B, while the SINEs emerged independently in lineages with this LINE.
Collapse
Affiliation(s)
- Konstantin P Gogolevsky
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, 32 Vavilov Street, Moscow, Russia
| | | | | |
Collapse
|
34
|
Grechko VV, Bannikova AA, Kosushkin SA, Ryabinina NL, Milto KD, Darevsky IS, Kramerov DA. Molecular genetic diversification of the lizard complex Darevskia raddei (Sauria: Lacertidae): Early stages of speciation. Mol Biol 2007. [DOI: 10.1134/s0026893307050093] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
|
35
|
Matveev V, Nishihara H, Okada N. Novel SINE families from salmons validate Parahucho (Salmonidae) as a distinct genus and give evidence that SINEs can incorporate LINE-related 3'-tails of other SINEs. Mol Biol Evol 2007; 24:1656-66. [PMID: 17470437 DOI: 10.1093/molbev/msm083] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Short interspersed elements (SINEs) constitute a group of retroposons propagating in the genome via a mechanism of reverse transcription, in which they depend on the enzymatic machinery of long retroposons (LINEs). Over 70 SINE families have been described to date from the genomes of various eukaryotes. Here, we characterize two novel SINEs from salmons (Actinopterygii: Salmonoidei). The first family, termed SlmI, was shown to be widespread among all genera of the suborder. These SINEs have a tRNA(Leu)-related promoter region at their 5'-end, a unique central conserved domain with a subfamily-specific region, and an end with RSg-1-LINE-derived 3'-terminus preceding the A/T-rich tail. The same LINE-related segment is also shared by two other salmonid SINEs: HpaI and OS-SINE1. The structural peculiarities and overall sequence identity of the SlmI 3'-terminus suggest that it has been acquired from HpaI SINEs but not directly from the partner LINE. This region plays a crucial role in the process of retrotransposition of short interspersed elements, and the case of its SINE-to-SINE transmission is the first recorded to date. Possible scenarios and potential evolutionary implications of the observed interaction between short retroposons are discussed. Apart from the above, we found a copy of the SlmI SINE in the GenBank entry for the blood fluke, Schistosoma japonicum (Trematoda: Strigeiformes) -- a trematode causing one of the most important human helminth infections, with its genome known to host other groups of salmonoid retroposons. In the present article, we suggest our views with regard to possible ways in which such an intensive horizontal transfer of salmonoid retroposons to the schistosomal genome occurs. The second novel SINE family, termed SlmII, originates from one of the SlmI subfamilies, with which it shares the same tRNA-related region, central domain, and a part of RSg-1-derived segment, but has a different 3'-tail of unidentified origin. Its distribution among salmonids validates Parahucho (Japanese huchen) as a distinct monotypic genus.
Collapse
Affiliation(s)
- Vitaliy Matveev
- Faculty of Bioscience and Biotechnology, Department of Biological Sciences, Tokyo Institute of Technology, Yokohama, Japan
| | | | | |
Collapse
|
36
|
Sun FJ, Fleurdépine S, Bousquet-Antonelli C, Caetano-Anollés G, Deragon JM. Common evolutionary trends for SINE RNA structures. Trends Genet 2006; 23:26-33. [PMID: 17126948 DOI: 10.1016/j.tig.2006.11.005] [Citation(s) in RCA: 46] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2006] [Revised: 10/10/2006] [Accepted: 11/10/2006] [Indexed: 10/23/2022]
Abstract
Short interspersed elements (SINEs) and long interspersed elements (LINEs) are transposable elements in eukaryotic genomes that mobilize through an RNA intermediate. Understanding their evolution is important because of their impact on the host genome. Most eukaryotic SINEs are ancestrally related to tRNA genes, although the typical tRNA cloverleaf structure is not apparent for most SINE consensus RNAs. Using a cladistic method where RNA structural components were coded as polarized and ordered multistate characters, we showed that related structural motifs are present in most SINE RNAs from mammals, fishes and plants, suggesting common selective constraints imposed at the SINE RNA structural level. Based on these results, we propose a general multistep model for the evolution of tRNA-related SINEs in eukaryotes.
Collapse
Affiliation(s)
- Feng-Jie Sun
- Department of Crop Sciences, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| | | | | | | | | |
Collapse
|
37
|
El-Mogharbel N, Wakefield M, Deakin JE, Tsend-Ayush E, Grützner F, Alsop A, Ezaz T, Marshall Graves JA. DMRT gene cluster analysis in the platypus: new insights into genomic organization and regulatory regions. Genomics 2006; 89:10-21. [PMID: 16962738 DOI: 10.1016/j.ygeno.2006.07.017] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2006] [Revised: 07/31/2006] [Accepted: 07/31/2006] [Indexed: 10/24/2022]
Abstract
We isolated and characterized a cluster of platypus DMRT genes and compared their arrangement, location, and sequence across vertebrates. The DMRT gene cluster on human 9p24.3 harbors, in order, DMRT1, DMRT3, and DMRT2, which share a DM domain. DMRT1 is highly conserved and involved in sexual development in vertebrates, and deletions in this region cause sex reversal in humans. Sequence comparisons of DMRT genes between species have been valuable in identifying exons, control regions, and conserved nongenic regions (CNGs). The addition of platypus sequences is expected to be particularly valuable, since monotremes fill a gap in the vertebrate genome coverage. We therefore isolated and fully sequenced platypus BAC clones containing DMRT3 and DMRT2 as well as DMRT1 and then generated multispecies alignments and ran prediction programs followed by experimental verification to annotate this gene cluster. We found that the three genes have 58-66% identity to their human orthologues, lie in the same order as in other vertebrates, and colocate on 1 of the 10 platypus sex chromosomes, X5. We also predict that optimal annotation of the newly sequenced platypus genome will be challenging. The analysis of platypus sequence revealed differences in structure and sequence of the DMRT gene cluster. Multispecies comparison was particularly effective for detecting CNGs, revealing several novel potential regulatory regions within DMRT3 and DMRT2 as well as DMRT1. RT-PCR indicated that platypus DMRT1 and DMRT3 are expressed specifically in the adult testis (and not ovary), but DMRT2 has a wider expression profile, as it does for other mammals. The platypus DMRT1 expression pattern, and its location on an X chromosome, suggests an involvement in monotreme sexual development.
Collapse
Affiliation(s)
- Nisrine El-Mogharbel
- Comparative Genomics Group, Research School of Biological Sciences, Australian National University, P.O. Box 475, Canberra, ACT 2601, Australia.
| | | | | | | | | | | | | | | |
Collapse
|
38
|
Fawcett JA, Kawahara T, Watanabe H, Yasui Y. A SINE family widely distributed in the plant kingdom and its evolutionary history. PLANT MOLECULAR BIOLOGY 2006; 61:505-14. [PMID: 16830182 DOI: 10.1007/s11103-006-0026-7] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/07/2005] [Accepted: 02/10/2006] [Indexed: 05/02/2023]
Abstract
The distribution and evolution of Au SINE in plants were examined. Au SINE is a short interspersed element first identified in Aegilops umbellulata, a close relative of wheat. The Au SINE was previously found in species such as wheat, maize, tobacco, and tomato, but not in rice. In this study, we first searched public databases, and next examined the presence of Au in a broad range of plant species by PCR using internal primers of Au. Although Au is likely to be absent from many species including rice, it was identified in many Gramineae, Solanaceae, and Fabaceae species, and also in a basal angiosperm species, Asimina triloba. Phylogenetic studies suggest that Au SINE originated before the divergence of monocots and eudicots. Au SINE sequences of Asimina, Triticum, Zea, Nicotiana, Lotus, Medicago, and Glycine were aligned and compared. Although sequences of Au were highly conserved among distantly related species, every Au element in Glycine had a 16 bp deletion and its 3' end differed from sequences of other species. This type of Au could only be found in G. max, and not in other species including other Fabaceae species such as M. truncatula and L. japonicus. This is the first report of a plant SINE family present in multiple lineages, and the evolution of Au SINE in the plant kingdom, especially in Gramineae and Fabaceae is discussed.
Collapse
Affiliation(s)
- Jeffrey A Fawcett
- Laboratory of Crop Evolution, Graduate School of Agriculture, Kyoto University, Kitashirakawa-Oiwakecho, Sakyoku, Kyoto 606-8502, Japan.
| | | | | | | |
Collapse
|
39
|
Bannikova AA, Bulatova NS, Kramerov DA. Molecular variability in the common shrew Sorex araneus L. from european russia and siberia inferred from the length polymorphism of DNA regions flanked by short interspersed elements (inter-SINE PCR) and the relationships between the moscow and seliger chromosome races. RUSS J GENET+ 2006. [DOI: 10.1134/s1022795406060020] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
|
40
|
Pratt-Hyatt MJ, Kapadia KM, Wilson TE, Engelke DR. Increased recombination between active tRNA genes. DNA Cell Biol 2006; 25:359-64. [PMID: 16792506 PMCID: PMC3756803 DOI: 10.1089/dna.2006.25.359] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Transfer RNA genes are distributed throughout eukaryotic genomes, and are frequently found as multicopy families. In Saccharomyces cerevisiae, tRNA gene transcription by RNA polymerase III suppresses nearby transcription by RNA polymerase II, partially because the tRNA genes are clustered near the nucleolus. We have tested whether active transcription of tRNA genes might also suppress recombination, since recombination between identical copies of the repetitive tRNA genes could delete intervening genes and be detrimental to survival. The opposite proved to be the case. Recombination between active tRNA genes was elevated, but only when both genes are transcribed. We also tested the effects of tRNA genes on recombination between the direct terminal repeats of a neighboring retrotransposon, since most Ty retrotransposons reside next to tRNA genes, and the selective advantage of this arrangement is not known.
Collapse
Affiliation(s)
- Matthew J Pratt-Hyatt
- Department of Biological Chemistry, University of Michigan Medical School, Ann Arbor, 48109-0606, USA
| | | | | | | |
Collapse
|
41
|
Nishihara H, Smit AF, Okada N. Functional noncoding sequences derived from SINEs in the mammalian genome. Genome Res 2006; 16:864-74. [PMID: 16717141 PMCID: PMC1484453 DOI: 10.1101/gr.5255506] [Citation(s) in RCA: 173] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
Recent comparative analyses of mammalian sequences have revealed that a large number of nonprotein-coding genomic regions are under strong selective constraint. Here, we report that some of these loci have been derived from a newly defined family of ancient SINEs (short interspersed repetitive elements). This is a surprising result, as SINEs and other transposable elements are commonly thought to be genomic parasites. We named the ancient SINE family AmnSINE1, for Amniota SINE1, because we found it to be present in mammals as well as in birds, and some copies predate the mammalian-bird split 310 million years ago (Mya). AmnSINE1 has a chimeric structure of a 5S rRNA and a tRNA-derived SINE, and is related to five tRNA-derived SINE families that we characterized here in the coelacanth, dogfish shark, hagfish, and amphioxus genomes. All of the newly described SINE families have a common central domain that is also shared by zebrafish SINE3, and we collectively name them the DeuSINE (Deuterostomia SINE) superfamily. Notably, of the approximately 1000 still identifiable copies of AmnSINE1 in the human genome, 105 correspond to loci phylogenetically highly conserved among mammalian orthologs. The conservation is strongest over the central domain. Thus, AmnSINE1 appears to be the best example of a transposable element of which a significant fraction of the copies have acquired genomic functionality.
Collapse
Affiliation(s)
- Hidenori Nishihara
- Graduate School of Bioscience and Biotechnology, Tokyo Institute of Technology, Yokohama, Japan
| | - Arian F.A. Smit
- Institute for Systems Biology, Seattle, Washington 98103, USA
| | - Norihiro Okada
- Graduate School of Bioscience and Biotechnology, Tokyo Institute of Technology, Yokohama, Japan
- Corresponding author.E-mail ; fax 81-45-924-5835
| |
Collapse
|
42
|
Bannikova AA, Lebedev VS, Kramerov DA, Zaitsev MV. Phylogeny and systematics of the Crocidura suaveolens species group: corroboration and controversy between nuclear and mitochondrial DNA markers / Phylogénie et systématique du groupe d'espèces Crocidura suaveolens: coordination et contradiction des marqueurs nucléaire et mitochondriaux de l'ADN. MAMMALIA 2006. [DOI: 10.1515/mamm.2006.011] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
AbstractDespite obvious advances in systematic research on Palaearctic white-toothed shrews ( Crocidura ), phylogenetic relationships and species diagnosis of 40-chromosome species ( suaveolens sp. group) remain poorly understood. Phylogenetic relationships of these shrews were analyzed on the basis of two independent molecular markers: interspersed repeat PCR fingerprints (inter-SINE-PCR) and complete (1140 bp) or partial (∼400 bp) sequences of the mtDNA cyt b gene. According to these data, C. suaveolens from Western Europe (Italy) appeared distinct from samples of C. suaveolens from Eastern Europe and Mongolia, as well as a Siberian sample. mtDNA introgression of Eastern European C. suaveolens with C. gueldenstaedtii in their contact zone in the Tuapse region was revealed. Hybrydization between C. gueldenstaedtii and C. suaveolens resulted in the formation of a population, nuclear DNA and morphological characteristics typical for C. gueldenstaedtii , while the mitochondrial genome is assimilated from C. suaveolens . The population of the Talysh region of the Caucasus ( C. caspica ) represents a separate entity that is clearly distinguished from the populations of Georgia and Tuapse ( C. gueldenstaedtii ) and C. suaveolens . Therefore, the position of C. caspica as a full species is supported. The present analysis of both inter-SINE-PCR and cyt b sequence data revealed two major clades in Palaearctic 40-chromosome Crocidura . The eastern clade is formed by true C. suaveolens/C. sibirica , together with C. caspica , and the western clade is formed by Western European C. suaveolens , which should be treated as a distinct species, C. mimula and the closely related C. gueldenstaedtii.
Collapse
|
43
|
Samollow PB. Status and applications of genomic resources for the gray, short-tailed opossum, Monodelphis domestica, an American marsupial model for comparative biology. AUST J ZOOL 2006. [DOI: 10.1071/zo05059] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
Owing to its small size, favourable reproductive characteristics, and simple husbandry, the gray, short-tailed opossum, Monodelphis domestica, has become the most widely distributed and intensively utilised laboratory-bred research marsupial in the world today. This article provides an overview of the current state and future projections of genomic resources for this species and discusses the potential impact of this growing resource base on active research areas that use M. domestica as a model system. The resources discussed include: fully arrayed, bacterial artificial chromosome (BAC) libraries; an expanding linkage map; developing full-genome BAC-contig and chromosomal fluorescence in situ hybridisation maps; public websites providing access to the M. domestica whole-genome-shotgun sequence trace database and the whole-genome sequence assembly; and a new project underway to create an expressed-sequence database and microchip expression arrays for functional genomics applications. Major research areas discussed span a variety of genetic, evolutionary, physiologic, reproductive, developmental, and behavioural topics, including: comparative immunogenetics; genomic imprinting; reproductive biology; neurobiology; photobiology and carcinogenesis; genetics of lipoprotein metabolism; developmental and behavioural endocrinology; sexual differentiation and development; embryonic and fetal development; meiotic recombination; genome evolution; molecular evolution and phylogenetics; and more.
Collapse
|
44
|
Sironi M, Menozzi G, Comi GP, Cereda M, Cagliani R, Bresolin N, Pozzoli U. Gene function and expression level influence the insertion/fixation dynamics of distinct transposon families in mammalian introns. Genome Biol 2006; 7:R120. [PMID: 17181857 PMCID: PMC1794433 DOI: 10.1186/gb-2006-7-12-r120] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2006] [Revised: 10/25/2006] [Accepted: 12/20/2006] [Indexed: 02/06/2023] Open
Abstract
BACKGROUND Transposable elements (TEs) represent more than 45% of the human and mouse genomes. Both parasitic and mutualistic features have been shown to apply to the host-TE relationship but a comprehensive scenario of the forces driving TE fixation within mammalian genes is still missing. RESULTS We show that intronic multispecies conserved sequences (MCSs) have been affecting TE integration frequency over time. We verify that a selective economizing pressure has been acting on TEs to decrease their frequency in highly expressed genes. After correcting for GC content, MCS density and intron size, we identified TE-enriched and TE-depleted gene categories. In addition to developmental regulators and transcription factors, TE-depleted regions encompass loci that might require subtle regulation of transcript levels or precise activation timing, such as growth factors, cytokines, hormones, and genes involved in the immune response. The latter, despite having reduced frequencies of most TE types, are significantly enriched in mammalian-wide interspersed repeats (MIRs). Analysis of orthologous genes indicated that MIR over-representation also occurs in dog and opossum immune response genes, suggesting, given the partially independent origin of MIR sequences in eutheria and metatheria, the evolutionary conservation of a specific function for MIRs located in these loci. Consistently, the core MIR sequence is over-represented in defense response genes compared to the background intronic frequency. CONCLUSION Our data indicate that gene function, expression level, and sequence conservation influence TE insertion/fixation in mammalian introns. Moreover, we provide the first report showing that a specific TE family is evolutionarily associated with a gene function category.
Collapse
Affiliation(s)
- Manuela Sironi
- Scientific Institute IRCCS E Medea, Bioinformatic Lab, Via don L Monza, 23842 Bosisio Parini (LC), Italy
| | - Giorgia Menozzi
- Scientific Institute IRCCS E Medea, Bioinformatic Lab, Via don L Monza, 23842 Bosisio Parini (LC), Italy
| | - Giacomo P Comi
- Dino Ferrari Centre, Department of Neurological Sciences, University of Milan, IRCCS Ospedale Maggiore Policlinico, Mangiagalli and Regina Elena Foundation, 20100 Milan, Italy
| | - Matteo Cereda
- Scientific Institute IRCCS E Medea, Bioinformatic Lab, Via don L Monza, 23842 Bosisio Parini (LC), Italy
| | - Rachele Cagliani
- Scientific Institute IRCCS E Medea, Bioinformatic Lab, Via don L Monza, 23842 Bosisio Parini (LC), Italy
| | - Nereo Bresolin
- Scientific Institute IRCCS E Medea, Bioinformatic Lab, Via don L Monza, 23842 Bosisio Parini (LC), Italy
- Dino Ferrari Centre, Department of Neurological Sciences, University of Milan, IRCCS Ospedale Maggiore Policlinico, Mangiagalli and Regina Elena Foundation, 20100 Milan, Italy
| | - Uberto Pozzoli
- Scientific Institute IRCCS E Medea, Bioinformatic Lab, Via don L Monza, 23842 Bosisio Parini (LC), Italy
| |
Collapse
|
45
|
Lenoir A, Pélissier T, Bousquet-Antonelli C, Deragon JM. Comparative evolution history of SINEs in Arabidopsis thaliana and Brassica oleracea: evidence for a high rate of SINE loss. Cytogenet Genome Res 2005; 110:441-7. [PMID: 16093696 DOI: 10.1159/000084976] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2003] [Accepted: 10/16/2003] [Indexed: 11/19/2022] Open
Abstract
Brassica oleracea and Arabidopsis thaliana belong to the Brassicaceae(Cruciferae) family and diverged 16 to 19 million years ago. Although the genome size of B. oleracea (approximately 600 million base pairs) is more than four times that of A. thaliana (approximately 130 million base pairs), their gene content is believed to be very similar with more than 85% sequence identity in the coding region. Therefore, this important difference in genome size is likely to reflect a different rate of non-coding DNA accumulation. Transposable elements (TEs) constitute a major fraction of non-coding DNA in plant species. A different rate in TE accumulation between two closely related species can result in significant genome size variations in a short evolutionary period. Short interspersed elements (SINEs) are non-autonomous retroposons that have invaded the genome of most eukaryote species. Several SINE families are present in B. oleracea and A. thaliana and we found that two of them (called RathE1 and RathE2) are present in both species. In this study, the tempo of evolution of RathE1 and RathE2 SINE families in both species was compared. We observed that most B. oleracea RathE2 SINEs are "young" (close to the consensus sequence) and abundant while elements from this family are more degenerated and much less abundant in A. thaliana. However, the situation is different for the RathE1 SINE family for which the youngest elements are found in A. thaliana. Surprisingly, no SINE was found to occupy the same (orthologous) genomic locus in both species suggesting that either these SINE families were not amplified at a significant rate in the common ancestor of the two species or that older elements were lost and only the recent (lineage-specific) insertions remain. To test this latter hypothesis, loci containing a recently inserted SINE in the A. thaliana col-0 ecotype were selected and characterized in several other A. thaliana ecotypes. In addition to the expected SINE containing allele and the pre-integrative allele (i.e. the "empty" allele), we observed in the different ecotypes, alleles with truncated portions of the SINE (up to the complete loss of the element) and of the immediate genomic flanking sequences. The absence of SINEs in orthologous positions between B. oleracea and A. thaliana and the presence in recently diverged A. thaliana ecotypes of alleles containing severely truncated SINEs suggest a very high rate of SINE loss in these species.
Collapse
Affiliation(s)
- A Lenoir
- CNRS UMR6547 Biomove, Université Blaise Pascal, Aubière, France
| | | | | | | |
Collapse
|
46
|
Ohshima K, Okada N. SINEs and LINEs: symbionts of eukaryotic genomes with a common tail. Cytogenet Genome Res 2005; 110:475-90. [PMID: 16093701 DOI: 10.1159/000084981] [Citation(s) in RCA: 120] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2004] [Accepted: 04/27/2004] [Indexed: 01/26/2023] Open
Abstract
Many SINEs and LINEs have been characterized to date, and examples of the SINE and LINE pair that have the same 3' end sequence have also increased. We report the phylogenetic relationships of nearly all known LINEs from which SINEs are derived, including a new example of a SINE/LINE pair identified in the salmon genome. We also use several biological examples to discuss the impact and significance of SINEs and LINEs in the evolution of vertebrate genomes.
Collapse
Affiliation(s)
- K Ohshima
- School and Graduate School of Bioscience and Biotechnology, Tokyo Institute of Technology, Yokohama, Japan.
| | | |
Collapse
|
47
|
Yu L, Zhang YP. Evolutionary implications of multiple SINE insertions in an intronic region from diverse mammals. Mamm Genome 2005; 16:651-60. [PMID: 16245022 DOI: 10.1007/s00335-004-2456-3] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2004] [Accepted: 05/20/2005] [Indexed: 10/25/2022]
Abstract
An analysis of the nuclear beta-fibrinogen intron 7 locus from 30 taxa representing 12 placental orders of mammals reveals the enriched occurrences of short interspersed element (SINE) insertion events. Mammalian-wide interspersed repeats (MIRs) are present at orthologous sites of all examined species except those in the order Rodentia. The higher substitution rate in mouse and a rare MIR deletion from rat account for the absence of MIR in the rodents. A minimum of five lineage-specific SINE sequences are also found to have independently inserted into this intron in Carnivora, Artiodactyla and Lagomorpha. In the case of Carnivora, the unique amplification pattern of order-specific CAN SINE provides important evidence for the "pan-carnivore" hypothesis of this repeat element and reveals that the CAN SINE family may still be active today. Particularly interesting is the finding that all identified lineage-specific SINE elements show a strong tendency to insert within or in very close proximity to the preexisting MIRs for their efficient integrations, suggesting that the MIR element is a hot spot for successive insertions of other SINEs. The unexpected MIR excision as a result of a random deletion in the rat intron locus and the non-random site targeting detected by this study indicate that SINEs actually have a greater insertional flexibility and regional specificity than had previously been recognized. Implications for SINE sequence evolution upon and following integration, as well as the fascinating interactions between retroposons and the host genomes are discussed.
Collapse
Affiliation(s)
- Li Yu
- Laboratory of Molecular Biology of Domestic Animals, and Cellular and Molecular Evolution, Kunming Institute of Zoology, Kunming, 650223, China
| | | |
Collapse
|
48
|
Abstract
A recent landmark paper demonstrates the unique contribution of marsupials and monotremes to comparative genome analysis, filling an evolutionary gap between the eutherian mammals and more distant vertebrate species. A recent landmark paper demonstrates the unique contribution of marsupials and monotremes to comparative genome analysis, filling an evolutionary gap between the eutherian mammals (including humans) and more distant vertebrate species.
Collapse
Affiliation(s)
- Matthew J Wakefield
- Division of Immunology and Genetics, John Curtin School of Medical Research, The Australian National University, Canberra 0200, Australia
| | | |
Collapse
|
49
|
Margulies EH, Maduro VVB, Thomas PJ, Tomkins JP, Amemiya CT, Luo M, Green ED. Comparative sequencing provides insights about the structure and conservation of marsupial and monotreme genomes. Proc Natl Acad Sci U S A 2005; 102:3354-9. [PMID: 15718282 PMCID: PMC549084 DOI: 10.1073/pnas.0408539102] [Citation(s) in RCA: 66] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2004] [Indexed: 11/18/2022] Open
Abstract
Sequencing and comparative analyses of genomes from multiple vertebrates are providing insights about the genetic basis for biological diversity. To date, these efforts largely have focused on eutherian mammals, chicken, and fish. In this article, we describe the generation and study of genomic sequences from noneutherian mammals, a group of species occupying unusual phylogenetic positions. A large sequence data set (totaling >5 Mb) was generated for the same orthologous region in three marsupial (North American opossum, South American opossum, and Australian tammar wallaby) and one monotreme (platypus) genomes. These ancient mammalian genomes are characterized by unusual architectural features with respect to G + C and repeat content, as well as compression relative to human. Approximately 14% and 34% of the human sequence forms alignments with the orthologous sequence from platypus and the marsupials, respectively; these numbers are distinctly lower than that observed with nonprimate eutherian mammals (45-70%). The alignable sequences between human and each marsupial species are not completely overlapping (only 80% common to all three species) nor are the platypus-alignable sequences completely contained within the marsupial-alignable sequences. Phylogenetic analysis of synonymous coding positions reveals that platypus has a notably long branch length, with the human-platypus substitution rate being on average 55% greater than that seen with human-marsupial pairs. Finally, analyses of the major mammalian lineages reveal distinct patterns with respect to the common presence of evolutionarily conserved vertebrate sequences. Our results confirm that genomic sequence from noneutherian mammals can contribute uniquely to unraveling the functional and evolutionary histories of the mammalian genome.
Collapse
Affiliation(s)
- Elliott H Margulies
- Genome Technology Branch and NISC, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD 20892, USA
| | | | | | | | | | | | | |
Collapse
|
50
|
Phylogenetic relationships between Afrotropical and Palaearctic Crocidura species inferred from Inter-SINE-PCR. BIOCHEM SYST ECOL 2005. [DOI: 10.1016/j.bse.2004.05.014] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
|