1
|
Kojima KK. Diversity and Evolution of DNA Transposons Targeting Multicopy Small RNA Genes from Actinopterygian Fish. BIOLOGY 2022; 11:biology11020166. [PMID: 35205033 PMCID: PMC8869645 DOI: 10.3390/biology11020166] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/22/2021] [Revised: 01/14/2022] [Accepted: 01/18/2022] [Indexed: 11/16/2022]
Abstract
Simple Summary DNA transposons are parasitic DNA segments that can move or duplicate themselves from one site to another in the genome. Dada is a unique group of DNA transposons, which specifically insert themselves into multicopy RNA genes such as transfer RNA (tRNA) genes or small nuclear RNA (snRNA) genes to avoid the disruption of single-copy functional genes. However, only a few Dada families have been characterized along with their target sequences. Here, vertebrate genomes were surveyed to characterize new Dada transposons, and over 120 Dada families were characterized from diverse fishes. They were classified into 12 groups with confirmed target specificities. Various tRNA genes, as well as 5S ribosomal RNA (rRNA) genes were inserted by Dada transposons. Phylogenetic analysis revealed that Dada transposons inserted in the same RNA genes are closely related. Phylogenetically related Dada transposons inserted in different RNA genes show the sequence similarity around their insertion sites, indicating Dada proteins recognize DNA nucleotide sequences to find their targets. Understanding how Dada discovers the targets would help develop target-specific insertions of foreign DNA segments. Abstract Dada is a unique superfamily of DNA transposons, inserted specifically in multicopy RNA genes. The zebrafish genome harbors five families of Dada transposons, whose targets are U6 and U1 snRNA genes, and tRNA-Ala and tRNA-Leu genes. Dada-U6, which is inserted specifically in U6 snRNA genes, is found in four animal phyla, but other target-specific lineages have been reported only from one or two species. Here, vertebrate genomes and transcriptomes were surveyed to characterize Dada families with new target specificities, and over 120 Dada families were characterized from the genomes of actinopterygian fish. They were classified into 12 groups with confirmed target specificities. Newly characterized Dada families target tRNA genes for Asp, Asn, Arg, Gly, Lys, Ser, Tyr, and Val, and 5S rRNA genes. Targeted positions inside of tRNA genes are concentrated in two regions: around the anticodon and the A box of RNA polymerase III promoter. Phylogenetic analysis revealed the relationships among actinopterygian Dada families, and one domestication event in the common ancestor of carps and minnows belonging to Cyprinoidei, Cypriniformes. Sequences targeted by phylogenetically related Dada families show sequence similarities, indicating that the target specificity of Dada is accomplished through the recognition of primary nucleotide sequences.
Collapse
Affiliation(s)
- Kenji K Kojima
- Genetic Information Research Institute, Cupertino, CA 95014, USA
| |
Collapse
|
2
|
R2 and Non-Site-Specific R2-Like Retrotransposons of the German Cockroach, Blattella germanica. Genes (Basel) 2020; 11:genes11101202. [PMID: 33076367 PMCID: PMC7650587 DOI: 10.3390/genes11101202] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2020] [Revised: 10/10/2020] [Accepted: 10/12/2020] [Indexed: 11/17/2022] Open
Abstract
The structural and functional organization of the ribosomal RNA gene cluster and the full-length R2 non-LTR retrotransposon (integrated into a specific site of 28S ribosomal RNA genes) of the German cockroach, Blattella germanica, is described. A partial sequence of the R2 retrotransposon of the cockroach Rhyparobia maderae is also analyzed. The analysis of previously published next-generation sequencing data from the B. germanica genome reveals a new type of retrotransposon closely related to R2 retrotransposons but with a random distribution in the genome. Phylogenetic analysis reveals that these newly described retrotransposons form a separate clade. It is shown that proteins corresponding to the open reading frames of newly described retrotransposons exhibit unequal structural domains. Within these retrotransposons, a recombination event is described. New mechanism of transposition activity is discussed. The essential structural features of R2 retrotransposons are conserved in cockroaches and are typical of previously described R2 retrotransposons. However, the investigation of the number and frequency of 5′-truncated R2 retrotransposon insertion variants in eight B. germanica populations suggests recent mobile element activity. It is shown that the pattern of 5′-truncated R2 retrotransposon copies can be an informative molecular genetic marker for revealing genetic distances between insect populations.
Collapse
|
3
|
Kojima KK. Structural and sequence diversity of eukaryotic transposable elements. Genes Genet Syst 2019; 94:233-252. [DOI: 10.1266/ggs.18-00024] [Citation(s) in RCA: 47] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Affiliation(s)
- Kenji K. Kojima
- Genetic Information Research Institute
- Department of Life Sciences, National Cheng Kung University
| |
Collapse
|
4
|
Kim DH, Lee BY, Kim HS, Jeong CB, Hwang DS, Kim IC, Lee JS. Identification and characterization of homeobox (Hox) genes and conservation of the single Hox cluster (324.6 kb) in the water flea Daphnia magna. JOURNAL OF EXPERIMENTAL ZOOLOGY PART B-MOLECULAR AND DEVELOPMENTAL EVOLUTION 2018; 330:76-82. [PMID: 29441720 DOI: 10.1002/jez.b.22793] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/25/2017] [Revised: 01/19/2018] [Accepted: 01/25/2018] [Indexed: 11/07/2022]
Abstract
We report the complete sequence analysis of the entire complement of eight typical homeobox (Hox) genes (Lab, Pb, Dfd, Scr, Antp, Ubx, Abd-A, and Abd-B) and two other genes (Hox3 and Ftz) in a 324.6-kb region in the water flea Daphnia magna. In the cluster of D. magna Hox genes, we found one long interspersed nuclear element (LINE)/R2-NeSL between Ubx and Abd-A that was not present in Daphnia pulex Hox genes. In basal expression of Hox genes at different developmental stages, biothorax complex genes (Ubx, Abd-A, and Abd-B) and some antennapedia complex genes (Lab, Scr, Antp) were moderately expressed, but the Hox3 gene was barely expressed. Three homeobox genes (Antp, Ubx, Abd-A) were highly expressed at 6-7 days after release from the brood chamber and/or in the adult stage. The structural array and transcribed orientation of Dm-Hox genes were identical to those of the sister species D. pulex (∼340 kb), indicating that the Hox gene structure in daphnids is highly conserved. However, Dm- and Dp-Hox3, -deformed (Dfd), and -fushi tarazu (Ftz) genes varied from orthologous genes in pancrustacean species.
Collapse
Affiliation(s)
- Duck-Hyun Kim
- Department of Biological Science, College of Science, Sungkyunkwan University, Suwon, South Korea
| | - Bo-Young Lee
- Department of Biological Science, College of Science, Sungkyunkwan University, Suwon, South Korea
| | - Hui-Su Kim
- Department of Biological Science, College of Science, Sungkyunkwan University, Suwon, South Korea
| | - Chang-Bum Jeong
- Department of Biological Science, College of Science, Sungkyunkwan University, Suwon, South Korea
| | - Dae-Sik Hwang
- Department of Biological Science, College of Science, Sungkyunkwan University, Suwon, South Korea
| | - Il-Chan Kim
- Division of Polar Life Sciences, Korea Polar Research Institute, Incheon, South Korea
| | - Jae-Seong Lee
- Department of Biological Science, College of Science, Sungkyunkwan University, Suwon, South Korea
| |
Collapse
|
5
|
Govindaraju A, Cortez JD, Reveal B, Christensen SM. Endonuclease domain of non-LTR retrotransposons: loss-of-function mutants and modeling of the R2Bm endonuclease. Nucleic Acids Res 2016; 44:3276-87. [PMID: 26961309 PMCID: PMC4838377 DOI: 10.1093/nar/gkw134] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2015] [Revised: 02/22/2016] [Accepted: 02/23/2016] [Indexed: 01/07/2023] Open
Abstract
Non-LTR retrotransposons are an important class of mobile elements that insert into host DNA by target-primed reverse transcription (TPRT). Non-LTR retrotransposons must bind to their mRNA, recognize and cleave their target DNA, and perform TPRT at the site of DNA cleavage. As DNA binding and cleavage are such central parts of the integration reaction, a better understanding of the endonuclease encoded by non-LTR retrotransposons is needed. This paper explores the R2 endonuclease domain from Bombyx mori using in vitro studies and in silico modeling. Mutations in conserved sequences located across the putative PD-(D/E)XK endonuclease domain reduced DNA cleavage, DNA binding and TPRT. A mutation at the beginning of the first α-helix of the modeled endonuclease obliterated DNA cleavage and greatly reduced DNA binding. It also reduced TPRT when tested on pre-cleaved DNA substrates. The catalytic K was located to a non-canonical position within the second α-helix. A mutation located after the fourth β-strand reduced DNA binding and cleavage. The motifs that showed impaired activity form an extensive basic region. The R2 biochemical and structural data are compared and contrasted with that of two other well characterized PD-(D/E)XK endonucleases, restriction endonucleases and archaeal Holliday junction resolvases.
Collapse
Affiliation(s)
- Aruna Govindaraju
- Department of Biology, University of Texas at Arlington, Arlington, TX 76019-0498, USA
| | - Jeremy D. Cortez
- Department of Biology, University of Texas at Arlington, Arlington, TX 76019-0498, USA
| | - Brad Reveal
- Department of Biology, University of Texas at Arlington, Arlington, TX 76019-0498, USA
| | - Shawn M. Christensen
- Department of Biology, University of Texas at Arlington, Arlington, TX 76019-0498, USA
| |
Collapse
|
6
|
Abstract
Although most of non-long terminal repeat (non-LTR) retrotransposons are incorporated in the host genome almost randomly, some non-LTR retrotransposons are incorporated into specific sequences within a target site. On the basis of structural and phylogenetic features, non-LTR retrotransposons are classified into two large groups, restriction enzyme-like endonuclease (RLE)-encoding elements and apurinic/apyrimidinic endonuclease (APE)-encoding elements. All clades of RLE-encoding non-LTR retrotransposons include site-specific elements. However, only two of more than 20 APE-encoding clades, Tx1 and R1, contain site-specific non-LTR elements. Site-specific non-LTR retrotransposons usually target within multi-copy RNA genes, such as rRNA gene (rDNA) clusters, or repetitive genomic sequences, such as telomeric repeats; this behavior may be a symbiotic strategy to reduce the damage to the host genome. Site- and sequence-specificity are variable even among closely related non-LTR elements and appeared to have changed during evolution. In the APE-encoding elements, the primary determinant of the sequence- specific integration is APE itself, which nicks one strand of the target DNA during the initiation of target primed reverse transcription (TPRT). However, other factors, such as interaction between mRNA and the target DNA, and access to the target region in the nuclei also affect the sequence-specificity. In contrast, in the RLE-encoding elements, DNA-binding motifs appear to affect their sequence-specificity, rather than the RLE domain itself. Highly specific integration properties of these site-specific non-LTR elements make them ideal alternative tools for sequence-specific gene delivery, particularly for therapeutic purposes in human diseases.
Collapse
|
7
|
Abstract
R2 elements are sequence specific non-LTR retrotransposons that exclusively insert in the 28S rRNA genes of animals. R2s encode an endonuclease that cleaves the insertion site and a reverse transcriptase that uses the cleaved DNA to prime reverse transcription of the R2 transcript, a process termed target primed reverse transcription. Additional unusual properties of the reverse transcriptase as well as DNA and RNA binding domains of the R2 encoded protein have been characterized. R2 expression is through co-transcription with the 28S gene and self-cleavage by a ribozyme encoded at the R2 5' end. Studies in laboratory stocks and natural populations of Drosophila suggest that R2 expression is tied to the distribution of R2-inserted units within the rDNA locus. Most individuals have no R2 expression because only a small fraction of their rRNA genes need to be active, and a contiguous region of the locus free of R2 insertions can be selected for activation. However, if the R2-free region is not large enough to produce sufficient rRNA, flanking units - including those inserted with R2 - must be activated. Finally, R2 copies rapidly turnover within the rDNA locus, yet R2 has been vertically maintained in animal lineages for hundreds of millions of years. The key to this stability is R2's ability to remain dormant in rDNA units outside the transcribed regions for generations until the stochastic nature of the crossovers that drive the concerted evolution of the rDNA locus inevitably reshuffle the inserted and uninserted units, resulting in transcription of the R2-inserted units.
Collapse
|
8
|
Kojima KK, Jurka J. Ancient Origin of the U2 Small Nuclear RNA Gene-Targeting Non-LTR Retrotransposons Utopia. PLoS One 2015; 10:e0140084. [PMID: 26556480 PMCID: PMC4640811 DOI: 10.1371/journal.pone.0140084] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2015] [Accepted: 09/21/2015] [Indexed: 11/22/2022] Open
Abstract
Most non-long terminal repeat (non-LTR) retrotransposons encoding a restriction-like endonuclease show target-specific integration into repetitive sequences such as ribosomal RNA genes and microsatellites. However, only a few target-specific lineages of non-LTR retrotransposons are distributed widely and no lineage is found across the eukaryotic kingdoms. Here we report the most widely distributed lineage of target sequence-specific non-LTR retrotransposons, designated Utopia. Utopia is found in three supergroups of eukaryotes: Amoebozoa, SAR, and Opisthokonta. Utopia is inserted into a specific site of U2 small nuclear RNA genes with different strength of specificity for each family. Utopia families from oomycetes and wasps show strong target specificity while only a small number of Utopia copies from reptiles are flanked with U2 snRNA genes. Oomycete Utopia families contain an “archaeal” RNase H domain upstream of reverse transcriptase (RT), which likely originated from a plant RNase H gene. Analysis of Utopia from oomycetes indicates that multiple lineages of Utopia have been maintained inside of U2 genes with few copy numbers. Phylogenetic analysis of RT suggests the monophyly of Utopia, and it likely dates back to the early evolution of eukaryotes.
Collapse
Affiliation(s)
- Kenji K. Kojima
- Genetic Information Research Institute, Los Altos, California, United States of America
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, University of Tokyo, Minato-ku, Tokyo, Japan
- Institute of Medical Science, University of Tokyo, Minato-ku, Tokyo, Japan
- * E-mail:
| | - Jerzy Jurka
- Genetic Information Research Institute, Los Altos, California, United States of America
| |
Collapse
|
9
|
Martoni F, Eickbush DG, Scavariello C, Luchetti A, Mantovani B. Dead element replicating: degenerate R2 element replication and rDNA genomic turnover in the Bacillus rossius stick insect (Insecta: Phasmida). PLoS One 2015; 10:e0121831. [PMID: 25799008 PMCID: PMC4370867 DOI: 10.1371/journal.pone.0121831] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2014] [Accepted: 02/04/2015] [Indexed: 11/18/2022] Open
Abstract
R2 is an extensively investigated non-LTR retrotransposon that specifically inserts into the 28S rRNA gene sequences of a wide range of metazoans, disrupting its functionality. During R2 integration, first strand synthesis can be incomplete so that 5’ end deleted copies are occasionally inserted. While active R2 copies repopulate the locus by retrotransposing, the non-functional truncated elements should frequently be eliminated by molecular drive processes leading to the concerted evolution of the rDNA array(s). Although, multiple R2 lineages have been discovered in the genome of many animals, the rDNA of the stick insect Bacillus rossius exhibits a peculiar situation: it harbors both a canonical, functional R2 element (R2Brfun) as well as a full-length but degenerate element (R2Brdeg). An intensive sequencing survey in the present study reveals that all truncated variants in stick insects are present in multiple copies suggesting they were duplicated by unequal recombination. Sequencing results also demonstrate that all R2Brdeg copies are full-length, i. e. they have no associated 5' end deletions, and functional assays indicate they have lost the active ribozyme necessary for R2 RNA maturation. Although it cannot be completely ruled out, it seems unlikely that the degenerate elements replicate via reverse transcription, exploiting the R2Brfun element enzymatic machinery, but rather via genomic amplification of inserted 28S by unequal recombination. That inactive copies (both R2Brdeg or 5'-truncated elements) are not eliminated in a short term in stick insects contrasts with findings for the Drosophila R2, suggesting a widely different management of rDNA loci and a lower efficiency of the molecular drive while achieving the concerted evolution.
Collapse
Affiliation(s)
- Francesco Martoni
- Dipartimento di Scienze Biologiche, Geologiche e Ambientali, Università di Bologna, Bologna, Italy
| | - Danna G. Eickbush
- Department of Biology, University of Rochester, Rochester, New York, United States of America
| | - Claudia Scavariello
- Dipartimento di Scienze Biologiche, Geologiche e Ambientali, Università di Bologna, Bologna, Italy
| | - Andrea Luchetti
- Dipartimento di Scienze Biologiche, Geologiche e Ambientali, Università di Bologna, Bologna, Italy
- * E-mail:
| | - Barbara Mantovani
- Dipartimento di Scienze Biologiche, Geologiche e Ambientali, Università di Bologna, Bologna, Italy
| |
Collapse
|
10
|
Bonandin L, Scavariello C, Luchetti A, Mantovani B. Evolutionary dynamics of R2 retroelement and insertion inheritance in the genome of bisexual and parthenogenetic Bacillus rossius populations (Insecta Phasmida). INSECT MOLECULAR BIOLOGY 2014; 23:808-820. [PMID: 25134735 DOI: 10.1111/imb.12126] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]
Abstract
Theoretical and empirical studies have shown differential management of transposable elements in organisms with different reproductive strategies. To investigate this issue, we analysed the R2 retroelement structure and variability in parthenogenetic and bisexual populations of Bacillus rossius stick insects, as well as insertions inheritance in the offspring of parthenogenetic isolates and of crosses. The B. rossius genome hosts a functional (R2Br(fun) ) and a degenerate (R2Br(deg) ) element, their presence correlating with neither reproductive strategies nor population distribution. The median-joining network method indicated that R2Br(fun) duplicates through a multiple source model, while R2Br(deg) is apparently still duplicating via a master gene model. Offspring analyses showed that unisexual and bisexual offspring have a similar number of R2Br-occupied sites. Multiple or recent shifts from gonochoric to parthenogenetic reproduction may explain the observed data. Moreover, insertion frequency spectra show that higher-frequency insertions in unisexual offspring significantly outnumber those in bisexual offspring. This suggests that unisexual offspring eliminate insertions with lower efficiency. A comparison with simulated insertion frequencies shows that inherited insertions in unisexual and bisexual offspring are significantly different from the expectation. On the whole, different mechanisms of R2 elimination in unisexual vs bisexual offspring and a complex interplay between recombination effectiveness, natural selection and time can explain the observed data.
Collapse
Affiliation(s)
- L Bonandin
- Dipartimento di Scienze Biologiche, Geologiche e Ambientali, Università di Bologna, Bologna, Italy
| | | | | | | |
Collapse
|
11
|
Jiang J, Zhao L, Yan L, Zhang L, Cao Y, Wang Y, Jiang Y, Yan T, Cao Y. Structural features and mechanism of translocation of non-LTR retrotransposons in Candida albicans. Virulence 2013; 5:245-52. [PMID: 24317340 PMCID: PMC3956500 DOI: 10.4161/viru.27278] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open
Abstract
A number of abundant mobile genetic elements called retrotransposons reverse transcribe RNA to generate DNA for insertion into eukaryotic genomes. Non-long-terminal repeat (non-LTR) retrotransposons represent a major class of retrotransposons, and transposons that move by target-primed reverse transcription lack LTRs characteristic of retroviruses and retroviral-like transposons. Yeast model systems in Candida albicans and Saccharomyces cerevisiae have been developed for the study of non-LTR retrotransposons. Non-LTR retrotransposons are divided into LINEs (long interspersed nuclear elements), SINEs (short interspersed nuclear elements), and SVA (SINE, VNTR, and Alu). LINE-1 elements have been described in fungi, and several families called Zorro elements have been detected from C. albicans. They are all members of L1 clades. Through a mechanism named target-primed reverse transcription (TPRT), LINEs translocate the new copy into the target site to initiate DNA synthesis primed by the 3′ OH of the broken strand. In this article, we describe some advances in the research on structural features and origin of non-LTR retrotransposons in C. albicans, and discuss mechanisms underlying their reverse transcription and integration of the donor copy into the target site.
Collapse
Affiliation(s)
- Jingchen Jiang
- Department of Pharmacology; School of Pharmacy; China Pharmaceutical University; Nanjing, PR China
| | - Liuya Zhao
- R & D Center of New Drug; School of Pharmacy; Second Military Medical University; Shanghai, PR China
| | - Lan Yan
- R & D Center of New Drug; School of Pharmacy; Second Military Medical University; Shanghai, PR China
| | - Lulu Zhang
- R & D Center of New Drug; School of Pharmacy; Second Military Medical University; Shanghai, PR China
| | - Yingying Cao
- R & D Center of New Drug; School of Pharmacy; Second Military Medical University; Shanghai, PR China
| | - Yan Wang
- R & D Center of New Drug; School of Pharmacy; Second Military Medical University; Shanghai, PR China
| | - Yuanying Jiang
- R & D Center of New Drug; School of Pharmacy; Second Military Medical University; Shanghai, PR China
| | - Tianhua Yan
- Department of Pharmacology; School of Pharmacy; China Pharmaceutical University; Nanjing, PR China
| | - Yongbing Cao
- R & D Center of New Drug; School of Pharmacy; Second Military Medical University; Shanghai, PR China
| |
Collapse
|
12
|
Mukha DV, Pasyukova EG, Kapelinskaya TV, Kagramanova AS. Endonuclease domain of the Drosophila melanogaster R2 non-LTR retrotransposon and related retroelements: a new model for transposition. Front Genet 2013; 4:63. [PMID: 23637706 PMCID: PMC3636483 DOI: 10.3389/fgene.2013.00063] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2012] [Accepted: 04/05/2013] [Indexed: 01/25/2023] Open
Abstract
The molecular mechanisms of the transposition of non-long terminal repeat (non-LTR) retrotransposons are not well understood; the key questions of how the 3′-ends of cDNA copies integrate and how site-specific integration occurs remain unresolved. Integration depends on properties of the endonuclease (EN) domain of retrotransposons. Using the EN domain of the Drosophila R2 retrotransposon as a model for other, closely related non-LTR retrotransposons, we investigated the EN domain and found that it resembles archaeal Holliday-junction resolvases. We suggest that these non-LTR retrotransposons are co-transcribed with the host transcript. Combined with the proposed resolvase activity of the EN domain, this model yields a novel mechanism for site-specific retrotransposition within this class of retrotransposons, with resolution proceeding via a Holliday junction intermediate.
Collapse
Affiliation(s)
- Dmitry V Mukha
- Vavilov Institute of General Genetics, Russian Academy of Sciences Moscow, Russia
| | | | | | | |
Collapse
|
13
|
Shivram H, Cawley D, Christensen SM. Targeting novel sites: The N-terminal DNA binding domain of non-LTR retrotransposons is an adaptable module that is implicated in changing site specificities. Mob Genet Elements 2011; 1:169-178. [PMID: 22479684 PMCID: PMC3312299 DOI: 10.4161/mge.1.3.18453] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2011] [Revised: 10/16/2011] [Accepted: 10/17/2011] [Indexed: 02/07/2023] Open
Abstract
Restriction-like endonuclease (RLE) bearing non-LTR retrotransposons are site-specific elements that integrate into the genome through target primed reverse transcription (TPRT). RLE-bearing elements have been used as a model system for investigating non-LTR retrotransposon integration. R2 elements target a specific site in the 28S rDNA gene. We previously demonstrated that the two major sub-classes of R2 (R2-A and R2-D) target the R2 insertion site in an opposing manner with regard to the pairing of known DNA binding domains and bound sequences-indicating that the A- and D-clades represent independently derived modes of targeting that site. Elements have been discovered that group phylogenetically with R2 but do not target the canonical R2 site. Here we extend our earlier studies to show that a separate R2-A clade element, which targets a site other than the canonical R2 site, does so by using the N-terminal zinc fingers and Myb motifs. We further extend our targeting studies beyond R2 clade elements by investigating the ability of the N-terminal zinc fingers from the nematode NeSL-1 element to target its integration site. Our data are consistent with the use of an N-terminal DNA binding domain as one of the major targeting determinants used by RLE-bearing non-LTR retrotransposons to secure a protein subunit near the insertion site. This N-terminal DNA binding domain can undergo modifications, allowing the element to target novel sites. The binding orientation of the N-terminal domain relative to the insertion site is quite variable.
Collapse
Affiliation(s)
- Haridha Shivram
- Department of Biology; University of Texas at Arlington; Arlington, TX USA
| | | | | |
Collapse
|
14
|
Thompson BK, Christensen SM. Independently derived targeting of 28S rDNA by A- and D-clade R2 retrotransposons: Plasticity of integration mechanism. Mob Genet Elements 2011; 1:29-37. [PMID: 22016843 PMCID: PMC3190273 DOI: 10.4161/mge.1.1.16485] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2011] [Revised: 05/16/2011] [Accepted: 05/16/2011] [Indexed: 12/24/2022] Open
Abstract
Restriction-like endonuclease (RLE) bearing non-LTR retrotransposons are site-specific elements that integrate into the genome through a target primed reverse transcription mechanism (TPRT). R2 elements have been used as a model system for investigating non-LTR retrotransposon integration. We previously demonstrated that R2 retrotransposons require two subunits of the element-encoded multifunctional protein to integrate-one subunit bound upstream of the insertion site and one bound downstream. R2 elements have been phylogenetically categorized into four clades: R2-A, B, C and D, that diverged from a common ancestor more than 850 million years ago. All R2 elements target the same sequence within 28S rDNA. The amino-terminal domain of R2Bm, an R2-D clade element, contains a single zinc finger and a Myb motif that are responsible for binding R2 protein downstream of the insertion site. Target site recognition is of interest as it is the first step in the integration reaction and may help elucidate evolutionary history and integration mechanism. The amino-terminal domain of R2-A clade members contains three zinc fingers and a Myb motif. We show here that R2Lp, an R2-A clade member, uses its amino-terminal DNA binding motifs to bind upstream of the insertion site. Because the R2-A and R2-D clade elements recognize 28S rDNA differently, we conclude the A- and D-clades represent independent targeting events to the 28S site. Our results also indicate a certain plasticity of insertional mechanics exists between the two clades.
Collapse
Affiliation(s)
- Blaine K Thompson
- Department of Biology; University of Texas at Arlington; Arlington, TX USA
| | | |
Collapse
|
15
|
Kagramanova AS, Kapelinskaya TV, Korolev AL, Mukha DV. Domain organization of the ORF2 C-terminal region of the German cockroach retroposon R1. RUSS J GENET+ 2010. [DOI: 10.1134/s102279541008003x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
|
16
|
Rho M, Schaack S, Gao X, Kim S, Lynch M, Tang H. LTR retroelements in the genome of Daphnia pulex. BMC Genomics 2010; 11:425. [PMID: 20618961 PMCID: PMC2996953 DOI: 10.1186/1471-2164-11-425] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2009] [Accepted: 07/09/2010] [Indexed: 01/19/2023] Open
Abstract
BACKGROUND Long terminal repeat (LTR) retroelements represent a successful group of transposable elements (TEs) that have played an important role in shaping the structure of many eukaryotic genomes. Here, we present a genome-wide analysis of LTR retroelements in Daphnia pulex, a cyclical parthenogen and the first crustacean for which the whole genomic sequence is available. In addition, we analyze transcriptional data and perform transposon display assays of lab-reared lineages and natural isolates to identify potential influences on TE mobility and differences in LTR retroelements loads among individuals reproducing with and without sex. RESULTS We conducted a comprehensive de novo search for LTR retroelements and identified 333 intact LTR retroelements representing 142 families in the D. pulex genome. While nearly half of the identified LTR retroelements belong to the gypsy group, we also found copia (95), BEL/Pao (66) and DIRS (19) retroelements. Phylogenetic analysis of reverse transcriptase sequences showed that LTR retroelements in the D. pulex genome form many lineages distinct from known families, suggesting that the majority are novel. Our investigation of transcriptional activity of LTR retroelements using tiling array data obtained from three different experimental conditions found that 71 LTR retroelements are actively transcribed. Transposon display assays of mutation-accumulation lines showed evidence for putative somatic insertions for two DIRS retroelement families. Losses of presumably heterozygous insertions were observed in lineages in which selfing occurred, but never in asexuals, highlighting the potential impact of reproductive mode on TE abundance and distribution over time. The same two families were also assayed across natural isolates (both cyclical parthenogens and obligate asexuals) and there were more retroelements in populations capable of reproducing sexually for one of the two families assayed. CONCLUSIONS Given the importance of LTR retroelements activity in the evolution of other genomes, this comprehensive survey provides insight into the potential impact of LTR retroelements on the genome of D. pulex, a cyclically parthenogenetic microcrustacean that has served as an ecological model for over a century.
Collapse
Affiliation(s)
- Mina Rho
- School of Informatics and Computing, Indiana University, Bloomington, IN 47405, USA
| | | | | | | | | | | |
Collapse
|
17
|
R2 retrotransposons encode a self-cleaving ribozyme for processing from an rRNA cotranscript. Mol Cell Biol 2010; 30:3142-50. [PMID: 20421411 DOI: 10.1128/mcb.00300-10] [Citation(s) in RCA: 76] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
The non-long terminal repeat (non-LTR) retrotransposon R2 is inserted into the 28S rRNA genes of many animals. Expression of the element appears to be by cotranscription with the rRNA gene unit. We show here that processing of the rRNA cotranscript at the 5' end of the R2 element in Drosophila simulans is rapid and utilizes an unexpected mechanism. Using RNA synthesized in vitro, the 5' untranslated region of R2 was shown capable of rapid and efficient self-cleavage of the 28S-R2 cotranscript. The 5' end generated in vitro by the R2 ribozyme was at the position identical to that found for in vivo R2 transcripts. The RNA segment corresponding to the R2 ribozyme could be folded into a double pseudoknot structure similar to that of the hepatitis delta virus (HDV) ribozyme. Remarkably, 21 of the nucleotide positions in and around the active site of the HDV ribozyme were identical in R2. R2 elements from other Drosophila species were also shown to encode HDV-like ribozymes capable of self-cleavage. Tracing their sequence evolution in the Drosophila lineage suggests that the extensive similarity of the R2 ribozyme from D. simulans to that of HDV was a result of convergent evolution, not common descent.
Collapse
|
18
|
Rho M, Tang H. MGEScan-non-LTR: computational identification and classification of autonomous non-LTR retrotransposons in eukaryotic genomes. Nucleic Acids Res 2010; 37:e143. [PMID: 19762481 PMCID: PMC2790886 DOI: 10.1093/nar/gkp752] [Citation(s) in RCA: 57] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Computational methods for genome-wide identification of mobile genetic elements (MGEs) have become increasingly necessary for both genome annotation and evolutionary studies. Non-long terminal repeat (non-LTR) retrotransposons are a class of MGEs that have been found in most eukaryotic genomes, sometimes in extremely high numbers. In this article, we present a computational tool, MGEScan-non-LTR, for the identification of non-LTR retrotransposons in genomic sequences, following a computational approach inspired by a generalized hidden Markov model (GHMM). Three different states represent two different protein domains and inter-domain linker regions encoded in the non-LTR retrotransposons, and their scores are evaluated by using profile hidden Markov models (for protein domains) and Gaussian Bayes classifiers (for linker regions), respectively. In order to classify the non-LTR retrotransposons into one of the 12 previously characterized clades using the same model, we defined separate states for different clades. MGEScan-non-LTR was tested on the genome sequences of four eukaryotic organisms, Drosophila melanogaster, Daphnia pulex, Ciona intestinalis and Strongylocentrotus purpuratus. For the D. melanogaster genome, MGEScan-non-LTR found all known 'full-length' elements and simultaneously classified them into the clades CR1, I, Jockey, LOA and R1. Notably, for the D. pulex genome, in which no non-LTR retrotransposon has been annotated, MGEScan-non-LTR found a significantly larger number of elements than did RepeatMasker, using the current version of the RepBase Update library. We also identified novel elements in the other two genomes, which have only been partially studied for non-LTR retrotransposons.
Collapse
Affiliation(s)
- Mina Rho
- School of Informatics and Computing, Indiana University, Bloomington, IN 47408, USA
| | | |
Collapse
|
19
|
Yadav VP, Mandal PK, Rao DN, Bhattacharya S. Characterization of the restriction enzyme-like endonuclease encoded by the Entamoeba histolytica non-long terminal repeat retrotransposon EhLINE1. FEBS J 2009; 276:7070-82. [DOI: 10.1111/j.1742-4658.2009.07419.x] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
|
20
|
|
21
|
Pritham EJ. Transposable elements and factors influencing their success in eukaryotes. J Hered 2009; 100:648-55. [PMID: 19666747 DOI: 10.1093/jhered/esp065] [Citation(s) in RCA: 94] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023] Open
Abstract
Recent advances in genome sequencing have led to a vast accumulation of transposable element data. Consideration of the genome sequencing projects in a phylogenetic context reveals that despite the hundreds of eukaryotic genomes that have been sequenced, a strong bias in sampling exists. There is a general under-representation of unicellular eukaryotes and a dearth of genome projects in many branches of the eukaryotic phylogeny. Among sequenced genomes, great variation in genome size exists, however, little difference in the total number of cellular genes is observed. For many eukaryotes, the remaining genomic space is extremely dynamic and predominantly composed of a menagerie of populations of transposable elements. Given the dynamic nature of the genomic niche filled by transposable elements, it is evident that these elements have played an important role in genome evolution. The contribution of transposable elements to genome architecture and to the advent of genetic novelty is likely to be dependent, at least in part, on the transposition mechanism, diversity, number, and rate of turnover of transposable elements in the genome at any given time. The focus of this review is the discussion of some of the forces that act to shape transposable element diversity within and between genomes.
Collapse
Affiliation(s)
- Ellen J Pritham
- Department of Biology, University of Texas, Arlington, Arlington, TX 76019, USA.
| |
Collapse
|
22
|
Novikova O, Fet V, Blinov A. Non-LTR retrotransposons in fungi. Funct Integr Genomics 2008; 9:27-42. [DOI: 10.1007/s10142-008-0093-8] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/29/2008] [Revised: 07/01/2008] [Accepted: 07/01/2008] [Indexed: 12/31/2022]
|
23
|
Patrick KL, Luz PM, Ruan JP, Shi H, Ullu E, Tschudi C. Genomic rearrangements and transcriptional analysis of the spliced leader-associated retrotransposon in RNA interference-deficient Trypanosoma brucei. Mol Microbiol 2007; 67:435-47. [PMID: 18067542 DOI: 10.1111/j.1365-2958.2007.06057.x] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
The Trypanosoma brucei genome is colonized by the site-specific non-LTR retrotransposon SLACS, or spliced leader-associated conserved sequence, which integrates exclusively into the spliced leader (SL) RNA genes. Although there is evidence that the RNA interference (RNAi) machinery regulates SLACS transcript levels, we do not know whether RNAi deficiency affects the genomic stability of SLACS, nor do we understand the mechanism of SLACS transcription. Here, we report that prolonged culturing of RNAi-deficient T. brucei cells, but not wild-type cells, results in genomic rearrangements of SLACS. Furthermore, two populations of SLACS transcripts persist in RNAi-deficient cells: a full-length transcript of approximately 7 kb and a heterogeneous population of small SLACS transcripts ranging in size from 450 to 550 nt. We provide evidence that SLACS transcription initiates at the +1 of the interrupted SL RNA gene and proceeds into the 5' UTR and open reading frame 1 (ORF1). This transcription is carried out by an RNA polymerase with alpha-amanitin sensitivity reminiscent of SL RNA synthesis and is dependent on the SL RNA promoter. Additionally, we show that both sense and antisense small SLACS transcripts originate from ORF1 and that they are associated with proteins in vivo. We speculate that the small SLACS transcripts serve as substrates for the production of siRNAs to regulate SLACS expression.
Collapse
Affiliation(s)
- Kristin L Patrick
- Department of Epidemiology and Public Health, Yale University Medical School, 295 Congress Avenue, New Haven, CT 06536-0812, USA
| | | | | | | | | | | |
Collapse
|
24
|
Gogolevsky KP, Vassetzky NS, Kramerov DA. Bov-B-mobilized SINEs in vertebrate genomes. Gene 2007; 407:75-85. [PMID: 17976929 DOI: 10.1016/j.gene.2007.09.021] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2007] [Revised: 09/27/2007] [Accepted: 09/27/2007] [Indexed: 11/26/2022]
Abstract
Two new short retroposon families (SINEs) have been found in the genome of springhare Pedetes capensis (Rodentia). One of them, Ped-1, originated from 5S rRNA, while the other one, Ped-2, originated from tRNA-derived SINE ID. In contrast to most currently active mammalian SINEs mobilized by L1 long retrotransposon (LINE), Ped-1 and Ped-2 are mobilized by Bov-B, a LINE family of the widely distributed RTE clade. The 3' part of these SINEs originates from two sequences in the 5' and 3' regions of Bov-B. Such bipartite structure of the LINE-derived part has been revealed in all Bov-B-mobilized SINEs known to date (AfroSINE, Bov-tA, Mar-1, and Ped-1/2), which distinguishes them from other SINEs with only a 3' LINE-derived part. Structural analysis and the distribution of Bov-B LINEs and partner SINEs supports the horizontal transfer of Bov-B, while the SINEs emerged independently in lineages with this LINE.
Collapse
Affiliation(s)
- Konstantin P Gogolevsky
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, 32 Vavilov Street, Moscow, Russia
| | | | | |
Collapse
|
25
|
Laha T, Kewgrai N, Loukas A, Brindley PJ. The dingo non-long terminal repeat retrotransposons from the genome of the hookworm, Ancylostoma caninum. Exp Parasitol 2006; 113:142-53. [PMID: 16445914 DOI: 10.1016/j.exppara.2005.12.018] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2005] [Revised: 12/21/2005] [Accepted: 12/22/2005] [Indexed: 11/22/2022]
Abstract
Members of the retrotransposable element (RTE) clade of non-long terminal repeat (LTR) retrotransposon are widely distributed among eukaryote taxa, with representatives known from Caenorhabditis elegans, mammals, mosquitoes, schistosomes, and other taxa. An RTE retrotransposon has not, however, been characterized in detail from a parasitic nematode. Here, we characterize two discrete copies of an RTE-like non-LTR retrotransposon from the genome of the dog hookworm, Ancylostoma caninum. The elements were named dingo-1 and dingo-2. The full-length dingo-1 and dingo-2 elements were 3421 and 3171bp in length, respectively. They exhibited 54% nucleotide sequence identity to one another across their entire length and 40%/58% amino-acid sequence identity/similarity across their open reading frames. dingo-1 and dingo-2 exhibited hallmark structures and sequences of non-LTR retrotransposons of the RTE family including a single open reading frame encoding apurinic-apyrimidinic endonuclease (EN) and reverse transcriptase (RT), in that order. Phylogenetic analyses targeting the RT and the EN domains both confirmed that dingo-1 and dingo-2 were members of the RTE clade and that they were closely related to RTE-1 from C. elegans, to BDDF from Bos taurus and to SR2 from Schistosoma mansoni. Dot blot hybridization indicated that as many as 100-1000 copies of dingo-1 reside within the genome of A. caninum, while detection by RT-PCR of transcripts encoding dingo-like elements suggested that dingo-1 and -2 may be retrotranspositionally active within the genome of A. caninum. The dingo elements are the first retrotransposons to be characterized from a hookworm genome.
Collapse
Affiliation(s)
- Thewarach Laha
- Department of Parasitology, Faculty of Medicine, Khon Kaen University, Khon Kaen 40002, Thailand
| | | | | | | |
Collapse
|
26
|
Glushkov S, Novikova O, Blinov A, Fet V. Divergent non-LTR retrotransposon lineages from the genomes of scorpions (Arachnida: Scorpiones). Mol Genet Genomics 2005; 275:288-96. [PMID: 16328371 DOI: 10.1007/s00438-005-0079-3] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2005] [Accepted: 11/12/2005] [Indexed: 01/28/2023]
Abstract
We screened across the taxonomic diversity of order Scorpiones (22 species belonging to 21 genera and 10 families) for the presence of seven different clades of non-LTR retrotransposons in their genomes using PCR with newly designed clade-specific consensus-degenerate hybrid oligonucleotide primers. Scorpion genomes were found to contain four known non-LTR retrotransposon clades: R1, I, Jockey, and CR1. In total, 35 fragments of reverse transcriptase genes of new elements from 22 scorpion species were obtained and analyzed for three clades, Jockey, I, and CR1. Phylogenies of different clades of elements were built using amino acid sequences inferred from 33 non-LTR retrotransposon clones. Distinct evolutionary lineages, with several major groups of the non-LTR retroelements were identified, showing significant variation. Four lineages were revealed in Jockey clade. The phylogeny of I clade showed strong support for the monophyletic origin of such group of elements in scorpions. Three separate lineages can be distinguished in the phylogenetic tree of CR1 clade. The large fraction of the isolated elements appeared to be defective.
Collapse
Affiliation(s)
- Sergei Glushkov
- Institute of Cytology and Genetics, Siberian Branch of the Russian Academy of Sciences, Prospekt Lavrentyeva 10, 630090 Novosibirsk, Russia
| | | | | | | |
Collapse
|
27
|
Abstract
Retrotransposons commonly encode a reverse transcriptase (RT), but other functional domains are variable. The acquisition of new domains is the dominant evolutionary force that brings structural variety to retrotransposons. Non-long-terminal-repeat (non-LTR) retrotransposons are classified into two groups by their structure. Early branched non-LTR retrotransposons encode a restriction-like endonuclease (RLE), and recently branched non-LTR retrotransposons encode an apurinic/apyrimidinic endonuclease-like endonuclease (APE). In this study, we report a novel non-LTR retrotransposon family Dualen, identified from the Chlamydomonas reinhardtii genome. Dualen encodes two endonucleases, RLE and APE, with RT, ribonuclease H, and cysteine protease. Phylogenetic analyses of the RT domains revealed that Dualen is positioned at the midpoint between the early-branched and the recently branched groups. In the APE tree, Dualen was branched earlier than the I group and the Jockey group. The ribonuclease H domains among the Dualen family and other non-LTR retrotransposons are monophyletic. Phylogenies of three domains revealed the monophyly of the Dualen family members. The domain structure and the phylogeny of each domain imply that Dualen is a retrotransposon conserving the domain structure just after the acquisition of APE. From these observations, we discuss the evolution of domain structure of non-LTR retrotransposons.
Collapse
Affiliation(s)
- Kenji K Kojima
- Department of Integrated Biosciences, Graduate School of Frontier Sciences, University of Tokyo, Kashiwa, Japan
| | | |
Collapse
|
28
|
Laha T, Kewgrai N, Loukas A, Brindley PJ. Characterization of SR3 reveals abundance of non-LTR retrotransposons of the RTE clade in the genome of the human blood fluke, Schistosoma mansoni. BMC Genomics 2005; 6:154. [PMID: 16271150 PMCID: PMC1291365 DOI: 10.1186/1471-2164-6-154] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2005] [Accepted: 11/04/2005] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND It is becoming apparent that perhaps as much as half of the genome of the human blood fluke Schistosoma mansoni is constituted of mobile genetic element-related sequences. Non-long terminal repeat (LTR) retrotransposons, related to the LINE elements of mammals, comprise much of this repetitive component of the schistosome genome. Of more than 12 recognized clades of non-LTR retrotransposons, only members of the CR1, RTE, and R2 clades have been reported from the schistosome genome. RESULTS Inspection of the nucleotide sequence of bacterial artificial chromosome number 49_J_14 from chromosome 1 of the genome of Schistosoma mansoni (GenBank AC093105) revealed the likely presence of several RTE-like retrotransposons. Among these, a new non-LTR retrotransposon designated SR3 was identified and is characterized here. Analysis of gene structure and phylogenetic analysis of both the reverse transcriptase and endonuclease domains of the mobile element indicated that SR3 represented a new family of RTE-like non-LTR retrotransposons. Remarkably, two full-length copies of SR3-like elements were present in BAC 49-J-14, and one of 3,211 bp in length appeared to be intact, indicating SR3 to be an active non-LTR retrotransposon. Both were flanked by target site duplications of 10-12 bp. Southern hybridization and bioinformatics analyses indicated the presence of numerous copies (probably >1,000) of SR3 interspersed throughout the genome of S. mansoni. Bioinformatics analyses also revealed SR3 to be transcribed in both larval and adult developmental stages of S. mansoni and to be also present in the genomes of the other major schistosome parasites of humans, Schistosoma haematobium and S. japonicum. CONCLUSION Numerous copies of SR3, a novel non-LTR retrotransposon of the RTE clade are present in the genome of S. mansoni. Non-LTR retrotransposons of the RTE clade including SR3 appear to have been remarkably successful in colonizing, and proliferation within the schistosome genome.
Collapse
Affiliation(s)
- Thewarach Laha
- Department of Parasitology, Faculty of Medicine, Khon Kaen University, Khon Kaen 40002, Thailand
| | - Nonglack Kewgrai
- Department of Parasitology, Faculty of Medicine, Khon Kaen University, Khon Kaen 40002, Thailand
| | - Alex Loukas
- Division of Infectious Diseases & Immunology, Queensland Institute of Medical Research, Brisbane, Queensland, 4029, Australia
| | - Paul J Brindley
- Department of Tropical Medicine, and Center for Infectious Diseases, Tulane University, Health Sciences Center, New Orleans, Louisiana, 70112, USA
| |
Collapse
|
29
|
Ohshima K, Okada N. SINEs and LINEs: symbionts of eukaryotic genomes with a common tail. Cytogenet Genome Res 2005; 110:475-90. [PMID: 16093701 DOI: 10.1159/000084981] [Citation(s) in RCA: 121] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2004] [Accepted: 04/27/2004] [Indexed: 01/26/2023] Open
Abstract
Many SINEs and LINEs have been characterized to date, and examples of the SINE and LINE pair that have the same 3' end sequence have also increased. We report the phylogenetic relationships of nearly all known LINEs from which SINEs are derived, including a new example of a SINE/LINE pair identified in the salmon genome. We also use several biological examples to discuss the impact and significance of SINEs and LINEs in the evolution of vertebrate genomes.
Collapse
Affiliation(s)
- K Ohshima
- School and Graduate School of Bioscience and Biotechnology, Tokyo Institute of Technology, Yokohama, Japan.
| | | |
Collapse
|
30
|
DeMarco R, Machado AA, Bisson-Filho AW, Verjovski-Almeida S. Identification of 18 new transcribed retrotransposons in Schistosoma mansoni. Biochem Biophys Res Commun 2005; 333:230-40. [PMID: 15939396 DOI: 10.1016/j.bbrc.2005.05.080] [Citation(s) in RCA: 37] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2005] [Accepted: 05/13/2005] [Indexed: 11/28/2022]
Abstract
This work describes 18 new transcribed retrotransposons of the blood fluke Schistosoma mansoni. Among them, 9 were LTR, 8 non-LTR, and 1 Penelope-like element (PLE) retrotransposon. Sequences were generated by in silico reconstruction using S. mansoni ESTs and transcripts obtained by rapid amplification of cDNA ends, complemented in some cases by sequencing of genomic clones amplified by PCR. A novel element from the ancient R2/R4/CRE transposon group is described for the first time in S. mansoni. In addition, one non-LTR retrotransposon family displays long (40-450 bp) 3'-UTR with at least six different transcribed sequences among the copies, five LTR retrotransposons have abundantly transcribed incomplete copies lacking the sequence segment coding for the reverse transcriptase domain, and four non-LTR retrotransposons code for DNA-binding PHD domains that may give them a differential targeting. These results allow for a comprehensive description of the transcribed retrotransposon diversity of this complex human parasite.
Collapse
Affiliation(s)
- Ricardo DeMarco
- Departamento de Bioquimica, Instituto de Quimica, Universidade de São Paulo, Brazil
| | | | | | | |
Collapse
|
31
|
Fischer C, Bouneau L, Coutanceau JP, Weissenbach J, Ozouf-Costaz C, Volff JN. Diversity and clustered distribution of retrotransposable elements in the compact genome of the pufferfish Tetraodon nigroviridis. Cytogenet Genome Res 2005; 110:522-36. [PMID: 16093705 DOI: 10.1159/000084985] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2004] [Accepted: 03/25/2004] [Indexed: 12/15/2022] Open
Abstract
We report the characterization and chromosomal distribution of retroelements in the compact genome of the pufferfish Tetraodon nigroviridis. We have reconstructed partial/complete retroelement sequences, established their phylogenetic relationship to other known eukaryotic retrotransposons, and performed double-color FISH analyses to gain new insights into their patterns of chromosomal distribution. We could identify 43 different reverse transcriptase retrotransposons belonging to the three major known subclasses (14 non-LTR retrotransposons from seven clades, 25 LTR retrotransposons representing the five major known groups, and four Penelope-like elements), and well as two SINEs (non-autonomous retroelements). Such a diversity of retrotransposable elements, which seems to be relatively common in fish but not in mammals, is astonishing in such a compact genome. The total number of retroelements was approximately 3000, roughly representing only 2.6% of the genome of T. nigroviridis. This is much less than in other vertebrate genomes, reflecting the compact nature of the genome of this pufferfish. Major differences in copy number were observed between different clades, indicating differential success in invading and persisting in the genome. Some retroelements displayed evidence of recent activity. Finally, FISH analysis showed that retrotransposable elements preferentially accumulate in specific heterochromatic regions of the genome of T. nigroviridis, revealing a degree of genomic compartmentalization not observed in the human genome.
Collapse
Affiliation(s)
- C Fischer
- Genoscope/Centre National de Séquençage, CNRS-UMR 8030, Evry, France.
| | | | | | | | | | | |
Collapse
|
32
|
Crainey JL, Garvey CF, Malcolm CA. The origin and evolution of mosquito APE retroposons. Mol Biol Evol 2005; 22:2190-7. [PMID: 16033989 DOI: 10.1093/molbev/msi217] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The detection of horizontal transfer is important to understanding the origin and spread of transposable elements and in assessing their impact on genetic diversity. The occurrence of the phenomenon is not in doubt for two of the three major groups of elements, but is disputed for retroposons, largely on the grounds of data paucity and overreliance on divergence estimates between host species. We present here the most wide-ranging retroposon data set assembled to date for a species group, the mosquitoes. The results provide no evidence for horizontal transfer events and show conclusively that four previously reported events, involving Juan-A, Juan-C, T1, and Q, did not occur. We propose that the origin of all known mosquito retroposons can be attributed to vertical inheritance and that retroposons have therefore been a persistent source of genetic diversity in mosquito genomes since the emergence of the taxon. Furthermore, the data confirm that the unprecedented levels of retroposon diversity previously reported in Anopheles gambiae extends to at least seven other species representing five genera and all three mosquito subfamilies. Most notably, this included the L1 elements, which are not known in other insects. A number of novel well-defined monophyletic groups were also identified, particularly, JM2 and JM3 within the Jockey clade, which included sequences from seven and five mosquito species, respectively. As JM3 does not contain an Anopheles element, this represents a good example of stochastic loss and the best out of at least four found in this study. This exceptionally diverse data set when compared with the wealth of data available for the many unrelated species with which mosquitoes have intimate contact through blood feeding ought to be fertile ground for the discovery of horizontal transfer events. The absence of positive results therefore supports the view that retroposon horizontal transfer does not occur or is far more exceptional than for other types of transposable elements.
Collapse
Affiliation(s)
- James L Crainey
- School of Biological Sciences, Queen Mary, University of London, London, United Kingdom
| | | | | |
Collapse
|
33
|
Papusheva E, Gruhl MC, Berezikov E, Groudieva T, Scherbik SV, Martin J, Blinov A, Bergtrom G. The Evolution of SINEs and LINEs in the genus Chironomus (Diptera). J Mol Evol 2004; 58:269-79. [PMID: 15045482 DOI: 10.1007/s00239-003-2549-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2002] [Accepted: 08/15/2003] [Indexed: 11/25/2022]
Abstract
Genomic DNA amplification from 51 species of the family Chironomidae shows that most contain relatives of NLRCth1 LINE and CTRT1 SINE retrotransposons first found in Chironomus thummi. More than 300 cloned PCR products were sequenced. The amplified region of the reverse transcriptase gene in the LINEs is intact and highly conserved, suggesting active elements. The SINEs are less conserved, consistent with minimal/no selection after transposition. A mitochondrial gene phylogeny resolves the Chironomus genus into six lineages (Guryev et al. 2001). LINE and SINE phylogenies resolve five of these lineages, indicating their monophyletic origin and vertical inheritance. However, both the LINE and the SINE tree topologies differ from the species phylogeny, resolving the elements into "clusters I-IV" and "cluster V" families. The data suggest a descent of all LINE and SINE subfamilies from two major families. Based on the species phylogeny, a few LINEs and a larger number of SINEs are cladisitically misplaced. Most misbranch with LINEs or SINEs from species with the same families of elements. From sequence comparisons, cladistically misplaced LINEs and several misplaced SINEs arose by convergent base substitutions. More diverged SINEs result from early transposition and some are derived from multiple source SINEs in the same species. SINEs from two species (C. dorsalis, C. pallidivittatus), expected to belong to the clusters I-IV family, branch instead with cluster V family SINEs; apparently both families predate separation of cluster V from clusters I-IV species. Correlation of the distribution of active SINEs and LINEs, as well as similar 3' sequence motifs in CTRT1 and NLRCth1, suggests coevolving retrotransposon pairs in which CTRT1 transposition depends on enzymes active during NLRCth1 LINE mobility.
Collapse
|
34
|
Pyatkov KI, Arkhipova IR, Malkova NV, Finnegan DJ, Evgen'ev MB. Reverse transcriptase and endonuclease activities encoded by Penelope-like retroelements. Proc Natl Acad Sci U S A 2004; 101:14719-24. [PMID: 15465912 PMCID: PMC522041 DOI: 10.1073/pnas.0406281101] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2004] [Indexed: 11/18/2022] Open
Abstract
Penelope-like elements are a class of retroelement that have now been identified in >50 species belonging to at least 10 animal phyla. The Penelope element isolated from Drosophila virilis is the only transpositionally active representative of this class isolated so far. The single ORF of Penelope and its relatives contains regions homologous to a reverse transcriptase of atypical structure and to the GIY-YIG, or Uri, an endonuclease (EN) domain not previously found in retroelements. We have expressed the single ORF of Penelope in a baculovirus expression system and have shown that it encodes a polyprotein with reverse transcriptase activity that requires divalent cations (Mn2+ and Mg2+). We have also expressed and purified the EN domain in Escherichia coli and have demonstrated that it has EN activity in vitro. Mutations in the conserved residues of the EN catalytic module abolish its nicking activity, whereas the DNA-binding properties of the mutant proteins remain unaffected. Only one strand of the target sequence is cleaved, and there is a certain degree of cleavage specificity. We propose that the Penelope EN cleaves the target DNA during transposition, generating a primer for reverse transcription. Our results show that an active Uri EN has been adopted by a retrotransposon.
Collapse
|
35
|
Zagrobelny M, Jeffares DC, Arctander P. Differences in non-LTR retrotransposons within C. elegans and C. briggsae genomes. Gene 2004; 330:61-6. [PMID: 15087124 DOI: 10.1016/j.gene.2004.01.003] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2003] [Revised: 09/15/2003] [Accepted: 01/08/2004] [Indexed: 11/21/2022]
Abstract
An exhaustive study of the Sam/Frodo family of non-LTR retrotransposons in the Caenorhabditis elegans and Caenorhabditis briggsae genomes demonstrated that C. briggsae contains 60 Sam/Frodo elements including a new subfamily designated Merry, while at least 1000 elements are present in C. elegans. In contrast to C. elegans, C. briggsae does not contain any other non-LTR retrotransposons. The Sam/Frodo/Merry sequences in C. briggsae are shorter and less complete than the Sam/Frodo sequences in C. elegans probably because they all lack a functional first open reading frame (ORF1) and because the genome only encodes one functional reverse transcriptase gene of a non-LTR retrotransposon. Evidence of purifying selection for a functional reverse transcriptase sequence in master/leader elements was found in both nematodes in spite of low copy numbers in C. briggsae. Sam elements in C. elegans are the most abundant Sam/Frodo/Merry family members. They contain the only functional ORF1 copies and, unlike Frodo and Merry members, have a higher GC content than the genomic regions in which they reside. This may indicate a higher transcription rate within this subfamily.
Collapse
Affiliation(s)
- Mika Zagrobelny
- Department of Evolutionary Biology, Zoological Institute, University of Copenhagen, Copenhagen, Denmark.
| | | | | |
Collapse
|
36
|
Permanyer J, Gonzàlez-Duarte R, Albalat R. The non-LTR retrotransposons in Ciona intestinalis: new insights into the evolution of chordate genomes. Genome Biol 2003; 4:R73. [PMID: 14611659 PMCID: PMC329123 DOI: 10.1186/gb-2003-4-11-r73] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2003] [Revised: 09/17/2003] [Accepted: 09/25/2003] [Indexed: 11/10/2022] Open
Abstract
In silico and experimental approaches have been used to identify the non-long terminal repeat retrotransposons of the urochordate Ciona intestinalis providing valuable data for understanding the evolution of early chordate genomes. Background Non-long terminal repeat (non-LTR) retrotransposons have contributed to shaping the structure and function of genomes. In silico and experimental approaches have been used to identify the non-LTR elements of the urochordate Ciona intestinalis. Knowledge of the types and abundance of non-LTR elements in urochordates is a key step in understanding their contribution to the structure and function of vertebrate genomes. Results Consensus elements phylogenetically related to the I, LINE1, LINE2, LOA and R2 elements of the 14 eukaryotic non-LTR clades are described from C. intestinalis. The ascidian elements showed conservation of both the reverse transcriptase coding sequence and the overall structural organization seen in each clade. The apurinic/apyrimidinic endonuclease and nucleic-acid-binding domains encoded upstream of the reverse transcriptase, and the RNase H and the restriction enzyme-like endonuclease motifs encoded downstream of the reverse transcriptase were identified in the corresponding Ciona families. Conclusions The genome of C. intestinalis harbors representatives of at least five clades of non-LTR retrotransposons. The copy number per haploid genome of each element is low, less than 100, far below the values reported for vertebrate counterparts but within the range for protostomes. Genomic and sequence analysis shows that the ascidian non-LTR elements are unmethylated and flanked by genomic segments with a gene density lower than average for the genome. The analysis provides valuable data for understanding the evolution of early chordate genomes and enlarges the view on the distribution of the non-LTR retrotransposons in eukaryotes.
Collapse
Affiliation(s)
- Jon Permanyer
- Departament de Genètica, Facultat de Biologia, Universitat de Barcelona, Av. Diagonal 645, E-08028 Barcelona, Spain
| | - Roser Gonzàlez-Duarte
- Departament de Genètica, Facultat de Biologia, Universitat de Barcelona, Av. Diagonal 645, E-08028 Barcelona, Spain
| | - Ricard Albalat
- Departament de Genètica, Facultat de Biologia, Universitat de Barcelona, Av. Diagonal 645, E-08028 Barcelona, Spain
| |
Collapse
|
37
|
Bouneau L, Fischer C, Ozouf-Costaz C, Froschauer A, Jaillon O, Coutanceau JP, Körting C, Weissenbach J, Bernot A, Volff JN. An active non-LTR retrotransposon with tandem structure in the compact genome of the pufferfish Tetraodon nigroviridis. Genome Res 2003; 13:1686-95. [PMID: 12805276 PMCID: PMC403742 DOI: 10.1101/gr.726003] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
The fish retrotransposable element Zebulon encodes a reverse transcriptase and a carboxy-terminal restriction enzyme-like endonuclease, and is related phylogenetically to site-specific non-LTR retrotransposons from nematodes. Zebulon was detected in the pufferfishes Tetraodon nigroviridis and Takifugu rubripes, as well as in the zebrafish Danio rerio. Structural analysis suggested that Zebulon, in contrast to most non-LTR retrotransposons, might be able to retrotranspose as a partial tandem array. Zebulon was active relatively recently in the compact genome of T. nigroviridis, in which it contributed to the extension of intergenic and intronic sequences, and possibly to the formation of genomic rearrangements. Accumulation of Zebulon together with other retrotransposons was observed in some heterochromatic chromosomal regions of the genome of T. nigroviridis that might serve as reservoirs for active elements. Hence, pufferfish compact genomes are not evolutionarily inert and contain active retrotransposons, suggesting the presence of mechanisms allowing accumulation of retrotransposable elements in heterochromatin, but minimizing their impact on euchromatic regions. Homologous recombination between partial tandem sequences eliminating active copies of Zebulon and reducing the size of insertions in intronic and intragenic regions might represent such a mechanism.
Collapse
Affiliation(s)
- Laurence Bouneau
- Genoscope/Centre National de Séquençage and CNRS-UMR 8030, F-91057 Evry Cedex 06, France
| | | | | | | | | | | | | | | | | | | |
Collapse
|
38
|
Van Dellen K, Field J, Wang Z, Loftus B, Samuelson J. LINEs and SINE-like elements of the protist Entamoeba histolytica. Gene 2002; 297:229-39. [PMID: 12384304 DOI: 10.1016/s0378-1119(02)00917-4] [Citation(s) in RCA: 40] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
A survey of whole genome shotgun sequences of the protozoan parasite Entamoeba histolytica revealed three families of non-long terminal repeat (LTR) retrotransposons or long interspersed elements (LINEs) (called EhLINEs in this report). The 4.8 kb EhLINEs each had a single open reading frame with a putative nucleic acid binding motif (CCHC) and restriction enzyme-like endonuclease domain located downstream of the reverse transcriptase (RT) domain. Phylogenetic analysis of the RT domain placed the EhLINEs in the R4 clade of non-LTR elements, a mixed clade of non-LTR elements that includes members from nematodes, insects, and vertebrates. EhLINE1 (which was previously identified as HMc and EhRLE) shared a common 3' end with a highly transcribed 0.55 kb short interspersed element (SINE)-like element previously identified as IE or ehapt2 and called EhLSINE1 in this report. Similarly, EhLINE2 shared a common 3' end with a highly transcribed 0.65 kb SINE-like element called EhLSINE2 in this report. The shared 3' end sequences of the EhLINEs and EhLSINEs suggested that EhLINEs are involved in the retrotransposition of the EhLSINEs. EhLSINEs were flanked by target site duplications and contained conserved 5' sequences, which likely regulate their transcription. The EhLSINEs are the first protist SINE-like elements identified that share a common 3' sequence with LINEs, and the first SINE-like elements that have been associated with the R4 clade of non-LTR elements.
Collapse
Affiliation(s)
- Katrina Van Dellen
- Department of Immunology and Infectious Diseases, Harvard School of Public Health, 665 Huntington Avenue, Boston, MA 02115, USA
| | | | | | | | | |
Collapse
|
39
|
Abstract
Mobile genetic elements, by virtue of their ability to move to new chromosomal locations, are considered important in shaping the evolutionary course of the genome. They are widespread in the biological kingdom. Among the protozoan parasites several types of transposable elements are encountered. The largest variety is seen in the trypanosomatids-Trypanosoma brucei, Trypanosoma cruzi and Crithidia fasciculata. They contain elements that insert site-specifically in the spliced-leader RNA genes, and others that are dispersed in a variety of genomic locations. Giardia lamblia contains three families of transposable elements. Two of these are subtleomeric in location while one is chromosome-internal. Entamoeba histolytica has an abundant retrotransposon dispersed in the genome. Nucleotide sequence analysis of all the elements shows that they are all retrotransposons, and, with the exception of one class of elements in T. cruzi, all of them are non-long-terminal-repeat retrotransposons. Although most copies have accumulated mutations, they can potentially encode reverse transcriptase, endonuclease and nucleic-acid-binding activities. Functionally and phylogenetically they do not belong to a single lineage, showing that retrotransposons were acquired early in the evolution of protozoan parasites. Many of the potentially autonomous elements that encode their own transposition functions have nonautonomous counterparts that probably utilize the functions in trans. In this respect these elements are similar to the mammalian LINEs and SINEs (long and short interspersed DNA elements), showing a common theme in the evolution of retrotransposons. So far there is no report of a DNA transposon in any protozoan parasite. The genome projects that are under way for most of these organisms will help understand the evolution and possible function of these genetic elements.
Collapse
Affiliation(s)
- Sudha Bhattacharya
- School of Environmental Sciences, Jawaharlal Nehru University, New Delhi 110 067, India.
| | | | | |
Collapse
|
40
|
Abstract
SINEs and LINEs are short and long interspersed retrotransposable elements, respectively, that invade new genomic sites using RNA intermediates. SINEs and LINEs are found in almost all eukaryotes (although not in Saccharomyces cerevisiae) and together account for at least 34% of the human genome. The noncoding SINEs depend on reverse transcriptase and endonuclease functions encoded by partner LINEs. With the completion of many genome sequences, including our own, the database of SINEs and LINEs has taken a great leap forward. The new data pose new questions that can only be answered by detailed studies of the mechanism of retroposition. Current work ranges from the biochemistry of reverse transcription and integration invitro, target site selection in vivo, nucleocytoplasmic transport of the RNA and ribonucleoprotein intermediates, and mechanisms of genomic turnover. Two particularly exciting new ideas are that SINEs may help cells survive physiological stress, and that the evolution of SINEs and LINEs has been shaped by the forces of RNA interference. Taken together, these studies promise to explain the birth and death of SINEs and LINEs, and the contribution of these repetitive sequence families to the evolution of genomes.
Collapse
Affiliation(s)
- Alan M Weiner
- Department of Biochemistry, HSB J417, University of Washington, Box 357350, Seattle, WA 98195-7350, USA.
| |
Collapse
|
41
|
Burke WD, Malik HS, Rich SM, Eickbush TH. Ancient lineages of non-LTR retrotransposons in the primitive eukaryote, Giardia lamblia. Mol Biol Evol 2002; 19:619-30. [PMID: 11961096 DOI: 10.1093/oxfordjournals.molbev.a004121] [Citation(s) in RCA: 53] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Mobile elements that use reverse transcriptase to make new copies of themselves are found in all major lineages of eukaryotes. The non-long terminal repeat (non-LTR) retrotransposons have been suggested to be the oldest of these eukaryotic elements. Phylogenetic analysis of non-LTR elements suggests that they have predominantly undergone vertical transmission, as opposed to the frequent horizontal transmissions found for other mobile elements. One prediction of this vertical model of inheritance is that the oldest lineages of eukaryotes should exclusively harbor the oldest lineages of non-LTR retrotransposons. Here we characterize the non-LTR retrotransposons present in one of the most primitive eukaryotes, the diplomonad Giardia lamblia. Two families of elements were detected in the WB isolate of G. lamblia currently being used for the genome sequencing project. These elements are clearly distinct from all other previously described non-LTR lineages. Phylogenetic analysis indicates that these Genie elements (for Giardia early non-LTR insertion element) are among the oldest known lineages of non-LTR elements consistent with strict vertical descent. Genie elements encode a single open reading frame with a carboxyl terminal endonuclease domain. Genie 1 is site specific, as seven to eight copies are present in a single tandem array of a 771-bp repeat near the telomere of one chromosome. The function of this repeat is not known. One additional, highly divergent, element within the Genie 1 lineage is not located in this tandem array but is near a second telomere. Four different telomere addition sites could be identified within or near the Genie elements on each of these chromosomes. The second lineage of non-LTR elements, Genie 2, is composed of about 10 degenerate copies. Genie 2 elements do not appear to be site specific in their insertion. An unusual aspect of Genie 2 is that all copies contain inverted repeats up to 172 bp in length.
Collapse
Affiliation(s)
- William D Burke
- Department of Biology, University of Rochester, Rochester, NY 14627, USA
| | | | | | | |
Collapse
|
42
|
Arkhipova IR, Morrison HG. Three retrotransposon families in the genome of Giardia lamblia: two telomeric, one dead. Proc Natl Acad Sci U S A 2001; 98:14497-502. [PMID: 11734649 PMCID: PMC64710 DOI: 10.1073/pnas.231494798] [Citation(s) in RCA: 73] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023] Open
Abstract
Transposable elements inhabiting eukaryotic genomes are generally regarded either as selfish DNA, which is selectively neutral to the host organism, or as parasitic DNA, deleterious to the host. Thus far, the only agreed-upon example of beneficial eukaryotic transposons is provided by Drosophila telomere-associated retrotransposons, which transpose directly to the chromosome ends and thereby protect them from degradation. This article reports the transposon content of the genome of the protozoan Giardia lamblia, one of the earliest-branching eukaryotes. A total of three non-long terminal repeat retrotransposon families have been identified, two of which are located at the ends of chromosomes, and the third one contains exclusively dead copies with multiple internal deletions, nucleotide substitutions, and frame shifts. No other reverse transcriptase- or transposase-related sequences were found. Thus, the entire genome of this protozoan, which is not known to reproduce sexually, contains only retrotransposons that are either confined to telomeric regions and possibly beneficial, or inactivated and completely nonfunctional.
Collapse
Affiliation(s)
- I R Arkhipova
- Department of Molecular and Cellular Biology, Harvard University, 7 Divinity Avenue, Cambridge, MA 02138, USA.
| | | |
Collapse
|
43
|
Lovsin N, Gubensek F, Kordi D. Evolutionary dynamics in a novel L2 clade of non-LTR retrotransposons in Deuterostomia. Mol Biol Evol 2001; 18:2213-24. [PMID: 11719571 DOI: 10.1093/oxfordjournals.molbev.a003768] [Citation(s) in RCA: 57] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
The evolution of the novel L2 clade of non-long terminal repeat (LTR) retrotransposons and their evolutionary dynamics in Deuterostomia has been examined. The short-term evolution of long interspersed nuclear element 2s (LINE2s) has been studied in 18 reptilian species by analysis of a PCR amplified 0.7-kb fragment encoding the palm/fingers subdomain of reverse transcriptase (RT). Most of the reptilian LINE2s examined are inactive since they contain multiple stop codons, indels, or frameshift mutations that disrupt the RT. Analysis of reptilian LINE2s has shown a high degree of sequence divergence and an unexpectedly large number of deletions. The evolutionary dynamics of LINE2s in reptiles has been found to be complex. LINE2s are shown to form a novel clade of non-LTR retrotransposons that is well separated from the CR1 clade. This novel L2 clade is more widely distributed than previously thought, and new representatives have been discovered in echinoderms, insects, teleost fishes, Xenopus, Squamata, and marsupials. There is an apparent absence of LINE2s from different vertebrate classes, such as cartilaginous fishes, Archosauria (birds and crocodiles), and turtles. Whereas the LINE2s are present in echinoderms and teleost fishes in a conserved form, in most tetrapods only highly degenerated pseudogenes can be found. The predominance of inactive LINE2s in Tetrapoda indicates that, in the host genomes, only inactive copies are still present. The present data indicate that the vertical inactivation of LINE2s might have begun at the time of Tetrapoda origin, 400 MYA. The evolutionary dynamics of the L2 clade in Deuterostomia can be described as a gradual vertical inactivation in Tetrapoda, stochastic loss in Archosauria and turtles, and strict vertical transmission in echinoderms and teleost fishes.
Collapse
Affiliation(s)
- N Lovsin
- Department of Chemistry and Biochemistry, Faculty of Chemistry and Chemical Technology, University of Ljubljana, Slovenia
| | | | | |
Collapse
|
44
|
Lenoir A, Lavie L, Prieto JL, Goubely C, Coté JC, Pélissier T, Deragon JM. The evolutionary origin and genomic organization of SINEs in Arabidopsis thaliana. Mol Biol Evol 2001; 18:2315-22. [PMID: 11719581 DOI: 10.1093/oxfordjournals.molbev.a003778] [Citation(s) in RCA: 50] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
We have characterized the two families of SINE retroposons present in Arabidopsis thaliana. The origin, distribution, organization, and evolutionary history of RAthE1 and RAthE2 elements were studied and compared to the well-characterized SINE S1 element from Brassica. Our studies show that RAthE1, RAthE2, and S1 retroposons were generated independently from three different tRNAs. The RAthE1 and RAthE2 families are older than the S1 family and are present in all tested Cruciferae species. The evolutionary history of the RAthE1 family is unusual for SINEs. The 144 RAthE1 elements of the Arabidopsis genome cannot be classified in distinct subfamilies of different evolutionary ages as is the case for S1, RAthE2, and mammalian SINEs. Instead, most RAthE1 elements were probably derived steadily from a single source gene that was maintained intact and active for at least 12-20 Myr, a result suggesting that the RAthE1 source gene was under selection. The distribution of RAthE1 and RAthE2 elements on the Arabidopsis physical map was studied. We observed that, in contrast to other Arabidopsis transposable elements, SINEs are not concentrated in the heterochromatic regions. Instead, SINEs are grouped in the euchromatic chromosome territories several hundred kilobase pairs long. In these territories, SINE elements are closely associated with genes. A retroposition partnership between Arabidopsis SINEs and LINEs is proposed.
Collapse
Affiliation(s)
- A Lenoir
- Centre National de la Recherche Scientifique, Université Blaise Pascal Clermont-Ferrand II, Aubière cedex, France
| | | | | | | | | | | | | |
Collapse
|
45
|
Zupunski V, Gubensek F, Kordis D. Evolutionary dynamics and evolutionary history in the RTE clade of non-LTR retrotransposons. Mol Biol Evol 2001; 18:1849-63. [PMID: 11557792 DOI: 10.1093/oxfordjournals.molbev.a003727] [Citation(s) in RCA: 76] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
This study examined the evolutionary dynamics of Bov-B LINEs in vertebrates and the evolution of the RTE clade of non-LTR retrotransposons. The first full-length reptilian Bov-B LINE element is described; it is 3.2 kb in length, with a structural organization typical of the RTE clade of non-LTR retrotransposons. The long-term evolution of Bov-B LINEs was studied in 10 species of Squamata by analysis of a PCR-amplified 1.8-kb fragment encoding part of apurinic/apyrimidinic endonuclease, the intervening domain, and the palm/fingers subdomain of reverse transcriptase. A very high level of conservation in Squamata Bov-B long interspersed nuclear elements has been found, reaching 86% identity in the nearly 600 amino acids of ORF2. The same level of conservation exists between the ancestral snake lineage and Ruminantia. Such a high level is exceptional when compared with the level of conservation observed in nuclear and mitochondrial proteins and in other transposable elements. The RTE clade has been found to be much more widely distributed than previously thought, and novel representatives have been discovered in plants, brown algae, annelids, crustaceans, mollusks, echinoderms, and teleost fishes. Evolutionary relationships in the RTE clade were deduced at the amino acid level from three separate regions of ORF2. By using different independent methods, including the divergence-versus-age analysis, several examples of horizontal transfer in the RTE clade were recognized, with important implications for the existence of HT in non-LTR retrotransposons.
Collapse
Affiliation(s)
- V Zupunski
- Department of Biochemistry and Molecular Biology, Jozef Stefan Institute, Ljubljana, Slovenia
| | | | | |
Collapse
|
46
|
Lander ES, Linton LM, Birren B, Nusbaum C, Zody MC, Baldwin J, Devon K, Dewar K, Doyle M, FitzHugh W, Funke R, Gage D, Harris K, Heaford A, Howland J, Kann L, Lehoczky J, LeVine R, McEwan P, McKernan K, Meldrim J, Mesirov JP, Miranda C, Morris W, Naylor J, Raymond C, Rosetti M, Santos R, Sheridan A, Sougnez C, Stange-Thomann Y, Stojanovic N, Subramanian A, Wyman D, Rogers J, Sulston J, Ainscough R, Beck S, Bentley D, Burton J, Clee C, Carter N, Coulson A, Deadman R, Deloukas P, Dunham A, Dunham I, Durbin R, French L, Grafham D, Gregory S, Hubbard T, Humphray S, Hunt A, Jones M, Lloyd C, McMurray A, Matthews L, Mercer S, Milne S, Mullikin JC, Mungall A, Plumb R, Ross M, Shownkeen R, Sims S, Waterston RH, Wilson RK, Hillier LW, McPherson JD, Marra MA, Mardis ER, Fulton LA, Chinwalla AT, Pepin KH, Gish WR, Chissoe SL, Wendl MC, Delehaunty KD, Miner TL, Delehaunty A, Kramer JB, Cook LL, Fulton RS, Johnson DL, Minx PJ, Clifton SW, Hawkins T, Branscomb E, Predki P, Richardson P, Wenning S, Slezak T, Doggett N, Cheng JF, Olsen A, Lucas S, Elkin C, Uberbacher E, Frazier M, et alLander ES, Linton LM, Birren B, Nusbaum C, Zody MC, Baldwin J, Devon K, Dewar K, Doyle M, FitzHugh W, Funke R, Gage D, Harris K, Heaford A, Howland J, Kann L, Lehoczky J, LeVine R, McEwan P, McKernan K, Meldrim J, Mesirov JP, Miranda C, Morris W, Naylor J, Raymond C, Rosetti M, Santos R, Sheridan A, Sougnez C, Stange-Thomann Y, Stojanovic N, Subramanian A, Wyman D, Rogers J, Sulston J, Ainscough R, Beck S, Bentley D, Burton J, Clee C, Carter N, Coulson A, Deadman R, Deloukas P, Dunham A, Dunham I, Durbin R, French L, Grafham D, Gregory S, Hubbard T, Humphray S, Hunt A, Jones M, Lloyd C, McMurray A, Matthews L, Mercer S, Milne S, Mullikin JC, Mungall A, Plumb R, Ross M, Shownkeen R, Sims S, Waterston RH, Wilson RK, Hillier LW, McPherson JD, Marra MA, Mardis ER, Fulton LA, Chinwalla AT, Pepin KH, Gish WR, Chissoe SL, Wendl MC, Delehaunty KD, Miner TL, Delehaunty A, Kramer JB, Cook LL, Fulton RS, Johnson DL, Minx PJ, Clifton SW, Hawkins T, Branscomb E, Predki P, Richardson P, Wenning S, Slezak T, Doggett N, Cheng JF, Olsen A, Lucas S, Elkin C, Uberbacher E, Frazier M, Gibbs RA, Muzny DM, Scherer SE, Bouck JB, Sodergren EJ, Worley KC, Rives CM, Gorrell JH, Metzker ML, Naylor SL, Kucherlapati RS, Nelson DL, Weinstock GM, Sakaki Y, Fujiyama A, Hattori M, Yada T, Toyoda A, Itoh T, Kawagoe C, Watanabe H, Totoki Y, Taylor T, Weissenbach J, Heilig R, Saurin W, Artiguenave F, Brottier P, Bruls T, Pelletier E, Robert C, Wincker P, Smith DR, Doucette-Stamm L, Rubenfield M, Weinstock K, Lee HM, Dubois J, Rosenthal A, Platzer M, Nyakatura G, Taudien S, Rump A, Yang H, Yu J, Wang J, Huang G, Gu J, Hood L, Rowen L, Madan A, Qin S, Davis RW, Federspiel NA, Abola AP, Proctor MJ, Myers RM, Schmutz J, Dickson M, Grimwood J, Cox DR, Olson MV, Kaul R, Raymond C, Shimizu N, Kawasaki K, Minoshima S, Evans GA, Athanasiou M, Schultz R, Roe BA, Chen F, Pan H, Ramser J, Lehrach H, Reinhardt R, McCombie WR, de la Bastide M, Dedhia N, Blöcker H, Hornischer K, Nordsiek G, Agarwala R, Aravind L, Bailey JA, Bateman A, Batzoglou S, Birney E, Bork P, Brown DG, Burge CB, Cerutti L, Chen HC, Church D, Clamp M, Copley RR, Doerks T, Eddy SR, Eichler EE, Furey TS, Galagan J, Gilbert JG, Harmon C, Hayashizaki Y, Haussler D, Hermjakob H, Hokamp K, Jang W, Johnson LS, Jones TA, Kasif S, Kaspryzk A, Kennedy S, Kent WJ, Kitts P, Koonin EV, Korf I, Kulp D, Lancet D, Lowe TM, McLysaght A, Mikkelsen T, Moran JV, Mulder N, Pollara VJ, Ponting CP, Schuler G, Schultz J, Slater G, Smit AF, Stupka E, Szustakowki J, Thierry-Mieg D, Thierry-Mieg J, Wagner L, Wallis J, Wheeler R, Williams A, Wolf YI, Wolfe KH, Yang SP, Yeh RF, Collins F, Guyer MS, Peterson J, Felsenfeld A, Wetterstrand KA, Patrinos A, Morgan MJ, de Jong P, Catanese JJ, Osoegawa K, Shizuya H, Choi S, Chen YJ, Szustakowki J. Initial sequencing and analysis of the human genome. Nature 2001; 409:860-921. [PMID: 11237011 DOI: 10.1038/35057062] [Show More Authors] [Citation(s) in RCA: 15023] [Impact Index Per Article: 626.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
The human genome holds an extraordinary trove of information about human development, physiology, medicine and evolution. Here we report the results of an international collaboration to produce and make freely available a draft sequence of the human genome. We also present an initial analysis of the data, describing some of the insights that can be gleaned from the sequence.
Collapse
Affiliation(s)
- E S Lander
- Whitehead Institute for Biomedical Research, Center for Genome Research, Cambridge, MA 02142, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
47
|
Gentile KL, Burke WD, Eickbush TH. Multiple lineages of R1 retrotransposable elements can coexist in the rDNA loci of Drosophila. Mol Biol Evol 2001; 18:235-45. [PMID: 11158382 DOI: 10.1093/oxfordjournals.molbev.a003797] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
R1 non-long terminal repeat retrotransposable elements insert specifically into the 28S rRNA genes of arthropods. One aspect of R1 evolution that has been difficult to explain is the presence of divergent lineages of R1 in the rDNA loci of the same species. Multiple lineages should compete for a limited number of insertion sites, in addition to being subject to the concerted evolution processes homogenizing the rRNA genes. The presence of multiple lineages suggests either the ability of the elements to overcome these factors and diverge within rDNA loci, or the introduction of new lineages by horizontal transmission. To address this issue, we attempted to characterize the complete set of R1 elements in the rDNA locus from five Drosophila species groups (melanogaster, obscura, testacea, quinaria, and repleta). Two major R1 lineages, A and B, that diverged about 100 MYA were found to exist in Drosophila. Elements of the A lineage were found in all 35 Drosophila species tested, while elements of the B lineage were found in only 11 species from three species groups. Phylogenetic analysis of the R1 elements, supported by comparison of their rates of nucleotide sequence substitution, revealed that both the A and the B lineages have been maintained by vertical descent. The B lineage was less stable and has undergone numerous, independent elimination events, while the A lineage has diverged into three sublineages, which were, in turn, differentially stable. We conclude that while the differential retention of multiple lineages greatly complicates its phylogenetic history, the available R1 data continue to be consistent with the strict vertical descent of these elements.
Collapse
Affiliation(s)
- K L Gentile
- Department of Biology, University of Rochester, Rochester, NY 14627, USA
| | | | | |
Collapse
|
48
|
Berezikov E, Bucheton A, Busseau I. A search for reverse transcriptase-coding sequences reveals new non-LTR retrotransposons in the genome of Drosophila melanogaster. Genome Biol 2000; 1:RESEARCH0012. [PMID: 11178266 PMCID: PMC16141 DOI: 10.1186/gb-2000-1-6-research0012] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2000] [Revised: 10/13/2000] [Accepted: 10/26/2000] [Indexed: 12/04/2022] Open
Abstract
BACKGROUND Non-long terminal repeat (non-LTR) retrotransposons are eukaryotic mobile genetic elements that transpose by reverse transcription of an RNA intermediate. We have performed a systematic search for sequences matching the characteristic reverse transcriptase domain of non-LTR retrotransposons in the sequenced regions of the Drosophila melanogaster genome. RESULTS In addition to previously characterized BS, Doc, F, G, I and Jockey elements, we have identified new non-LTR retrotransposons: Waldo, You and JuanDm. Waldo elements are related to mosquito RTI elements. You to the Drosophila I factor, and JuanDm to mosquito Juan-A and Juan-C. Interestingly, all JuanDm elements are highly homogeneous in sequence, suggesting that they are recent components of the Drosophila genome. CONCLUSIONS The genome of D. melanogaster contains at least ten families of non-site-specific non-LTR retrotransposons representing three distinct clades. Many of these families contain potentially active members. Fine evolutionary analyses must await the more accurate sequences that are expected in the next future.
Collapse
Affiliation(s)
- Eugene Berezikov
- Institute of Cytology and Genetics, Prospect Lavrentjeva 10, Novosibirsk 630090, Russia
| | - Alain Bucheton
- Institut de Génétique Humaine, CNRS, rue de la Cardonille, Montpellier cedex 5, France
| | - Isabelle Busseau
- Institut de Génétique Humaine, CNRS, rue de la Cardonille, Montpellier cedex 5, France
| |
Collapse
|