1
|
Fell CW, Villiger L, Lim J, Hiraizumi M, Tagliaferri D, Yarnall MTN, Lee A, Jiang K, Kayabolen A, Krajeski RN, Schmitt-Ulms C, Ramani H, Yousef SM, Roberts N, Vakulskas CA, Nishimasu H, Abudayyeh OO, Gootenberg JS. Reprogramming site-specific retrotransposon activity to new DNA sites. Nature 2025:10.1038/s41586-025-08877-4. [PMID: 40205048 DOI: 10.1038/s41586-025-08877-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2022] [Accepted: 03/07/2025] [Indexed: 04/11/2025]
Abstract
Retroelements have a critical role in shaping eukaryotic genomes. For instance, site-specific non-long terminal repeat retrotransposons have spread widely through preferential integration into repetitive genomic sequences, such as microsatellite regions and ribosomal DNA genes1-6. Despite the widespread occurrence of these systems, their targeting constraints remain unclear. Here we use a computational pipeline to discover multiple new site-specific retrotransposon families, profile members both biochemically and in mammalian cells, find previously undescribed insertion preferences and chart potential evolutionary paths for retrotransposon retargeting. We identify R2Tg, an R2 retrotransposon from the zebra finch, Taeniopygia guttata, as an orthologue that can be retargeted by payload engineering for target cleavage, reverse transcription and scarless insertion of heterologous payloads at new genomic sites. We enhance this activity by fusing R2Tg to CRISPR-Cas9 nickases for efficient insertion at new genomic sites. Through further screening of R2 orthologues, we select an orthologue, R2Tocc, with natural reprogrammability and minimal insertion at its natural 28S site, to engineer SpCas9H840A-R2Tocc, a system we name site-specific target-primed insertion through targeted CRISPR homing of retroelements (STITCHR). STITCHR enables the scarless, efficient installation of edits, ranging from a single base to 12.7 kilobases, gene replacement and use of in vitro transcribed or synthetic RNA templates. Inspired by the prevalence of nLTR retrotransposons across eukaryotic genomes, we anticipate that STITCHR will serve as a platform for scarless programmable integration in dividing and non-dividing cells, with both research and therapeutic applications.
Collapse
Affiliation(s)
- Christopher W Fell
- Department of Medicine, Division of Engineering in Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
- Gene and Cell Therapy Institute, Mass General Brigham, Cambridge, MA, USA
- Center for Virology and Vaccine Research, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA, USA
- McGovern Institute for Brain Research at MIT, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Lukas Villiger
- McGovern Institute for Brain Research at MIT, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Justin Lim
- McGovern Institute for Brain Research at MIT, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Masahiro Hiraizumi
- Department of Chemistry and Biotechnology, Graduate School of Engineering, The University of Tokyo, Tokyo, Japan
| | - Dario Tagliaferri
- Department of Medicine, Division of Engineering in Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
- Gene and Cell Therapy Institute, Mass General Brigham, Cambridge, MA, USA
- Center for Virology and Vaccine Research, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA, USA
- McGovern Institute for Brain Research at MIT, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Matthew T N Yarnall
- McGovern Institute for Brain Research at MIT, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Anderson Lee
- McGovern Institute for Brain Research at MIT, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Kaiyi Jiang
- Department of Medicine, Division of Engineering in Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
- Gene and Cell Therapy Institute, Mass General Brigham, Cambridge, MA, USA
- Center for Virology and Vaccine Research, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA, USA
- McGovern Institute for Brain Research at MIT, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Alisan Kayabolen
- Department of Medicine, Division of Engineering in Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
- Gene and Cell Therapy Institute, Mass General Brigham, Cambridge, MA, USA
- Center for Virology and Vaccine Research, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA, USA
- McGovern Institute for Brain Research at MIT, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Rohan N Krajeski
- McGovern Institute for Brain Research at MIT, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Cian Schmitt-Ulms
- Department of Medicine, Division of Engineering in Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
- Gene and Cell Therapy Institute, Mass General Brigham, Cambridge, MA, USA
- Center for Virology and Vaccine Research, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA, USA
- McGovern Institute for Brain Research at MIT, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Harsh Ramani
- Department of Medicine, Division of Engineering in Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
- Gene and Cell Therapy Institute, Mass General Brigham, Cambridge, MA, USA
- Center for Virology and Vaccine Research, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA, USA
| | - Sarah M Yousef
- McGovern Institute for Brain Research at MIT, Massachusetts Institute of Technology, Cambridge, MA, USA
| | | | | | - Hiroshi Nishimasu
- Department of Chemistry and Biotechnology, Graduate School of Engineering, The University of Tokyo, Tokyo, Japan
- Structural Biology Division Research Center for Advanced Science and Technology, The University of Tokyo, Tokyo, Japan
- Inamori Research Institute for Science, Kyoto, Japan
| | - Omar O Abudayyeh
- Department of Medicine, Division of Engineering in Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA.
- Gene and Cell Therapy Institute, Mass General Brigham, Cambridge, MA, USA.
- Center for Virology and Vaccine Research, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA, USA.
- McGovern Institute for Brain Research at MIT, Massachusetts Institute of Technology, Cambridge, MA, USA.
| | - Jonathan S Gootenberg
- Department of Medicine, Division of Engineering in Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA.
- Gene and Cell Therapy Institute, Mass General Brigham, Cambridge, MA, USA.
- Center for Virology and Vaccine Research, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA, USA.
- McGovern Institute for Brain Research at MIT, Massachusetts Institute of Technology, Cambridge, MA, USA.
| |
Collapse
|
2
|
López-Cortegano E, Craig RJ, Chebib J, Balogun EJ, Keightley PD. Rates and spectra of de novo structural mutations in Chlamydomonas reinhardtii. Genome Res 2023; 33:45-60. [PMID: 36617667 PMCID: PMC9977147 DOI: 10.1101/gr.276957.122] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2022] [Accepted: 12/06/2022] [Indexed: 12/14/2022]
Abstract
Genetic variation originates from several types of spontaneous mutation, including single-nucleotide substitutions, short insertions and deletions (indels), and larger structural changes. Structural mutations (SMs) drive genome evolution and are thought to play major roles in evolutionary adaptation, speciation, and genetic disease, including cancers. Sequencing of mutation accumulation (MA) lines has provided estimates of rates and spectra of single-nucleotide and indel mutations in many species, yet the rate of new SMs is largely unknown. Here, we use long-read sequencing to determine the full mutation spectrum in MA lines derived from two strains (CC-1952 and CC-2931) of the green alga Chlamydomonas reinhardtii The SM rate is highly variable between strains and between MA lines, and SMs represent a substantial proportion of all mutations in both strains (CC-1952 6%; CC-2931 12%). The SM spectra differ considerably between the two strains, with almost all inversions and translocations occurring in CC-2931 MA lines. This variation is associated with heterogeneity in the number and type of active transposable elements (TEs), which comprise major proportions of SMs in both strains (CC-1952 22%; CC-2931 38%). In CC-2931, a Crypton and a previously undescribed type of DNA element have caused 71% of chromosomal rearrangements, whereas in CC-1952, a Dualen LINE is associated with 87% of duplications. Other SMs, notably large duplications in CC-2931, are likely products of various double-strand break repair pathways. Our results show that diverse types of SMs occur at substantial rates, and support prominent roles for SMs and TEs in evolution.
Collapse
Affiliation(s)
- Eugenio López-Cortegano
- Institute of Ecology and Evolution, University of Edinburgh, Edinburgh EH9 3FL, United Kingdom
| | - Rory J Craig
- Institute of Ecology and Evolution, University of Edinburgh, Edinburgh EH9 3FL, United Kingdom
- California Institute for Quantitative Biosciences, UC Berkeley, Berkeley, California 94720, USA
| | - Jobran Chebib
- Institute of Ecology and Evolution, University of Edinburgh, Edinburgh EH9 3FL, United Kingdom
| | - Eniolaye J Balogun
- Department of Ecology and Evolutionary Biology, University of Toronto, Ontario ON M5S 3B2, Canada
- Department of Biology, University of Toronto Mississauga, Mississauga ON L5L 1C6, Canada
| | - Peter D Keightley
- Institute of Ecology and Evolution, University of Edinburgh, Edinburgh EH9 3FL, United Kingdom
| |
Collapse
|
3
|
Craig RJ, Yushenova IA, Rodriguez F, Arkhipova IR. An ancient clade of Penelope-like retroelements with permuted domains is present in the green lineage and protists, and dominates many invertebrate genomes. Mol Biol Evol 2021; 38:5005-5020. [PMID: 34320655 PMCID: PMC8557442 DOI: 10.1093/molbev/msab225] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
Penelope-like elements (PLEs) are an enigmatic clade of retrotransposons whose reverse transcriptases (RTs) share a most recent common ancestor with telomerase RTs. The single ORF of canonical endonuclease (EN)+ PLEs encodes RT and a C-terminal GIY–YIG EN that enables intrachromosomal integration, whereas EN− PLEs lack EN and are generally restricted to chromosome termini. EN+ PLEs have only been found in animals, except for one case of horizontal transfer to conifers, whereas EN− PLEs occur in several kingdoms. Here, we report a new, deep-branching PLE clade with a permuted domain order, whereby an N-terminal GIY–YIG EN is linked to a C-terminal RT by a short domain with a characteristic CxC motif. These N-terminal EN+ PLEs share a structural organization, including pseudo-LTRs and complex tandem/inverted insertions, with canonical EN+ PLEs from Penelope/Poseidon, Neptune, and Nematis clades, and show insertion bias for microsatellites, but lack canonical hammerhead ribozyme motifs. However, their phylogenetic distribution is much broader. The Naiads, found in numerous invertebrate phyla, can reach tens of thousands of copies per genome. In spiders and clams, Naiads independently evolved to encode selenoproteins containing multiple selenocysteines. Chlamys, which lack the CCHH motif universal to PLE ENs, occur in green algae, spike mosses (targeting ribosomal DNA), and slime molds. Unlike canonical PLEs, RTs of N-terminal EN+ PLEs contain the insertion-in-fingers domain (IFD), strengthening the link between PLEs and telomerases. Additionally, we describe Hydra, a novel metazoan C-terminal EN+ clade. Overall, we conclude that PLE diversity, taxonomic distribution, and abundance are comparable with non-LTR and LTR-retrotransposons.
Collapse
Affiliation(s)
- Rory J Craig
- Institute of Evolutionary Biology, University of Edinburgh, Edinburgh, UK
| | - Irina A Yushenova
- Josephine Bay Paul Center for Comparative Molecular Biology and Evolution, Marine Biological Laboratory, Woods Hole, MA, USA
| | - Fernando Rodriguez
- Josephine Bay Paul Center for Comparative Molecular Biology and Evolution, Marine Biological Laboratory, Woods Hole, MA, USA
| | - Irina R Arkhipova
- Josephine Bay Paul Center for Comparative Molecular Biology and Evolution, Marine Biological Laboratory, Woods Hole, MA, USA
| |
Collapse
|
4
|
Craig RJ, Hasan AR, Ness RW, Keightley PD. Comparative genomics of Chlamydomonas. THE PLANT CELL 2021; 33:1016-1041. [PMID: 33793842 PMCID: PMC8226300 DOI: 10.1093/plcell/koab026] [Citation(s) in RCA: 42] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/16/2020] [Accepted: 01/22/2021] [Indexed: 05/06/2023]
Abstract
Despite its role as a reference organism in the plant sciences, the green alga Chlamydomonas reinhardtii entirely lacks genomic resources from closely related species. We present highly contiguous and well-annotated genome assemblies for three unicellular C. reinhardtii relatives: Chlamydomonas incerta, Chlamydomonas schloesseri, and the more distantly related Edaphochlamys debaryana. The three Chlamydomonas genomes are highly syntenous with similar gene contents, although the 129.2 Mb C. incerta and 130.2 Mb C. schloesseri assemblies are more repeat-rich than the 111.1 Mb C. reinhardtii genome. We identify the major centromeric repeat in C. reinhardtii as a LINE transposable element homologous to Zepp (the centromeric repeat in Coccomyxa subellipsoidea) and infer that centromere locations and structure are likely conserved in C. incerta and C. schloesseri. We report extensive rearrangements, but limited gene turnover, between the minus mating type loci of these Chlamydomonas species. We produce an eight-species core-Reinhardtinia whole-genome alignment, which we use to identify several hundred false positive and missing genes in the C. reinhardtii annotation and >260,000 evolutionarily conserved elements in the C. reinhardtii genome. In summary, these resources will enable comparative genomics analyses for C. reinhardtii, significantly extending the analytical toolkit for this emerging model system.
Collapse
Affiliation(s)
| | - Ahmed R Hasan
- Department of Biology, University of Toronto Mississauga, Mississauga, Onatrio, Canada L5L 1C6
| | - Rob W Ness
- Department of Biology, University of Toronto Mississauga, Mississauga, Onatrio, Canada L5L 1C6
| | - Peter D Keightley
- School of Biological Sciences, Institute of Evolutionary Biology, University of Edinburgh, EH9 3FL Edinburgh, UK
| |
Collapse
|
5
|
Kojima KK. Structural and sequence diversity of eukaryotic transposable elements. Genes Genet Syst 2019; 94:233-252. [DOI: 10.1266/ggs.18-00024] [Citation(s) in RCA: 47] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Affiliation(s)
- Kenji K. Kojima
- Genetic Information Research Institute
- Department of Life Sciences, National Cheng Kung University
| |
Collapse
|
6
|
Ustyantsev K, Blinov A, Smyshlyaev G. Convergence of retrotransposons in oomycetes and plants. Mob DNA 2017; 8:4. [PMID: 28293305 PMCID: PMC5348765 DOI: 10.1186/s13100-017-0087-y] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2016] [Accepted: 03/07/2017] [Indexed: 12/30/2022] Open
Abstract
Background Retrotransposons comprise a ubiquitous and abundant class of eukaryotic transposable elements. All members of this class rely on reverse transcriptase activity to produce a DNA copy of the element from the RNA template. However, other activities of the retrotransposon-encoded polyprotein may differ between diverse retrotransposons. The polyprotein domains corresponding to each of these activities may have their own evolutionary history independent from that of the reverse transcriptase, thus underlying the modular view on the evolution of retrotransposons. Furthermore, some transposable elements can independently evolve similar domain architectures by acquiring functionally similar but phylogenetically distinct modules. This convergent evolution of retrotransposons may ultimately suggest similar regulatory pathways underlying the lifecycle of the elements. Results Here, we provide new examples of the convergent evolution of retrotransposons of species from two unrelated taxa: green plants and parasitic protozoan oomycetes. In the present study we first analyzed the available genomic sequences of oomycete species and characterized two groups of Ty3/Gypsy long terminal repeat retrotransposons, namely Chronos and Archon, and a subgroup of L1 non-long terminal repeat retrotransposons. The results demonstrated that the retroelements from these three groups each have independently acquired plant-related ribonuclease H domains. This process closely resembles the evolution of retrotransposons in the genomes of green plants. In addition, we showed that Chronos elements captured a chromodomain, mimicking the process of chromodomain acquisition by Chromoviruses, another group of Ty3/Gypsy retrotransposons of plants, fungi, and vertebrates. Conclusions Repeated and strikingly similar acquisitions of ribonuclease H domains and chromodomains by different retrotransposon groups from unrelated taxa indicate similar selection pressure acting on these elements. Thus, there are some major trends in the evolution of the structural composition of retrotransposons, and characterizing these trends may enhance the current understanding of the retrotransposon life cycle. Electronic supplementary material The online version of this article (doi:10.1186/s13100-017-0087-y) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Kirill Ustyantsev
- Institute of Cytology and Genetics, Laboratory of Molecular Genetic Systems, Prospekt Lavrentyeva 10, 630090 Novosibirsk, Russia
| | - Alexandr Blinov
- Institute of Cytology and Genetics, Laboratory of Molecular Genetic Systems, Prospekt Lavrentyeva 10, 630090 Novosibirsk, Russia
| | - Georgy Smyshlyaev
- Structural and Computational Biology Unit, European Molecular Biology Laboratory, 69117 Heidelberg, Germany
| |
Collapse
|
7
|
Involvement of Conserved Amino Acids in the C-Terminal Region of LINE-1 ORF2p in Retrotransposition. Genetics 2017; 205:1139-1149. [PMID: 28100588 DOI: 10.1534/genetics.116.191403] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2016] [Accepted: 12/10/2016] [Indexed: 01/09/2023] Open
Abstract
Long interspersed element 1 (L1) is the only currently active autonomous retroelement in the human genome. Along with the parasitic SVA and short interspersed element Alu, L1 is the source of DNA damage induced by retrotransposition: a copy-and-paste process that has the potential to disrupt gene function and cause human disease. The retrotransposition process is dependent upon the ORF2 protein (ORF2p). However, it is unknown whether most of the protein is important for retrotransposition. In particular, other than the Cys motif, the C terminus of the protein has not been intensely examined in the context of retrotransposition. Using evolutionary analysis and the Alu retrotransposition assay, we sought to identify additional amino acids in the C terminus important for retrotransposition. Here, we demonstrate that Gal4-tagged and untagged C-terminally truncated ORF2p fragments possess residual potential to drive Alu retrotransposition. Using sight-directed mutagenesis we identify that while the Y1180 amino acid is important for ORF2p- and L1-driven Alu retrotransposition, a mutation at this position improves L1 retrotransposition. Even though the mechanism of the contribution of Y1180 to Alu and L1 mobilization remains unknown, experimental evidence rules out its direct involvement in the ability of the ORF2p reverse transcriptase to generate complementary DNA. Additionally, our data support that ORF2p amino acids 1180 and 1250-1262 may be involved in the reported ORF1p-mediated increase in ORF2p-driven Alu retrotransposition.
Collapse
|
8
|
Abstract
Although most of non-long terminal repeat (non-LTR) retrotransposons are incorporated in the host genome almost randomly, some non-LTR retrotransposons are incorporated into specific sequences within a target site. On the basis of structural and phylogenetic features, non-LTR retrotransposons are classified into two large groups, restriction enzyme-like endonuclease (RLE)-encoding elements and apurinic/apyrimidinic endonuclease (APE)-encoding elements. All clades of RLE-encoding non-LTR retrotransposons include site-specific elements. However, only two of more than 20 APE-encoding clades, Tx1 and R1, contain site-specific non-LTR elements. Site-specific non-LTR retrotransposons usually target within multi-copy RNA genes, such as rRNA gene (rDNA) clusters, or repetitive genomic sequences, such as telomeric repeats; this behavior may be a symbiotic strategy to reduce the damage to the host genome. Site- and sequence-specificity are variable even among closely related non-LTR elements and appeared to have changed during evolution. In the APE-encoding elements, the primary determinant of the sequence- specific integration is APE itself, which nicks one strand of the target DNA during the initiation of target primed reverse transcription (TPRT). However, other factors, such as interaction between mRNA and the target DNA, and access to the target region in the nuclei also affect the sequence-specificity. In contrast, in the RLE-encoding elements, DNA-binding motifs appear to affect their sequence-specificity, rather than the RLE domain itself. Highly specific integration properties of these site-specific non-LTR elements make them ideal alternative tools for sequence-specific gene delivery, particularly for therapeutic purposes in human diseases.
Collapse
|
9
|
Abstract
This review focuses on recent developments in our understanding of group II intron function, the relationships of these introns to retrotransposons and spliceosomes, and how their common features have informed thinking about bacterial group II introns as key elements in eukaryotic evolution. Reverse transcriptase-mediated and host factor-aided intron retrohoming pathways are considered along with retrotransposition mechanisms to novel sites in bacteria, where group II introns are thought to have originated. DNA target recognition and movement by target-primed reverse transcription infer an evolutionary relationship among group II introns, non-LTR retrotransposons, such as LINE elements, and telomerase. Additionally, group II introns are almost certainly the progenitors of spliceosomal introns. Their profound similarities include splicing chemistry extending to RNA catalysis, reaction stereochemistry, and the position of two divalent metals that perform catalysis at the RNA active site. There are also sequence and structural similarities between group II introns and the spliceosome's small nuclear RNAs (snRNAs) and between a highly conserved core spliceosomal protein Prp8 and a group II intron-like reverse transcriptase. It has been proposed that group II introns entered eukaryotes during bacterial endosymbiosis or bacterial-archaeal fusion, proliferated within the nuclear genome, necessitating evolution of the nuclear envelope, and fragmented giving rise to spliceosomal introns. Thus, these bacterial self-splicing mobile elements have fundamentally impacted the composition of extant eukaryotic genomes, including the human genome, most of which is derived from close relatives of mobile group II introns.
Collapse
|
10
|
Lazareva E, Lezzhov A, Vassetzky N, Solovyev A, Morozov S. Acquisition of Full-Length Viral Helicase Domains by Insect Retrotransposon-Encoded Polypeptides. Front Microbiol 2015; 6:1447. [PMID: 26733982 PMCID: PMC4686597 DOI: 10.3389/fmicb.2015.01447] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2015] [Accepted: 12/04/2015] [Indexed: 12/18/2022] Open
Abstract
Recent metagenomic studies in insects identified many sequences unexpectedly closely related to plant virus genes. Here we describe a new example of this kind, insect R1 LINEs with an additional C-terminal domain in their open reading frame 2. This domain is similar to NTPase/helicase (SF1H) domains, which are found in replicative proteins encoded by plant viruses of the genus Tobamovirus. We hypothesize that the SF1H domain could be acquired by LINEs, directly or indirectly, upon insect feeding on virus-infected plants. Possible functions of this domain in LINE transposition and involvement in LINEs counteraction the silencing-based cell defense against retrotransposons are discussed.
Collapse
Affiliation(s)
- Ekaterina Lazareva
- Department of Virology, Faculty of Biology, Lomonosov Moscow State University Moscow, Russia
| | - Alexander Lezzhov
- Department of Virology, Faculty of Biology, Lomonosov Moscow State University Moscow, Russia
| | - Nikita Vassetzky
- Laboratory of Eukaryotic Genome Evolution, Engelhardt Institute of Molecular Biology, Russian Academy of Sciences Moscow, Russia
| | - Andrey Solovyev
- Genetic Engineering of Plant Viruses, A.N. Belozersky Institute of Physico-Chemical Biology, Moscow State University Moscow, Russia
| | - Sergey Morozov
- Department of Virology, Faculty of Biology, Lomonosov Moscow State UniversityMoscow, Russia; Genetic Engineering of Plant Viruses, A.N. Belozersky Institute of Physico-Chemical Biology, Moscow State UniversityMoscow, Russia
| |
Collapse
|
11
|
Reverse transcriptase genes are highly abundant and transcriptionally active in marine plankton assemblages. ISME JOURNAL 2015; 10:1134-46. [PMID: 26613339 PMCID: PMC5029228 DOI: 10.1038/ismej.2015.192] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/27/2015] [Revised: 08/27/2015] [Accepted: 09/22/2015] [Indexed: 11/18/2022]
Abstract
Genes encoding reverse transcriptases (RTs) are found in most eukaryotes, often
as a component of retrotransposons, as well as in retroviruses and in
prokaryotic retroelements. We investigated the abundance, classification and
transcriptional status of RTs based on Tara Oceans marine metagenomes
and metatranscriptomes encompassing a wide organism size range. Our analyses
revealed that RTs predominate large-size fraction metagenomes
(>5 μm), where they reached a maximum of 13.5% of the total
gene abundance. Metagenomic RTs were widely distributed across the phylogeny of
known RTs, but many belonged to previously uncharacterized clades.
Metatranscriptomic RTs showed distinct abundance patterns across samples
compared with metagenomic RTs. The relative abundances of viral and bacterial
RTs among identified RT sequences were higher in metatranscriptomes than in
metagenomes and these sequences were detected in all metatranscriptome size
fractions. Overall, these observations suggest an active proliferation of
various RT-assisted elements, which could be involved in genome evolution or
adaptive processes of plankton assemblage.
Collapse
|
12
|
Kojima KK, Jurka J. Ancient Origin of the U2 Small Nuclear RNA Gene-Targeting Non-LTR Retrotransposons Utopia. PLoS One 2015; 10:e0140084. [PMID: 26556480 PMCID: PMC4640811 DOI: 10.1371/journal.pone.0140084] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2015] [Accepted: 09/21/2015] [Indexed: 11/22/2022] Open
Abstract
Most non-long terminal repeat (non-LTR) retrotransposons encoding a restriction-like endonuclease show target-specific integration into repetitive sequences such as ribosomal RNA genes and microsatellites. However, only a few target-specific lineages of non-LTR retrotransposons are distributed widely and no lineage is found across the eukaryotic kingdoms. Here we report the most widely distributed lineage of target sequence-specific non-LTR retrotransposons, designated Utopia. Utopia is found in three supergroups of eukaryotes: Amoebozoa, SAR, and Opisthokonta. Utopia is inserted into a specific site of U2 small nuclear RNA genes with different strength of specificity for each family. Utopia families from oomycetes and wasps show strong target specificity while only a small number of Utopia copies from reptiles are flanked with U2 snRNA genes. Oomycete Utopia families contain an “archaeal” RNase H domain upstream of reverse transcriptase (RT), which likely originated from a plant RNase H gene. Analysis of Utopia from oomycetes indicates that multiple lineages of Utopia have been maintained inside of U2 genes with few copy numbers. Phylogenetic analysis of RT suggests the monophyly of Utopia, and it likely dates back to the early evolution of eukaryotes.
Collapse
Affiliation(s)
- Kenji K. Kojima
- Genetic Information Research Institute, Los Altos, California, United States of America
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, University of Tokyo, Minato-ku, Tokyo, Japan
- Institute of Medical Science, University of Tokyo, Minato-ku, Tokyo, Japan
- * E-mail:
| | - Jerzy Jurka
- Genetic Information Research Institute, Los Altos, California, United States of America
| |
Collapse
|
13
|
Piégu B, Bire S, Arensburger P, Bigot Y. A survey of transposable element classification systems--a call for a fundamental update to meet the challenge of their diversity and complexity. Mol Phylogenet Evol 2015; 86:90-109. [PMID: 25797922 DOI: 10.1016/j.ympev.2015.03.009] [Citation(s) in RCA: 90] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2014] [Revised: 03/11/2015] [Accepted: 03/12/2015] [Indexed: 10/25/2022]
Abstract
The increase of publicly available sequencing data has allowed for rapid progress in our understanding of genome composition. As new information becomes available we should constantly be updating and reanalyzing existing and newly acquired data. In this report we focus on transposable elements (TEs) which make up a significant portion of nearly all sequenced genomes. Our ability to accurately identify and classify these sequences is critical to understanding their impact on host genomes. At the same time, as we demonstrate in this report, problems with existing classification schemes have led to significant misunderstandings of the evolution of both TE sequences and their host genomes. In a pioneering publication Finnegan (1989) proposed classifying all TE sequences into two classes based on transposition mechanisms and structural features: the retrotransposons (class I) and the DNA transposons (class II). We have retraced how ideas regarding TE classification and annotation in both prokaryotic and eukaryotic scientific communities have changed over time. This has led us to observe that: (1) a number of TEs have convergent structural features and/or transposition mechanisms that have led to misleading conclusions regarding their classification, (2) the evolution of TEs is similar to that of viruses by having several unrelated origins, (3) there might be at least 8 classes and 12 orders of TEs including 10 novel orders. In an effort to address these classification issues we propose: (1) the outline of a universal TE classification, (2) a set of methods and classification rules that could be used by all scientific communities involved in the study of TEs, and (3) a 5-year schedule for the establishment of an International Committee for Taxonomy of Transposable Elements (ICTTE).
Collapse
Affiliation(s)
- Benoît Piégu
- UMR INRA-CNRS 7247, PRC, Centre INRA de Nouzilly, 37380 Nouzilly, France
| | - Solenne Bire
- UMR INRA-CNRS 7247, PRC, Centre INRA de Nouzilly, 37380 Nouzilly, France; Institute of Biotechnology, University of Lausanne, Center for Biotechnology UNIL-EPFL, 1015 Lausanne, Switzerland
| | - Peter Arensburger
- UMR INRA-CNRS 7247, PRC, Centre INRA de Nouzilly, 37380 Nouzilly, France; Biological Sciences Department, California State Polytechnic University, Pomona, CA 91768, United States.
| | - Yves Bigot
- UMR INRA-CNRS 7247, PRC, Centre INRA de Nouzilly, 37380 Nouzilly, France.
| |
Collapse
|
14
|
Montiel EE, Cabrero J, Ruiz-Estévez M, Burke WD, Eickbush TH, Camacho JPM, López-León MD. Preferential occupancy of R2 retroelements on the B chromosomes of the grasshopper Eyprepocnemis plorans. PLoS One 2014; 9:e91820. [PMID: 24632855 PMCID: PMC3954772 DOI: 10.1371/journal.pone.0091820] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2014] [Accepted: 02/14/2014] [Indexed: 02/02/2023] Open
Abstract
R2 non-LTR retrotransposons exclusively insert into the 28S rRNA genes of their host, and are expressed by co-transcription with the rDNA unit. The grasshopper Eyprepocnemis plorans contains transcribed rDNA clusters on most of its A chromosomes, as well as non-transcribed rDNA clusters on the parasitic B chromosomes found in many populations. Here the structure of the E. plorans R2 element, its abundance relative to the number of rDNA units and its retrotransposition activity were determined. Animals screened from five populations contained on average over 12,000 rDNA units on their A chromosomes, but surprisingly only about 100 R2 elements. Monitoring the patterns of R2 insertions in individuals from these populations revealed only low levels of retrotransposition. The low rates of R2 insertion observed in E. plorans differ from the high levels of R2 insertion previously observed in insect species that have many fewer rDNA units. It is proposed that high levels of R2 are strongly selected against in E. plorans, because the rDNA transcription machinery in this species is unable to differentiate between R2-inserted and uninserted units. The B chromosomes of E. plorans contain an additional 7,000 to 15,000 rDNA units, but in contrast to the A chromosomes, from 150 to over 1,500 R2 elements. The higher concentration of R2 in the inactive B chromosomes rDNA clusters suggests these chromosomes can act as a sink for R2 insertions thus further reducing the level of insertions on the A chromosomes. These studies suggest an interesting evolutionary relationship between the parasitic B chromosomes and R2 elements.
Collapse
Affiliation(s)
- Eugenia E. Montiel
- Departamento de Genética, Facultad de Ciencias, Universidad de Granada, Granada, Spain
| | - Josefa Cabrero
- Departamento de Genética, Facultad de Ciencias, Universidad de Granada, Granada, Spain
| | - Mercedes Ruiz-Estévez
- Departamento de Genética, Facultad de Ciencias, Universidad de Granada, Granada, Spain
| | - William D. Burke
- Department of Biology, University of Rochester, Rochester, New York, United States of America
| | - Thomas H. Eickbush
- Department of Biology, University of Rochester, Rochester, New York, United States of America
| | - Juan Pedro M. Camacho
- Departamento de Genética, Facultad de Ciencias, Universidad de Granada, Granada, Spain
| | | |
Collapse
|
15
|
Acquisition of an Archaea-like ribonuclease H domain by plant L1 retrotransposons supports modular evolution. Proc Natl Acad Sci U S A 2013; 110:20140-5. [PMID: 24277848 DOI: 10.1073/pnas.1310958110] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Although a variety of non-LTR retrotransposons of the L1 superfamily have been found in plant genomes over recent decades, their diversity, distribution, and evolution have yet to be analyzed in depth. Here, we perform comprehensive comparative and evolutionary analyses of L1 retrotransposons from 29 genomes of land plants covering a wide range of taxa. We identify numerous L1 elements in these genomes and detect a striking diversity of their domain composition. We show that all known land plant L1 retrotransposons can be grouped into five major families based on their phylogenetic relationships and domain composition. Moreover, we trace the putative evolution timeline that created the current variants and reveal that evolutionary events included losses and acquisitions of diverse putative RNA-binding domains and the acquisition of an Archaea-like ribonuclease H (RNH) domain. We also show that the latter RNH domain is autonomously active in vitro and speculate that retrotransposons may play a role in the horizontal transfer of RNH between plants, Archaea, and bacteria. The acquisition of an Archaea-like RNH domain by plant L1 retrotransposons negates the hypothesis that RNH domains in non-LTR retrotransposons have a single origin and provides evidence that acquisition happened at least twice. Together, our data indicate that the evolution of the investigated retrotransposons can be mainly characterized by repeated events of domain rearrangements and identify modular evolution as a major trend in the evolution of plant L1 retrotransposons.
Collapse
|
16
|
RNA-Mediated Gene Duplication and Retroposons: Retrogenes, LINEs, SINEs, and Sequence Specificity. INTERNATIONAL JOURNAL OF EVOLUTIONARY BIOLOGY 2013; 2013:424726. [PMID: 23984183 PMCID: PMC3747384 DOI: 10.1155/2013/424726] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/14/2013] [Accepted: 07/01/2013] [Indexed: 11/18/2022]
Abstract
A substantial number of “retrogenes” that are derived from the mRNA of various intron-containing genes have been reported. A class of mammalian retroposons, long interspersed element-1 (LINE1, L1), has been shown to be involved in the reverse transcription of retrogenes (or processed pseudogenes) and non-autonomous short interspersed elements (SINEs). The 3′-end sequences of various SINEs originated from a corresponding LINE. As the 3′-untranslated regions of several LINEs are essential for retroposition, these LINEs presumably require “stringent” recognition of the 3′-end sequence of the RNA template. However, the 3′-ends of mammalian L1s do not exhibit any similarity to SINEs, except for the presence of 3′-poly(A) repeats. Since the 3′-poly(A) repeats of L1 and Alu SINE are critical for their retroposition, L1 probably recognizes the poly(A) repeats, thereby mobilizing not only Alu SINE but also cytosolic mRNA. Many flowering plants only harbor L1-clade LINEs and a significant number of SINEs with poly(A) repeats, but no homology to the LINEs. Moreover, processed pseudogenes have also been found in flowering plants. I propose that the ancestral L1-clade LINE in the common ancestor of green plants may have recognized a specific RNA template, with stringent recognition then becoming relaxed during the course of plant evolution.
Collapse
|
17
|
Mukha DV, Pasyukova EG, Kapelinskaya TV, Kagramanova AS. Endonuclease domain of the Drosophila melanogaster R2 non-LTR retrotransposon and related retroelements: a new model for transposition. Front Genet 2013; 4:63. [PMID: 23637706 PMCID: PMC3636483 DOI: 10.3389/fgene.2013.00063] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2012] [Accepted: 04/05/2013] [Indexed: 01/25/2023] Open
Abstract
The molecular mechanisms of the transposition of non-long terminal repeat (non-LTR) retrotransposons are not well understood; the key questions of how the 3′-ends of cDNA copies integrate and how site-specific integration occurs remain unresolved. Integration depends on properties of the endonuclease (EN) domain of retrotransposons. Using the EN domain of the Drosophila R2 retrotransposon as a model for other, closely related non-LTR retrotransposons, we investigated the EN domain and found that it resembles archaeal Holliday-junction resolvases. We suggest that these non-LTR retrotransposons are co-transcribed with the host transcript. Combined with the proposed resolvase activity of the EN domain, this model yields a novel mechanism for site-specific retrotransposition within this class of retrotransposons, with resolution proceeding via a Holliday junction intermediate.
Collapse
Affiliation(s)
- Dmitry V Mukha
- Vavilov Institute of General Genetics, Russian Academy of Sciences Moscow, Russia
| | | | | | | |
Collapse
|
18
|
Smyshlyaev GA, Blinov AG. Evolution and biodiversity of L1 retrotransposons in angiosperm genomes. ACTA ACUST UNITED AC 2012. [DOI: 10.1134/s2079059712010133] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
19
|
Kojima KK, Jurka J. Crypton transposons: identification of new diverse families and ancient domestication events. Mob DNA 2011; 2:12. [PMID: 22011512 PMCID: PMC3212892 DOI: 10.1186/1759-8753-2-12] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2011] [Accepted: 10/19/2011] [Indexed: 01/27/2023] Open
Abstract
Background "Domestication" of transposable elements (TEs) led to evolutionary breakthroughs such as the origin of telomerase and the vertebrate adaptive immune system. These breakthroughs were accomplished by the adaptation of molecular functions essential for TEs, such as reverse transcription, DNA cutting and ligation or DNA binding. Cryptons represent a unique class of DNA transposons using tyrosine recombinase (YR) to cut and rejoin the recombining DNA molecules. Cryptons were originally identified in fungi and later in the sea anemone, sea urchin and insects. Results Herein we report new Cryptons from animals, fungi, oomycetes and diatom, as well as widely conserved genes derived from ancient Crypton domestication events. Phylogenetic analysis based on the YR sequences supports four deep divisions of Crypton elements. We found that the domain of unknown function 3504 (DUF3504) in eukaryotes is derived from Crypton YR. DUF3504 is similar to YR but lacks most of the residues of the catalytic tetrad (R-H-R-Y). Genes containing the DUF3504 domain are potassium channel tetramerization domain containing 1 (KCTD1), KIAA1958, zinc finger MYM type 2 (ZMYM2), ZMYM3, ZMYM4, glutamine-rich protein 1 (QRICH1) and "without children" (WOC). The DUF3504 genes are highly conserved and are found in almost all jawed vertebrates. The sequence, domain structure, intron positions and synteny blocks support the view that ZMYM2, ZMYM3, ZMYM4, and possibly QRICH1, were derived from WOC through two rounds of genome duplication in early vertebrate evolution. WOC is observed widely among bilaterians. There could be four independent events of Crypton domestication, and one of them, generating WOC/ZMYM, predated the birth of bilaterian animals. This is the third-oldest domestication event known to date, following the domestication generating telomerase reverse transcriptase (TERT) and Prp8. Many Crypton-derived genes are transcriptional regulators with additional DNA-binding domains, and the acquisition of the DUF3504 domain could have added new regulatory pathways via protein-DNA or protein-protein interactions. Conclusions Cryptons have contributed to animal evolution through domestication of their YR sequences. The DUF3504 domains are domesticated YRs of animal Crypton elements.
Collapse
Affiliation(s)
- Kenji K Kojima
- Genetic Information Research Institute, 1925 Landings Drive, Mountain View, CA 94043, USA.
| | | |
Collapse
|
20
|
Unique functions of repetitive transcriptomes. INTERNATIONAL REVIEW OF CELL AND MOLECULAR BIOLOGY 2010; 285:115-88. [PMID: 21035099 DOI: 10.1016/b978-0-12-381047-2.00003-7] [Citation(s) in RCA: 42] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
Repetitive sequences occupy a huge fraction of essentially every eukaryotic genome. Repetitive sequences cover more than 50% of mammalian genomic DNAs, whereas gene exons and protein-coding sequences occupy only ~3% and 1%, respectively. Numerous genomic repeats include genes themselves. They generally encode "selfish" proteins necessary for the proliferation of transposable elements (TEs) in the host genome. The major part of evolutionary "older" TEs accumulated mutations over time and fails to encode functional proteins. However, repeats have important functions also on the RNA level. Repetitive transcripts may serve as multifunctional RNAs by participating in the antisense regulation of gene activity and by competing with the host-encoded transcripts for cellular factors. In addition, genomic repeats include regulatory sequences like promoters, enhancers, splice sites, polyadenylation signals, and insulators, which actively reshape cellular transcriptomes. TE expression is tightly controlled by the host cells, and some mechanisms of this regulation were recently decoded. Finally, capacity of TEs to proliferate in the host genome led to the development of multiple biotechnological applications.
Collapse
|
21
|
Kapitonov VV, Tempel S, Jurka J. Simple and fast classification of non-LTR retrotransposons based on phylogeny of their RT domain protein sequences. Gene 2009; 448:207-13. [PMID: 19651192 DOI: 10.1016/j.gene.2009.07.019] [Citation(s) in RCA: 76] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2009] [Revised: 07/19/2009] [Accepted: 07/22/2009] [Indexed: 11/29/2022]
Abstract
Rapidly growing number of sequenced genomes requires fast and accurate computational tools for analysis of different transposable elements (TEs). In this paper we focus on a rapid and reliable procedure for classification of autonomous non-LTR retrotransposons based on alignment and clustering of their reverse transcriptase (RT) domains. Typically, the RT domain protein sequences encoded by different non-LTR retrotransposons are similar to each other in terms of significant BLASTP E-values. Therefore, they can be easily detected by the routine BLASTP searches of genomic DNA sequences coding for proteins similar to the RT domains of known non-LTR retrotransposons. However, detailed classification of non-LTR retrotransposons, i.e. their assignment to specific clades, is a slow and complex procedure that is not formalized or integrated as a standard set of computational methods and data. Here we describe a tool (RTclass1) designed for the fast and accurate automated assignment of novel non-LTR retrotransposons to known or novel clades using phylogenetic analysis of the RT domain protein sequences. RTclass1 classifies a particular non-LTR retrotransposon based on its RT domain in less than 10 min on a standard desktop computer and achieves 99.5% accuracy. RT1class1 works either as a stand-alone program installed locally or as a web-server that can be accessed distantly by uploading sequence data through the internet (http://www.girinst.org/RTphylogeny/RTclass1).
Collapse
Affiliation(s)
- Vladimir V Kapitonov
- Genetic Information Research Institute, 1925 Landings Dr, Mountain View, CA 94041, USA.
| | | | | |
Collapse
|
22
|
Novikova OS, Blinov AG. Origin, evolution, and distribution of different groups of non-LTR retrotransposons among eukaryotes. RUSS J GENET+ 2009. [DOI: 10.1134/s102279540902001x] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
23
|
Novikova O, Fet V, Blinov A. Non-LTR retrotransposons in fungi. Funct Integr Genomics 2008; 9:27-42. [DOI: 10.1007/s10142-008-0093-8] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/29/2008] [Revised: 07/01/2008] [Accepted: 07/01/2008] [Indexed: 12/31/2022]
|
24
|
Novikova O, Śliwińska E, Fet V, Settele J, Blinov A, Woyciechowski M. CR1 clade of non-LTR retrotransposons from Maculinea butterflies (Lepidoptera: Lycaenidae): evidence for recent horizontal transmission. BMC Evol Biol 2007; 7:93. [PMID: 17588269 PMCID: PMC1925062 DOI: 10.1186/1471-2148-7-93] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2006] [Accepted: 06/25/2007] [Indexed: 01/28/2023] Open
Abstract
Background Non-long terminal repeat (non-LTR) retrotransposons are mobile genetic elements that propagate themselves by reverse transcription of an RNA intermediate. Non-LTR retrotransposons are known to evolve mainly via vertical transmission and random loss. Horizontal transmission is believed to be a very rare event in non-LTR retrotransposons. Our knowledge of distribution and diversity of insect non-LTR retrotransposons is limited to a few species – mainly model organisms such as dipteran genera Drosophila, Anopheles, and Aedes. However, diversity of non-LTR retroelements in arthropods seems to be much richer. The present study extends the analysis of non-LTR retroelements to CR1 clade from four butterfly species of genus Maculinea (Lepidoptera: Lycaenidae). The lycaenid genus Maculinea, the object of interest for evolutionary biologists and also a model group for European biodiversity studies, possesses a unique, specialized myrmecophilous lifestyle at larval stage. Their caterpillars, after three weeks of phytophagous life on specific food plants drop to the ground where they are adopted to the ant nest by Myrmica foraging workers. Results We found that the genome of Maculinea butterflies contains multiple CR1 lineages of non-LTR retrotransposons, including those from MacCR1A, MacCR1B and T1Q families. A comparative analysis of RT nucleotide sequences demonstrated an extremely high similarity among elements both in interspecific and intraspecific comparisons. CR1A-like elements were found only in family Lycaenidae. In contrast, MacCR1B lineage clones were extremely similar to CR1B non-LTR retrotransposons from Bombycidae moths: silkworm Bombyx mori and Oberthueria caeca. Conclusion The degree of coding sequence similarity of the studied elements, their discontinuous distribution, and results of divergence-versus-age analysis make it highly unlikely that these sequences diverged at the same time as their host taxa. The only reasonable alternative explanation is horizontal transfer. In addition, phylogenetic markers for population analysis of Maculinea could be developed based on the described non-LTR retrotransposons.
Collapse
Affiliation(s)
- Olga Novikova
- Laboratory of Molecular Evolution, Institute of Cytology and Genetics SB RAS, Novosibirsk, Russia
| | - Ewa Śliwińska
- UJAG – Jagiellonian University, Institute of Environmental Sciences, Krakow, Poland
| | - Victor Fet
- Marshall University, Huntington, West Virginia, USA
| | - Josef Settele
- Department of Community Ecology, UFZ – Centre for Environmental Research Leipzig-Halle, Halle (Saale), Germany
| | - Alexander Blinov
- Laboratory of Molecular Evolution, Institute of Cytology and Genetics SB RAS, Novosibirsk, Russia
| | - Michal Woyciechowski
- UJAG – Jagiellonian University, Institute of Environmental Sciences, Krakow, Poland
| |
Collapse
|
25
|
Maita N, Aoyagi H, Osanai M, Shirakawa M, Fujiwara H. Characterization of the sequence specificity of the R1Bm endonuclease domain by structural and biochemical studies. Nucleic Acids Res 2007; 35:3918-27. [PMID: 17537809 PMCID: PMC1919474 DOI: 10.1093/nar/gkm397] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2006] [Revised: 04/25/2007] [Accepted: 05/01/2007] [Indexed: 11/30/2022] Open
Abstract
R1Bm is a long interspersed element (LINE) inserted into a specific sequence within 28S rDNA of the silkworm genome. Of two open reading frames (ORFs) of R1Bm, ORF2 encodes a reverse transcriptase (RT) and an endonuclease (EN) domain which digests specifically both top and bottom strand of the target sequence in 28S rDNA. To elucidate the sequence specificity of EN domain of R1Bm (R1Bm EN), we examined the cleavage tendency for the target sequences, and found that 5'-A(G/C)(A/T)!(A/G)T-3' is the consensus sequence (! = cleavage site). We also determined the crystal structure of R1Bm EN at 2.0 A resolution. Its structure was basically similar to AP endonuclease family, but had a special beta-hairpin at the edge of the DNA binding surface, which is a common feature among EN of LINEs. Point-mutations on the DNA binding surface of R1Bm EN significantly decreased the cleavage activities, but did not affect the sequence recognition in most residues. However, two mutants Y98A and N180A had altered cleavage patterns, suggesting an important role of these residues (Y98 and N180) for the sequence recognition of R1Bm EN. In addition, Y98A mutant showed another cleavage pattern, that implies de novo design of novel sequence-specific EN.
Collapse
Affiliation(s)
- Nobuo Maita
- Graduate School of Systems Life Sciences, Kyushu University, Fukuoka 812-8582, Japan, Graduate School of Integrated Science, Yokohama City University, Yokohama 230-0045, Japan, Department of Integrated Biosciences, Graduate School of Frontier Sciences, University of Tokyo, Bioscience Building 501, Kashiwa, Chiba 277-8562, Japan, Graduate School of Engineering Kyoto University, Kyoto 615-8510, Japan and CREST, Japan Science and Technology Corporation, Kawaguchi, Saitama 332-0012, Japan
| | - Hideyuki Aoyagi
- Graduate School of Systems Life Sciences, Kyushu University, Fukuoka 812-8582, Japan, Graduate School of Integrated Science, Yokohama City University, Yokohama 230-0045, Japan, Department of Integrated Biosciences, Graduate School of Frontier Sciences, University of Tokyo, Bioscience Building 501, Kashiwa, Chiba 277-8562, Japan, Graduate School of Engineering Kyoto University, Kyoto 615-8510, Japan and CREST, Japan Science and Technology Corporation, Kawaguchi, Saitama 332-0012, Japan
| | - Mizuko Osanai
- Graduate School of Systems Life Sciences, Kyushu University, Fukuoka 812-8582, Japan, Graduate School of Integrated Science, Yokohama City University, Yokohama 230-0045, Japan, Department of Integrated Biosciences, Graduate School of Frontier Sciences, University of Tokyo, Bioscience Building 501, Kashiwa, Chiba 277-8562, Japan, Graduate School of Engineering Kyoto University, Kyoto 615-8510, Japan and CREST, Japan Science and Technology Corporation, Kawaguchi, Saitama 332-0012, Japan
| | - Masahiro Shirakawa
- Graduate School of Systems Life Sciences, Kyushu University, Fukuoka 812-8582, Japan, Graduate School of Integrated Science, Yokohama City University, Yokohama 230-0045, Japan, Department of Integrated Biosciences, Graduate School of Frontier Sciences, University of Tokyo, Bioscience Building 501, Kashiwa, Chiba 277-8562, Japan, Graduate School of Engineering Kyoto University, Kyoto 615-8510, Japan and CREST, Japan Science and Technology Corporation, Kawaguchi, Saitama 332-0012, Japan
| | - Haruhiko Fujiwara
- Graduate School of Systems Life Sciences, Kyushu University, Fukuoka 812-8582, Japan, Graduate School of Integrated Science, Yokohama City University, Yokohama 230-0045, Japan, Department of Integrated Biosciences, Graduate School of Frontier Sciences, University of Tokyo, Bioscience Building 501, Kashiwa, Chiba 277-8562, Japan, Graduate School of Engineering Kyoto University, Kyoto 615-8510, Japan and CREST, Japan Science and Technology Corporation, Kawaguchi, Saitama 332-0012, Japan
| |
Collapse
|
26
|
Kojima KK, Kuma KI, Toh H, Fujiwara H. Identification of rDNA-specific non-LTR retrotransposons in Cnidaria. Mol Biol Evol 2006; 23:1984-93. [PMID: 16870681 DOI: 10.1093/molbev/msl067] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
Ribosomal RNA genes are abundant repetitive sequences in most eukaryotes. Ribosomal DNA (rDNA) contains many insertions derived from mobile elements including non-long terminal repeat (non-LTR) retrotransposons. R2 is the well-characterized 28S rDNA-specific non-LTR retrotransposon family that is distributed over at least 4 bilaterian phyla. R2 is a large family sharing the same insertion specificity and classified into 4 clades (R2-A, -B, -C, and -D) based on the N-terminal domain structure and the phylogeny. There is no observation of horizontal transfer of R2; therefore, the origin of R2 dates back to before the split between protostomes and deuterostomes. Here, we in silico identified 1 R2 element from the sea anemone Nematostella vectensis and 2 R2-like retrotransposons from the hydrozoan Hydra magnipapillata. R2 from N. vectensis was inserted into the 28S rDNA like other R2, but the R2-like elements from H. magnipapillata were inserted into the specific sequence in the highly conserved region of the 18S rDNA. We designated the Hydra R2-like elements R8. R8 is inserted at 37 bp upstream from R7, another 18S rDNA-specific retrotransposon family. There is no obvious sequence similarity between targets of R2 and R8, probably because they recognize long DNA sequences. Domain structure and phylogeny indicate that R2 from N. vectensis is the member of the R2-D clade, and R8 from H. magnipapillata belongs to the R2-A clade despite its different sequence specificity. These results suggest that R2 had been generated before the split between cnidarians and bilaterians and that R8 is a retrotransposon family that changed its target from the 28S rDNA to the 18S rDNA.
Collapse
Affiliation(s)
- Kenji K Kojima
- Institute for Chemical Research, Kyoto University, Uji, Japan.
| | | | | | | |
Collapse
|