1
|
Maiwald S, Mann L, Garcia S, Heitkam T. Evolving Together: Cassandra Retrotransposons Gradually Mirror Promoter Mutations of the 5S rRNA Genes. Mol Biol Evol 2024; 41:msae010. [PMID: 38262464 PMCID: PMC10853983 DOI: 10.1093/molbev/msae010] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2023] [Revised: 10/26/2023] [Accepted: 12/11/2023] [Indexed: 01/25/2024] Open
Abstract
The 5S rRNA genes are among the most conserved nucleotide sequences across all species. Similar to the 5S preservation we observe the occurrence of 5S-related nonautonomous retrotransposons, so-called Cassandras. Cassandras harbor highly conserved 5S rDNA-related sequences within their long terminal repeats, advantageously providing them with the 5S internal promoter. However, the dynamics of Cassandra retrotransposon evolution in the context of 5S rRNA gene sequence information and structural arrangement are still unclear, especially: (1) do we observe repeated or gradual domestication of the highly conserved 5S promoter by Cassandras and (2) do changes in 5S organization such as in the linked 35S-5S rDNA arrangements impact Cassandra evolution? Here, we show evidence for gradual co-evolution of Cassandra sequences with their corresponding 5S rDNAs. To follow the impact of 5S rDNA variability on Cassandra TEs, we investigate the Asteraceae family where highly variable 5S rDNAs, including 5S promoter shifts and both linked and separated 35S-5S rDNA arrangements have been reported. Cassandras within the Asteraceae mirror 5S rDNA promoter mutations of their host genome, likely as an adaptation to the host's specific 5S transcription factors and hence compensating for evolutionary changes in the 5S rDNA sequence. Changes in the 5S rDNA sequence and in Cassandras seem uncorrelated with linked/separated rDNA arrangements. We place all these observations into the context of angiosperm 5S rDNA-Cassandra evolution, discuss Cassandra's origin hypotheses (single or multiple) and Cassandra's possible impact on rDNA and plant genome organization, giving new insights into the interplay of ribosomal genes and transposable elements.
Collapse
Affiliation(s)
- Sophie Maiwald
- Faculty of Biology, Technische Universität Dresden, 01069 Dresden, Germany
| | - Ludwig Mann
- Faculty of Biology, Technische Universität Dresden, 01069 Dresden, Germany
| | - Sònia Garcia
- Institut Botànic de Barcelona, IBB (CSIC-MCNB), 08038 Barcelona, Catalonia, Spain
| | - Tony Heitkam
- Faculty of Biology, Technische Universität Dresden, 01069 Dresden, Germany
- Institute of Biology, NAWI Graz, Karl-Franzens-Universität, 8010 Graz, Austria
| |
Collapse
|
2
|
Warkocki Z. An update on post-transcriptional regulation of retrotransposons. FEBS Lett 2023; 597:380-406. [PMID: 36460901 DOI: 10.1002/1873-3468.14551] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2022] [Revised: 11/09/2022] [Accepted: 11/18/2022] [Indexed: 12/04/2022]
Abstract
Retrotransposons, including LINE-1, Alu, SVA, and endogenous retroviruses, are one of the major constituents of human genomic repetitive sequences. Through the process of retrotransposition, some of them occasionally insert into new genomic locations by a copy-paste mechanism involving RNA intermediates. Irrespective of de novo genomic insertions, retrotransposon expression can lead to DNA double-strand breaks and stimulate cellular innate immunity through endogenous patterns. As a result, retrotransposons are tightly regulated by multi-layered regulatory processes to prevent the dangerous effects of their expression. In recent years, significant progress was made in revealing how retrotransposon biology intertwines with general post-transcriptional RNA metabolism. Here, I summarize current knowledge on the involvement of post-transcriptional factors in the biology of retrotransposons, focusing on LINE-1. I emphasize general RNA metabolisms such as methylation of adenine (m6 A), RNA 3'-end polyadenylation and uridylation, RNA decay and translation regulation. I discuss the effects of retrotransposon RNP sequestration in cytoplasmic bodies and autophagy. Finally, I summarize how innate immunity restricts retrotransposons and how retrotransposons make use of cellular enzymes, including the DNA repair machinery, to complete their replication cycles.
Collapse
Affiliation(s)
- Zbigniew Warkocki
- Department of RNA Metabolism, Institute of Bioorganic Chemistry, Polish Academy of Sciences, Poznan, Poland
| |
Collapse
|
3
|
Ilina TV, Brosenitsch T, Sluis-Cremer N, Ishima R. Retroviral RNase H: Structure, mechanism, and inhibition. Enzymes 2021; 50:227-247. [PMID: 34861939 DOI: 10.1016/bs.enz.2021.07.007] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
All retroviruses encode the enzyme, reverse transcriptase (RT), which is involved in the conversion of the single-stranded viral RNA genome into double-stranded DNA. RT is a multifunctional enzyme and exhibits DNA polymerase and ribonuclease H (RNH) activities, both of which are essential to the reverse-transcription process. Despite the successful development of polymerase-targeting antiviral drugs over the last three decades, no bona fide inhibitor against the RNH activity of HIV-1 RT has progressed to clinical evaluation. In this review article, we describe the retroviral RNH function and inhibition, with primary consideration of the structural aspects of inhibition.
Collapse
Affiliation(s)
- Tatiana V Ilina
- Department of Structural Biology, University of Pittsburgh School of Medicine, Pittsburgh, PA, United States
| | - Teresa Brosenitsch
- Department of Structural Biology, University of Pittsburgh School of Medicine, Pittsburgh, PA, United States
| | - Nicolas Sluis-Cremer
- Department of Medicine, Division of Infectious Diseases, University of Pittsburgh School of Medicine, Pittsburgh, PA, United States
| | - Rieko Ishima
- Department of Structural Biology, University of Pittsburgh School of Medicine, Pittsburgh, PA, United States.
| |
Collapse
|
4
|
Fang C, Liu J, Zhang T, Su T, Li S, Cheng Q, Kong L, Li X, Bu T, Li H, Dong L, Lu S, Kong F, Liu B. A recent retrotransposon insertion of J caused E6 locus facilitating soybean adaptation into low latitude. JOURNAL OF INTEGRATIVE PLANT BIOLOGY 2021; 63:995-1003. [PMID: 33205888 DOI: 10.1111/jipb.13034] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/06/2020] [Accepted: 11/09/2020] [Indexed: 06/11/2023]
Abstract
Soybean (Glycine max) is an important legume crop that was domesticated in temperate regions. Soybean varieties from these regions generally mature early and exhibit extremely low yield when grown under inductive short-day (SD) conditions at low latitudes. The long-juvenile (LJ) trait, which is characterized by delayed flowering and maturity, and improved yield under SD conditions, allowed the cultivation of soybean to expand to lower latitudes. Two major loci control the LJ trait: J and E6. In the current study, positional cloning, sequence analysis, and transgenic complementation confirmed that E6 is a novel allele of J, the ortholog of Arabidopsis thaliana EARLY FLOWERING 3 (ELF3). The mutant allele e6PG , which carries a Ty1/Copia-like retrotransposon insertion, does not suppress the legume-specific flowering repressor E1, allowing E1 to inhibit Flowering Locus T (FT) expression and thus delaying flowering and increasing yields under SD conditions. The e6PG allele is a rare allele that has not been incorporated into modern breeding programs. The dysfunction of J might have greatly facilitated the adaptation of soybean to low latitudes. Our findings increase our understanding of the molecular mechanisms underlying the LJ trait and provide valuable resources for soybean breeding.
Collapse
Affiliation(s)
- Chao Fang
- Innovative Center of Molecular Genetics and Evolution, School of Life Sciences, Guangzhou University, Guangzhou, 510006, China
| | - Jun Liu
- Innovative Center of Molecular Genetics and Evolution, School of Life Sciences, Guangzhou University, Guangzhou, 510006, China
| | - Ting Zhang
- Innovative Center of Molecular Genetics and Evolution, School of Life Sciences, Guangzhou University, Guangzhou, 510006, China
| | - Tong Su
- The Innovative Academy of Seed Design, Key Laboratory of Soybean Molecular Design Breeding, Northeast Institute of Geography and Agroecology, the Chinese Academy of Sciences, Harbin, 150000, China
| | - Shichen Li
- The Innovative Academy of Seed Design, Key Laboratory of Soybean Molecular Design Breeding, Northeast Institute of Geography and Agroecology, the Chinese Academy of Sciences, Harbin, 150000, China
| | - Qun Cheng
- Innovative Center of Molecular Genetics and Evolution, School of Life Sciences, Guangzhou University, Guangzhou, 510006, China
| | - Lingping Kong
- Innovative Center of Molecular Genetics and Evolution, School of Life Sciences, Guangzhou University, Guangzhou, 510006, China
| | - Xiaoming Li
- Key Laboratory of South China Agricultural Plant Molecular Analysis and Genetic Improvement, South China Botanical Garden, the Chinese Academy of Sciences, Guangzhou, 510650, China
| | - Tiantian Bu
- Innovative Center of Molecular Genetics and Evolution, School of Life Sciences, Guangzhou University, Guangzhou, 510006, China
| | - Haiyang Li
- Innovative Center of Molecular Genetics and Evolution, School of Life Sciences, Guangzhou University, Guangzhou, 510006, China
- National Key Laboratory of Crop Genetics and Germplasm Enhancement, Jiangsu Collaborative Innovation Center for Modern Crop Production, Nanjing Agricultural University, Nanjing, 210095, China
| | - Lidong Dong
- Innovative Center of Molecular Genetics and Evolution, School of Life Sciences, Guangzhou University, Guangzhou, 510006, China
| | - Sijia Lu
- Innovative Center of Molecular Genetics and Evolution, School of Life Sciences, Guangzhou University, Guangzhou, 510006, China
| | - Fanjiang Kong
- Innovative Center of Molecular Genetics and Evolution, School of Life Sciences, Guangzhou University, Guangzhou, 510006, China
- The Innovative Academy of Seed Design, Key Laboratory of Soybean Molecular Design Breeding, Northeast Institute of Geography and Agroecology, the Chinese Academy of Sciences, Harbin, 150000, China
| | - Baohui Liu
- Innovative Center of Molecular Genetics and Evolution, School of Life Sciences, Guangzhou University, Guangzhou, 510006, China
- The Innovative Academy of Seed Design, Key Laboratory of Soybean Molecular Design Breeding, Northeast Institute of Geography and Agroecology, the Chinese Academy of Sciences, Harbin, 150000, China
| |
Collapse
|
5
|
Nissan N, Cober ER, Sadowski M, Charette M, Golshani A, Samanfar B. Identifying new variation at the J locus, previously identified as e6, in long juvenile 'Paranagoiana' soybean. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2021; 134:1007-1014. [PMID: 33386860 PMCID: PMC7973924 DOI: 10.1007/s00122-020-03746-2] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/24/2020] [Accepted: 12/05/2020] [Indexed: 05/05/2023]
Abstract
KEY MESSAGE A previously identified soybean maturity locus, E6, is discovered to be J, with the long juvenile allele in Paranagoiana now deemed j-x. Soybean grown at latitudes of ~20° or lower can produce lower grain yields due to the short days. This limitation can be overcome by using the long juvenile trait (LJ) which delays flowering under short day conditions. Two LJ loci have been mapped to the same location on Gm04, J and E6. The objective of this research was to investigate the e6 allele in 'Paranagoiana' and determine if E6 and J are the same locus or linked loci. KASP markers showed that e6 lines did not have the j-1 allele of LJ PI 159925. A population fixed for E1 but segregating for E6, with e6 introgressed from Paranagoiana, showed single gene control for flowering and maturity under short days. Sequencing Glyma.04G050200, the J gene, with long amplification Taq found that the e6 line 'Paranagoiana' contains a Ty1-copia retrotransposon of ~10,000 bp, inserted within exon 4. PCR amplification of the cDNA of Glyma.04G050200 also showed differences between the mRNA sequences (presence of insertion in j-x). Hence, we conclude that the loci E6 and J are one locus and deem this new variation found in Paranagoiana as j-x.
Collapse
Affiliation(s)
- Nour Nissan
- Ottawa Research and Development Center, Agriculture and Agri-Food Canada, Ottawa, ON, Canada
- Department of Biology, Ottawa Institute of Systems Biology, Carleton University, Ottawa, ON, Canada
| | - Elroy R Cober
- Ottawa Research and Development Center, Agriculture and Agri-Food Canada, Ottawa, ON, Canada
| | - Michael Sadowski
- Ottawa Research and Development Center, Agriculture and Agri-Food Canada, Ottawa, ON, Canada
- Department of Biology, Ottawa Institute of Systems Biology, Carleton University, Ottawa, ON, Canada
| | - Martin Charette
- Ottawa Research and Development Center, Agriculture and Agri-Food Canada, Ottawa, ON, Canada
| | - Ashkan Golshani
- Department of Biology, Ottawa Institute of Systems Biology, Carleton University, Ottawa, ON, Canada
| | - Bahram Samanfar
- Ottawa Research and Development Center, Agriculture and Agri-Food Canada, Ottawa, ON, Canada.
- Department of Biology, Ottawa Institute of Systems Biology, Carleton University, Ottawa, ON, Canada.
| |
Collapse
|
6
|
Abstract
Transposable elements (TEs) are mobile DNA sequences that propagate within genomes. Through diverse invasion strategies, TEs have come to occupy a substantial fraction of nearly all eukaryotic genomes, and they represent a major source of genetic variation and novelty. Here we review the defining features of each major group of eukaryotic TEs and explore their evolutionary origins and relationships. We discuss how the unique biology of different TEs influences their propagation and distribution within and across genomes. Environmental and genetic factors acting at the level of the host species further modulate the activity, diversification, and fate of TEs, producing the dramatic variation in TE content observed across eukaryotes. We argue that cataloging TE diversity and dissecting the idiosyncratic behavior of individual elements are crucial to expanding our comprehension of their impact on the biology of genomes and the evolution of species.
Collapse
Affiliation(s)
- Jonathan N Wells
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York 14850; ,
| | - Cédric Feschotte
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York 14850; ,
| |
Collapse
|
7
|
Boman J, Frankl-Vilches C, da Silva Dos Santos M, de Oliveira EHC, Gahr M, Suh A. The Genome of Blue-Capped Cordon-Bleu Uncovers Hidden Diversity of LTR Retrotransposons in Zebra Finch. Genes (Basel) 2019; 10:E301. [PMID: 31013951 PMCID: PMC6523648 DOI: 10.3390/genes10040301] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2019] [Revised: 04/05/2019] [Accepted: 04/05/2019] [Indexed: 02/02/2023] Open
Abstract
Avian genomes have perplexed researchers by being conservative in both size and rearrangements, while simultaneously holding the blueprints for a massive species radiation during the last 65 million years (My). Transposable elements (TEs) in bird genomes are relatively scarce but have been implicated as important hotspots for chromosomal inversions. In zebra finch (Taeniopygia guttata), long terminal repeat (LTR) retrotransposons have proliferated and are positively associated with chromosomal breakpoint regions. Here, we present the genome, karyotype and transposons of blue-capped cordon-bleu (Uraeginthus cyanocephalus), an African songbird that diverged from zebra finch at the root of estrildid finches 10 million years ago (Mya). This constitutes the third linked-read sequenced genome assembly and fourth in-depth curated TE library of any bird. Exploration of TE diversity on this brief evolutionary timescale constitutes a considerable increase in resolution for avian TE biology and allowed us to uncover 4.5 Mb more LTR retrotransposons in the zebra finch genome. In blue-capped cordon-bleu, we likewise observed a recent LTR accumulation indicating that this is a shared feature of Estrildidae. Curiously, we discovered 25 new endogenous retrovirus-like LTR retrotransposon families of which at least 21 are present in zebra finch but were previously undiscovered. This highlights the importance of studying close relatives of model organisms.
Collapse
Affiliation(s)
- Jesper Boman
- Department of Evolutionary Biology, Evolutionary Biology Centre (EBC), Science for Life Laboratory, Uppsala University, SE-752 36 Uppsala, Sweden.
| | - Carolina Frankl-Vilches
- Department of Behavioral Neurobiology, Max Planck Institute for Ornithology, 82319 Seewiesen, Germany.
| | - Michelly da Silva Dos Santos
- Laboratório de Cultura de Tecidos e Citogenética, SAMAM, Instituto Evandro Chagas, Ananindeua, Pará, and Faculdade de Ciências Naturais (ICEN), Universidade Federal do Pará, Belém 66075-110, Brazil.
| | - Edivaldo H C de Oliveira
- Laboratório de Cultura de Tecidos e Citogenética, SAMAM, Instituto Evandro Chagas, Ananindeua, Pará, and Faculdade de Ciências Naturais (ICEN), Universidade Federal do Pará, Belém 66075-110, Brazil.
| | - Manfred Gahr
- Department of Behavioral Neurobiology, Max Planck Institute for Ornithology, 82319 Seewiesen, Germany.
| | - Alexander Suh
- Department of Evolutionary Biology, Evolutionary Biology Centre (EBC), Science for Life Laboratory, Uppsala University, SE-752 36 Uppsala, Sweden.
| |
Collapse
|
8
|
Neumann P, Novák P, Hoštáková N, Macas J. Systematic survey of plant LTR-retrotransposons elucidates phylogenetic relationships of their polyprotein domains and provides a reference for element classification. Mob DNA 2019; 10:1. [PMID: 30622655 PMCID: PMC6317226 DOI: 10.1186/s13100-018-0144-1] [Citation(s) in RCA: 230] [Impact Index Per Article: 38.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2018] [Accepted: 12/20/2018] [Indexed: 12/30/2022] Open
Abstract
BACKGROUND Plant LTR-retrotransposons are classified into two superfamilies, Ty1/copia and Ty3/gypsy. They are further divided into an enormous number of families which are, due to the high diversity of their nucleotide sequences, usually specific to a single or a group of closely related species. Previous attempts to group these families into broader categories reflecting their phylogenetic relationships were limited either to analyzing a narrow range of plant species or to analyzing a small numbers of elements. Furthermore, there is no reference database that allows for similarity based classification of LTR-retrotransposons. RESULTS We have assembled a database of retrotransposon encoded polyprotein domains sequences extracted from 5410 Ty1/copia elements and 8453 Ty3/gypsy elements sampled from 80 species representing major groups of green plants (Viridiplantae). Phylogenetic analysis of the three most conserved polyprotein domains (RT, RH and INT) led to dividing Ty1/copia and Ty3/gypsy retrotransposons into 16 and 14 lineages respectively. We also characterized various features of LTR-retrotransposon sequences including additional polyprotein domains, extra open reading frames and primer binding sites, and found that the occurrence and/or type of these features correlates with phylogenies inferred from the three protein domains. CONCLUSIONS We have established an improved classification system applicable to LTR-retrotransposons from a wide range of plant species. This system reflects phylogenetic relationships as well as distinct sequence and structural features of the elements. A comprehensive database of retrotransposon protein domains (REXdb) that reflects this classification provides a reference for efficient and unified annotation of LTR-retrotransposons in plant genomes. Access to REXdb related tools is implemented in the RepeatExplorer web server (https://repeatexplorer-elixir.cerit-sc.cz/) or using a standalone version of REXdb that can be downloaded seaparately from RepeatExplorer web page (http://repeatexplorer.org/).
Collapse
Affiliation(s)
- Pavel Neumann
- Biology Centre of the Czech Academy of Sciences, Institute of Plant Molecular Biology, 37005 České Budějovice, Czech Republic
| | - Petr Novák
- Biology Centre of the Czech Academy of Sciences, Institute of Plant Molecular Biology, 37005 České Budějovice, Czech Republic
| | - Nina Hoštáková
- Biology Centre of the Czech Academy of Sciences, Institute of Plant Molecular Biology, 37005 České Budějovice, Czech Republic
| | - Jiří Macas
- Biology Centre of the Czech Academy of Sciences, Institute of Plant Molecular Biology, 37005 České Budějovice, Czech Republic
| |
Collapse
|
9
|
Arkhipova IR. Using bioinformatic and phylogenetic approaches to classify transposable elements and understand their complex evolutionary histories. Mob DNA 2017; 8:19. [PMID: 29225705 PMCID: PMC5718144 DOI: 10.1186/s13100-017-0103-2] [Citation(s) in RCA: 60] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2017] [Accepted: 11/28/2017] [Indexed: 12/11/2022] Open
Abstract
In recent years, much attention has been paid to comparative genomic studies of transposable elements (TEs) and the ensuing problems of their identification, classification, and annotation. Different approaches and diverse automated pipelines are being used to catalogue and categorize mobile genetic elements in the ever-increasing number of prokaryotic and eukaryotic genomes, with little or no connectivity between different domains of life. Here, an overview of the current picture of TE classification and evolutionary relationships is presented, updating the diversity of TE types uncovered in sequenced genomes. A tripartite TE classification scheme is proposed to account for their replicative, integrative, and structural components, and the need to expand in vitro and in vivo studies of their structural and biological properties is emphasized. Bioinformatic studies have now become front and center of novel TE discovery, and experimental pursuits of these discoveries hold great promise for both basic and applied science.
Collapse
Affiliation(s)
- Irina R Arkhipova
- Josephine Bay Paul Center for Comparative Molecular Biology and Evolution, Marine Biological Laboratory, Woods Hole, MA 02543 USA
| |
Collapse
|
10
|
Ustyantsev K, Blinov A, Smyshlyaev G. Convergence of retrotransposons in oomycetes and plants. Mob DNA 2017; 8:4. [PMID: 28293305 PMCID: PMC5348765 DOI: 10.1186/s13100-017-0087-y] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2016] [Accepted: 03/07/2017] [Indexed: 12/30/2022] Open
Abstract
Background Retrotransposons comprise a ubiquitous and abundant class of eukaryotic transposable elements. All members of this class rely on reverse transcriptase activity to produce a DNA copy of the element from the RNA template. However, other activities of the retrotransposon-encoded polyprotein may differ between diverse retrotransposons. The polyprotein domains corresponding to each of these activities may have their own evolutionary history independent from that of the reverse transcriptase, thus underlying the modular view on the evolution of retrotransposons. Furthermore, some transposable elements can independently evolve similar domain architectures by acquiring functionally similar but phylogenetically distinct modules. This convergent evolution of retrotransposons may ultimately suggest similar regulatory pathways underlying the lifecycle of the elements. Results Here, we provide new examples of the convergent evolution of retrotransposons of species from two unrelated taxa: green plants and parasitic protozoan oomycetes. In the present study we first analyzed the available genomic sequences of oomycete species and characterized two groups of Ty3/Gypsy long terminal repeat retrotransposons, namely Chronos and Archon, and a subgroup of L1 non-long terminal repeat retrotransposons. The results demonstrated that the retroelements from these three groups each have independently acquired plant-related ribonuclease H domains. This process closely resembles the evolution of retrotransposons in the genomes of green plants. In addition, we showed that Chronos elements captured a chromodomain, mimicking the process of chromodomain acquisition by Chromoviruses, another group of Ty3/Gypsy retrotransposons of plants, fungi, and vertebrates. Conclusions Repeated and strikingly similar acquisitions of ribonuclease H domains and chromodomains by different retrotransposon groups from unrelated taxa indicate similar selection pressure acting on these elements. Thus, there are some major trends in the evolution of the structural composition of retrotransposons, and characterizing these trends may enhance the current understanding of the retrotransposon life cycle. Electronic supplementary material The online version of this article (doi:10.1186/s13100-017-0087-y) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Kirill Ustyantsev
- Institute of Cytology and Genetics, Laboratory of Molecular Genetic Systems, Prospekt Lavrentyeva 10, 630090 Novosibirsk, Russia
| | - Alexandr Blinov
- Institute of Cytology and Genetics, Laboratory of Molecular Genetic Systems, Prospekt Lavrentyeva 10, 630090 Novosibirsk, Russia
| | - Georgy Smyshlyaev
- Structural and Computational Biology Unit, European Molecular Biology Laboratory, 69117 Heidelberg, Germany
| |
Collapse
|
11
|
Spaller T, Kling E, Glöckner G, Hillmann F, Winckler T. Convergent evolution of tRNA gene targeting preferences in compact genomes. Mob DNA 2016; 7:17. [PMID: 27583033 PMCID: PMC5006619 DOI: 10.1186/s13100-016-0073-9] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2016] [Accepted: 08/20/2016] [Indexed: 11/30/2022] Open
Abstract
Background In gene-dense genomes, mobile elements are confronted with highly selective pressure to amplify without causing excessive damage to the host. The targeting of tRNA genes as potentially safe integration sites has been developed by retrotransposons in various organisms such as the social amoeba Dictyostelium discoideum and the yeast Saccharomyces cerevisiae. In D. discoideum, tRNA gene-targeting retrotransposons have expanded to approximately 3 % of the genome. Recently obtained genome sequences of species representing the evolutionary history of social amoebae enabled us to determine whether the targeting of tRNA genes is a generally successful strategy for mobile elements to colonize compact genomes. Results During the evolution of dictyostelids, different retrotransposon types independently developed the targeting of tRNA genes at least six times. DGLT-A elements are long terminal repeat (LTR) retrotransposons that display integration preferences ~15 bp upstream of tRNA gene-coding regions reminiscent of the yeast Ty3 element. Skipper elements are chromoviruses that have developed two subgroups: one has canonical chromo domains that may favor integration in centromeric regions, whereas the other has diverged chromo domains and is found ~100 bp downstream of tRNA genes. The integration of D. discoideum non-LTR retrotransposons ~50 bp upstream (TRE5 elements) and ~100 bp downstream (TRE3 elements) of tRNA genes, respectively, likely emerged at the root of dictyostelid evolution. We identified two novel non-LTR retrotransposons unrelated to TREs: one with a TRE5-like integration behavior and the other with preference ~4 bp upstream of tRNA genes. Conclusions Dictyostelid retrotransposons demonstrate convergent evolution of tRNA gene targeting as a probable means to colonize the compact genomes of their hosts without being excessively mutagenic. However, high copy numbers of tRNA gene-associated retrotransposons, such as those observed in D. discoideum, are an exception, suggesting that the targeting of tRNA genes does not necessarily favor the amplification of position-specific integrating elements to high copy numbers under the repressive conditions that prevail in most host cells. Electronic supplementary material The online version of this article (doi:10.1186/s13100-016-0073-9) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Thomas Spaller
- Institute of Pharmacy, Department of Pharmaceutical Biology, Friedrich Schiller University Jena, Semmelweisstraße 10, Jena, 07743 Germany
| | - Eva Kling
- Institute of Pharmacy, Department of Pharmaceutical Biology, Friedrich Schiller University Jena, Semmelweisstraße 10, Jena, 07743 Germany
| | - Gernot Glöckner
- Institute for Biochemistry I, Medical Faculty, University of Cologne, Berlin, Germany ; Institute for Freshwater Ecology and Inland Fisheries, IGB, Berlin, Germany
| | - Falk Hillmann
- Junior Research Group Evolution of Microbial Interaction, Leibniz Institute for Natural Product Research and Infection Biology-Hans Knöll Institute, Jena, Germany
| | - Thomas Winckler
- Institute of Pharmacy, Department of Pharmaceutical Biology, Friedrich Schiller University Jena, Semmelweisstraße 10, Jena, 07743 Germany
| |
Collapse
|
12
|
Ilut DC, Lipka AE, Jeong N, Bae DN, Kim DH, Kim JH, Redekar N, Yang K, Park W, Kang ST, Kim N, Moon JK, Saghai Maroof MA, Gore MA, Jeong SC. Identification of haplotypes at the Rsv4 genomic region in soybean associated with durable resistance to soybean mosaic virus. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2016; 129:453-68. [PMID: 26649868 DOI: 10.1007/s00122-015-2640-8] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/24/2015] [Accepted: 11/16/2015] [Indexed: 05/29/2023]
Abstract
KEY MESSAGE Discovery of new germplasm sources and identification of haplotypes for the durable Soybean mosaic virus resistance gene, Rsv 4, provide novel resources for map-based cloning and genetic improvement efforts in soybean. The Soybean mosaic virus (SMV) resistance locus Rsv4 is of interest because it provides a durable type of resistance in soybean [Glycine max (L.) Merr.]. To better understand its molecular basis, we used a population of 309 BC3F2 individuals to fine-map Rsv4 to a ~120 kb interval and leveraged this genetic information in a second study to identify accessions 'Haman' and 'Ilpumgeomjeong' as new sources of Rsv4. These two accessions along with three other Rsv4 and 14 rsv4 accessions were used to examine the patterns of nucleotide diversity at the Rsv4 region based on high-depth resequencing data. Through a targeted association analysis of these 19 accessions within the ~120 kb interval, a cluster of four intergenic single-nucleotide polymorphisms (SNPs) was found to perfectly associate with SMV resistance. Interestingly, this ~120 kb interval did not contain any genes similar to previously characterized dominant disease resistance genes. Therefore, a haplotype analysis was used to further resolve the association signal to a ~94 kb region, which also resulted in the identification of at least two Rsv4 haplotypes. A haplotype phylogenetic analysis of this region suggests that the Rsv4 locus in G. max is recently introgressed from G. soja. This integrated study provides a strong foundation for efforts focused on the cloning of this durable virus resistance gene and marker-assisted selection of Rsv4-mediated SMV resistance in soybean breeding programs.
Collapse
Affiliation(s)
- Daniel C Ilut
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, 14853, USA
| | - Alexander E Lipka
- Department of Crop Sciences, University of Illinois, Urbana, IL, 61801, USA
| | - Namhee Jeong
- Bio-Evaluation Center, Korea Research Institute of Bioscience and Biotechnology, Ch'ongju, Chungbuk, 28116, Korea
| | - Dong Nyuk Bae
- Bio-Evaluation Center, Korea Research Institute of Bioscience and Biotechnology, Ch'ongju, Chungbuk, 28116, Korea
| | - Dong Hyun Kim
- Bio-Evaluation Center, Korea Research Institute of Bioscience and Biotechnology, Ch'ongju, Chungbuk, 28116, Korea
| | - Ji Hong Kim
- Bio-Evaluation Center, Korea Research Institute of Bioscience and Biotechnology, Ch'ongju, Chungbuk, 28116, Korea
| | - Neelam Redekar
- Department of Crop and Soil Environmental Sciences, Virginia Tech, Blacksburg, 24061, USA
| | - Kiwoung Yang
- Bio-Evaluation Center, Korea Research Institute of Bioscience and Biotechnology, Ch'ongju, Chungbuk, 28116, Korea
| | - Won Park
- Epigenomics Research Center, Genome Institute, Korea Research Institute of Bioscience and Biotechnology, Taejon, 34141, Korea
- Department of Functional Genomics, Korea University of Science and Technology, Taejon, 34141, Korea
| | - Sung-Taeg Kang
- Department of Crop Science and Biotechnology, Dankook University, Cheonan, Chungnam, 31116, Korea
| | - Namshin Kim
- Epigenomics Research Center, Genome Institute, Korea Research Institute of Bioscience and Biotechnology, Taejon, 34141, Korea
- Department of Functional Genomics, Korea University of Science and Technology, Taejon, 34141, Korea
| | - Jung-Kyung Moon
- National Institute of Crop Science, Rural Development Administration, Wanju, Jeonbuk, 55365, Korea
| | - M A Saghai Maroof
- Department of Crop and Soil Environmental Sciences, Virginia Tech, Blacksburg, 24061, USA
| | - Michael A Gore
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, 14853, USA.
| | - Soon-Chun Jeong
- Bio-Evaluation Center, Korea Research Institute of Bioscience and Biotechnology, Ch'ongju, Chungbuk, 28116, Korea.
| |
Collapse
|
13
|
Virus world as an evolutionary network of viruses and capsidless selfish elements. Microbiol Mol Biol Rev 2015; 78:278-303. [PMID: 24847023 DOI: 10.1128/mmbr.00049-13] [Citation(s) in RCA: 160] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
Viruses were defined as one of the two principal types of organisms in the biosphere, namely, as capsid-encoding organisms in contrast to ribosome-encoding organisms, i.e., all cellular life forms. Structurally similar, apparently homologous capsids are present in a huge variety of icosahedral viruses that infect bacteria, archaea, and eukaryotes. These findings prompted the concept of the capsid as the virus "self" that defines the identity of deep, ancient viral lineages. However, several other widespread viral "hallmark genes" encode key components of the viral replication apparatus (such as polymerases and helicases) and combine with different capsid proteins, given the inherently modular character of viral evolution. Furthermore, diverse, widespread, capsidless selfish genetic elements, such as plasmids and various types of transposons, share hallmark genes with viruses. Viruses appear to have evolved from capsidless selfish elements, and vice versa, on multiple occasions during evolution. At the earliest, precellular stage of life's evolution, capsidless genetic parasites most likely emerged first and subsequently gave rise to different classes of viruses. In this review, we develop the concept of a greater virus world which forms an evolutionary network that is held together by shared conserved genes and includes both bona fide capsid-encoding viruses and different classes of capsidless replicons. Theoretical studies indicate that selfish replicons (genetic parasites) inevitably emerge in any sufficiently complex evolving ensemble of replicators. Therefore, the key signature of the greater virus world is not the presence of a capsid but rather genetic, informational parasitism itself, i.e., various degrees of reliance on the information processing systems of the host.
Collapse
|
14
|
Koonin EV, Dolja VV, Krupovic M. Origins and evolution of viruses of eukaryotes: The ultimate modularity. Virology 2015; 479-480:2-25. [PMID: 25771806 PMCID: PMC5898234 DOI: 10.1016/j.virol.2015.02.039] [Citation(s) in RCA: 352] [Impact Index Per Article: 35.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2015] [Revised: 02/19/2015] [Accepted: 02/20/2015] [Indexed: 01/04/2023]
Abstract
Viruses and other selfish genetic elements are dominant entities in the biosphere, with respect to both physical abundance and genetic diversity. Various selfish elements parasitize on all cellular life forms. The relative abundances of different classes of viruses are dramatically different between prokaryotes and eukaryotes. In prokaryotes, the great majority of viruses possess double-stranded (ds) DNA genomes, with a substantial minority of single-stranded (ss) DNA viruses and only limited presence of RNA viruses. In contrast, in eukaryotes, RNA viruses account for the majority of the virome diversity although ssDNA and dsDNA viruses are common as well. Phylogenomic analysis yields tangible clues for the origins of major classes of eukaryotic viruses and in particular their likely roots in prokaryotes. Specifically, the ancestral genome of positive-strand RNA viruses of eukaryotes might have been assembled de novo from genes derived from prokaryotic retroelements and bacteria although a primordial origin of this class of viruses cannot be ruled out. Different groups of double-stranded RNA viruses derive either from dsRNA bacteriophages or from positive-strand RNA viruses. The eukaryotic ssDNA viruses apparently evolved via a fusion of genes from prokaryotic rolling circle-replicating plasmids and positive-strand RNA viruses. Different families of eukaryotic dsDNA viruses appear to have originated from specific groups of bacteriophages on at least two independent occasions. Polintons, the largest known eukaryotic transposons, predicted to also form virus particles, most likely, were the evolutionary intermediates between bacterial tectiviruses and several groups of eukaryotic dsDNA viruses including the proposed order "Megavirales" that unites diverse families of large and giant viruses. Strikingly, evolution of all classes of eukaryotic viruses appears to have involved fusion between structural and replicative gene modules derived from different sources along with additional acquisitions of diverse genes.
Collapse
Affiliation(s)
- Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA.
| | - Valerian V Dolja
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR 97331, USA.
| | - Mart Krupovic
- Institut Pasteur, Unité Biologie Moléculaire du Gène chez les Extrêmophiles, Department of Microbiology, Paris 75015, France.
| |
Collapse
|
15
|
|
16
|
Ustyantsev K, Novikova O, Blinov A, Smyshlyaev G. Convergent evolution of ribonuclease h in LTR retrotransposons and retroviruses. Mol Biol Evol 2015; 32:1197-207. [PMID: 25605791 PMCID: PMC4408406 DOI: 10.1093/molbev/msv008] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Ty3/Gypsy long terminals repeat (LTR) retrotransposons are structurally and phylogenetically close to retroviruses. Two notable structural differences between these groups of genetic elements are 1) the presence in retroviruses of an additional envelope gene, env, which mediates infection, and 2) a specific dual ribonuclease H (RNH) domain encoded by the retroviral pol gene. However, similar to retroviruses, many Ty3/Gypsy LTR retrotransposons harbor additional env-like genes, promoting concepts of the infective mode of these retrotransposons. Here, we provide a further line of evidence of similarity between retroviruses and some Ty3/Gypsy LTR retrotransposons. We identify that, together with their additional genes, plant Ty3/Gypsy LTR retrotransposons of the Tat group have a second RNH, as do retroviruses. Most importantly, we show that the resulting dual RNHs of Tat LTR retrotransposons and retroviruses emerged independently, providing strong evidence for their convergent evolution. The convergent resemblance of Tat LTR retrotransposons and retroviruses may indicate similar selection pressures acting on these diverse groups of elements and reveal potential evolutionary constraints on their structure. We speculate that dual RNH is required to accelerate retrotransposon evolution through increased rates of strand transfer events and subsequent recombination events.
Collapse
Affiliation(s)
- Kirill Ustyantsev
- Laboratory of Molecular Genetic Systems, Institute of Cytology and Genetics, Novosibirsk, Russia
| | - Olga Novikova
- Department of Biological Sciences and RNA Institute, University at Albany
| | - Alexander Blinov
- Laboratory of Molecular Genetic Systems, Institute of Cytology and Genetics, Novosibirsk, Russia
| | - Georgy Smyshlyaev
- Laboratory of Molecular Genetic Systems, Institute of Cytology and Genetics, Novosibirsk, Russia Department of Natural Sciences, Novosibirsk State University, Novosibirsk, Russia
| |
Collapse
|
17
|
Abstract
Retroelements with long-terminal repeats (LTRs) inhabit nearly all eukaryotic genomes. During the time of their rich evolutionary history they have developed highly diverse forms, ranging from ordinary retrotransposons to complex pathogenic retroviruses such as HIV-I. Errantiviruses are a group of insect endogenous LTR elements that share structural and functional features with vertebrate endogenous retroviruses. The errantiviruses illustrate one of the evolutionary strategies of retrotransposons to become infective, which together with their similarities to vertebrate retroviruses make them an attractive object of research promising to shed more light on the evolution of retroviruses.
Collapse
Affiliation(s)
- Yury Stefanov
- Engelhardt Institute of Molecular Biology; Russian Academy of Sciences; Moscow, Russia
| | | | | |
Collapse
|
18
|
Majorek KA, Dunin-Horkawicz S, Steczkiewicz K, Muszewska A, Nowotny M, Ginalski K, Bujnicki JM. The RNase H-like superfamily: new members, comparative structural analysis and evolutionary classification. Nucleic Acids Res 2014; 42:4160-79. [PMID: 24464998 PMCID: PMC3985635 DOI: 10.1093/nar/gkt1414] [Citation(s) in RCA: 107] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2013] [Revised: 12/12/2013] [Accepted: 12/26/2013] [Indexed: 11/13/2022] Open
Abstract
Ribonuclease H-like (RNHL) superfamily, also called the retroviral integrase superfamily, groups together numerous enzymes involved in nucleic acid metabolism and implicated in many biological processes, including replication, homologous recombination, DNA repair, transposition and RNA interference. The RNHL superfamily proteins show extensive divergence of sequences and structures. We conducted database searches to identify members of the RNHL superfamily (including those previously unknown), yielding >60 000 unique domain sequences. Our analysis led to the identification of new RNHL superfamily members, such as RRXRR (PF14239), DUF460 (PF04312, COG2433), DUF3010 (PF11215), DUF429 (PF04250 and COG2410, COG4328, COG4923), DUF1092 (PF06485), COG5558, OrfB_IS605 (PF01385, COG0675) and Peptidase_A17 (PF05380). Based on the clustering analysis we grouped all identified RNHL domain sequences into 152 families. Phylogenetic studies revealed relationships between these families, and suggested a possible history of the evolution of RNHL fold and its active site. Our results revealed clear division of the RNHL superfamily into exonucleases and endonucleases. Structural analyses of features characteristic for particular groups revealed a correlation between the orientation of the C-terminal helix with the exonuclease/endonuclease function and the architecture of the active site. Our analysis provides a comprehensive picture of sequence-structure-function relationships in the RNHL superfamily that may guide functional studies of the previously uncharacterized protein families.
Collapse
Affiliation(s)
- Karolina A. Majorek
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, ul. Ks. Trojdena 4, PL-02-109 Warsaw, Poland, Department of Molecular Physiology and Biological Physics, University of Virginia, 1340 Jefferson Park Avenue, Charlottesville, VA USA-22908, USA, Bioinformatics Laboratory, Institute of Molecular Biology and Biotechnology, Adam Mickiewicz University, Umultowska 89, PL-61-614 Poznan, Poland, Laboratory of Bioinformatics and Systems Biology, Centre of New Technologies, University of Warsaw, Zwirki i Wigury 93, PL-02-089 Warsaw, Poland, Institute of Biochemistry and Biophysics PAS, Pawinskiego 5A, PL-02-106 Warsaw, Poland and Laboratory of Protein Structure, International Institute of Molecular and Cell Biology, ul. Ks. Trojdena 4, PL-02-109 Warsaw, Poland
| | - Stanislaw Dunin-Horkawicz
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, ul. Ks. Trojdena 4, PL-02-109 Warsaw, Poland, Department of Molecular Physiology and Biological Physics, University of Virginia, 1340 Jefferson Park Avenue, Charlottesville, VA USA-22908, USA, Bioinformatics Laboratory, Institute of Molecular Biology and Biotechnology, Adam Mickiewicz University, Umultowska 89, PL-61-614 Poznan, Poland, Laboratory of Bioinformatics and Systems Biology, Centre of New Technologies, University of Warsaw, Zwirki i Wigury 93, PL-02-089 Warsaw, Poland, Institute of Biochemistry and Biophysics PAS, Pawinskiego 5A, PL-02-106 Warsaw, Poland and Laboratory of Protein Structure, International Institute of Molecular and Cell Biology, ul. Ks. Trojdena 4, PL-02-109 Warsaw, Poland
| | - Kamil Steczkiewicz
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, ul. Ks. Trojdena 4, PL-02-109 Warsaw, Poland, Department of Molecular Physiology and Biological Physics, University of Virginia, 1340 Jefferson Park Avenue, Charlottesville, VA USA-22908, USA, Bioinformatics Laboratory, Institute of Molecular Biology and Biotechnology, Adam Mickiewicz University, Umultowska 89, PL-61-614 Poznan, Poland, Laboratory of Bioinformatics and Systems Biology, Centre of New Technologies, University of Warsaw, Zwirki i Wigury 93, PL-02-089 Warsaw, Poland, Institute of Biochemistry and Biophysics PAS, Pawinskiego 5A, PL-02-106 Warsaw, Poland and Laboratory of Protein Structure, International Institute of Molecular and Cell Biology, ul. Ks. Trojdena 4, PL-02-109 Warsaw, Poland
| | - Anna Muszewska
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, ul. Ks. Trojdena 4, PL-02-109 Warsaw, Poland, Department of Molecular Physiology and Biological Physics, University of Virginia, 1340 Jefferson Park Avenue, Charlottesville, VA USA-22908, USA, Bioinformatics Laboratory, Institute of Molecular Biology and Biotechnology, Adam Mickiewicz University, Umultowska 89, PL-61-614 Poznan, Poland, Laboratory of Bioinformatics and Systems Biology, Centre of New Technologies, University of Warsaw, Zwirki i Wigury 93, PL-02-089 Warsaw, Poland, Institute of Biochemistry and Biophysics PAS, Pawinskiego 5A, PL-02-106 Warsaw, Poland and Laboratory of Protein Structure, International Institute of Molecular and Cell Biology, ul. Ks. Trojdena 4, PL-02-109 Warsaw, Poland
| | - Marcin Nowotny
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, ul. Ks. Trojdena 4, PL-02-109 Warsaw, Poland, Department of Molecular Physiology and Biological Physics, University of Virginia, 1340 Jefferson Park Avenue, Charlottesville, VA USA-22908, USA, Bioinformatics Laboratory, Institute of Molecular Biology and Biotechnology, Adam Mickiewicz University, Umultowska 89, PL-61-614 Poznan, Poland, Laboratory of Bioinformatics and Systems Biology, Centre of New Technologies, University of Warsaw, Zwirki i Wigury 93, PL-02-089 Warsaw, Poland, Institute of Biochemistry and Biophysics PAS, Pawinskiego 5A, PL-02-106 Warsaw, Poland and Laboratory of Protein Structure, International Institute of Molecular and Cell Biology, ul. Ks. Trojdena 4, PL-02-109 Warsaw, Poland
| | - Krzysztof Ginalski
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, ul. Ks. Trojdena 4, PL-02-109 Warsaw, Poland, Department of Molecular Physiology and Biological Physics, University of Virginia, 1340 Jefferson Park Avenue, Charlottesville, VA USA-22908, USA, Bioinformatics Laboratory, Institute of Molecular Biology and Biotechnology, Adam Mickiewicz University, Umultowska 89, PL-61-614 Poznan, Poland, Laboratory of Bioinformatics and Systems Biology, Centre of New Technologies, University of Warsaw, Zwirki i Wigury 93, PL-02-089 Warsaw, Poland, Institute of Biochemistry and Biophysics PAS, Pawinskiego 5A, PL-02-106 Warsaw, Poland and Laboratory of Protein Structure, International Institute of Molecular and Cell Biology, ul. Ks. Trojdena 4, PL-02-109 Warsaw, Poland
| | - Janusz M. Bujnicki
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, ul. Ks. Trojdena 4, PL-02-109 Warsaw, Poland, Department of Molecular Physiology and Biological Physics, University of Virginia, 1340 Jefferson Park Avenue, Charlottesville, VA USA-22908, USA, Bioinformatics Laboratory, Institute of Molecular Biology and Biotechnology, Adam Mickiewicz University, Umultowska 89, PL-61-614 Poznan, Poland, Laboratory of Bioinformatics and Systems Biology, Centre of New Technologies, University of Warsaw, Zwirki i Wigury 93, PL-02-089 Warsaw, Poland, Institute of Biochemistry and Biophysics PAS, Pawinskiego 5A, PL-02-106 Warsaw, Poland and Laboratory of Protein Structure, International Institute of Molecular and Cell Biology, ul. Ks. Trojdena 4, PL-02-109 Warsaw, Poland
| |
Collapse
|
19
|
Acquisition of an Archaea-like ribonuclease H domain by plant L1 retrotransposons supports modular evolution. Proc Natl Acad Sci U S A 2013; 110:20140-5. [PMID: 24277848 DOI: 10.1073/pnas.1310958110] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Although a variety of non-LTR retrotransposons of the L1 superfamily have been found in plant genomes over recent decades, their diversity, distribution, and evolution have yet to be analyzed in depth. Here, we perform comprehensive comparative and evolutionary analyses of L1 retrotransposons from 29 genomes of land plants covering a wide range of taxa. We identify numerous L1 elements in these genomes and detect a striking diversity of their domain composition. We show that all known land plant L1 retrotransposons can be grouped into five major families based on their phylogenetic relationships and domain composition. Moreover, we trace the putative evolution timeline that created the current variants and reveal that evolutionary events included losses and acquisitions of diverse putative RNA-binding domains and the acquisition of an Archaea-like ribonuclease H (RNH) domain. We also show that the latter RNH domain is autonomously active in vitro and speculate that retrotransposons may play a role in the horizontal transfer of RNH between plants, Archaea, and bacteria. The acquisition of an Archaea-like RNH domain by plant L1 retrotransposons negates the hypothesis that RNH domains in non-LTR retrotransposons have a single origin and provides evidence that acquisition happened at least twice. Together, our data indicate that the evolution of the investigated retrotransposons can be mainly characterized by repeated events of domain rearrangements and identify modular evolution as a major trend in the evolution of plant L1 retrotransposons.
Collapse
|
20
|
Muszewska A, Steczkiewicz K, Ginalski K. DIRS and Ngaro Retrotransposons in Fungi. PLoS One 2013; 8:e76319. [PMID: 24086727 PMCID: PMC3783388 DOI: 10.1371/journal.pone.0076319] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2013] [Accepted: 08/23/2013] [Indexed: 11/19/2022] Open
Abstract
Retrotransposons with a tyrosine recombinase (YR) have been discovered recently and lack thorough annotation in fungi. YR retrotransposons are divided into 3 groups: DIRS, Ngaro and VIPER (known only from kinetoplastida). We used comparative genomics to investigate the evolutionary patterns of retrotransposons in the fungal kingdom. The identification of both functional and remnant elements provides a unique view on both recent and past transposition activity. Our searches covering a wide range of fungal genomes allowed us to identify 2241 YR retrotransposons. Based on CLANS clustering of concatenated sequences of the reverse transcriptase (RT), RNase H (RH), DNA N-6-adenine-methyltransferase (MT) and YR protein domains we propose a revised classification of YR elements expanded by two new categories of Ngaro elements. A phylogenetic analysis of 477 representatives supports this observation and additionally demonstrates that DIRS and Ngaro abundance changed independently in Basidiomycota and Blastocladiomycota/Mucoromycotina/Kixellomycotina. Interestingly, a single remnant Ngaro element could be identified in an Ascomycota genome. Our analysis revealed also that 3 Pucciniomycotina taxa, known for their overall mobile element abundance and big genome size, encode an elevated number of Ngaro retrotransposons. Considering the presence of DIRS elements in all analyzed Mucoromycotina, Kickxellomycotina and Blastocladiomycota genomes one might assume a common origin of fungal DIRS retrotransposons with a loss in Dicarya. Ngaro elements described to date from Opisthokonta, seem to have invaded the common ancestor of Agaricomycotina and Pucciniomycotina after Ustilagomycotina divergence. Yet, most of analyzed genomes are devoid of YR elements and most identified retrotransposons are incomplete.
Collapse
Affiliation(s)
- Anna Muszewska
- Laboratory of Bioinformatics and Systems Biology, CeNT, University of Warsaw, Warsaw, Poland
| | - Kamil Steczkiewicz
- Laboratory of Bioinformatics and Systems Biology, CeNT, University of Warsaw, Warsaw, Poland
| | - Krzysztof Ginalski
- Laboratory of Bioinformatics and Systems Biology, CeNT, University of Warsaw, Warsaw, Poland
| |
Collapse
|
21
|
Novikov A, Smyshlyaev G, Novikova O. Evolutionary history of LTR retrotransposon chromodomains in plants. INTERNATIONAL JOURNAL OF PLANT GENOMICS 2012; 2012:874743. [PMID: 22611377 PMCID: PMC3350952 DOI: 10.1155/2012/874743] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/15/2011] [Revised: 01/27/2012] [Accepted: 02/12/2012] [Indexed: 05/25/2023]
Abstract
Chromodomain-containing LTR retrotransposons are one of the most successful groups of mobile elements in plant genomes. Previously, we demonstrated that two types of chromodomains (CHDs) are carried by plant LTR retrotransposons. Chromodomains from group I (CHD_I) were detected only in Tcn1-like LTR retrotransposons from nonseed plants such as mosses (including the model moss species Physcomitrella) and lycophytes (the Selaginella species). LTR retrotransposon chromodomains from group II (CHD_II) have been described from a wide range of higher plants. In the present study, we performed computer-based mining of plant LTR retrotransposon CHDs from diverse plants with an emphasis on spike-moss Selaginella. Our extended comparative and phylogenetic analysis demonstrated that two types of CHDs are present only in the Selaginella genome, which puts this species in a unique position among plants. It appears that a transition from CHD_I to CHD_II and further diversification occurred in the evolutionary history of plant LTR retrotransposons at approximately 400 MYA and most probably was associated with the evolution of chromatin organization.
Collapse
Affiliation(s)
- Anton Novikov
- Laboratory of Molecular Genetic Systems, Institute of Cytology and Genetics, Novosibirsk, 630090, Russia
| | - Georgiy Smyshlyaev
- Department of Natural Sciences, Novosibirsk State University, Novosibirsk, 630090, Russia
| | - Olga Novikova
- Department of Plant Pathology, University of Kentucky, Lexington, KY 40546, USA
- Department of Biological Sciences, University at Albany, Life Sciences Building 2061, 1400 Washington Avenue, Albany, NY 12222, USA
| |
Collapse
|
22
|
Smyshlyaev GA, Blinov AG. Evolution and biodiversity of L1 retrotransposons in angiosperm genomes. ACTA ACUST UNITED AC 2012. [DOI: 10.1134/s2079059712010133] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
23
|
Muszewska A, Hoffman-Sommer M, Grynberg M. LTR retrotransposons in fungi. PLoS One 2011; 6:e29425. [PMID: 22242120 PMCID: PMC3248453 DOI: 10.1371/journal.pone.0029425] [Citation(s) in RCA: 78] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2011] [Accepted: 11/28/2011] [Indexed: 01/17/2023] Open
Abstract
Transposable elements with long terminal direct repeats (LTR TEs) are one of the best studied groups of mobile elements. They are ubiquitous elements present in almost all eukaryotic genomes. Their number and state of conservation can be a highlight of genome dynamics. We searched all published fungal genomes for LTR-containing retrotransposons, including both complete, functional elements and remnant copies. We identified a total of over 66,000 elements, all of which belong to the Ty1/Copia or Ty3/Gypsy superfamilies. Most of the detected Gypsy elements represent Chromoviridae, i.e. they carry a chromodomain in the pol ORF. We analyzed our data from a genome-ecology perspective, looking at the abundance of various types of LTR TEs in individual genomes and at the highest-copy element from each genome. The TE content is very variable among the analyzed genomes. Some genomes are very scarce in LTR TEs (<50 elements), others demonstrate huge expansions (>8000 elements). The data shows that transposon expansions in fungi usually involve an increase both in the copy number of individual elements and in the number of element types. The majority of the highest-copy TEs from all genomes are Ty3/Gypsy transposons. Phylogenetic analysis of these elements suggests that TE expansions have appeared independently of each other, in distant genomes and at different taxonomical levels. We also analyzed the evolutionary relationships between protein domains encoded by the transposon pol ORF and we found that the protease is the fastest evolving domain whereas reverse transcriptase and RNase H evolve much slower and in correlation with each other.
Collapse
Affiliation(s)
- Anna Muszewska
- Institute of Biochemistry and Biophysics, Polish Academy of Sciences, Warsaw, Poland.
| | | | | |
Collapse
|
24
|
Thomas MC, Macias F, Alonso C, López MC. The biology and evolution of transposable elements in parasites. Trends Parasitol 2010; 26:350-62. [PMID: 20444649 DOI: 10.1016/j.pt.2010.04.001] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2009] [Revised: 03/30/2010] [Accepted: 04/01/2010] [Indexed: 12/19/2022]
Abstract
Transposable elements (TEs) are dynamic elements that can reshape host genomes by generating rearrangements with the potential to create or disrupt genes, to shuffle existing genes, and to modulate their patterns of expression. In the genomes of parasites that infect mammals several TEs have been identified that probably have been maintained throughout evolution due to their contribution to gene function and regulation of gene expression. This review addresses how TEs are organized, how they colonize the genomes of mammalian parasites, the functional role these elements play in parasite biology, and the interactions between these elements and the parasite genome.
Collapse
Affiliation(s)
- M Carmen Thomas
- Departamento de Biología Molecular, Instituto de Parasitología y Biomedicina López Neyra - CSIC, Parque Tecnológico de Ciencias de la Salud, 18100 Granada, Spain
| | | | | | | |
Collapse
|
25
|
Novikova O, Fet V, Blinov A. Non-LTR retrotransposons in fungi. Funct Integr Genomics 2008; 9:27-42. [DOI: 10.1007/s10142-008-0093-8] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/29/2008] [Revised: 07/01/2008] [Accepted: 07/01/2008] [Indexed: 12/31/2022]
|
26
|
Eickbush TH, Jamburuthugoda VK. The diversity of retrotransposons and the properties of their reverse transcriptases. Virus Res 2008; 134:221-34. [PMID: 18261821 PMCID: PMC2695964 DOI: 10.1016/j.virusres.2007.12.010] [Citation(s) in RCA: 174] [Impact Index Per Article: 10.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2007] [Revised: 12/14/2007] [Accepted: 12/14/2007] [Indexed: 11/30/2022]
Abstract
A number of abundant mobile genetic elements called retrotransposons reverse transcribe RNA to generate DNA for insertion into eukaryotic genomes. Four major classes of retrotransposons are described here. First, the long-terminal-repeat (LTR) retrotransposons have similar structures and mechanisms to those of the vertebrate retroviruses. Genes that may enable these retrotransposons to leave a cell have been acquired by these elements in a number of animal and plant lineages. Second, the tyrosine recombinase retrotransposons are similar to the LTR retrotransposons except that they have substituted a recombinase for the integrase and recombine into the host chromosomes. Third, the non-LTR retrotransposons use a cleaved chromosomal target site generated by an encoded endonuclease to prime reverse transcription. Finally, the Penelope-like retrotransposons are not well understood but appear to also use cleaved DNA or the ends of chromosomes as primer for reverse transcription. Described in the second part of this review are the enzymatic properties of the reverse transcriptases (RTs) encoded by retrotransposons. The RTs of the LTR retrotransposons are highly divergent in sequence but have similar enzymatic activities to those of retroviruses. The RTs of the non-LTR retrotransposons have several unique properties reflecting their adaptation to a different mechanism of retrotransposition.
Collapse
Affiliation(s)
- Thomas H Eickbush
- Department of Biology, University of Rochester, Rochester, NY 14627, USA.
| | | |
Collapse
|
27
|
Bousalem M, Douzery EJP, Seal SE. Taxonomy, molecular phylogeny and evolution of plant reverse transcribing viruses (family Caulimoviridae) inferred from full-length genome and reverse transcriptase sequences. Arch Virol 2008; 153:1085-102. [PMID: 18483693 DOI: 10.1007/s00705-008-0095-9] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2007] [Accepted: 03/20/2008] [Indexed: 11/26/2022]
Abstract
This study constitutes the first evaluation and application of quantitative taxonomy to the family Caulimoviridae and the first in-depth phylogenetic study of the family Caulimoviridae that integrates the common origin between LTR retrotransposons and caulimoviruses. The phylogenetic trees and PASC analyses derived from the full genome and from the corresponding partial RT concurred, providing strong support for the current genus classification based mainly on genome organisation and use of partial RT sequence as a molecular marker. The PASC distributions obtained are multimodal, making it possible to distinguish between genus, species and strain. The taxonomy of badnaviruses infecting banana (Musa spp.) was clarified, and the consequence of endogenous badnaviruses on the genetic diversity and evolution of caulimoviruses is discussed. The use of LTR retrotransposons as outgroups reveals a structured bipolar topology separating the genus Badnavirus from the other genera. Badnaviruses appear to be the most recent genus, with the genus Tungrovirus in an intermediary position. This structuring intersects the one established by genomic and biological properties and allows us to make a correlation between phylogeny and biogeography. The variability shown between members of the family Caulimoviridae is in a similar range to that reported within other DNA and RNA plant virus families.
Collapse
Affiliation(s)
- M Bousalem
- INRA Antilles-Guyane, UR979 Productions Végétales, Domaine Duclos, 97170 Petit-Bourg, Guadeloupe.
| | | | | |
Collapse
|
28
|
A Copia-like Retrotransposon Gene Encoding Gypsy-like Integrase in a Red Alga, Porphyra yezoensis. J Mol Evol 2007; 66:72-9. [DOI: 10.1007/s00239-007-9057-1] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2007] [Accepted: 11/07/2007] [Indexed: 11/26/2022]
|
29
|
Transcriptome analysis of the venom gland of the Mexican scorpion Hadrurus gertschi (Arachnida: Scorpiones). BMC Genomics 2007; 8:119. [PMID: 17506894 PMCID: PMC1904202 DOI: 10.1186/1471-2164-8-119] [Citation(s) in RCA: 108] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2007] [Accepted: 05/16/2007] [Indexed: 11/19/2022] Open
Abstract
Background Scorpions like other venomous animals posses a highly specialized organ that produces, secretes and disposes the venom components. In these animals, the last postabdominal segment, named telson, contains a pair of venomous glands connected to the stinger. The isolation of numerous scorpion toxins, along with cDNA-based gene cloning and, more recently, proteomic analyses have provided us with a large collection of venom components sequences. However, all of them are secreted, or at least are predicted to be secretable gene products. Therefore very little is known about the cellular processes that normally take place inside the glands for production of the venom mixture. To gain insights into the scorpion venom gland biology, we have decided to perform a transcriptomic analysis by constructing a cDNA library and conducting a random sequencing screening of the transcripts. Results From the cDNA library prepared from a single venom gland of the scorpion Hadrurus gertschi, 160 expressed sequence tags (ESTs) were analyzed. These transcripts were further clustered into 68 unique sequences (20 contigs and 48 singlets), with an average length of 919 bp. Half of the ESTs can be confidentially assigned as homologues of annotated gene products. Annotation of these ESTs, with the aid of Gene Ontology terms and homology to eukaryotic orthologous groups, reveals some cellular processes important for venom gland function; including high protein synthesis, tuned posttranslational processing and trafficking. Nonetheless, the main group of the identified gene products includes ESTs similar to known scorpion toxins or other previously characterized scorpion venom components, which account for nearly 60% of the identified proteins. Conclusion To the best of our knowledge this report contains the first transcriptome analysis of genes transcribed by the venomous gland of a scorpion. The data were obtained for the species Hadrurus gertschi, belonging to the family Caraboctonidae. One hundred and sixty ESTs were analyzed, showing enrichment in genes that encode for products similar to known venom components, but also provides the first sketch of cellular components, molecular functions, biological processes and some unique sequences of the scorpion venom gland.
Collapse
|