1
|
Vertacnik KL, Herrig DK, Godfrey RK, Hill T, Geib SM, Unckless RL, Nelson DR, Linnen CR. Evolution of five environmentally responsive gene families in a pine-feeding sawfly, Neodiprion lecontei (Hymenoptera: Diprionidae). Ecol Evol 2023; 13:e10506. [PMID: 37791292 PMCID: PMC10542623 DOI: 10.1002/ece3.10506] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2023] [Revised: 07/17/2023] [Accepted: 07/21/2023] [Indexed: 10/05/2023] Open
Abstract
A central goal in evolutionary biology is to determine the predictability of adaptive genetic changes. Despite many documented cases of convergent evolution at individual loci, little is known about the repeatability of gene family expansions and contractions. To address this void, we examined gene family evolution in the redheaded pine sawfly Neodiprion lecontei, a noneusocial hymenopteran and exemplar of a pine-specialized lineage evolved from angiosperm-feeding ancestors. After assembling and annotating a draft genome, we manually annotated multiple gene families with chemosensory, detoxification, or immunity functions before characterizing their genomic distributions and molecular evolution. We find evidence of recent expansions of bitter gustatory receptor, clan 3 cytochrome P450, olfactory receptor, and antimicrobial peptide subfamilies, with strong evidence of positive selection among paralogs in a clade of gustatory receptors possibly involved in the detection of bitter compounds. In contrast, these gene families had little evidence of recent contraction via pseudogenization. Overall, our results are consistent with the hypothesis that in response to novel selection pressures, gene families that mediate ecological interactions may expand and contract predictably. Testing this hypothesis will require the comparative analysis of high-quality annotation data from phylogenetically and ecologically diverse insect species and functionally diverse gene families. To this end, increasing sampling in under-sampled hymenopteran lineages and environmentally responsive gene families and standardizing manual annotation methods should be prioritized.
Collapse
Affiliation(s)
- Kim L. Vertacnik
- Department of EntomologyUniversity of KentuckyLexingtonKentuckyUSA
| | | | - R. Keating Godfrey
- McGuire Center for Lepidoptera and Biodiversity, University of FloridaGainesvilleFloridaUSA
| | - Tom Hill
- National Institute of Allergy and Infectious DiseasesBethesdaMarylandUSA
| | - Scott M. Geib
- Tropical Crop and Commodity Protection Research UnitUnited States Department of Agriculture: Agriculture Research Service Pacific Basin Agricultural Research CenterHiloHawaiiUSA
| | - Robert L. Unckless
- Department of Molecular BiosciencesUniversity of KansasLawrenceKansasUSA
| | - David R. Nelson
- Department of Microbiology, Immunology and BiochemistryUniversity of Tennessee Health Science CenterMemphisTennesseeUSA
| | | |
Collapse
|
2
|
Díaz-González J, Domínguez A. Different structural variants of roo retrotransposon are active in Drosophila melanogaster. Gene 2020; 741:144546. [PMID: 32165306 DOI: 10.1016/j.gene.2020.144546] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2019] [Revised: 01/30/2020] [Accepted: 03/08/2020] [Indexed: 11/29/2022]
Abstract
Retrotransposon roo is one of the most active elements in Drosophila melanogaster. The level of nucleotide diversity between copies of roo is very low but structural variation in the 5'-UTR is considerable. Transposition of roo at high frequency (around 5 × 10-2 per generation) has been shown previously in the set of mutation accumulation lines named Oviedo. Here we isolated thirteen individual insertions by inverse PCR and sequenced the 5' end of the elements (between 1663 and 2039 nt) including the LTR, the 5'-UTR and a fragment of 661 nucleotides from the ORF, to study whether the new transposed copies come from a unique variant (the master copy model) or different elements are able to move (the transposon model). The elements in the Oviedo lines presented the same structural variants as the reference genome. Different structural variants were active, a behaviour compatible with the "transposon model" in which the copies localized in multiple sites in the genome are able to transpose. At the level of sequence, the copies of roo in our lines are highly similar to the elements in the reference genome. The phylogenetic tree shows a shallow diversification with unsupported nodes denoting that all the elements currently active are very young. This observation together with the great polymorphism in insertion sites implies a rapid turnover of the elements.
Collapse
Affiliation(s)
- J Díaz-González
- Departamento de Biología Funcional, Área de Genética. Universidad de Oviedo, 33071 Oviedo, Spain
| | - A Domínguez
- Departamento de Biología Funcional, Área de Genética. Universidad de Oviedo, 33071 Oviedo, Spain.
| |
Collapse
|
3
|
Distinguishing friends, foes, and freeloaders in giant genomes. Curr Opin Genet Dev 2018; 49:49-55. [DOI: 10.1016/j.gde.2018.02.013] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2018] [Revised: 02/23/2018] [Accepted: 02/26/2018] [Indexed: 12/11/2022]
|
4
|
Xiong TL, Xiao JH, Li YX, Bian SN, Huang DW. Diversity and evolution of Ty1-copia retroelements within Chalcidoidea by reverse transcriptase domain analysis. INSECT MOLECULAR BIOLOGY 2015; 24:503-516. [PMID: 26079156 DOI: 10.1111/imb.12167] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]
Abstract
Ty1-copia retrotransposons are widespread and diverse in insects. Some features of their hosts, such as mating and genetic systems, are predicted to influence the spread of selfish genetic elements like Ty1-copia. Using part of the reverse transcriptase gene as a reference, we experimentally surveyed Ty1-copia elements in eight species of fig wasps (Hymenoptera: Chalcidoidea), and performed an in silico analysis of six available genomes of chalcid wasps. Contrary to initial expectations that selfish elements such as Ty1-copia would be purged from the genomes of these species because of inbreeding and haplodiploidy, almost all of these wasps harbour an abundance of diverse Ty1-copia elements. Phylogenetic analyses suggest that the families of Ty1-copia elements found in these species have had a long association with their chalcid hosts. These results suggest an evolutionary scenario in which there was ancestral polymorphism followed by some taxa-specific events including stochastic loss and further diversification. Furthermore, estimating natural selection within the internal and terminal portions of the Ty1-copia phylogenies demonstrated that the elements are under strong evolutionary constraints for their long-term survival, but evolve like pseudogenes in the short term, accompanied by the rise and fall of parasitic elements in the history of wasp lineage.
Collapse
Affiliation(s)
- T-L Xiong
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, 100101, China
- University of Chinese Academy of Sciences, Beijing, 100039, China
| | - J-H Xiao
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, 100101, China
| | - Y-X Li
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, 100101, China
- College of Plant Protection, Shandong Agricultural University, Tai'an, 271018, China
| | - S-N Bian
- College of Plant Protection, Shandong Agricultural University, Tai'an, 271018, China
| | - D-W Huang
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, 100101, China
- College of Plant Protection, Shandong Agricultural University, Tai'an, 271018, China
| |
Collapse
|
5
|
Chalup L, Grabiele M, Neffa VS, Seijo G. DNA content in South American endemic species of Lathyrus. JOURNAL OF PLANT RESEARCH 2014; 127:469-480. [PMID: 24840864 DOI: 10.1007/s10265-014-0637-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/29/2013] [Accepted: 03/24/2014] [Indexed: 06/03/2023]
Abstract
The genome size was surveyed in 13 Notolathyrus species endemic to South America by flow cytometry and analyzed in an evolutionary and biogeographic context. A DNA content variation of 1.7-fold was registered, and four groups of species with different DNA content were determined. Although, the 2C values were correlated with the total chromosome length and intrachromosomal asymmetry index (A1), the karyotype formula remained almost constant. The conservation of the karyotype formula is in agreement with proportional changes of DNA in the chromosome arms. Species with annual life cycle and shorter generation time had the lowest DNA content and the data suggest that changes in DNA content involved reductions of genome size in the perennial to annual transitions. The variation of 2C values was correlated with precipitation of the coldest quarter and, to some extent, with altitude. Additional correlations with other variables were observed when the species were analyzed separately according to the biogeographic regions. In general, the species with higher DNA content were found in more stable environments. The bulk of evidence suggests that changes on genome size would have been one of the most important mechanisms that drove or accompanied the diversification of Notolathyrus species.
Collapse
Affiliation(s)
- Laura Chalup
- Instituto de Botánica del Nordeste (UNNE, Facultad de Ciencias Agrarias -CONICET), Casilla de Correo 209, 3400, Corrientes, Argentina,
| | | | | | | |
Collapse
|
6
|
Eisman RC, Kaufman TC. Probing the boundaries of orthology: the unanticipated rapid evolution of Drosophila centrosomin. Genetics 2013; 194:903-26. [PMID: 23749319 PMCID: PMC3730919 DOI: 10.1534/genetics.113.152546] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2013] [Accepted: 05/28/2013] [Indexed: 11/18/2022] Open
Abstract
The rapid evolution of essential developmental genes and their protein products is both intriguing and problematic. The rapid evolution of gene products with simple protein folds and a lack of well-characterized functional domains typically result in a low discovery rate of orthologous genes. Additionally, in the absence of orthologs it is difficult to study the processes and mechanisms underlying rapid evolution. In this study, we have investigated the rapid evolution of centrosomin (cnn), an essential gene encoding centrosomal protein isoforms required during syncytial development in Drosophila melanogaster. Until recently the rapid divergence of cnn made identification of orthologs difficult and questionable because Cnn violates many of the assumptions underlying models for protein evolution. To overcome these limitations, we have identified a group of insect orthologs and present conserved features likely to be required for the functions attributed to cnn in D. melanogaster. We also show that the rapid divergence of Cnn isoforms is apparently due to frequent coding sequence indels and an accelerated rate of intronic additions and eliminations. These changes appear to be buffered by multi-exon and multi-reading frame maximum potential ORFs, simple protein folds, and the splicing machinery. These buffering features also occur in other genes in Drosophila and may help prevent potentially deleterious mutations due to indels in genes with large coding exons and exon-dense regions separated by small introns. This work promises to be useful for future investigations of cnn and potentially other rapidly evolving genes and proteins.
Collapse
Affiliation(s)
- Robert C. Eisman
- Department of Biology, Indiana University, Bloomington, Indiana 47405
| | - Thomas C. Kaufman
- Department of Biology, Indiana University, Bloomington, Indiana 47405
| |
Collapse
|
7
|
Novel genes from formation to function. INTERNATIONAL JOURNAL OF EVOLUTIONARY BIOLOGY 2012; 2012:821645. [PMID: 22811949 PMCID: PMC3395120 DOI: 10.1155/2012/821645] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/07/2012] [Accepted: 04/26/2012] [Indexed: 11/29/2022]
Abstract
The study of the evolution of novel genes generally focuses on the formation of new coding sequences. However, equally important in the evolution of novel functional genes are the formation of regulatory regions that allow the expression of the genes and the effects of the new genes in the organism as well. Herein, we discuss the current knowledge on the evolution of novel functional genes, and we examine in more detail the youngest genes discovered. We examine the existing data on a very recent and rapidly evolving cluster of duplicated genes, the Sdic gene cluster. This cluster of genes is an excellent model for the evolution of novel genes, as it is very recent and may still be in the process of evolving.
Collapse
|
8
|
Fernández-Medina RD, Ribeiro JMC, Carareto CMA, Velasque L, Struchiner CJ. Losing identity: structural diversity of transposable elements belonging to different classes in the genome of Anopheles gambiae. BMC Genomics 2012; 13:272. [PMID: 22726298 PMCID: PMC3442997 DOI: 10.1186/1471-2164-13-272] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2012] [Accepted: 06/08/2012] [Indexed: 01/10/2023] Open
Abstract
Background Transposable elements (TEs), both DNA transposons and retrotransposons, are genetic elements with the main characteristic of being able to mobilize and amplify their own representation within genomes, utilizing different mechanisms of transposition. An almost universal feature of TEs in eukaryotic genomes is their inability to transpose by themselves, mainly as the result of sequence degeneration (by either mutations or deletions). Most of the elements are thus either inactive or non-autonomous. Considering that the bulk of some eukaryotic genomes derive from TEs, they have been conceived as “TE graveyards.” It has been shown that once an element has been inactivated, it progressively accumulates mutations and deletions at neutral rates until completely losing its identity or being lost from the host genome; however, it has also been shown that these “neutral sequences” might serve as raw material for domestication by host genomes. Results We have analyzed the sequence structural variations, nucleotide divergence, and pattern of insertions and deletions of several superfamilies of TEs belonging to both class I (long terminal repeats [LTRs] and non-LTRs [NLTRs]) and II in the genome of Anopheles gambiae, aiming at describing the landscape of deterioration of these elements in this particular genome. Our results describe a great diversity in patterns of deterioration, indicating lineage-specific differences including the presence of Solo-LTRs in the LTR lineage, 5′-deleted NLTRs, and several non-autonomous and MITEs in the class II families. Interestingly, we found fragments of NLTRs corresponding to the RT domain, which preserves high identity among them, suggesting a possible remaining genomic role for these domains. Conclusions We show here that the TEs in the An. gambiae genome deteriorate in different ways according to the class to which they belong. This diversity certainly has implications not only at the host genomic level but also at the amplification dynamic and evolution of the TE families themselves.
Collapse
Affiliation(s)
- Rita D Fernández-Medina
- Escola Nacional de Saúde Pública Sergio Arouca, Fundação Oswaldo Cruz, Rio de Janeiro, Brazil.
| | | | | | | | | |
Collapse
|
9
|
Blass E, Bell M, Boissinot S. Accumulation and rapid decay of non-LTR retrotransposons in the genome of the three-spine stickleback. Genome Biol Evol 2012; 4:687-702. [PMID: 22534163 PMCID: PMC3381678 DOI: 10.1093/gbe/evs044] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
The diversity and abundance of non–long terminal repeat (LTR) retrotransposons (nLTR-RT) differ drastically among vertebrate genomes. At one extreme, the genome of placental mammals is littered with hundreds of thousands of copies resulting from the activity of a single clade of nLTR-RT, the L1 clade. In contrast, fish genomes contain a much more diverse repertoire of nLTR-RT, represented by numerous active clades and families. Yet, the number of nLTR-RT copies in teleostean fish is two orders of magnitude smaller than in mammals. The vast majority of insertions appear to be very recent, suggesting that nLTR-RT do not accumulate in fish genomes. This pattern had previously been explained by a high rate of turnover, in which the insertion of new elements is offset by the selective loss of deleterious inserts. The turnover model was proposed because of the similarity between fish and Drosophila genomes with regard to their nLTR-RT profile. However, it is unclear if this model applies to fish. In fact, a previous study performed on the puffer fish suggested that transposable element insertions behave as neutral alleles. Here we examined the dynamics of amplification of nLTR-RT in the three-spine stickleback (Gasterosteus aculeatus). In this species, the vast majority of nLTR-RT insertions are relatively young, as suggested by their low level of divergence. Contrary to expectations, a majority of these insertions are fixed in lake and oceanic populations; thus, nLTR-RT do indeed accumulate in the genome of their fish host. This is not to say that nLTR-RTs are fully neutral, as the lack of fixed long elements in this genome suggests a deleterious effect related to their length. This analysis does not support the turnover model and strongly suggests that a much higher rate of DNA loss in fish than in mammals is responsible for the relatively small number of nLTR-RT copies and for the scarcity of ancient elements in fish genomes. We further demonstrate that nLTR-RT decay in fish occurs mostly through large deletions and not by the accumulation of small deletions.
Collapse
Affiliation(s)
- Eryn Blass
- Department of Biology, Queens College, City University of New York, Flushing, NY, USA
| | | | | |
Collapse
|
10
|
Klimopoulos A, Sellis D, Almirantis Y. Widespread occurrence of power-law distributions in inter-repeat distances shaped by genome dynamics. Gene 2012; 499:88-98. [PMID: 22370293 DOI: 10.1016/j.gene.2012.02.005] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2011] [Revised: 02/05/2012] [Accepted: 02/06/2012] [Indexed: 11/25/2022]
Abstract
Repetitive DNA sequences derived from transposable elements (TE) are distributed in a non-random way, co-clustering with other classes of repeat elements, genes and other genomic components. In a previous work we reported power-law-like size distributions (linearity in log-log scale) in the spatial arrangement of Alu and LINE1 elements in the human genome. Here we investigate the large-scale features of the spatial arrangement of all principal classes of TEs in 14 genomes from phylogenetically distant organisms by studying the size distribution of inter-repeat distances. Power-law-like size distributions are found to be widespread, extending up to several orders of magnitude. In order to understand the emergence of this distributional pattern, we introduce an evolutionary scenario, which includes (i) Insertions of DNA segments (e.g., more recent repeats) into the considered sequence and (ii) Eliminations of members of the studied TE family. In the proposed model we also incorporate the potential for transposition events (characteristic of the DNA transposons' life-cycle) and segmental duplications. Simulations reproduce the main features of the observed size distributions. Furthermore, we investigate the effects of various genomic features on the presence and extent of power-law size distributions including TE class and age, mode of parental TE transmission, GC content, deletion and recombination rates in the studied genomic region, etc. Our observations corroborate the hypothesis that insertions of genomic material and eliminations of repeats are at the basis of power-laws in inter-repeat distances. The existence of these power-laws could facilitate the formation of the recently proposed "fractal globule" for the confined chromatin organization.
Collapse
Affiliation(s)
- Alexandros Klimopoulos
- National Center for Scientific Research "Demokritos," Institute of Biology, 153 10 Athens, Greece.
| | | | | |
Collapse
|
11
|
Granzotto A, Lopes FR, Vieira C, Carareto CMA. Vertical inheritance and bursts of transposition have shaped the evolution of the BS non-LTR retrotransposon in Drosophila. Mol Genet Genomics 2011; 286:57-66. [DOI: 10.1007/s00438-011-0629-9] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2011] [Accepted: 05/10/2011] [Indexed: 01/13/2023]
|
12
|
Muñoz-López M, García-Pérez JL. DNA transposons: nature and applications in genomics. Curr Genomics 2010; 11:115-28. [PMID: 20885819 PMCID: PMC2874221 DOI: 10.2174/138920210790886871] [Citation(s) in RCA: 282] [Impact Index Per Article: 18.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2009] [Revised: 11/18/2009] [Accepted: 12/01/2009] [Indexed: 12/19/2022] Open
Abstract
Repeated DNA makes up a large fraction of a typical mammalian genome, and some repetitive elements are able to move within the genome (transposons and retrotransposons). DNA transposons move from one genomic location to another by a cut-and-paste mechanism. They are powerful forces of genetic change and have played a significant role in the evolution of many genomes. As genetic tools, DNA transposons can be used to introduce a piece of foreign DNA into a genome. Indeed, they have been used for transgenesis and insertional mutagenesis in different organisms, since these elements are not generally dependent on host factors to mediate their mobility. Thus, DNA transposons are useful tools to analyze the regulatory genome, study embryonic development, identify genes and pathways implicated in disease or pathogenesis of pathogens, and even contribute to gene therapy. In this review, we will describe the nature of these elements and discuss recent advances in this field of research, as well as our evolving knowledge of the DNA transposons most widely used in these studies.
Collapse
Affiliation(s)
- Martín Muñoz-López
- Andalusian Stem Cell Bank, Center for Biomedical Research, University of Granada, Avda. del Conocimiento s/n, Armilla, 18100, Granada, Spain
| | - José L. García-Pérez
- Andalusian Stem Cell Bank, Center for Biomedical Research, University of Granada, Avda. del Conocimiento s/n, Armilla, 18100, Granada, Spain
| |
Collapse
|
13
|
Carr M, Nelson M, Leadbeater BSC, Baldauf SL. Three families of LTR retrotransposons are present in the genome of the choanoflagellate Monosiga brevicollis. Protist 2008; 159:579-90. [PMID: 18621583 DOI: 10.1016/j.protis.2008.05.001] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2008] [Accepted: 05/01/2008] [Indexed: 11/29/2022]
Abstract
The choanoflagellates are a ubiquitous group of nanoflagellates and the sister group of Metazoa. Examination of the initial draft version of the first choanoflagellate genome, that of Monosiga brevicollis, reveals the presence of three novel families of long terminal repeat (LTR) retrotransposons and an apparent absence of non-LTR retrotransposons and transposons. One of the newly discovered LTR families falls in the chromovirus clade of the Ty3/gypsy group while the other two families are closely related members of the Ty1/copia group. Examination of EST sequences and nucleotide analyses show that all three families are transcriptionally active and potentially functional within the genome of M. brevicollis.
Collapse
Affiliation(s)
- Martin Carr
- Department of Biology, University of York, Heslington, York YO10 5YW, UK
| | | | | | | |
Collapse
|
14
|
Lanier W, Moustafa A, Bhattacharya D, Comeron JM. EST analysis of Ostreococcus lucimarinus, the most compact eukaryotic genome, shows an excess of introns in highly expressed genes. PLoS One 2008; 3:e2171. [PMID: 18478122 PMCID: PMC2367439 DOI: 10.1371/journal.pone.0002171] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2008] [Accepted: 03/25/2008] [Indexed: 11/19/2022] Open
Abstract
Background The genome of the pico-eukaryotic (bacterial-sized) prasinophyte green alga Ostreococcus lucimarinus has one of the highest gene densities known in eukaryotes, yet it contains many introns. Phylogenetic studies suggest this unusually compact genome (13.2 Mb) is an evolutionarily derived state among prasinophytes. The presence of introns in the highly reduced O. lucimarinus genome appears to be in opposition to simple explanations of genome evolution based on unidirectional tendencies, either neutral or selective. Therefore, patterns of intron retention in this species can potentially provide insights into the forces governing intron evolution. Methodology/Principal Findings Here we studied intron features and levels of expression in O. lucimarinus using expressed sequence tags (ESTs) to annotate the current genome assembly. ESTs were assembled into unigene clusters that were mapped back to the O. lucimarinus Build 2.0 assembly using BLAST and the level of gene expression was inferred from the number of ESTs in each cluster. We find a positive correlation between expression levels and both intron number (R = +0.0893, p = <0.0005) and intron density (number of introns/kb of CDS; R = +0.0753, p = <0.005). Conclusions/Significance In a species with a genome that has been recently subjected to a great reduction of non-coding DNA, these results imply the existence of selective/functional roles for introns that are principally detectable in highly expressed genes. In these cases, introns are likely maintained by balancing the selective forces favoring their maintenance with other mutational and/or selective forces acting on genome size.
Collapse
Affiliation(s)
- William Lanier
- Interdisciplinary Program in Genetics, University of Iowa, Iowa, United States of America
| | - Ahmed Moustafa
- Interdisciplinary Program in Genetics, University of Iowa, Iowa, United States of America
| | - Debashish Bhattacharya
- Interdisciplinary Program in Genetics, University of Iowa, Iowa, United States of America
- Department of Biological Sciences and Roy J. Carver Center for Comparative Genomics, University of Iowa, Iowa, United States of America
| | - Josep M. Comeron
- Interdisciplinary Program in Genetics, University of Iowa, Iowa, United States of America
- Department of Biological Sciences and Roy J. Carver Center for Comparative Genomics, University of Iowa, Iowa, United States of America
- * E-mail:
| |
Collapse
|
15
|
Johnson LJ. The Genome Strikes Back: The Evolutionary Importance of Defence Against Mobile Elements. Evol Biol 2007. [DOI: 10.1007/s11692-007-9012-5] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]
|
16
|
Carr M. Multiple subfamilies of mariner transposable elements are present in stalk-eyed flies (Diptera: Diopsidae). Genetica 2007; 132:113-22. [PMID: 17562187 DOI: 10.1007/s10709-007-9157-2] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2006] [Accepted: 04/30/2007] [Indexed: 10/23/2022]
Abstract
The Diopsid stalk-eyed flies are an increasingly well-studied group. Presented here is evidence of the first known transposable elements discovered in these flies. The vertumnana mariner subfamily was identified in the Diopsini tribe, but could not be amplified in species of the Sphyracephalini tribe. PCR screening with degenerate primers revealed that multiple mariner subfamilies are present within the Diopsidae. Most of the sequenced elements appear to be pseudogenes; however two subfamilies are shown to be evolving under purifying selection, raising the possibility that mariner is active in some Diopsid species. Evidence is presented of a possible horizontal transfer event involving an unknown Teleopsis species and the Tephritid fly Bactrocera neohumeralis.
Collapse
Affiliation(s)
- Martin Carr
- Department of Biology, University of York, Heslington, York, YO10 5YW, UK.
| |
Collapse
|
17
|
Choudhary M, Zanhua X, Fu YX, Kaplan S. Genome analyses of three strains of Rhodobacter sphaeroides: evidence of rapid evolution of chromosome II. J Bacteriol 2006; 189:1914-21. [PMID: 17172323 PMCID: PMC1855717 DOI: 10.1128/jb.01498-06] [Citation(s) in RCA: 47] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Three strains of Rhodobacter sphaeroides of diverse origin have been under investigation in our laboratory for their genome complexities, including the presence of multiple chromosomes and the distribution of essential genes within their genomes. The genome of R. sphaeroides 2.4.1 has been completely sequenced and fully annotated, and now two additional strains (ATCC 17019 and ATCC 17025) of R. sphaeroides have been sequenced. Thus, genome comparisons have become a useful approach in determining the evolutionary relationships among different strains of R. sphaeroides. In this study, the concatenated chromosomal sequences from the three strains of R. sphaeroides were aligned, using Mauve, to examine the extent of shared DNA regions and the degree of relatedness among their chromosome-specific DNA sequences. In addition, the exact intra- and interchromosomal DNA duplications were analyzed using Mummer. Genome analyses employing these two independent approaches revealed that strain ATCC 17025 diverged considerably from the other two strains, 2.4.1 and ATCC 17029, and that the two latter strains are more closely related to one another. Results further demonstrated that chromosome II (CII)-specific DNA sequences of R. sphaeroides have rapidly evolved, while CI-specific DNA sequences have remained highly conserved. Aside from the size variation of CII of R. sphaeroides, variation in sequence lengths of the CII-shared DNA regions and their high sequence divergence among strains of R. sphaeroides suggest the involvement of CII in the evolution of strain-specific genomic rearrangements, perhaps requiring strains to adapt in specialized niches.
Collapse
Affiliation(s)
- M Choudhary
- Department of Microbiology and Molecular Genetics, The University of Texas Medical School, Houston, Texas 77030, USA
| | | | | | | |
Collapse
|
18
|
Hawkins JS, Kim H, Nason JD, Wing RA, Wendel JF. Differential lineage-specific amplification of transposable elements is responsible for genome size variation in Gossypium. Genes Dev 2006; 16:1252-61. [PMID: 16954538 PMCID: PMC1581434 DOI: 10.1101/gr.5282906] [Citation(s) in RCA: 289] [Impact Index Per Article: 15.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2006] [Accepted: 05/22/2006] [Indexed: 11/25/2022]
Abstract
The DNA content of eukaryotic nuclei (C-value) varies approximately 200,000-fold, but there is only a approximately 20-fold variation in the number of protein-coding genes. Hence, most C-value variation is ascribed to the repetitive fraction, although little is known about the evolutionary dynamics of the specific components that lead to genome size variation. To understand the modes and mechanisms that underlie variation in genome composition, we generated sequence data from whole genome shotgun (WGS) libraries for three representative diploid (n = 13) members of Gossypium that vary in genome size from 880 to 2460 Mb (1C) and from a phylogenetic outgroup, Gossypioides kirkii, with an estimated genome size of 588 Mb. Copy number estimates including all dispersed repetitive sequences indicate that 40%-65% of each genome is composed of transposable elements. Inspection of individual sequence types revealed differential, lineage-specific expansion of various families of transposable elements among the different plant lineages. Copia-like retrotransposable element sequences have differentially accumulated in the Gossypium species with the smallest genome, G. raimondii, while gypsy-like sequences have proliferated in the lineages with larger genomes. Phylogenetic analyses demonstrated a pattern of lineage-specific amplification of particular subfamilies of retrotransposons within each species studied. One particular group of gypsy-like retrotransposon sequences, Gorge3 (Gossypium retrotransposable gypsy-like element), appears to have undergone a massive proliferation in two plant lineages, accounting for a major fraction of genome-size change. Like maize, Gossypium has undergone a threefold increase in genome size due to the accumulation of LTR retrotransposons over the 5-10 Myr since its origin.
Collapse
Affiliation(s)
- Jennifer S. Hawkins
- Iowa State University, Department of Ecology, Evolution and Organismal Biology, Ames, Iowa 50011, USA
| | - HyeRan Kim
- University of Arizona, Department of Plant Sciences, Arizona Genomics Institute, Tucson, Arizona 85721, USA
| | - John D. Nason
- Iowa State University, Department of Ecology, Evolution and Organismal Biology, Ames, Iowa 50011, USA
| | - Rod A. Wing
- University of Arizona, Department of Plant Sciences, Arizona Genomics Institute, Tucson, Arizona 85721, USA
| | - Jonathan F. Wendel
- Iowa State University, Department of Ecology, Evolution and Organismal Biology, Ames, Iowa 50011, USA
| |
Collapse
|
19
|
Ponce R, Hartl DL. The evolution of the novel Sdic gene cluster in Drosophila melanogaster. Gene 2006; 376:174-83. [PMID: 16765537 DOI: 10.1016/j.gene.2006.02.011] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2005] [Revised: 02/14/2006] [Accepted: 02/16/2006] [Indexed: 11/17/2022]
Abstract
The origin of new genes and of new functions for existing genes are fundamental processes in molecular evolution. Sdic is a newly evolved gene that arose recently in the D. melanogaster lineage. The gene encodes a novel sperm motility protein. It is a chimeric gene formed by duplication of two other genes followed by multiple deletions and other sequence rearrangements. The Sdic gene exists in several copies in the X chromosome, and is presumed to have undergone several duplications to form a tandemly arrayed gene cluster. Given the very recent origin of the gene and the gene cluster, the analysis of the composition of this gene cluster represents an excellent opportunity to study the origin and evolution of new gene functions and the fate of gene duplications. We have analyzed the nucleotide sequence of this region and reconstructed the evolutionary history of this gene cluster. We found that the cluster is composed by four tandem copies of Sdic; these duplicates are very similar but can be distinguished by the unique pattern of insertions, deletions, and point mutations in each copy. The oldest gene copy in the array has a 3' exon that has undergone accelerated diversification, and also shows divergent regulatory sequences. Moreover, there is evidence that this might be the only gene copy in the tandem array that is transcribed at a significant level, expressing a novel sperm-specific protein. There is also a retrotransposon located at the 3' end of each Sdic gene copy. We argue that this gene cluster was formed in the last two million years by at least three tandem duplications and one retrotransposition event.
Collapse
MESH Headings
- 3' Untranslated Regions
- Amino Acid Motifs
- Amino Acid Sequence
- Animals
- Axonemal Dyneins
- Base Sequence
- DNA, Intergenic/chemistry
- Drosophila Proteins/chemistry
- Drosophila Proteins/genetics
- Drosophila melanogaster/genetics
- Dyneins/chemistry
- Dyneins/genetics
- Evolution, Molecular
- Exons
- Gene Deletion
- Gene Dosage
- Gene Duplication
- Gene Rearrangement
- Genes, Insect
- Genes, X-Linked
- Genetic Variation
- Molecular Sequence Data
- Multigene Family
- Mutagenesis, Insertional
- Phylogeny
- Point Mutation
- Promoter Regions, Genetic
- Protein Structure, Tertiary
- Regulatory Sequences, Nucleic Acid
- Retroelements
- Sequence Homology, Amino Acid
- Transcription, Genetic
Collapse
Affiliation(s)
- Rita Ponce
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA.
| | | |
Collapse
|
20
|
Docking TR, Saadé FE, Elliott MC, Schoen DJ. Retrotransposon Sequence Variation in Four Asexual Plant Species. J Mol Evol 2006; 62:375-87. [PMID: 16547645 DOI: 10.1007/s00239-004-0350-y] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2004] [Accepted: 12/05/2005] [Indexed: 11/30/2022]
Abstract
Transposable elements (TEs) can be viewed as genetic parasites that persist in populations due to their capacity for increase in copy number and the inefficacy of selection against them. A corollary of this hypothesis is that TEs are more likely to spread within sexual populations and be eliminated or inactivated within asexual populations. While previous work with animals has shown that asexual taxa may contain less TE diversity than sexual taxa, comparable work with plants has been lacking. Here we report the results of a study of Ty1/copia, Ty3/gypsy, and LINE-like retroelement diversity in four asexual plant species. Retroelement-like sequences, with a high degree of conservation both within and between species, were isolated from all four species. The sequences correspond to several previously annotated retroelement subfamilies. They also exhibit a pattern of nucleotide substitution characterized by an excess of synonymous substitutions, suggestive of a history of purifying selection. These findings were compared with retroelement sequence evolution in sexual plant taxa. One likely explanation for the discovery of conserved TE sequences in the genomes of these asexual taxa is simply that asexuality within these taxa evolved relatively recently, such that the loss and breakdown of TEs is not yet detectable through analysis of sequence diversity. This explanation is examined by conducting stochastic simulation of TE evolution and by using published information to infer rough estimates of the ages of asexual taxa.
Collapse
Affiliation(s)
- T Roderick Docking
- Department of Biology, McGill University, 1205 Avenue Docteur Penfield, Montréal, H3A 1B1, Québec, Canada
| | | | | | | |
Collapse
|
21
|
Nelson CE, Hersh BM, Carroll SB. The regulatory content of intergenic DNA shapes genome architecture. Genome Biol 2004; 5:R25. [PMID: 15059258 PMCID: PMC395784 DOI: 10.1186/gb-2004-5-4-r25] [Citation(s) in RCA: 98] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2003] [Revised: 01/09/2004] [Accepted: 02/08/2004] [Indexed: 11/21/2022] Open
Abstract
The relationship between regulatory complexity and gene spacing was examined in Caenorhabditis elegans and Drosophila melanogaster. Intergenic distance, and hence genome architecture, is shaped by regulatory information contained in noncoding DNA. Background Factors affecting the organization and spacing of functionally unrelated genes in metazoan genomes are not well understood. Because of the vast size of a typical metazoan genome compared to known regulatory and protein-coding regions, functional DNA is generally considered to have a negligible impact on gene spacing and genome organization. In particular, it has been impossible to estimate the global impact, if any, of regulatory elements on genome architecture. Results To investigate this, we examined the relationship between regulatory complexity and gene spacing in Caenorhabditis elegans and Drosophila melanogaster. We found that gene density directly reflects local regulatory complexity, such that the amount of noncoding DNA between a gene and its nearest neighbors correlates positively with that gene's regulatory complexity. Genes with complex functions are flanked by significantly more noncoding DNA than genes with simple or housekeeping functions. Genes of low regulatory complexity are associated with approximately the same amount of noncoding DNA in D. melanogaster and C. elegans, while loci of high regulatory complexity are significantly larger in the more complex animal. Complex genes in C. elegans have larger 5' than 3' noncoding intervals, whereas those in D. melanogaster have roughly equivalent 5' and 3' noncoding intervals. Conclusions Intergenic distance, and hence genome architecture, is highly nonrandom. Rather, it is shaped by regulatory information contained in noncoding DNA. Our findings suggest that in compact genomes, the species-specific loss of nonfunctional DNA reveals a landscape of regulatory information by leaving a profile of functional DNA in its wake.
Collapse
Affiliation(s)
- Craig E Nelson
- Howard Hughes Medical Institute, University of Wisconsin-Madison, 1525 Linden Drive, Madison, WI 53703, USA.
| | | | | |
Collapse
|
22
|
Abstract
Numerous theories have been proposed to account for the pronounced differences in the quantity of non-coding DNA among eukaryotic genomes, but the current repertoire remains incomplete because the only explicit mechanisms it provides involve DNA gain. It has been proposed more recently that biases in spontaneous insertions and deletions (indels) can lead to genome shrinkage by mutational mechanisms alone. The present article provides the first detailed critical discussion of this approach, and covers three different ideas related to it: (1) the general notion of DNA loss by deletion bias, (2) the "DNA loss hypothesis" which supposes that variation in genome size can be attributed to differences in DNA loss rate, and (3) the "mutational equilibrium model" which attempts to describe the long-term evolution of genome size. The mutational equilibrium model is found to be problematic, and it is noted that DNA loss by small indels is too slow in real time to determine variation in genome size above a relatively low threshold. Some alternative explanations for the observed patterns are provided, and the critique also identifies some potential problems with the current dataset. These include a failure to cite a more detailed (and somewhat contradictory) mammalian dataset, a questionable use of arithmetic means with highly skewed data, and important discrepancies among the particular DNA sequences so far analyzed. Overall, evolutionary reductions in genome size are considered important, but the specific mechanism relating to small deletion bias is far too weak to be accepted as a primary determinant of genome size variation in general.
Collapse
Affiliation(s)
- T Ryan Gregory
- Division of Invertebrate Zoology, American Museum of Natural History, Central Park West at 79th Street, New York, NY 10024, USA.
| |
Collapse
|
23
|
Lerat E, Rizzon C, Biémont C. Sequence divergence within transposable element families in the Drosophila melanogaster genome. Genome Res 2003; 13:1889-96. [PMID: 12869581 PMCID: PMC403780 DOI: 10.1101/gr.827603] [Citation(s) in RCA: 47] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
The availability of the sequenced Drosophila melanogaster genome provides an opportunity to study sequence variation between copies within transposable element families. In this study,we analyzed the 624 copies of 22 transposable element (TE) families (14 LTR retrotransposons, five non-LTR retrotransposons, and three transposons). LTR and non-LTR retrotransposons possessed far fewer divergent elements than the transposons,suggesting that the difference depends on the transposition mechanism. However,there was not a continuous range of divergence of the copies in each class,which were either very similar to the canonical elements,or very divergent from them. This sequence homogeneity among TE family copies matches the theoretical models of the dynamics of these repeated sequences. The sequenced Drosophila genome thus appears to be composed of a mixture of TEs that are still active and of ancient relics that have degenerated and the distribution of which along the chromosomes results from natural selection. This clearly demonstrates that the TEs are highly active within the genome,suggesting that the genetic variability of the Drosophila genome is still being renewed by the action of TEs.
Collapse
Affiliation(s)
- Emmanuelle Lerat
- Laboratoire de Biométrie et Biologie Evolutive, Université Lyon 1, 69622 Villeurbanne cedex, France
| | | | | |
Collapse
|
24
|
Neafsey DE, Palumbi SR. Genome size evolution in pufferfish: a comparative analysis of diodontid and tetraodontid pufferfish genomes. Genome Res 2003; 13:821-30. [PMID: 12727902 PMCID: PMC430906 DOI: 10.1101/gr.841703] [Citation(s) in RCA: 55] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Smooth pufferfish of the family Tetraodontidae have the smallest vertebrate genomes yet measured. They have a haploid genome size of approximately 400 million bp (Mb), which is almost eight times smaller than the human genome. Given that spiny pufferfish from the sister family Diodontidae and a fish from the outgroup Molidae have genomes twice as large as smooth puffers, it appears that the genome size of smooth puffers has contracted in the last 50-70 million years since their divergence from the spiny puffers. Here we use renaturation kinetics to compare the repetitive nature of the smooth and spiny puffer genomes. We also estimate the rates of small (<400 bp) insertions and deletions in smooth and spiny puffers using defunct non-LTR retrotransposons. We find a significantly greater abundance of a transposon-like repetitive DNA class in spiny puffers relative to smooth puffers, in addition to nearly identical indel rates. We comment on the role that large insertions may play in the evolution of genome size in these two groups.
Collapse
Affiliation(s)
- Daniel E Neafsey
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts 02138, USA.
| | | |
Collapse
|
25
|
Navarro-Quezada A, Schoen DJ. Sequence evolution and copy number of Ty1-copia retrotransposons in diverse plant genomes. Proc Natl Acad Sci U S A 2002; 99:268-73. [PMID: 11752395 PMCID: PMC117550 DOI: 10.1073/pnas.012422299] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2001] [Indexed: 11/18/2022] Open
Abstract
Sequence evolution of the reverse transcriptase (RT) gene in retrotransposons belonging to the Ty1-copia class was studied in 11 plant species. Phylogenetic reconstruction of the evolutionary history of RT sequences indicated a strong pattern of purifying selection, manifested as high ratios of third to first plus second codon position substitutions, and low ratios of nonsynonymous substitutions per nonsynonymous site to synonymous substitutions per synonymous site, especially in internal portions of the element phylogenies. Evidence of purifying selection was most pronounced in plant species with low estimated copy numbers of Ty1-copia elements. This finding is consistent with the hypothesis that high element turnover rates (e.g., caused by high rates of element loss and selection against high element copy number) favors elements capable of transposition. Simulations of RT sequence evolution were conducted to help verify the logical validity of this conclusion. The results argue that it is incorrect to assume that low copy numbers of transposable elements are the product of reduced levels of element activity.
Collapse
Affiliation(s)
- Aura Navarro-Quezada
- Department of Biology, McGill University, 1205 Avenue Docteur Penfield, Montreal, QC, Canada
| | | |
Collapse
|
26
|
Chipman AD, Khaner O, Haas A, Tchernov E. The evolution of genome size: what can be learned from anuran development? THE JOURNAL OF EXPERIMENTAL ZOOLOGY 2001; 291:365-74. [PMID: 11754015 DOI: 10.1002/jez.1135] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
Differences in nuclear DNA content in vertebrates have been shown to be correlated with cell size, cell division rate, and embryonic developmental rate. We compare seven species of anuran amphibians with a three-fold range of genome sizes. Parameters examined include the number and density of cells in a number of embryonic structures, and the change in cell number in the CNS during development. We show that genome size is correlated with cell proliferation rate and with developmental rate at different stages of embryonic development, but that the correlation between genome size and cell size is only evident at later stages. We discuss the evolution of genome size in amphibians. Our discussion takes into account data that reportedly support two conflicting hypotheses: the "skeletal DNA" hypothesis, which claims a selective role for differences in genome size, and the "junk DNA" hypothesis, which claims that differences in genome size are a random result of the accumulation of noncoding DNA sequences. We show that these supposedly conflicting hypotheses can be integrated into a more complex and inclusive model for the evolution of genome size.
Collapse
Affiliation(s)
- A D Chipman
- The Department for Cell and Animal Biology, The Hebrew University, Jerusalem 91904, Israel.
| | | | | | | |
Collapse
|
27
|
Lovsin N, Gubensek F, Kordi D. Evolutionary dynamics in a novel L2 clade of non-LTR retrotransposons in Deuterostomia. Mol Biol Evol 2001; 18:2213-24. [PMID: 11719571 DOI: 10.1093/oxfordjournals.molbev.a003768] [Citation(s) in RCA: 57] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
The evolution of the novel L2 clade of non-long terminal repeat (LTR) retrotransposons and their evolutionary dynamics in Deuterostomia has been examined. The short-term evolution of long interspersed nuclear element 2s (LINE2s) has been studied in 18 reptilian species by analysis of a PCR amplified 0.7-kb fragment encoding the palm/fingers subdomain of reverse transcriptase (RT). Most of the reptilian LINE2s examined are inactive since they contain multiple stop codons, indels, or frameshift mutations that disrupt the RT. Analysis of reptilian LINE2s has shown a high degree of sequence divergence and an unexpectedly large number of deletions. The evolutionary dynamics of LINE2s in reptiles has been found to be complex. LINE2s are shown to form a novel clade of non-LTR retrotransposons that is well separated from the CR1 clade. This novel L2 clade is more widely distributed than previously thought, and new representatives have been discovered in echinoderms, insects, teleost fishes, Xenopus, Squamata, and marsupials. There is an apparent absence of LINE2s from different vertebrate classes, such as cartilaginous fishes, Archosauria (birds and crocodiles), and turtles. Whereas the LINE2s are present in echinoderms and teleost fishes in a conserved form, in most tetrapods only highly degenerated pseudogenes can be found. The predominance of inactive LINE2s in Tetrapoda indicates that, in the host genomes, only inactive copies are still present. The present data indicate that the vertical inactivation of LINE2s might have begun at the time of Tetrapoda origin, 400 MYA. The evolutionary dynamics of the L2 clade in Deuterostomia can be described as a gradual vertical inactivation in Tetrapoda, stochastic loss in Archosauria and turtles, and strict vertical transmission in echinoderms and teleost fishes.
Collapse
Affiliation(s)
- N Lovsin
- Department of Chemistry and Biochemistry, Faculty of Chemistry and Chemical Technology, University of Ljubljana, Slovenia
| | | | | |
Collapse
|
28
|
Bowen NJ, McDonald JF. Drosophila euchromatic LTR retrotransposons are much younger than the host species in which they reside. Genome Res 2001; 11:1527-40. [PMID: 11544196 PMCID: PMC311128 DOI: 10.1101/gr.164201] [Citation(s) in RCA: 124] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
The recent release of the complete euchromatic genome sequence of Drosophila melanogaster offers a unique opportunity to explore the evolutionary history of transposable elements (TEs) within the genome of a higher eukaryote. In this report, we describe the annotation and phylogenetic comparison of 178 full-length long terminal repeat (LTR) retrotransposons from the sequenced component of the D. melanogaster genome. We report the characterization of 17 LTR retrotransposon families described previously and five newly discovered element families. Phylogenetically, these families can be divided into three distinct lineages that consist of members from the canonical Copia and Gypsy groups as well as a newly discovered third group containing BEL, mazi, and roo elements. Each family consists of members with average pairwise identities > or =99% at the nucleotide level, indicating they may be the products of recent transposition events. Consistent with the recent transposition hypothesis, we found that 70% (125/178) of the elements (across all families) have identical intra-element LTRs. Using the synonymous substitution rate that has been calculated previously for Drosophila (.016 substitutions per site per million years) and the intra-element LTR divergence calculated here, the average age of the remaining 30% (53/178) of the elements was found to be 137,000 +/-89,000 yr. Collectively, these results indicate that many full-length LTR retrotransposons present in the D. melanogaster genome have transposed well after this species diverged from its closest relative Drosophila simulans, 2.3 +/-.3 million years ago.
Collapse
Affiliation(s)
- N J Bowen
- Department of Genetics, University of Georgia, Athens, Georgia 30602, USA
| | | |
Collapse
|
29
|
Abstract
Studies of neutrally evolving sequences suggest that differences in eukaryotic genome sizes result from different rates of DNA loss. However, very few pseudogenes have been identified in microbial species, and the processes whereby genes and genomes deteriorate in bacteria remain largely unresolved. The typhus-causing agent, Rickettsia prowazekii, is exceptional in that as much as 24% of its 1.1-Mb genome consists of noncoding DNA and pseudogenes. To test the hypothesis that the noncoding DNA in the R. prowazekii genome represents degraded remnants of ancestral genes, we systematically examined all of the identified pseudogenes and their flanking sequences in three additional Rickettsia species. Consistent with the hypothesis, we observe sequence similarities between genes and pseudogenes in one species and intergenic DNA in another species. We show that the frequencies and average sizes of deletions are larger than insertions in neutrally evolving pseudogene sequences. Our results suggest that inactivated genetic material in the Rickettsia genomes deteriorates spontaneously due to a mutation bias for deletions and that the noncoding sequences represent DNA in the final stages of this degenerative process.
Collapse
Affiliation(s)
- J O Andersson
- Department of Molecular Evolution, University of Uppsala, Uppsala, Sweden
| | | |
Collapse
|
30
|
Blesa D, Gandía M, Martínez-Sebastián MJ. Distribution of the bilbo non-LTR retrotransposon in Drosophilidae and its evolution in the Drosophila obscura species group. Mol Biol Evol 2001; 18:585-92. [PMID: 11264411 DOI: 10.1093/oxfordjournals.molbev.a003839] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
The bilbo element is a non-LTR retrotransposon isolated from Drosophila subobscura. We conducted a distribution survey by Southern blot for 52 species of the family Drosophilidae, mainly from the obscura and melanogaster groups. Most of the analyzed species bear sequences homologous to bilbo from D. subobscura. In the obscura group, species from the same species subgroup also share similar Southern blot patterns. To investigate the phylogenetic relationship among these elements, we analyzed eight copies of a short sequence of the element from several species of the obscura group. The obtained phylogram agrees with the phylogeny of the species, which suggests vertical transmission of the element.
Collapse
Affiliation(s)
- D Blesa
- Departament de Genètica, Universitat de València, València, Spain
| | | | | |
Collapse
|
31
|
Abstract
Factors that influence the genesis and genomic distribution of microsatellite DNA are poorly understood. We have identified a novel class of Dipteran mobile elements, mini-me elements, which help elucidate both of these issues. These retroposons contain two internal proto-microsatellite regions that commonly expand into lengthy microsatellite repeats. These elements are highly abundant, accounting for approximately 1.2% of the Drosophila melanogaster genome, giving them the potential to be a prolific source of microsatellite DNA variation. They also give us the opportunity to observe the outcomes of multiple microsatellite genesis events (initiating from the same proto-microsatellite) at separate mini-me loci. Based on these observations, we determined that the genesis of microsatellites within mini-me elements occurs through two separate mutational processes: the expansion of preexisting tandem repeats and the conversion of sequence with high cryptic simplicity into tandemly repetitive DNA. These modes of microsatellite genesis can be generalized beyond the case of mini-me elements and help to explain the genesis of microsatellites in any sequence region that is not constrained by selection.
Collapse
Affiliation(s)
- J Wilder
- Department of Ecology and Evolutionary Biology, Princeton University, Princeton, New Jersey 08544, USA.
| | | |
Collapse
|
32
|
Abstract
Eukaryotic genomes come in a wide variety of sizes. Haploid DNA contents (C values) range > 80,000-fold without an apparent correlation with either the complexity of the organism or the number of genes. This puzzling observation, the C-value paradox, has remained a mystery for almost half a century, despite much progress in the elucidation of the structure and function of genomes. Here I argue that new approaches focussing on the genetic mechanisms that generate genome-size differences could shed much light on the evolution of genome size.
Collapse
Affiliation(s)
- D A Petrov
- Department of Biological Sciences, Stanford University, Stanford, CA 94305, USA.
| |
Collapse
|
33
|
Goldfeld AE, Leung JY, Sawyer SA, Hartl DL. Post-genomics and the neutral theory: variation and conservation in the tumor necrosis factor-alpha promoter. Gene 2000; 261:19-25. [PMID: 11164033 DOI: 10.1016/s0378-1119(00)00477-7] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
In the post-genomics era, molecular evolutionary geneticists have come to possess the molecular, statistical, and computational tools for estimating the relative importance of selection and random genetic drift in virtually any gene in almost any organism. We have examined single-nucleotide polymorphisms (SNPs) and nucleotide divergence across a region of approximately 1 kb in the promoter of the human tumor necrosis factor alpha (TNF-alpha) gene. TNF-alpha, which plays an important role in lymphocyte biology and in the pathogenesis of infectious and autoimmune diseases, is tightly regulated at the level of transcription through sequence-specific binding of transcription factors to cognate binding sites in a relatively small region of the 5' non-coding region of the gene. Analysis of the promoter region in 207 human chromosomes revealed nine SNPs, none of which were located in regions known to be important in transcriptional activation. Comparison with one promoter sequence in each of seven species of primates revealed 162 nucleotide sites occupied by a monomorphic nucleotide in the human sample but occupied by a different nucleotide in at least one of the primate sequences (a 'fixed human difference'). The fixed human differences were found outside the regions known to be important in transcriptional activation, and their large number suggests that they might be effectively neutral (Ns<<1). With regard to the human SNPs, although the hypothesis Ns approximately 0 cannot be rejected, the sample configurations suggest that the substitutions might be mildly deleterious. We emphasize the analytical insight to be gained from interspecific comparisons: through the interspecific comparisons, 3.1% of the total sequence information yielded 94.7% of the variable nucleotides. This combined approach, using interspecific comparisons and human polymorphism together with data from functional analyses, provides valuable insights into the evolutionary history and regulation of a key gene in the human immune response.
Collapse
Affiliation(s)
- A E Goldfeld
- The Center for Blood Research and Department of Medicine, Harvard Medical School, 800 Huntington Avenue, Boston, MA 02115, USA
| | | | | | | |
Collapse
|
34
|
Abstract
For 50 years now, one of the enigmas of molecular evolution has been the C-value paradox, which refers to the often massive, counterintuitive and seemingly arbitrary differences in genome size observed among eukaryotic organisms. For example, the genome of the fruitfly Drosophila melanogaster is 180 megabases (Mb), whereas that of the European brown grasshopper Podisma pedestris is 18,000 Mb. The difference in genome size of a factor of 100 is difficult to explain in view of the apparently similar levels of evolutionary, developmental and behavioural complexity of these organisms.
Collapse
Affiliation(s)
- D L Hartl
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts 02138, USA.
| |
Collapse
|
35
|
Petrov DA, Sangster TA, Johnston JS, Hartl DL, Shaw KL. Evidence for DNA loss as a determinant of genome size. Science 2000; 287:1060-2. [PMID: 10669421 DOI: 10.1126/science.287.5455.1060] [Citation(s) in RCA: 220] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]
Abstract
Eukaryotic genome sizes range over five orders of magnitude. This variation cannot be explained by differences in organismic complexity (the C value paradox). To test the hypothesis that some variation in genome size can be attributed to differences in the patterns of insertion and deletion (indel) mutations among organisms, this study examines the indel spectrum in Laupala crickets, which have a genome size 11 times larger than that of Drosophila. Consistent with the hypothesis, DNA loss is more than 40 times slower in Laupala than in Drosophila.
Collapse
Affiliation(s)
- D A Petrov
- Harvard University Society of Fellows, Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA.
| | | | | | | | | |
Collapse
|
36
|
Affiliation(s)
- P Capy
- Laboratoire Populations, Genetique & Evoluation, CNRS, Gif-sur-Yvette, France.
| |
Collapse
|
37
|
Robertson HM. The large srh family of chemoreceptor genes in Caenorhabditis nematodes reveals processes of genome evolution involving large duplications and deletions and intron gains and losses. Genome Res 2000; 10:192-203. [PMID: 10673277 DOI: 10.1101/gr.10.2.192] [Citation(s) in RCA: 107] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
The srh family of chemoreceptors in the nematode Caenorhabditis elegans is very large, containing 214 genes and 90 pseudogenes. It is related to the str, stl, and srd families of seven-transmembrane or serpentine receptors. Like these three families, most srh genes are concentrated on chromosome V, and mapping of their chromosomal locations on a phylogenetic tree reveals 27 different movements of genes to other chromosomes. Mapping of intron gains and losses onto the phylogenetic tree reveals that the last common ancestral gene of the family had five introns, which are inferred to have been lost 70 times independently during evolution of the family. In addition, seven intron gains are revealed, three of which are fairly recent. Comparisons with 20 family members in the C. briggsae genome confirms these patterns, including two intron losses in C. briggsae since the species split. There are 14 clear C. elegans orthologs for these 20 genes, whose average amino acid divergence of 68% allows estimation of 85 gene duplications in the C. elegans lineage since the species split. The absence of six orthologs in C. elegans also indicates that gene loss occurs; consideration of all deletions and terminal truncations of srh pseudogenes reveals that large deletions are common. Together these observations provide insight into the evolutionary dynamics of this compact animal genome.
Collapse
Affiliation(s)
- H M Robertson
- Department of Entomology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801 USA.
| |
Collapse
|
38
|
Abstract
Studies of noncoding and pseudogene sequence diversity, particularly in Rickettsia, have begun to reveal the basic principles of genome degradation in microorganisms. Increasingly, studies of genes and genomes suggest that there has been an extensive amount of horizontal gene transfer among microorganisms. As this inflow of genetic material does not seem generally to have resulted in genome size expansions, however, degenerative processes must be at the very least as widespread as horizontal gene transfer. The basic principles of gene degradation and elimination that are being explored in Rickettsia are likely to be of major importance for our understanding of how microbial genomes evolve.
Collapse
Affiliation(s)
- J O Andersson
- Department of Molecular Evolution, Uppsala University, Biomedical Center, Box 590, Uppsala, 751 24, Sweden.
| | | |
Collapse
|
39
|
Lozovskaya ER, Nurminsky DI, Petrov DA, Hartl DL. Genome size as a mutation-selection-drift process. Genes Genet Syst 1999; 74:201-7. [PMID: 10734601 DOI: 10.1266/ggs.74.201] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
A novel method for estimating neutral rates and patterns of DNA evolution in Drosophila takes advantage of the propensity of non-LTR retrotransposable elements to create nonfunctional, transpositionally inactive copies as a product of transposition. For many LINE elements, most copies present in a genome at any one time are nonfunctional "dead-on-arrival" (DOA) copies. Because these are off-shoots of active, transpositionally competent "master" lineages, in a gene tree of a LINE element from multiple samples from related species, the DOA lineages are expected to map to the terminal branches and the active lineages to the internal branches, the primary exceptions being when the sample includes DOA copies that are allelic or orthologous. Analysis of nucleotide substitutions and other changes along the terminal branches therefore allows estimation of the fixation process in the DOA copies, which are unconstrained with respect to protein coding; and under selective neutrality, the fixation process estimates the underlying mutational pattern. We have studied the retroelement Helena in Drosophila. An unexpectedly high rate of DNA loss was observed, yielding a half-life of unconstrained DNA sequences approximately 60-fold faster in Drosophila than in mammals. The high rate of DNA loss suggests a straightforward explanation of the seeming paradox that Drosophila has many fewer pseudogenes than found in mammalian species. Differential rates of deletion in different taxa might also contribute to the celebrated C-value paradox of why some closely related organisms can have very different DNA contents. New data presented here rule out the possibility that the transposition process itself is highly mutagenic, hence the observed linear relation between number of deletions and number of nucleotide substitutions is most easily explained by the hypothesis that both types of changes accumulate in unconstrained sequences over time.
Collapse
Affiliation(s)
- E R Lozovskaya
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts 02138, USA
| | | | | | | |
Collapse
|
40
|
Gregory TR, Hebert PD. The Modulation of DNA Content: Proximate Causes and Ultimate Consequences. Genome Res 1999. [DOI: 10.1101/gr.9.4.317] [Citation(s) in RCA: 66] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
The forces responsible for modulating the large-scale features of the genome remain one of the most difficult issues confronting evolutionary biology. Although diversity in chromosomal architecture, nucleotide composition, and genome size has been well documented, there is little understanding of either the evolutionary origins or impact of much of this variation. The 80,000-fold divergence in genome sizes among eukaryotes represents perhaps the greatest challenge for genomic holists. Although some researchers continue to characterize much variation in genome size as a mere by-product of an intragenomic selfish DNA “free-for-all” there is increasing evidence for the primacy of selection in molding genome sizes via impacts on cell size and division rates. Moreover, processes inducing quantum or doubling series variation in gametic or somatic genome sizes are common. These abrupt shifts have broad effects on phenotypic attributes at both cellular and organismal levels and may play an important role in explaining episodes of rapid—or even saltational—character state evolution.
Collapse
|
41
|
Petrov DA, Hartl DL. Patterns of nucleotide substitution in Drosophila and mammalian genomes. Proc Natl Acad Sci U S A 1999; 96:1475-9. [PMID: 9990048 PMCID: PMC15487 DOI: 10.1073/pnas.96.4.1475] [Citation(s) in RCA: 108] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
To estimate patterns of molecular evolution of unconstrained DNA sequences, we used maximum parsimony to separate phylogenetic trees of a non-long terminal repeat retrotransposable element into either internal branches, representing mainly the constrained evolution of active lineages, or into terminal branches, representing mainly nonfunctional "dead-on-arrival" copies that are unconstrained by selection and evolve as pseudogenes. The pattern of nucleotide substitutions in unconstrained sequences is expected to be congruent with the pattern of point mutation. We examined the retrotransposon Helena in the Drosophila virilis species group (subgenus Drosophila) and the Drosophila melanogaster species subgroup (subgenus Sophophora). The patterns of point mutation are indistinguishable, suggesting considerable stability over evolutionary time (40-60 million years). The relative frequencies of different point mutations are unequal, but the "transition bias" results largely from an approximately 2-fold excess of G.C to A.T substitutions. Spontaneous mutation is biased toward A.T base pairs, with an expected mutational equilibrium of approximately 65% A + T (quite similar to that of long introns). These data also enable the first detailed comparison of patterns of point mutations in Drosophila and mammals. Although the patterns are different, all of the statistical significance comes from a much greater rate of G.C to A.T substitution in mammals, probably because of methylated cytosine "hotspots." When the G.C to A.T substitutions are discounted, the remaining differences are considerably reduced and not statistically significant.
Collapse
Affiliation(s)
- D A Petrov
- Harvard University Society of Fellows, 78 Mt. Auburn Street, Cambridge, MA 02138, USA.
| | | |
Collapse
|
42
|
Robertson HM. Two large families of chemoreceptor genes in the nematodes Caenorhabditis elegans and Caenorhabditis briggsae reveal extensive gene duplication, diversification, movement, and intron loss. Genome Res 1998; 8:449-63. [PMID: 9582190 DOI: 10.1101/gr.8.5.449] [Citation(s) in RCA: 119] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
The str family of genes encoding seven-transmembrane G-protein-coupled or serpentine receptors related to the ODR-10 diacetyl chemoreceptor is very large, with at least 197 members in the Caenorhabditis elegans genome. The closely related stl family has 43 genes, and both families are distantly related to the srd family with 55 genes. Analysis of the structures of these genes indicates that a third of them are clearly or likely pseudogenes. Preliminary surveys of other candidate chemoreceptor families indicates that as many as 800 genes and pseudogenes or 6% of the genome might encode 550 functional chemoreceptors constituting 4% of the C. elegans protein complement. Phylogenetic analyses of the str and stl families, and comparisons with a few orthologs in Caenorhabditis briggsae, reveal ongoing processes of gene duplication, diversification, and movement. The reconstructed ancestral gene structures for these two families have eight introns each, four of which are homologous. Mapping of intron distributions on the phylogenetic tree reveals that each intron has been lost many times independently. Most of these introns were lost individually, which might best be explained by precise in-frame deletions involving nonhomologous recombination between short direct repeats at their termini. [Alignment of the putatively functional proteins in the str and stl families is available from Pfam (http://genome. wustl.edu/Pfam); alignments of all translations are available at http://cshl.org/gr; alignments of the genes are available from the author at hughrobe@uiuc.edu]
Collapse
Affiliation(s)
- H M Robertson
- Department of Entomology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA.
| |
Collapse
|
43
|
Robertson HM, Martos R. Molecular evolution of the second ancient human mariner transposon, Hsmar2, illustrates patterns of neutral evolution in the human genome lineage. Gene 1997; 205:219-28. [PMID: 9461396 DOI: 10.1016/s0378-1119(97)00471-x] [Citation(s) in RCA: 44] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
A consensus sequence for the second ancient mariner identified in the human genome, Hsmar2, was constructed by majority rule from full-length and partial sequences of 44 of the +/-1000 copies in the genome. This 1300 base pair (bp) consensus has 31 bp imperfect terminal repeats (ITRs) and encodes a 351 amino acid (aa) mariner transposase. The sequence of this transposase has allowed classification of Hsmar2 as a basal lineage of the irritans subfamily of mariners, sharing at most 38% aa identity with other members of the subfamily. The individual copies in the human genome are all highly mutated from the consensus, having suffered numerous small and some large insertions and deletions (indels), including many insertions of S and J subfamily Alu elements. The copies differ, on average, from the consensus by 11.6%, have suffered 11.8 indels per kilobase (kb), and only 3.7% of the 30 hypermutable CpG dinucleotide pairs in the consensus remain intact. This level of divergence indicates that the ancestrally active Hsmar2 element represented by the consensus was present in the human genome lineage about 80 million years (Myr) ago. Each copy has apparently evolved since then largely independently of the others, and with little constraint on its transposase coding capacity. This pattern of molecular evolution fits the current model for mariner transposon evolution. These copies provide multiple independent datasets for evaluating the pattern of neutral evolution in the human genome, for example, they confirm that most indels are very short and that deletions are twice as common as insertions.
Collapse
Affiliation(s)
- H M Robertson
- Department of Entomology, University of Illinois at Urbana-Champaign, Urbana 61801, USA.
| | | |
Collapse
|