1
|
Polygenic architecture of flowering time and its relationship with local environments in the grass Brachypodium distachyon. Genetics 2024; 227:iyae042. [PMID: 38504651 DOI: 10.1093/genetics/iyae042] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2024] [Revised: 01/12/2024] [Accepted: 03/07/2024] [Indexed: 03/21/2024] Open
Abstract
Synchronizing the timing of reproduction with the environment is crucial in the wild. Among the multiple mechanisms, annual plants evolved to sense their environment, the requirement of cold-mediated vernalization is a major process that prevents individuals from flowering during winter. In many annual plants including crops, both a long and short vernalization requirement can be observed within species, resulting in so-called early-(spring) and late-(winter) flowering genotypes. Here, using the grass model Brachypodium distachyon, we explored the link between flowering-time-related traits (vernalization requirement and flowering time), environmental variation, and diversity at flowering-time genes by combining measurements under greenhouse and outdoor conditions. These experiments confirmed that B. distachyon natural accessions display large differences regarding vernalization requirements and ultimately flowering time. We underline significant, albeit quantitative effects of current environmental conditions on flowering-time-related traits. While disentangling the confounding effects of population structure on flowering-time-related traits remains challenging, population genomics analyses indicate that well-characterized flowering-time genes may contribute significantly to flowering-time variation and display signs of polygenic selection. Flowering-time genes, however, do not colocalize with genome-wide association peaks obtained with outdoor measurements, suggesting that additional genetic factors contribute to flowering-time variation in the wild. Altogether, our study fosters our understanding of the polygenic architecture of flowering time in a natural grass system and opens new avenues of research to investigate the gene-by-environment interaction at play for this trait.
Collapse
|
2
|
The evolution of transposable elements in Brachypodium distachyon is governed by purifying selection, while neutral and adaptive processes play a minor role. eLife 2024; 12:RP93284. [PMID: 38606833 PMCID: PMC11014726 DOI: 10.7554/elife.93284] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/13/2024] Open
Abstract
Understanding how plants adapt to changing environments and the potential contribution of transposable elements (TEs) to this process is a key question in evolutionary genomics. While TEs have recently been put forward as active players in the context of adaptation, few studies have thoroughly investigated their precise role in plant evolution. Here, we used the wild Mediterranean grass Brachypodium distachyon as a model species to identify and quantify the forces acting on TEs during the adaptation of this species to various conditions, across its entire geographic range. Using sequencing data from more than 320 natural B. distachyon accessions and a suite of population genomics approaches, we reveal that putatively adaptive TE polymorphisms are rare in wild B. distachyon populations. After accounting for changes in past TE activity, we show that only a small proportion of TE polymorphisms evolved neutrally (<10%), while the vast majority of them are under moderate purifying selection regardless of their distance to genes. TE polymorphisms should not be ignored when conducting evolutionary studies, as they can be linked to adaptation. However, our study clearly shows that while they have a large potential to cause phenotypic variation in B. distachyon, they are not favored during evolution and adaptation over other types of mutations (such as point mutations) in this species.
Collapse
|
3
|
Karyotype and LTR-RTs analysis provide insights into oak genomic evolution. BMC Genomics 2024; 25:328. [PMID: 38566015 PMCID: PMC10988972 DOI: 10.1186/s12864-024-10177-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2023] [Accepted: 03/01/2024] [Indexed: 04/04/2024] Open
Abstract
BACKGROUND Whole-genome duplication and long terminal repeat retrotransposons (LTR-RTs) amplification in organisms are essential factors that affect speciation, local adaptation, and diversification of organisms. Understanding the karyotype projection and LTR-RTs amplification could contribute to untangling evolutionary history. This study compared the karyotype and LTR-RTs evolution in the genomes of eight oaks, a dominant lineage in Northern Hemisphere forests. RESULTS Karyotype projections showed that chromosomal evolution was relatively conservative in oaks, especially on chromosomes 1 and 7. Modern oak chromosomes formed through multiple fusions, fissions, and rearrangements after an ancestral triplication event. Species-specific chromosomal rearrangements revealed fragments preserved through natural selection and adaptive evolution. A total of 441,449 full-length LTR-RTs were identified from eight oak genomes, and the number of LTR-RTs for oaks from section Cyclobalanopsis was larger than in other sections. Recent amplification of the species-specific LTR-RTs lineages resulted in significant variation in the abundance and composition of LTR-RTs among oaks. The LTR-RTs insertion suppresses gene expression, and the suppressed intensity in gene regions was larger than in promoter regions. Some centromere and rearrangement regions indicated high-density peaks of LTR/Copia and LTR/Gypsy. Different centromeric regional repeat units (32, 78, 79 bp) were detected on different Q. glauca chromosomes. CONCLUSION Chromosome fusions and arm exchanges contribute to the formation of oak karyotypes. The composition and abundance of LTR-RTs are affected by its recent amplification. LTR-RTs random retrotransposition suppresses gene expression and is enriched in centromere and chromosomal rearrangement regions. This study provides novel insights into the evolutionary history of oak karyotypes and the organization, amplification, and function of LTR-RTs.
Collapse
|
4
|
Profiling genome-wide methylation in two maples: Fine-scale approaches to detection with nanopore technology. Evol Appl 2024; 17:e13669. [PMID: 38633133 PMCID: PMC11022628 DOI: 10.1111/eva.13669] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2023] [Revised: 02/04/2024] [Accepted: 02/12/2024] [Indexed: 04/19/2024] Open
Abstract
DNA methylation is critical to the regulation of transposable elements and gene expression and can play an important role in the adaptation of stress response mechanisms in plants. Traditional methods of methylation quantification rely on bisulfite conversion that can compromise accuracy. Recent advances in long-read sequencing technologies allow for methylation detection in real time. The associated algorithms that interpret these modifications have evolved from strictly statistical approaches to Hidden Markov Models and, recently, deep learning approaches. Much of the existing software focuses on methylation in the CG context, but methylation in other contexts is important to quantify, as it is extensively leveraged in plants. Here, we present methylation profiles for two maple species across the full range of 5mC sequence contexts using Oxford Nanopore Technologies (ONT) long-reads. Hybrid and reference-guided assemblies were generated for two new Acer accessions: Acer negundo (box elder; 65x ONT and 111X Illumina) and Acer saccharum (sugar maple; 93x ONT and 148X Illumina). The ONT reads generated for these assemblies were re-basecalled, and methylation detection was conducted in a custom pipeline with the published Acer references (PacBio assemblies) and hybrid assemblies reported herein to generate four epigenomes. Examination of the transposable element landscape revealed the dominance of LTR Copia elements and patterns of methylation associated with different classes of TEs. Methylation distributions were examined at high resolution across gene and repeat density and described within the broader angiosperm context, and more narrowly in the context of gene family dynamics and candidate nutrient stress genes.
Collapse
|
5
|
Transposition of HOPPLA in siRNA-deficient plants suggests a limited effect of the environment on retrotransposon mobility in Brachypodium distachyon. PLoS Genet 2024; 20:e1011200. [PMID: 38470914 PMCID: PMC10959353 DOI: 10.1371/journal.pgen.1011200] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2023] [Revised: 03/22/2024] [Accepted: 02/23/2024] [Indexed: 03/14/2024] Open
Abstract
Long terminal repeat retrotransposons (LTR-RTs) are powerful mutagens regarded as a major source of genetic novelty and important drivers of evolution. Yet, the uncontrolled and potentially selfish proliferation of LTR-RTs can lead to deleterious mutations and genome instability, with large fitness costs for their host. While population genomics data suggest that an ongoing LTR-RT mobility is common in many species, the understanding of their dual role in evolution is limited. Here, we harness the genetic diversity of 320 sequenced natural accessions of the Mediterranean grass Brachypodium distachyon to characterize how genetic and environmental factors influence plant LTR-RT dynamics in the wild. When combining a coverage-based approach to estimate global LTR-RT copy number variations with mobilome-sequencing of nine accessions exposed to eight different stresses, we find little evidence for a major role of environmental factors in LTR-RT accumulations in B. distachyon natural accessions. Instead, we show that loss of RNA polymerase IV (Pol IV), which mediates RNA-directed DNA methylation in plants, results in high transcriptional and transpositional activities of RLC_BdisC024 (HOPPLA) LTR-RT family elements, and that these effects are not stress-specific. This work supports findings indicating an ongoing mobility in B. distachyon and reveals that host RNA-directed DNA methylation rather than environmental factors controls their mobility in this wild grass model.
Collapse
|
6
|
Multi-integrated genomic data for Passiflora foetida provides insights into genome size evolution and floral development in Passiflora. MOLECULAR HORTICULTURE 2023; 3:27. [PMID: 38105261 PMCID: PMC10726625 DOI: 10.1186/s43897-023-00076-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/28/2023] [Accepted: 12/03/2023] [Indexed: 12/19/2023]
Abstract
Passiflora is a plant genus known for its extremely distinctive and colorful flowers and a wide range of genome size variation. However, how genome characteristics are related to flower traits among Passiflora species remains poorly understood. Here, we assembled a chromosome-scale genome of P. foetida, which belongs to the same subgenus as the commercial passionfruit P. edulis. The genome of P. foetida is smaller (424.16 Mb) and contains fewer copies of long terminal repeat retrotransposons (LTR-RTs). The disparity in LTR-RTs is one of the main contributors to the differences in genome sizes between these two species and possibly in floral traits. Additionally, we observed variation in insertion times and copy numbers of LTR-RTs across different transposable element (TE) lineages. Then, by integrating transcriptomic data from 33 samples (eight floral organs and flower buds at three developmental stages) with phylogenomic and metabolomic data, we conducted an in-depth analysis of the expression, phylogeny, and copy number of MIKC-type MADS-box genes and identified essential biosynthetic genes responsible for flower color and scent from glandular bracts and other floral organs. Our study pinpoints LRT-RTs as an important player in genome size variation in Passiflora species and provides insights into future genetic improvement.
Collapse
|
7
|
Transposable element evolution in plant genome ecosystems. CURRENT OPINION IN PLANT BIOLOGY 2023; 75:102418. [PMID: 37459733 DOI: 10.1016/j.pbi.2023.102418] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/31/2023] [Revised: 05/22/2023] [Accepted: 06/20/2023] [Indexed: 09/18/2023]
Abstract
The relationship of transposable elements (TEs) with their host genomes has usually been seen as an arms race between TEs and their host genomes. Consequently, TEs are supposed to amplify by bursts of transposition, when the TE escapes host surveillance, followed by long periods of TE quiescence and efficient host control. Recent data obtained from an increasing number of assembled plant genomes and resequencing population datasets show that TE dynamics is more complex and varies among TE families and their host genomes. This variation ranges from large genomes that accommodate large TE populations to genomes that are very active in TE elimination, and from inconspicuous elements with very low activity to elements with high transposition and elimination rates. The dynamics of each TE family results from a long history of interaction with the host in a genome populated by many other TE families, very much like an evolving ecosystem.
Collapse
|
8
|
The nature and genomic landscape of repetitive DNA classes in Chrysanthemum nankingense shows recent genomic changes. ANNALS OF BOTANY 2023; 131:215-228. [PMID: 35639931 PMCID: PMC9904347 DOI: 10.1093/aob/mcac066] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/14/2022] [Accepted: 05/24/2022] [Indexed: 06/15/2023]
Abstract
BACKGROUND AND AIMS Tandemly repeated DNA and transposable elements represent most of the DNA in higher plant genomes. High-throughput sequencing allows a survey of the DNA in a genome, but whole-genome assembly can miss a substantial fraction of highly repeated sequence motifs. Chrysanthemum nankingense (2n = 2x = 18; genome size = 3.07 Gb; Asteraceae), a diploid reference for the many auto- and allopolyploids in the genus, was considered as an ancestral species and serves as an ornamental plant and high-value food. We aimed to characterize the major repetitive DNA motifs, understand their structure and identify key features that are shaped by genome and sequence evolution. METHODS Graph-based clustering with RepeatExplorer was used to identify and classify repetitive motifs in 2.14 millions of 250-bp paired-end Illumina reads from total genomic DNA of C. nankingense. Independently, the frequency of all canonical motifs k-bases long was counted in the raw read data and abundant k-mers (16, 21, 32, 64 and 128) were extracted and assembled to generate longer contigs for repetitive motif identification. For comparison, long terminal repeat retrotransposons were checked in the published C. nankingense reference genome. Fluorescent in situ hybridization was performed to show the chromosomal distribution of the main types of repetitive motifs. KEY RESULTS Apart from rDNA (0.86 % of the total genome), a few microsatellites (0.16 %), and telomeric sequences, no highly abundant tandem repeats were identified. There were many transposable elements: 40 % of the genome had sequences with recognizable domains related to transposable elements. Long terminal repeat retrotransposons showed widespread distribution over chromosomes, although different sequence families had characteristic features such as abundance at or exclusion from centromeric or subtelomeric regions. Another group of very abundant repetitive motifs, including those most identified as low-complexity sequences (9.07 %) in the genome, showed no similarity to known sequence motifs or tandemly repeated elements. CONCLUSIONS The Chrysanthemum genome has an unusual structure with a very low proportion of tandemly repeated sequences (~1.02 %) in the genome, and a high proportion of low-complexity sequences, most likely degenerated remains of transposable elements. Identifying the presence, nature and genomic organization of major genome fractions enables inference of the evolutionary history of sequences, including degeneration and loss, critical to understanding biodiversity and diversification processes in the genomes of diploid and polyploid Chrysanthemum, Asteraceae and plants more widely.
Collapse
|
9
|
Multiple origins, one evolutionary trajectory: gradual evolution characterizes distinct lineages of allotetraploid Brachypodium. Genetics 2022; 223:6758249. [PMID: 36218464 PMCID: PMC9910409 DOI: 10.1093/genetics/iyac146] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2022] [Accepted: 09/16/2022] [Indexed: 11/13/2022] Open
Abstract
The "genomic shock" hypothesis posits that unusual challenges to genome integrity such as whole genome duplication may induce chaotic genome restructuring. Decades of research on polyploid genomes have revealed that this is often, but not always the case. While some polyploids show major chromosomal rearrangements and derepression of transposable elements in the immediate aftermath of whole genome duplication, others do not. Nonetheless, all polyploids show gradual diploidization over evolutionary time. To evaluate these hypotheses, we produced a chromosome-scale reference genome for the natural allotetraploid grass Brachypodium hybridum, accession "Bhyb26." We compared 2 independently derived accessions of B. hybridum and their deeply diverged diploid progenitor species Brachypodium stacei and Brachypodium distachyon. The 2 B. hybridum lineages provide a natural timecourse in genome evolution because one formed 1.4 million years ago, and the other formed 140 thousand years ago. The genome of the older lineage reveals signs of gradual post-whole genome duplication genome evolution including minor gene loss and genome rearrangement that are missing from the younger lineage. In neither B. hybridum lineage do we find signs of homeologous recombination or pronounced transposable element activation, though we find evidence supporting steady post-whole genome duplication transposable element activity in the older lineage. Gene loss in the older lineage was slightly biased toward 1 subgenome, but genome dominance was not observed at the transcriptomic level. We propose that relaxed selection, rather than an abrupt genomic shock, drives evolutionary novelty in B. hybridum, and that the progenitor species' similarity in transposable element load may account for the subtlety of the observed genome dominance.
Collapse
|
10
|
Brachypodium: 20 years as a grass biology model system; the way forward? TRENDS IN PLANT SCIENCE 2022; 27:1002-1016. [PMID: 35644781 DOI: 10.1016/j.tplants.2022.04.008] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/10/2021] [Revised: 04/13/2022] [Accepted: 04/26/2022] [Indexed: 06/15/2023]
Abstract
It has been 20 years since Brachypodium distachyon was suggested as a model grass species, but ongoing research now encompasses the entire genus. Extensive Brachypodium genome sequencing programmes have provided resources to explore the determinants and drivers of population diversity. This has been accompanied by cytomolecular studies to make Brachypodium a platform to investigate speciation, polyploidisation, perenniality, and various aspects of chromosome and interphase nucleus organisation. The value of Brachypodium as a functional genomic platform has been underscored by the identification of key genes for development, biotic and abiotic stress, and cell wall structure and function. While Brachypodium is relevant to the biofuel industry, its impact goes far beyond that as an intriguing model to study climate change and combinatorial stress.
Collapse
|
11
|
Evolutionary Dynamics of the Repeatome Explains Contrasting Differences in Genome Sizes and Hybrid and Polyploid Origins of Grass Loliinae Lineages. FRONTIERS IN PLANT SCIENCE 2022; 13:901733. [PMID: 35845705 PMCID: PMC9284676 DOI: 10.3389/fpls.2022.901733] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/22/2022] [Accepted: 05/25/2022] [Indexed: 06/15/2023]
Abstract
The repeatome is composed of diverse families of repetitive DNA that keep signatures on the historical events that shaped the evolution of their hosting species. The cold seasonal Loliinae subtribe includes worldwide distributed taxa, some of which are the most important forage and lawn species (fescues and ray-grasses). The Loliinae are prone to hybridization and polyploidization. It has been observed a striking two-fold difference in genome size between the broad-leaved (BL) and fine-leaved (FL) Loliinae diploids and a general trend of genome reduction of some high polyploids. We have used genome skimming data to uncover the composition, abundance, and potential phylogenetic signal of repetitive elements across 47 representatives of the main Loliinae lineages. Independent and comparative analyses of repetitive sequences and of 5S rDNA loci were performed for all taxa under study and for four evolutionary Loliinae groups [Loliinae, Broad-leaved (BL), Fine-leaved (FL), and Schedonorus lineages]. Our data showed that the proportion of the genome covered by the repeatome in the Loliinae species was relatively high (average ∼ 51.8%), ranging from high percentages in some diploids (68.7%) to low percentages in some high-polyploids (30.7%), and that changes in their genome sizes were likely caused by gains or losses in their repeat elements. Ty3-gypsy Retand and Ty1-copia Angela retrotransposons were the most frequent repeat families in the Loliinae although the relatively more conservative Angela repeats presented the highest correlation of repeat content with genome size variation and the highest phylogenetic signal of the whole repeatome. By contrast, Athila retrotransposons presented evidence of recent proliferations almost exclusively in the Lolium clade. The repeatome evolutionary networks showed an overall topological congruence with the nuclear 35S rDNA phylogeny and a geographic-based structure for some lineages. The evolution of the Loliinae repeatome suggests a plausible scenario of recurrent allopolyploidizations followed by diploidizations that generated the large genome sizes of BL diploids as well as large genomic rearrangements in highly hybridogenous lineages that caused massive repeatome and genome contractions in the Schedonorus and Aulaxyper polyploids. Our study has contributed to disentangling the impact of the repeatome dynamics on the genome diversification and evolution of the Loliinae grasses.
Collapse
|
12
|
Diverse and mobile: eccDNA-based identification of carrot low-copy-number LTR retrotransposons active in callus cultures. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2022; 110:1811-1828. [PMID: 35426957 PMCID: PMC9324142 DOI: 10.1111/tpj.15773] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/21/2021] [Revised: 03/15/2022] [Accepted: 03/29/2022] [Indexed: 05/28/2023]
Abstract
Long terminal repeat retrotransposons (LTR-RTs) are mobilized via an RNA intermediate using a 'copy and paste' mechanism, and account for the majority of repetitive DNA in plant genomes. As a side effect of mobilization, the formation of LTR-RT-derived extrachromosomal circular DNAs (eccDNAs) occurs. Thus, high-throughput sequencing of eccDNA can be used to identify active LTR-RTs in plant genomes. Despite the release of a reference genome assembly, carrot LTR-RTs have not yet been thoroughly characterized. LTR-RTs are abundant and diverse in the carrot genome. We identified 5976 carrot LTR-RTs, 2053 and 1660 of which were attributed to Copia and Gypsy superfamilies, respectively. They were further classified into lineages, families and subfamilies. More diverse LTR-RT lineages, i.e. lineages comprising many low-copy-number subfamilies, were more frequently associated with genic regions. Certain LTR-RT lineages have been recently active in Daucus carota. In particular, low-copy-number LTR-RT subfamilies, e.g. those belonging to the DcAle lineage, have significantly contributed to carrot genome diversity as a result of continuing activity. We utilized eccDNA sequencing to identify and characterize two DcAle subfamilies, Alex1 and Alex3, active in carrot callus. We documented 14 and 32 de novo insertions of Alex1 and Alex3, respectively, which were positioned in non-repetitive regions.
Collapse
|
13
|
Comparative Analysis of Transposable Elements and the Identification of Candidate Centromeric Elements in the Prunus Subgenus Cerasus and Its Relatives. Genes (Basel) 2022; 13:genes13040641. [PMID: 35456447 PMCID: PMC9028240 DOI: 10.3390/genes13040641] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2022] [Revised: 03/29/2022] [Accepted: 03/31/2022] [Indexed: 12/04/2022] Open
Abstract
The subgenus Cerasus and its relatives include many crucial economic drupe fruits and ornamental plants. Repetitive elements make up a large part of complex genomes, and some of them play an important role in gene regulation that can affect phenotypic variation. However, the variation in their genomes remains poorly understood. This work conducted a comprehensive repetitive sequence identification across the draft genomes of eight taxa of the genus Prunus, including four of the Prunus subgenus Cerasus (Prunus pseudocerasus, P. avium, P. yedoensis and P. × yedoensis) as well as congeneric species (Prunus salicina, P. armeniaca, P. dulcis and P. persica). Annotation results showed high proportions of transposable elements in their genomes, ranging from 52.28% (P. armeniaca) to 61.86% (P. pseudocerasus). The most notable differences in the contents of long terminal repeat retrotransposons (LTR-RTs) and tandem repeats (TRs) were confirmed with de novo identification based on the structure of each genome, which significantly contributed to their genome size variation, especially in P. avium and P.salicina. Sequence comparisons showed many similar LTR-RTs closely related to their phylogenetic relationships, and a highly similar monomer unit of the TR sequence was conserved among species. Additionally, the predicted centromere-associated sequence was located in centromeric regions with FISH in the 12 taxa of Prunus. It presented significantly different signal intensities, even within the diverse interindividual phenotypes for Prunus tomentosa. This study provides insight into the LTR-RT and TR variation within Prunus and increases our knowledge about its role in genome evolution.
Collapse
|
14
|
Migration without interbreeding: Evolutionary history of a highly selfing Mediterranean grass inferred from whole genomes. Mol Ecol 2022; 31:70-85. [PMID: 34601787 PMCID: PMC9298040 DOI: 10.1111/mec.16207] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2021] [Revised: 09/07/2021] [Accepted: 09/28/2021] [Indexed: 11/30/2022]
Abstract
Wild plant populations show extensive genetic subdivision and are far from the ideal of panmixia which permeates population genetic theory. Understanding the spatial and temporal scale of population structure is therefore fundamental for empirical population genetics - and of interest in itself, as it yields insights into the history and biology of a species. In this study we extend the genomic resources for the wild Mediterranean grass Brachypodium distachyon to investigate the scale of population structure and its underlying history at whole-genome resolution. A total of 86 accessions were sampled at local and regional scales in Italy and France, which closes a conspicuous gap in the collection for this model organism. The analysis of 196 accessions, spanning the Mediterranean from Spain to Iraq, suggests that the interplay of high selfing and seed dispersal rates has shaped genetic structure in B. distachyon. At the continental scale, the evolution in B. distachyon is characterized by the independent expansion of three lineages during the Upper Pleistocene. Today, these lineages may occur on the same meadow yet do not interbreed. At the regional scale, dispersal and selfing interact and maintain high genotypic diversity, thus challenging the textbook notion that selfing in finite populations implies reduced diversity. Our study extends the population genomic resources for B. distachyon and suggests that an important use of this wild plant model is to investigate how selfing and dispersal, two processes typically studied separately, interact in colonizing plant species.
Collapse
|
15
|
Transposable element variants and their potential adaptive impact in urban populations of the malaria vector Anopheles coluzzii. Genome Res 2021; 32:189-202. [PMID: 34965939 PMCID: PMC8744685 DOI: 10.1101/gr.275761.121] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2021] [Accepted: 11/24/2021] [Indexed: 11/28/2022]
Abstract
Anopheles coluzzii is one of the primary vectors of human malaria in sub-Saharan Africa. Recently, it has spread into the main cities of Central Africa threatening vector control programs. The adaptation of An. coluzzii to urban environments partly results from an increased tolerance to organic pollution and insecticides. Some of the molecular mechanisms for ecological adaptation are known, but the role of transposable elements (TEs) in the adaptive processes of this species has not been studied yet. As a first step toward assessing the role of TEs in rapid urban adaptation, we sequenced using long reads six An. coluzzii genomes from natural breeding sites in two major Central Africa cities. We de novo annotated TEs in these genomes and in an additional high-quality An. coluzzii genome, and we identified 64 new TE families. TEs were nonrandomly distributed throughout the genome with significant differences in the number of insertions of several superfamilies across the studied genomes. We identified seven putatively active families with insertions near genes with functions related to vectorial capacity, and several TEs that may provide promoter and transcription factor binding sites to insecticide resistance and immune-related genes. Overall, the analysis of multiple high-quality genomes allowed us to generate the most comprehensive TE annotation in this species to date and identify several TE insertions that could potentially impact both genome architecture and the regulation of functionally relevant genes. These results provide a basis for future studies of the impact of TEs on the biology of An. coluzzii.
Collapse
|
16
|
Transposable Element Populations Shed Light on the Evolutionary History of Wheat and the Complex Co-Evolution of Autonomous and Non-Autonomous Retrotransposons. ADVANCED GENETICS (HOBOKEN, N.J.) 2021; 3:2100022. [PMID: 36619351 PMCID: PMC9744471 DOI: 10.1002/ggn2.202100022] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/31/2021] [Indexed: 01/11/2023]
Abstract
Wheat has one of the largest and most repetitive genomes among major crop plants, containing over 85% transposable elements (TEs). TEs populate genomes much in the way that individuals populate ecosystems, diversifying into different lineages, sub-families and sub-populations. The recent availability of high-quality, chromosome-scale genome sequences from ten wheat lines enables a detailed analysis how TEs evolved in allohexaploid wheat, its diploids progenitors, and in various chromosomal haplotype segments. LTR retrotransposon families evolved into distinct sub-populations and sub-families that were active in waves lasting several hundred thousand years. Furthermore, It is shown that different retrotransposon sub-families were active in the three wheat sub-genomes, making them useful markers to study and date polyploidization events and chromosomal rearrangements. Additionally, haplotype-specific TE sub-families are used to characterize chromosomal introgressions in different wheat lines. Additionally, populations of non-autonomous TEs co-evolved over millions of years with their autonomous partners, leading to complex systems with multiple types of autonomous, semi-autonomous and non-autonomous elements. Phylogenetic and TE population analyses revealed the relationships between non-autonomous elements and their mobilizing autonomous partners. TE population analysis provided insights into genome evolution of allohexaploid wheat and genetic diversity of species, and may have implication for future crop breeding.
Collapse
|
17
|
Differentially Amplified Repetitive Sequences Among Aegilops tauschii Subspecies and Genotypes. FRONTIERS IN PLANT SCIENCE 2021; 12:716750. [PMID: 34490015 PMCID: PMC8417419 DOI: 10.3389/fpls.2021.716750] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/29/2021] [Accepted: 07/27/2021] [Indexed: 06/13/2023]
Abstract
Genomic repetitive sequences commonly show species-specific sequence type, abundance, and distribution patterns, however, their intraspecific characteristics have been poorly described. We quantified the genomic repetitive sequences and performed single nucleotide polymorphism (SNP) analysis between 29 Ae. tauschii genotypes and subspecies using publicly available raw genomic Illumina sequence reads and used fluorescence in situ hybridization (FISH) to experimentally analyze some repeats. The majority of the identified repetitive sequences had similar contents and proportions between anathera, meyeri, and strangulata subspecies. However, two Ty3/gypsy retrotransposons (CL62 and CL87) showed significantly higher abundances, and CL1, CL119, CL213, CL217 tandem repeats, and CL142 retrotransposon (Ty1/copia type) showed significantly lower abundances in subspecies strangulata compared with the subspecies anathera and meyeri. One tandem repeat and 45S ribosomal DNA (45S rDNA) abundances showed a high variation between genotypes but their abundances were not subspecies specific. Phylogenetic analysis using the repeat abundances of the aforementioned clusters placed the strangulata subsp. in a distinct clade but could not discriminate anathera and meyeri. A near complete differentiation of anathera and strangulata subspecies was observed using SNP analysis; however, var. meyeri showed higher genetic diversity. FISH using major tandem repeats couldn't detect differences between subspecies, although (GAA)10 signal patterns generated two different karyotype groups. Taken together, the different classes of repetitive DNA sequences have differentially accumulated between strangulata and the other two subspecies of Ae. tauschii that is generally in agreement with spike morphology, implying that factors affecting repeatome evolution are variable even among highly closely related lineages.
Collapse
|
18
|
Rare transposable elements challenge the prevailing view of transposition dynamics in plants. AMERICAN JOURNAL OF BOTANY 2021; 108:1310-1314. [PMID: 34415576 PMCID: PMC9290919 DOI: 10.1002/ajb2.1709] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/02/2021] [Revised: 05/10/2021] [Accepted: 05/12/2021] [Indexed: 06/01/2023]
|
19
|
On the Origin of Tetraploid Vernal Grasses ( Anthoxanthum) in Europe. Genes (Basel) 2021; 12:966. [PMID: 34202779 PMCID: PMC8308110 DOI: 10.3390/genes12070966] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2021] [Revised: 06/19/2021] [Accepted: 06/23/2021] [Indexed: 11/16/2022] Open
Abstract
Polyploidy has played a crucial role in the evolution of many plant taxa, namely in higher latitudinal zones. Surprisingly, after several decades of an intensive research on polyploids, there are still common polyploid species whose evolutionary history is virtually unknown. Here, we addressed the origin of sweet vernal grass (Anthoxanthum odoratum) using flow cytometry, DNA sequencing, and in situ hybridization-based cytogenetic techniques. An allotetraploid and polytopic origin of the species has been verified. The chromosome study reveals an extensive variation between the European populations. In contrast, an autopolyploid origin of the rarer tetraploid vernal grass species, A. alpinum, has been corroborated. Diploid A. alpinum played an essential role in the polyploidization of both European tetraploids studied.
Collapse
|
20
|
InpactorDB: A Classified Lineage-Level Plant LTR Retrotransposon Reference Library for Free-Alignment Methods Based on Machine Learning. Genes (Basel) 2021; 12:genes12020190. [PMID: 33525408 PMCID: PMC7910972 DOI: 10.3390/genes12020190] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2020] [Revised: 01/21/2021] [Accepted: 01/22/2021] [Indexed: 12/04/2022] Open
Abstract
Long terminal repeat (LTR) retrotransposons are mobile elements that constitute the major fraction of most plant genomes. The identification and annotation of these elements via bioinformatics approaches represent a major challenge in the era of massive plant genome sequencing. In addition to their involvement in genome size variation, LTR retrotransposons are also associated with the function and structure of different chromosomal regions and can alter the function of coding regions, among others. Several sequence databases of plant LTR retrotransposons are available for public access, such as PGSB and RepetDB, or restricted access such as Repbase. Although these databases are useful to identify LTR-RTs in new genomes by similarity, the elements of these databases are not fully classified to the lineage (also called family) level. Here, we present InpactorDB, a semi-curated dataset composed of 130,439 elements from 195 plant genomes (belonging to 108 plant species) classified to the lineage level. This dataset has been used to train two deep neural networks (i.e., one fully connected and one convolutional) for the rapid classification of these elements. In lineage-level classification approaches, we obtain up to 98% performance, indicated by the F1-score, precision and recall scores.
Collapse
|
21
|
Rapid Genome Evolution and Adaptation of Thlaspi arvense Mediated by Recurrent RNA-Based and Tandem Gene Duplications. FRONTIERS IN PLANT SCIENCE 2021; 12:772655. [PMID: 35058947 PMCID: PMC8764390 DOI: 10.3389/fpls.2021.772655] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/22/2021] [Accepted: 11/09/2021] [Indexed: 05/21/2023]
Abstract
Retrotransposons are the most abundant group of transposable elements (TEs) in plants, providing an extraordinarily versatile source of genetic variation. Thlaspi arvense, a close relative of the model plant Arabidopsis thaliana with worldwide distribution, thrives from sea level to above 4,000 m elevation in the Qinghai-Tibet Plateau (QTP), China. Its strong adaptability renders it an ideal model system for studying plant adaptation in extreme environments. However, how the retrotransposons affect the T. arvense genome evolution and adaptation is largely unknown. We report a high-quality chromosome-scale genome assembly of T. arvense with a scaffold N50 of 59.10 Mb. Long terminal repeat retrotransposons (LTR-RTs) account for 56.94% of the genome assembly, and the Gypsy superfamily is the most abundant TEs. The amplification of LTR-RTs in the last six million years primarily contributed to the genome size expansion in T. arvense. We identified 351 retrogenes and 303 genes flanked by LTRs, respectively. A comparative analysis showed that orthogroups containing those retrogenes and genes flanked by LTRs have a higher percentage of significantly expanded orthogroups (SEOs), and these SEOs possess more recent tandem duplicated genes. All present results indicate that RNA-based gene duplication (retroduplication) accelerated the subsequent tandem duplication of homologous genes resulting in family expansions, and these expanded gene families were implicated in plant growth, development, and stress responses, which were one of the pivotal factors for T. arvense's adaptation to the harsh environment in the QTP regions. In conclusion, the high-quality assembly of the T. arvense genome provides insights into the retroduplication mediated mechanism of plant adaptation to extreme environments.
Collapse
|
22
|
TE-greedy-nester: structure-based detection of LTR retrotransposons and their nesting. Bioinformatics 2020; 36:4991-4999. [PMID: 32663247 PMCID: PMC7755421 DOI: 10.1093/bioinformatics/btaa632] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2019] [Revised: 06/08/2020] [Accepted: 07/07/2020] [Indexed: 11/23/2022] Open
Abstract
Motivation Transposable elements (TEs) in eukaryotes often get inserted into one another, forming sequences that become a complex mixture of full-length elements and their fragments. The reconstruction of full-length elements and the order in which they have been inserted is important for genome and transposon evolution studies. However, the accumulation of mutations and genome rearrangements over evolutionary time makes this process error-prone and decreases the efficiency of software aiming to recover all nested full-length TEs. Results We created software that uses a greedy recursive algorithm to mine increasingly fragmented copies of full-length LTR retrotransposons in assembled genomes and other sequence data. The software called TE-greedy-nester considers not only sequence similarity but also the structure of elements. This new tool was tested on a set of natural and synthetic sequences and its accuracy was compared to similar software. We found TE-greedy-nester to be superior in a number of parameters, namely computation time and full-length TE recovery in highly nested regions. Availability and implementation http://gitlab.fi.muni.cz/lexa/nested. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
|
23
|
Impact of Transposable Elements on Methylation and Gene Expression across Natural Accessions of Brachypodium distachyon. Genome Biol Evol 2020; 12:1994-2001. [PMID: 32853352 PMCID: PMC7643609 DOI: 10.1093/gbe/evaa180] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/22/2020] [Indexed: 12/25/2022] Open
Abstract
Transposable elements (TEs) constitute a large fraction of plant genomes and are mostly present in a transcriptionally silent state through repressive epigenetic modifications, such as DNA methylation. TE silencing is believed to influence the regulation of adjacent genes, possibly as DNA methylation spreads away from the TE. Whether this is a general principle or a context-dependent phenomenon is still under debate, pressing for studying the relationship between TEs, DNA methylation, and nearby gene expression in additional plant species. Here, we used the grass Brachypodium distachyon as a model and produced DNA methylation and transcriptome profiles for 11 natural accessions. In contrast to what is observed in Arabidopsis thaliana, we found that TEs have a limited impact on methylation spreading and that only few TE families are associated with a low expression of their adjacent genes. Interestingly, we found that a subset of TE insertion polymorphisms is associated with differential gene expression across accessions. Thus, although not having a global impact on gene expression, distinct TE insertions may contribute to specific gene expression patterns in B. distachyon.
Collapse
|
24
|
Repeat-sequence turnover shifts fundamentally in species with large genomes. NATURE PLANTS 2020; 6:1325-1329. [PMID: 33077876 DOI: 10.1038/s41477-020-00785-x] [Citation(s) in RCA: 59] [Impact Index Per Article: 14.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/09/2020] [Accepted: 09/14/2020] [Indexed: 05/04/2023]
Abstract
Given the 2,400-fold range of genome sizes (0.06-148.9 Gbp (gigabase pair)) of seed plants (angiosperms and gymnosperms) with a broadly similar gene content (amounting to approximately 0.03 Gbp), the repeat-sequence content of the genome might be expected to increase with genome size, resulting in the largest genomes consisting almost entirely of repetitive sequences. Here we test this prediction, using the same bioinformatic approach for 101 species to ensure consistency in what constitutes a repeat. We reveal a fundamental change in repeat turnover in genomes above around 10 Gbp, such that species with the largest genomes are only about 55% repetitive. Given that genome size influences many plant traits, habits and life strategies, this fundamental shift in repeat dynamics is likely to affect the evolutionary trajectory of species lineages.
Collapse
|
25
|
Advances on genomics, biology, ecology and evolution of Brachypodium, a bridging model grass system for cereals and biofuel grasses. THE NEW PHYTOLOGIST 2020; 227:1587-1590. [PMID: 33439505 DOI: 10.1111/nph.16831] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
|