1
|
Chromosome-level genome assembly of milk thistle (Silybum marianum (L.) Gaertn.). Sci Data 2024; 11:342. [PMID: 38580686 PMCID: PMC10997770 DOI: 10.1038/s41597-024-03178-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2023] [Accepted: 03/22/2024] [Indexed: 04/07/2024] Open
Abstract
Silybum marianum (L.) Gaertn., commonly known as milk thistle, is a medicinal plant belonging to the Asteraceae family. This plant has been recognized for its medicinal properties for over 2,000 years. However, the genome of this plant remains largely undiscovered, having no reference genome at a chromosomal level. Here, we assembled the chromosome-level genome of S. marianum, allowing for the annotation of 53,552 genes and the identification of transposable elements comprising 58% of the genome. The genome assembly from this study showed 99.1% completeness as determined by BUSCO assessment, while the previous assembly (ASM154182v1) showed 36.7%. Functional annotation of the predicted genes showed 50,329 genes (94% of total genes) with known protein functions in public databases. Comparative genome analysis among Asteraceae plants revealed a striking conservation of collinearity between S. marianum and C. cardunculus. The genomic information generated from this study will be a valuable resource for milk thistle breeding and for use by the larger research community.
Collapse
|
2
|
Genome-wide analysis of horizontal transfer in non-model wild species from a natural ecosystem reveals new insights into genetic exchange in plants. PLoS Genet 2023; 19:e1010964. [PMID: 37856455 PMCID: PMC10586619 DOI: 10.1371/journal.pgen.1010964] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2023] [Accepted: 09/11/2023] [Indexed: 10/21/2023] Open
Abstract
Horizontal transfer (HT) refers to the exchange of genetic material between divergent species by mechanisms other than reproduction. In recent years, several studies have demonstrated HTs in eukaryotes, particularly in the context of parasitic relationships and in model species. However, very little is known about HT in natural ecosystems, especially those involving non-parasitic wild species, and the nature of the ecological relationships that promote these HTs. In this work, we conducted a pilot study investigating HTs by sequencing the genomes of 17 wild non-model species from a natural ecosystem, the Massane forest, located in southern France. To this end, we developed a new computational pipeline called INTERCHANGE that is able to characterize HTs at the whole genome level without prior annotation and directly in the raw sequencing reads. Using this pipeline, we identified 12 HT events, half of which occurred between lianas and trees. We found that mainly low copy number LTR-retrotransposons from the Copia superfamily were transferred between these wild plant species, especially those of the Ivana and Ale lineages. This study revealed a possible new route for HTs between non-parasitic plants and provides new insights into the genomic characteristics of horizontally transferred DNA in plant genomes.
Collapse
|
3
|
Horizontal Gene Transfers in Plants. Life (Basel) 2021; 11:life11080857. [PMID: 34440601 PMCID: PMC8401529 DOI: 10.3390/life11080857] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2021] [Revised: 08/10/2021] [Accepted: 08/16/2021] [Indexed: 12/24/2022] Open
Abstract
In plants, as in all eukaryotes, the vertical transmission of genetic information through reproduction ensures the maintenance of the integrity of species. However, many reports over the past few years have clearly shown that horizontal gene transfers, referred to as HGTs (the interspecific transmission of genetic information across reproductive barriers) are very common in nature and concern all living organisms including plants. The advent of next-generation sequencing technologies (NGS) has opened new perspectives for the study of HGTs through comparative genomic approaches. In this review, we provide an up-to-date view of our current knowledge of HGTs in plants.
Collapse
|
4
|
The m 6A pathway protects the transcriptome integrity by restricting RNA chimera formation in plants. Life Sci Alliance 2019; 2:2/3/e201900393. [PMID: 31142640 PMCID: PMC6545605 DOI: 10.26508/lsa.201900393] [Citation(s) in RCA: 35] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2019] [Revised: 05/20/2019] [Accepted: 05/20/2019] [Indexed: 11/24/2022] Open
Abstract
This study reveals that an m6A-assisted polyadenylation pathway comprising conserved m6A writer proteins and a plant-specific m6A reader contributes to transcriptome integrity in Arabidopsis thaliana by restricting RNA chimera formation at rearranged loci. Global, segmental, and gene duplication–related processes are driving genome size and complexity in plants. Despite their evolutionary potentials, those processes can also have adverse effects on genome regulation, thus implying the existence of specialized corrective mechanisms. Here, we report that an N6-methyladenosine (m6A)–assisted polyadenylation (m-ASP) pathway ensures transcriptome integrity in Arabidopsis thaliana. Efficient m-ASP pathway activity requires the m6A methyltransferase-associated factor FIP37 and CPSF30L, an m6A reader corresponding to an YT512-B Homology Domain-containing protein (YTHDC)-type domain containing isoform of the 30-kD subunit of cleavage and polyadenylation specificity factor. Targets of the m-ASP pathway are enriched in recently rearranged gene pairs, displayed an atypical chromatin signature, and showed transcriptional readthrough and mRNA chimera formation in FIP37- and CPSF30L-deficient plants. Furthermore, we showed that the m-ASP pathway can also restrict the formation of chimeric gene/transposable-element transcript, suggesting a possible implication of this pathway in the control of transposable elements at specific locus. Taken together, our results point to selective recognition of 3′-UTR m6A as a safeguard mechanism ensuring transcriptome integrity at rearranged genomic loci in plants.
Collapse
|
5
|
The genome sequence of segmental allotetraploid peanut Arachis hypogaea. Nat Genet 2019; 51:877-884. [PMID: 31043755 DOI: 10.1038/s41588-019-0405-z] [Citation(s) in RCA: 285] [Impact Index Per Article: 57.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2018] [Accepted: 03/28/2019] [Indexed: 12/24/2022]
Abstract
Like many other crops, the cultivated peanut (Arachis hypogaea L.) is of hybrid origin and has a polyploid genome that contains essentially complete sets of chromosomes from two ancestral species. Here we report the genome sequence of peanut and show that after its polyploid origin, the genome has evolved through mobile-element activity, deletions and by the flow of genetic information between corresponding ancestral chromosomes (that is, homeologous recombination). Uniformity of patterns of homeologous recombination at the ends of chromosomes favors a single origin for cultivated peanut and its wild counterpart A. monticola. However, through much of the genome, homeologous recombination has created diversity. Using new polyploid hybrids made from the ancestral species, we show how this can generate phenotypic changes such as spontaneous changes in the color of the flowers. We suggest that diversity generated by these genetic mechanisms helped to favor the domestication of the polyploid A. hypogaea over other diploid Arachis species cultivated by humans.
Collapse
|
6
|
Publisher Correction: Genomes of 13 domesticated and wild rice relatives highlight genetic conservation, turnover and innovation across the genus Oryza. Nat Genet 2018; 50:1618. [PMID: 30291357 DOI: 10.1038/s41588-018-0261-2] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
This article was not made open access when initially published online, which was corrected before print publication. In addition, ORCID links were missing for 12 authors and have been added to the HTML and PDF versions of the article.
Collapse
|
7
|
Genic C-Methylation in Soybean Is Associated with Gene Paralogs Relocated to Transposable Element-Rich Pericentromeres. MOLECULAR PLANT 2018; 11:485-495. [PMID: 29476915 DOI: 10.1016/j.molp.2018.02.006] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/24/2017] [Revised: 02/15/2018] [Accepted: 02/15/2018] [Indexed: 06/08/2023]
Abstract
Most plants are polyploid due to whole-genome duplications (WGD) and can thus have duplicated genes. Following a WGD, paralogs are often fractionated (lost) and few duplicate pairs remain. Little attention has been paid to the role of DNA methylation in the functional divergence of paralogous genes. Using high-resolution methylation maps of accessions of domesticated and wild soybean, we show that in soybean, a recent paleopolyploid with many paralogs, DNA methylation likely contributed to the elimination of genetic redundancy of polyploidy-derived gene paralogs. Transcriptionally silenced paralogs exhibit particular genomic features as they are often associated with proximal transposable elements (TEs) and are preferentially located in pericentromeres, likely due to gene movement during evolution. Additionally, we provide evidence that gene methylation associated with proximal TEs is implicated in the divergence of expression profiles between orthologous genes of wild and domesticated soybean, and within populations.
Collapse
|
8
|
Reconciling the evolutionary origin of bread wheat (Triticum aestivum). THE NEW PHYTOLOGIST 2017; 213:1477-1486. [PMID: 27551821 DOI: 10.1111/nph.14113] [Citation(s) in RCA: 76] [Impact Index Per Article: 10.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/17/2016] [Accepted: 06/18/2016] [Indexed: 05/26/2023]
Abstract
The origin of bread wheat (Triticum aestivum; AABBDD) has been a subject of controversy and of intense debate in the scientific community over the last few decades. In 2015, three articles published in New Phytologist discussed the origin of hexaploid bread wheat (AABBDD) from the diploid progenitors Triticum urartu (AA), a relative of Aegilops speltoides (BB) and Triticum tauschii (DD). Access to new genomic resources since 2013 has offered the opportunity to gain novel insights into the paleohistory of modern bread wheat, allowing characterization of its origin from its diploid progenitors at unprecedented resolution. We propose a reconciled evolutionary scenario for the modern bread wheat genome based on the complementary investigation of transposable element and mutation dynamics between diploid, tetraploid and hexaploid wheat. In this scenario, the structural asymmetry observed between the A, B and D subgenomes in hexaploid bread wheat derives from the cumulative effect of diploid progenitor divergence, the hybrid origin of the D subgenome, and subgenome partitioning following the polyploidization events.
Collapse
|
9
|
Comprehensive definition of genome features in Spirodela polyrhiza by high-depth physical mapping and short-read DNA sequencing strategies. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2017; 89:617-635. [PMID: 27754575 DOI: 10.1111/tpj.13400] [Citation(s) in RCA: 38] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/10/2016] [Revised: 10/05/2016] [Accepted: 10/07/2016] [Indexed: 05/15/2023]
Abstract
Spirodela polyrhiza is a fast-growing aquatic monocot with highly reduced morphology, genome size and number of protein-coding genes. Considering these biological features of Spirodela and its basal position in the monocot lineage, understanding its genome architecture could shed light on plant adaptation and genome evolution. Like many draft genomes, however, the 158-Mb Spirodela genome sequence has not been resolved to chromosomes, and important genome characteristics have not been defined. Here we deployed rapid genome-wide physical maps combined with high-coverage short-read sequencing to resolve the 20 chromosomes of Spirodela and to empirically delineate its genome features. Our data revealed a dramatic reduction in the number of the rDNA repeat units in Spirodela to fewer than 100, which is even fewer than that reported for yeast. Consistent with its unique phylogenetic position, small RNA sequencing revealed 29 Spirodela-specific microRNA, with only two being shared with Elaeis guineensis (oil palm) and Musa balbisiana (banana). Combining DNA methylation data and small RNA sequencing enabled the accurate prediction of 20.5% long terminal repeats (LTRs) that doubled the previous estimate, and revealed a high Solo:Intact LTR ratio of 8.2. Interestingly, we found that Spirodela has the lowest global DNA methylation levels (9%) of any plant species tested. Taken together our results reveal a genome that has undergone reduction, likely through eliminating non-essential protein coding genes, rDNA and LTRs. In addition to delineating the genome features of this unique plant, the methodologies described and large-scale genome resources from this work will enable future evolutionary and functional studies of this basal monocot family.
Collapse
|
10
|
A Comparative Epigenomic Analysis of Polyploidy-Derived Genes in Soybean and Common Bean. PLANT PHYSIOLOGY 2015; 168:1433-47. [PMID: 26149573 PMCID: PMC4528746 DOI: 10.1104/pp.15.00408] [Citation(s) in RCA: 65] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/16/2015] [Accepted: 07/03/2015] [Indexed: 05/02/2023]
Abstract
Soybean (Glycine max) and common bean (Phaseolus vulgaris) share a paleopolyploidy (whole-genome duplication [WGD]) event, approximately 56.5 million years ago, followed by a genus Glycine-specific polyploidy, approximately 10 million years ago. Cytosine methylation is an epigenetic mark that plays an important role in the regulation of genes and transposable elements (TEs); however, the role of DNA methylation in the fate/evolution of genes following polyploidy and speciation has not been fully explored. Whole-genome bisulfite sequencing was used to produce nucleotide resolution methylomes for soybean and common bean. We found that, in soybean, CG body-methylated genes were abundant in WGD genes, which were, on average, more highly expressed than single-copy genes and had slower evolutionary rates than unmethylated genes, suggesting that WGD genes evolve more slowly than single-copy genes. CG body-methylated genes were also enriched in shared single-copy genes (single copy in both species) that may be responsible for the broad and high expression patterns of this class of genes. In addition, diverged methylation patterns in non-CG contexts between paralogs were due mostly to TEs in or near genes, suggesting a role for TEs and non-CG methylation in regulating gene expression post polyploidy. Reference methylomes for both soybean and common bean were constructed, providing resources for investigating epigenetic variation in legume crops. Also, the analysis of methylation patterns of duplicated and single-copy genes has provided insights into the functional consequences of polyploidy and epigenetic regulation in plant genomes.
Collapse
|
11
|
A new approach for annotation of transposable elements using small RNA mapping. Nucleic Acids Res 2015; 43:e84. [PMID: 25813049 PMCID: PMC4513842 DOI: 10.1093/nar/gkv257] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2014] [Revised: 03/10/2015] [Accepted: 03/15/2015] [Indexed: 12/31/2022] Open
Abstract
Transposable elements (TEs) are mobile genomic DNA sequences found in most organisms. They so densely populate the genomes of many eukaryotic species that they are often the major constituents. With the rapid generation of many plant genome sequencing projects over the past few decades, there is an urgent need for improved TE annotation as a prerequisite for genome-wide studies. Analogous to the use of RNA-seq for gene annotation, we propose a new method for de novo TE annotation that uses as a guide 24 nt-siRNAs that are a part of TE silencing pathways. We use this new approach, called TASR (for Transposon Annotation using Small RNAs), for de novo annotation of TEs in Arabidopsis, rice and soybean and demonstrate that this strategy can be successfully applied for de novo TE annotation in plants.Executable PERL is available for download from: http://tasr-pipeline.sourceforge.net/.
Collapse
|
12
|
RiTE database: a resource database for genus-wide rice genomics and evolutionary biology. BMC Genomics 2015; 16:538. [PMID: 26194356 PMCID: PMC4508813 DOI: 10.1186/s12864-015-1762-3] [Citation(s) in RCA: 64] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2015] [Accepted: 07/09/2015] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Comparative evolutionary analysis of whole genomes requires not only accurate annotation of gene space, but also proper annotation of the repetitive fraction which is often the largest component of most if not all genomes larger than 50 kb in size. RESULTS Here we present the Rice TE database (RiTE-db)--a genus-wide collection of transposable elements and repeated sequences across 11 diploid species of the genus Oryza and the closely-related out-group Leersia perrieri. The database consists of more than 170,000 entries divided into three main types: (i) a classified and curated set of publicly-available repeated sequences, (ii) a set of consensus assemblies of highly-repetitive sequences obtained from genome sequencing surveys of 12 species; and (iii) a set of full-length TEs, identified and extracted from 12 whole genome assemblies. CONCLUSIONS This is the first report of a repeat dataset that spans the majority of repeat variability within an entire genus, and one that includes complete elements as well as unassembled repeats. The database allows sequence browsing, downloading, and similarity searches. Because of the strategy adopted, the RiTE-db opens a new path to unprecedented direct comparative studies that span the entire nuclear repeat content of 15 million years of Oryza diversity.
Collapse
|
13
|
Abstract
Cytosine DNA methylation is the addition of a methyl group to the 5' position of a cytosine, which plays a crucial role in plant development and gene silencing. Genome-wide profiling of DNA methylation is now possible using various techniques and strategies. Using these technologies, we are beginning to elucidate the extent and impact of variation in DNA methylation between individuals and/or tissues. Here, we review the different techniques used to analyze the methylomes at the whole-genome level and their applications to better understand epigenetic variations in plants.
Collapse
|
14
|
Abstract
Vertical, transgenerational transmission of genetic material occurs through reproduction of living organisms. In addition to vertical inheritance, horizontal gene transfer between reproductively isolated species has recently been shown to be an important, if not dominant, mechanism in the evolution of prokaryotic genomes. In contrast, only a few horizontal transfer (HT) events have been characterized so far in eukaryotes and mainly concern transposable elements (TEs). Whether these are frequent and have a significant impact on genome evolution remains largely unknown. We performed a computational search for highly conserved LTR retrotransposons among 40 sequenced eukaryotic genomes representing the major plant families. We found that 26 genomes (65%) harbor at least one case of horizontal TE transfer (HTT). These transfers concern species as distantly related as palm and grapevine, tomato and bean, or poplar and peach. In total, we identified 32 cases of HTTs, which could translate into more than 2 million among the 13,551 monocot and dicot genera. Moreover, we show that these TEs have remained functional after their transfer, occasionally causing a transpositional burst. This suggests that plants can frequently exchange genetic material through horizontal transfers and that this mechanism may be important in TE-driven genome evolution.
Collapse
|
15
|
Comparative genomic paleontology across plant kingdom reveals the dynamics of TE-driven genome evolution. Genome Biol Evol 2013; 5:954-65. [PMID: 23426643 PMCID: PMC3673626 DOI: 10.1093/gbe/evt025] [Citation(s) in RCA: 114] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Long terminal repeat-retrotransposons (LTR-RTs) are the most abundant class of transposable elements (TEs) in plants. They strongly impact the structure, function, and evolution of their host genome, and, in particular, their role in genome size variation has been clearly established. However, the dynamics of the process through which LTR-RTs have differentially shaped plant genomes is still poorly understood because of a lack of comparative studies. Using a new robust and automated family classification procedure, we exhaustively characterized the LTR-RTs in eight plant genomes for which a high-quality sequence is available (i.e., Arabidopsis thaliana, A. lyrata, grapevine, soybean, rice, Brachypodium dystachion, sorghum, and maize). This allowed us to perform a comparative genome-wide study of the retrotranspositional landscape in these eight plant lineages from both monocots and dicots. We show that retrotransposition has recurrently occurred in all plant genomes investigated, regardless their size, and through bursts, rather than a continuous process. Moreover, in each genome, only one or few LTR-RT families have been active in the recent past, and the difference in genome size among the species studied could thus mostly be accounted for by the extent of the latest transpositional burst(s). Following these bursts, LTR-RTs are efficiently eliminated from their host genomes through recombination and deletion, but we show that the removal rate is not lineage specific. These new findings lead us to propose a new model of TE-driven genome evolution in plants.
Collapse
|