1
|
RNA-Seq transcriptome profiling of immature grain wheat is a technique for understanding comparative modeling of baking quality. Sci Rep 2024; 14:10940. [PMID: 38740888 DOI: 10.1038/s41598-024-61528-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2023] [Accepted: 05/07/2024] [Indexed: 05/16/2024] Open
Abstract
Improving the baking quality is a primary challenge in the wheat flour production value chain, as baking quality represents a crucial factor in determining its overall value. In the present study, we conducted a comparative RNA-Seq analysis on the high baking quality mutant "O-64.1.10" genotype and its low baking quality wild type "Omid" cultivar to recognize potential genes associated with bread quality. The cDNA libraries were constructed from immature grains that were 15 days post-anthesis, with an average of 16.24 and 18.97 million paired-end short-read sequences in the mutant and wild-type, respectively. A total number of 733 transcripts with differential expression were identified, 585 genes up-regulated and 188 genes down-regulated in the "O-64.1.10" genotype compared to the "Omid". In addition, the families of HSF, bZIP, C2C2-Dof, B3-ARF, BES1, C3H, GRF, HB-HD-ZIP, PLATZ, MADS-MIKC, GARP-G2-like, NAC, OFP and TUB were appeared as the key transcription factors with specific expression in the "O-64.1.10" genotype. At the same time, pathways related to baking quality were identified through Kyoto Encyclopedia of Genes and Genomes. Collectively, we found that the endoplasmic network, metabolic pathways, secondary metabolite biosynthesis, hormone signaling pathway, B group vitamins, protein pathways, pathways associated with carbohydrate and fat metabolism, as well as the biosynthesis and metabolism of various amino acids, have a great deal of potential to play a significant role in the baking quality. Ultimately, the RNA-seq results were confirmed using quantitative Reverse Transcription PCR for some hub genes such as alpha-gliadin, low molecular weight glutenin subunit and terpene synthase (gibberellin) and as a resource for future study, 127 EST-SSR primers were generated using RNA-seq data.
Collapse
|
2
|
Homoplastic versus xenoplastic evolution: exploring the emergence of key intrinsic and extrinsic traits in the montane genus Soldanella (Primulaceae). THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2024; 118:753-765. [PMID: 38217489 DOI: 10.1111/tpj.16630] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/08/2023] [Revised: 12/02/2023] [Accepted: 12/27/2023] [Indexed: 01/15/2024]
Abstract
Specific ecological conditions in the high mountain environment exert a selective pressure that often leads to convergent trait evolution. Reticulations induced by incomplete lineage sorting and introgression can lead to discordant trait patterns among gene and species trees (hemiplasy/xenoplasy), providing a false illusion that the traits under study are homoplastic. Using phylogenetic species networks, we explored the effect of gene exchange on trait evolution in Soldanella, a genus profoundly influenced by historical introgression. At least three features evolved independently multiple times: the single-flowered dwarf phenotype, dysploid cytotype, and ecological generalism. The present analyses also indicated that the recurring occurrence of stoloniferous growth might have been prompted by an introgression event between an ancestral lineage and a still extant species, although its emergence via convergent evolution cannot be completely ruled out. Phylogenetic regression suggested that the independent evolution of larger genomes in snowbells is most likely a result of the interplay between hybridization events of dysploid and euploid taxa and hostile environments at the range margins of the genus. The emergence of key intrinsic and extrinsic traits in snowbells has been significantly impacted not only by convergent evolution but also by historical and recent introgression events.
Collapse
|
3
|
Genome assembly and microsatellite marker development using Illumina and PacBio sequencing in Persicaria maackiana (Polygonaceae) from Korea. Genes Genomics 2024; 46:187-202. [PMID: 38240922 DOI: 10.1007/s13258-023-01479-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2023] [Accepted: 11/23/2023] [Indexed: 01/30/2024]
Abstract
BACKGROUND Persicaria maackiana (Regel) is a potential medicinal plant that exerts anti-diabetic effects. However, the lack of genomic information on P. maackiana hinders research at the molecular level. OBJECTIVE Herein, we aimed to construct a draft genome assembly and obtain comprehensive genomic information on P. maackiana using high-throughput sequencing tools PacBio Sequel II and Illumina. METHODS Persicaria maackiana samples from three natural populations in Gaecheon, Gichi, and Uiryeong reservoirs in South Korea were used to generate genomic DNA libraries, perform genome de novo assembly, gene ontology analysis, phylogenetic tree analysis, genotyping, and identify microsatellite markers. RESULTS The assembled P. maackiana genome yielded 32,179 contigs. Assessment of assembly integrity revealed 1503 (93.12%) complete Benchmarking Universal Single-Copy Orthologs. A total of 64,712 protein-coding genes were predicted and annotated successfully in the protein database. In the Kyoto Encyclopedia of Genes and Genomes (KEGG) orthologs, 13,778 genes were annotated into 18 categories. Genes that activated AMPK were identified in the KEGG pathway. A total of 316,992 microsatellite loci were identified, and primers targeting the flanking regions were developed for 292,059 microsatellite loci. Of these, 150 primer sets were randomly selected for amplification, and 30 of these primer sets were identified as polymorphic. These primers amplified 3-9 alleles. The mean observed and expected heterozygosity were 0.189 and 0.593, respectively. Polymorphism information content values of the markers were 0.361-0.754. CONCLUSION Collectively, our study provides a valuable resource for future comparative genomics, phylogeny, and population studies of P. maackiana.
Collapse
|
4
|
Structure characteristics of mutation sites in two waxy alleles from Yunnan waxy maize (Zea mays L. var. certaina Kulesh) landraces. PLoS One 2023; 18:e0291116. [PMID: 37682926 PMCID: PMC10490952 DOI: 10.1371/journal.pone.0291116] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2023] [Accepted: 08/22/2023] [Indexed: 09/10/2023] Open
Abstract
A large number of waxy maize landraces are distributed in Yunnan and surrounding areas, and abundant waxy alleles of different types are distributed in these landraces. The identification of waxy alleles is helpful to the protection and utilization of these waxy landraces. This study introduced structure characteristics of waxy genes from two specific landraces of Yunnan, Zinuoyumi and Myanmar Four-Row Wax. Zinuoyumi has two waxy alleles wx-Cin4 and wx-Cin4-2; Myanmar Four-Row Wax has three waxy alleles wx-D10, wx-Reina and wx-D11. The wx-Cin4-2 and wx-D11 are two types of waxy alleles first reported in this study. The wx-Cin4-2 has two mutation sites, deletion of 30 bp in exon 10, insertion of a 1,267 bp non-long terminal repeat (non-LTR) retrotransposon Cin4 in intron 10, and 13 bp extra sequence were found at 5' end of the Cin4; the mutation site of wx-D11 is a 1,082 bp deletion from exons 11 to 14 of the waxy gene and is replaced with a 72 bp filler sequence. This study enriched the type of waxy allele from Yunnan waxy maize landraces and further discussed the molecular basis for the formation of mutation sites of wx-Cin4-2 and wx-D11.
Collapse
|
5
|
Suppressing a phosphohydrolase of cytokinin nucleotide enhances grain yield in rice. Nat Genet 2023; 55:1381-1389. [PMID: 37500729 DOI: 10.1038/s41588-023-01454-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2022] [Accepted: 06/21/2023] [Indexed: 07/29/2023]
Abstract
One-step and two-step pathways are proposed to synthesize cytokinin in plants. The one-step pathway is mediated by LONELY GUY (LOG) proteins. However, the enzyme for the two-step pathway remains to be identified. Here, we show that quantitative trait locus GY3 may boost grain yield by more than 20% through manipulating a two-step pathway. Locus GY3 encodes a LOG protein that acts as a 5'-ribonucleotide phosphohydrolase by excessively consuming the cytokinin precursors, which contrasts with the activity of canonical LOG members as phosphoribohydrolases in a one-step pathway. The residue S41 of GY3 is crucial for the dephosphorylation of iPRMP to produce iPR. A solo-LTR insertion within the promoter of GY3 suppressed its expression and resulted in a higher content of active cytokinins in young panicles. Introgression of GY302428 increased grain yield per plot by 7.4% to 16.3% in all investigated indica backgrounds, which demonstrates the great value of GY302428 in indica rice production.
Collapse
|
6
|
Lineage-specific amplification and epigenetic regulation of LTR-retrotransposons contribute to the structure, evolution, and function of Fabaceae species. BMC Genomics 2023; 24:423. [PMID: 37501164 PMCID: PMC10373317 DOI: 10.1186/s12864-023-09530-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2023] [Accepted: 07/22/2023] [Indexed: 07/29/2023] Open
Abstract
BACKGROUND Long terminal repeat (LTR)-retrotransposons (LTR-RTs) are ubiquitous and make up the majority of nearly all sequenced plant genomes, whereas their pivotal roles in genome evolution, gene expression regulation as well as their epigenetic regulation are still not well understood, especially in a large number of closely related species. RESULTS Here, we analyzed the abundance and dynamic evolution of LTR-RTs in 54 species from an economically and agronomically important family, Fabaceae, and also selected two representative species for further analysis in expression of associated genes, transcriptional activity and DNA methylation patterns of LTR-RTs. Annotation results revealed highly varied proportions of LTR-RTs in these genomes (5.1%~68.4%) and their correlation with genome size was highly positive, and they were significantly contributed to the variance in genome size through species-specific unique amplifications. Almost all of the intact LTR-RTs were inserted into the genomes 4 Mya (million years ago), and more than 50% of them were inserted in the last 0.5 million years, suggesting that recent amplifications of LTR-RTs were an important force driving genome evolution. In addition, expression levels of genes with intronic, promoter, and downstream LTR-RT insertions of Glycine max and Vigna radiata, two agronomically important crops in Fabaceae, showed that the LTR-RTs located in promoter or downstream regions suppressed associated gene expression. However, the LTR-RTs within introns promoted gene expression or had no contribution to gene expression. Additionally, shorter and younger LTR-RTs maintained higher mobility and transpositional potential. Compared with the transcriptionally silent LTR-RTs, the active elements showed significantly lower DNA methylation levels in all three contexts. The distributions of transcriptionally active and silent LTR-RT methylation varied across different lineages due to the position of LTR-RTs located or potentially epigenetic regulation. CONCLUSION Lineage-specific amplification patterns were observed and higher methylation level may repress the activity of LTR-RTs, further influence evolution in Fabaceae species. This study offers valuable clues into the evolution, function, transcriptional activity and epigenetic regulation of LTR-RTs in Fabaceae genomes.
Collapse
|
7
|
A Chromosome-level Reference Genome of African Oil Palm Provides Insights into Its Divergence and Stress Adaptation. GENOMICS, PROTEOMICS & BIOINFORMATICS 2023; 21:440-454. [PMID: 36435453 PMCID: PMC10787024 DOI: 10.1016/j.gpb.2022.11.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/23/2021] [Revised: 10/02/2022] [Accepted: 11/17/2022] [Indexed: 11/27/2022]
Abstract
The palm family (Arecaceae), consisting of ∼ 2600 species, is the third most economically important family of plants. The African oil palm (Elaeis guineensis) is one of the most important palms. However, the genome sequences of palms that are currently available are still limited and fragmented. Here, we report a high-quality chromosome-level reference genome of an oil palm, Dura, assembled by integrating long reads with ∼ 150× genome coverage. The assembled genome was 1.7 Gb in size, covering 94.5% of the estimated genome, of which 91.6% was assigned into 16 pseudochromosomes and 73.7% was repetitive sequences. Relying on the conserved synteny with oil palm, the existing draft genome sequences of both date palm and coconut were further assembled into chromosomal level. Transposon burst, particularly long terminal repeat retrotransposons, following the last whole-genome duplication, likely explains the genome size variation across palms. Sequence analysis of the VIRESCENS gene in palms suggests that DNA variations in this gene are related to fruit colors. Recent duplications of highly tandemly repeated pathogenesis-related proteins from the same tandem arrays play an important role in defense responses to Ganoderma. Whole-genome resequencing of both ancestral African and introduced oil palms in Southeast Asia reveals that genes under putative selection are notably associated with stress responses, suggesting adaptation to stresses in the new habitat. The genomic resources and insights gained in this study could be exploited for accelerating genetic improvement and understanding the evolution of palms.
Collapse
|
8
|
Chromatographic printed array strip (C-PAS) method for cultivar-specific identification of sweetpotato cultivars 'Beniharuka' and 'Fukumurasaki'. BREEDING SCIENCE 2023; 73:313-321. [PMID: 37840975 PMCID: PMC10570877 DOI: 10.1270/jsbbs.22101] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/26/2022] [Accepted: 03/22/2023] [Indexed: 10/17/2023]
Abstract
Sweetpotato (Ipomoea batatas) cultivars grown in Japan are highly valued for their excellent sweetness, high quality, and good texture. The export volume of sweetpotato from Japan has been rising rapidly, with a 10-fold increase on a weight basis over the last 10 years. However, since sweetpotato is propagated vegetatively from storage roots, it is easy to cultivate and propagate this crop, prompting concerns that Japanese sweetpotato cultivars/lines are being exported overseas, cultivated without permission, or reimported. Therefore, a rapid and accurate cultivar identification methodology is needed. In this study, we comprehensively analyzed the insertion sites of Cl8 retrotransposon to develop a cultivar identification technique for the Japanese cultivars 'Beniharuka' and 'Fukumurasaki'. These two cultivars were successfully distinguished from other cultivars using a minimum of two marker sets. Using the chromatographic printed array strip (C-PAS) method for DNA signal detection, 'Beniharuka' and 'Fukumurasaki' can be precisely identified using a single strip of chromatographic paper based on multiplex DNA signals derived from the amplicons of the Cl8 insertion sites. Since this method can detect DNA signals in only ~15 minutes, we expect that our method will facilitate rapid, reliable, and convenient cultivar discrimination for on-site inspection of sweetpotato.
Collapse
|
9
|
How to start a LINE: 5' switching rejuvenates LINE retrotransposons in tobacco and related Nicotiana species. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2023. [PMID: 36965091 DOI: 10.1111/tpj.16208] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/01/2022] [Revised: 02/10/2023] [Accepted: 02/19/2023] [Indexed: 06/18/2023]
Abstract
By contrast to their conserved mammalian counterparts, plant long interspersed nuclear elements (LINEs) are highly variable, splitting into many low-copy families. Curiously, LINE families from the retrotransposable element (RTE) clade retain a stronger sequence conservation and hence reach higher copy numbers. The cause of this RTE-typical property is not yet understood, but would help clarify why some transposable elements are removed quickly, whereas others persist in plant genomes. Here, we bring forward a detailed study of RTE LINE structure, diversity and evolution in plants. For this, we argue that the nightshade family is the ideal taxon to follow the evolutionary trajectories of RTE LINEs, given their high abundance, recent activity and partnership to non-autonomous elements. Using bioinformatic, cytogenetic and molecular approaches, we detect 4029 full-length RTE LINEs across the Solanaceae. We finely characterize and manually curate a core group of 458 full-length LINEs in allotetraploid tobacco, show an integration event after polyploidization and trace hybridization by RTE LINE composition of parental genomes. Finally, we reveal the role of the untranslated regions (UTRs) as causes for the unique RTE LINE amplification and evolution pattern in plants. On the one hand, we detected a highly conserved motif at the 3' UTR, suggesting strong selective constraints acting on the RTE terminus. On the other hand, we observed successive rounds of 5' UTR cycling, constantly rejuvenating the promoter sequences. This interplay between exchangeable promoters and conserved LINE bodies and 3' UTR likely allows RTE LINEs to persist and thrive in plant genomes.
Collapse
|
10
|
Discovering the Repeatome of Five Species Belonging to the Asteraceae Family: A Computational Study. PLANTS (BASEL, SWITZERLAND) 2023; 12:1405. [PMID: 36987093 PMCID: PMC10058865 DOI: 10.3390/plants12061405] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/08/2023] [Revised: 03/08/2023] [Accepted: 03/20/2023] [Indexed: 06/19/2023]
Abstract
Genome divergence by repeat proliferation and/or loss is a process that plays a crucial role in species evolution. Nevertheless, knowledge of the variability related to repeat proliferation among species of the same family is still limited. Considering the importance of the Asteraceae family, here we present a first contribution towards the metarepeatome of five Asteraceae species. A comprehensive picture of the repetitive components of all genomes was obtained by genome skimming with Illumina sequence reads and by analyzing a pool of full-length long terminal repeat retrotransposons (LTR-REs). Genome skimming allowed us to estimate the abundance and variability of repetitive components. The structure of the metagenome of the selected species was composed of 67% repetitive sequences, of which LTR-REs represented the bulk of annotated clusters. The species essentially shared ribosomal DNA sequences, whereas the other classes of repetitive DNA were highly variable among species. The pool of full-length LTR-REs was retrieved from all the species and their age of insertion was established, showing several lineage-specific proliferation peaks over the last 15-million years. Overall, a large variability of repeat abundance at superfamily, lineage, and sublineage levels was observed, indicating that repeats within individual genomes followed different evolutionary and temporal dynamics, and that different events of amplification or loss of these sequences may have occurred after species differentiation.
Collapse
|
11
|
PlantLTRdb: An interactive database for 195 plant species LTR-retrotransposons. FRONTIERS IN PLANT SCIENCE 2023; 14:1134627. [PMID: 36950350 PMCID: PMC10025401 DOI: 10.3389/fpls.2023.1134627] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/30/2022] [Accepted: 02/16/2023] [Indexed: 05/29/2023]
Abstract
LTR-retrotransposons (LTR-RTs) are a large group of transposable elements that replicate through an RNA intermediate and alter genome structure. The activities of LTR-RTs in plant genomes provide helpful information about genome evolution and gene function. LTR-RTs near or within genes can directly alter gene function. This work introduces PlantLTRdb, an intact LTR-RT database for 195 plant species. Using homology- and de novo structure-based methods, a total of 150.18 Gbp representing 3,079,469 pseudomolecules/scaffolds were analyzed to identify, characterize, annotate LTR-RTs, estimate insertion ages, detect LTR-RT-gene chimeras, and determine nearby genes. Accordingly, 520,194 intact LTR-RTs were discovered, including 29,462 autonomous and 490,732 nonautonomous LTR-RTs. The autonomous LTR-RTs included 10,286 Gypsy and 19,176 Copia, while the nonautonomous were divided into 224,906 Gypsy, 218,414 Copia, 1,768 BARE-2, 3,147 TR-GAG and 4,2497 unknown. Analysis of the identified LTR-RTs located within genes showed that a total of 36,236 LTR-RTs were LTR-RT-gene chimeras and 11,619 LTR-RTs were within pseudo-genes. In addition, 50,026 genes are within 1 kbp of LTR-RTs, and 250,587 had a distance of 1 to 10 kbp from LTR-RTs. PlantLTRdb allows researchers to search, visualize, BLAST and analyze plant LTR-RTs. PlantLTRdb can contribute to the understanding of structural variations, genome organization, functional genomics, and the development of LTR-RT target markers for molecular plant breeding. PlantLTRdb is available at https://bioinformatics.um6p.ma/PlantLTRdb.
Collapse
|
12
|
The origin of genetic and metabolic systems: Evolutionary structuralinsights. Heliyon 2023; 9:e14466. [PMID: 36967965 PMCID: PMC10036676 DOI: 10.1016/j.heliyon.2023.e14466] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2022] [Revised: 02/27/2023] [Accepted: 03/06/2023] [Indexed: 03/16/2023] Open
Abstract
DNA is derived from reverse transcription and its origin is related to reverse transcriptase, DNA polymerase and integrase. The gene structure originated from the evolution of the first RNA polymerase. Thus, an explanation of the origin of the genetic system must also explain the evolution of these enzymes. This paper proposes a polymer structure model, termed the stable complex evolution model, which explains the evolution of enzymes and functional molecules. Enzymes evolved their functions by forming locally tightly packed complexes with specific substrates. A metabolic reaction can therefore be considered to be the result of adaptive evolution in this way when a certain essential molecule is lacking in a cell. The evolution of the primitive genetic and metabolic systems was thus coordinated and synchronized. According to the stable complex model, almost all functional molecules establish binding affinity and specific recognition through complementary interactions, and functional molecules therefore have the nature of being auto-reactive. This is thermodynamically favorable and leads to functional duplication and self-organization. Therefore, it can be speculated that biological systems have a certain tendency to maintain functional stability or are influenced by an inherent selective power. The evolution of dormant bacteria may support this hypothesis, and inherent selectivity can be unified with natural selection at the molecular level.
Collapse
|
13
|
The comparison of polymorphism among Avena species revealed by retrotransposon-based DNA markers and soluble carbohydrates in seeds. J Appl Genet 2023; 64:247-264. [PMID: 36719514 PMCID: PMC10076396 DOI: 10.1007/s13353-023-00748-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2022] [Revised: 01/13/2023] [Accepted: 01/18/2023] [Indexed: 02/01/2023]
Abstract
Here, we compared the polymorphism among 13 Avena species revealed by the iPBS markers and soluble carbohydrate profiles in seeds. The application of seven iPBS markers generated 83 bands, out of which 20.5% were polymorphic. No species-specific bands were scored. Shannon's information index (I) and expected heterozygosity (He) revealed low genetic diversity, with the highest values observed for A. nuda (I = 0.099; He = 0.068). UPGMA clustering of studied Avena accessions and PCoA results showed that the polyploidy level is the main grouping criterion. High-resolution gas chromatography revealed that the studied Avena accessions share the same composition of soluble carbohydrates, but significant differences in the content of total (5.30-22.38 mg g-1 of dry weight) and particular sugars among studied samples were observed. Sucrose appeared as the most abundant sugar (mean 61.52% of total soluble carbohydrates), followed by raffinose family oligosaccharides (31.23%), myo-inositol and its galactosides (6.16%), and monosaccharides (1.09%). The pattern of interspecific variation in soluble carbohydrates, showed by PCA, was convergent to that revealed by iPBS markers. Thus, both methods appeared as a source of valuable data useful in the characterization of Avena resources or in the discussion on the evolution of this genus.
Collapse
|
14
|
Gapless genome assembly of azalea and multi-omics investigation into divergence between two species with distinct flower color. HORTICULTURE RESEARCH 2023; 10:uhac241. [PMID: 36643737 PMCID: PMC9832866 DOI: 10.1093/hr/uhac241] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/15/2022] [Accepted: 10/17/2022] [Indexed: 05/09/2023]
Abstract
The genus Rhododendron (Ericaceae), with more than 1000 species highly diverse in flower color, is providing distinct ornamental values and a model system for flower color studies. Here, we investigated the divergence between two parental species with different flower color widely used for azalea breeding. Gapless genome assembly was generated for the yellow-flowered azalea, Rhododendron molle. Comparative genomics found recent proliferation of long terminal repeat retrotransposons (LTR-RTs), especially Gypsy, has resulted in a 125 Mb (19%) genome size increase in species-specific regions, and a significant amount of dispersed gene duplicates (13 402) and pseudogenes (17 437). Metabolomic assessment revealed that yellow flower coloration is attributed to the dynamic changes of carotenoids/flavonols biosynthesis and chlorophyll degradation. Time-ordered gene co-expression networks (TO-GCNs) and the comparison confirmed the metabolome and uncovered the specific gene regulatory changes underpinning the distinct flower pigmentation. B3 and ERF TFs were found dominating the gene regulation of carotenoids/flavonols characterized pigmentation in R. molle, while WRKY, ERF, WD40, C2H2, and NAC TFs collectively regulated the anthocyanins characterized pigmentation in the red-flowered R simsii. This study employed a multi-omics strategy in disentangling the complex divergence between two important azaleas and provided references for further functional genetics and molecular breeding.
Collapse
|
15
|
Genetic studies on continuous flowering in woody plant Osmanthus fragrans. FRONTIERS IN PLANT SCIENCE 2022; 13:1049479. [PMID: 36407607 PMCID: PMC9671776 DOI: 10.3389/fpls.2022.1049479] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/20/2022] [Accepted: 10/10/2022] [Indexed: 06/16/2023]
Abstract
Continuous flowering is a key horticultural trait in ornamental plants, whereas the specific molecular regulation mechanism remains largely unknown. In sweet osmanthus (Osmanthus fragrans Lour.), plants based on their flowering characteristics are divided into once-flowering (OF) habit and continuous flowering (CF) habit. Here, we first described the flowering phenology shifts of OF and CF habits in sweet osmanthus through paraffin section and microscope assay. Phenotypic characterization showed that CF plants had constant new shoot growth, floral transition, and blooming for 1 year, which might lead to a continuous flowering trait. We performed the transcriptome sequencing of OF and CF sweet osmanthus and analyzed the transcriptional activity of flowering-related genes. Among the genes, three floral integrators, OfFT, OfTFL1, and OfBFT, had a differential expression during the floral transition process in OF and CF habits. The expression patterns of the three genes in 1 year were revealed. The results suggested that their accumulations corresponded to the new shoots occurring and the floral transition process. Function studies suggested that OfFT acted as a flowering activator, whereas OfBFT was a flowering inhibitor. Yeast one-hybrid assay indicated that OfSPL8 was a common upstream transcription factor of OfFT and OfBFT, suggesting the vital role of OfSPL8 in continuous flowering regulation. These results provide a novel insight into the molecular mechanism of continuous flowering.
Collapse
|
16
|
Spontaneous, Artificial, and Genome Editing-Mediated Mutations in Prunus. Int J Mol Sci 2022; 23:ijms232113273. [PMID: 36362061 PMCID: PMC9653787 DOI: 10.3390/ijms232113273] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2022] [Revised: 10/27/2022] [Accepted: 10/28/2022] [Indexed: 11/06/2022] Open
Abstract
Mutation is a source of genetic diversity widely used in breeding programs for the acquisition of agronomically interesting characters in commercial varieties of the Prunus species, as well as in the rest of crop species. Mutation can occur in nature at a very low frequency or can be induced artificially. Spontaneous or bud sport mutations in somatic cells can be vegetatively propagated to get an individual with the mutant phenotype. Unlike animals, plants have unlimited growth and totipotent cells that let somatic mutations to be transmitted to the progeny. On the other hand, in vitro tissue culture makes it possible to induce mutation in plant material and perform large screenings for mutant’s selection and cleaning of chimeras. Finally, targeted mutagenesis has been boosted by the application of CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats)/Cas9 and Transcription activator-like effector nuclease (TALEN) editing technologies. Over the last few decades, environmental stressors such as global warming have been threatening the supply of global demand for food based on population growth in the near future. For this purpose, the release of new varieties adapted to such changes is a requisite, and selected or generated Prunus mutants by properly regulated mechanisms could be helpful to this task. In this work, we reviewed the most relevant mutations for breeding traits in Prunus species such as flowering time, self-compatibility, fruit quality, and disease tolerance, including new molecular perspectives in the present postgenomic era including CRISPR/Cas9 and TALEN editing technologies.
Collapse
|
17
|
Genome-Wide Identification and Expression Analysis of Wall-Associated Kinase (WAK) Gene Family in Cannabis sativa L. PLANTS (BASEL, SWITZERLAND) 2022; 11:2703. [PMID: 36297727 PMCID: PMC9609219 DOI: 10.3390/plants11202703] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/07/2022] [Revised: 10/07/2022] [Accepted: 10/10/2022] [Indexed: 06/16/2023]
Abstract
Wall-associated kinases (WAKs) are receptors that bind pectin or small pectic fragments in the cell wall and play roles in cell elongation and pathogen response. In the Cannabis sativa (Cs) genome, 53 CsWAK/CsWAKL (WAK-like) protein family members were identified and characterized; their amino acid lengths and molecular weights varied from 582 to 983, and from 65.6 to 108.8 kDa, respectively. They were classified into four main groups by a phylogenetic tree. Out of the 53 identified CsWAK/CsWAKL genes, 23 CsWAK/CsWAKL genes were unevenly distributed among six chromosomes. Two pairs of genes on chromosomes 4 and 7 have undergone duplication. The number of introns and exons among CsWAK/CsWAKL genes ranged from 1 to 6 and from 2 to 7, respectively. The promoter regions of 23 CsWAKs/CsWAKLs possessed diverse cis-regulatory elements that are involved in light, development, environmental stress, and hormone responsiveness. The expression profiles indicated that our candidate genes (CsWAK1, CsWAK4, CsWAK7, CsWAKL1, and CsWAKL7) are expressed in leaf tissue. These genes exhibit different expression patterns than their homologs in other plant species. These initial findings are useful resources for further research work on the potential roles of CsWAK/CsWAKL in cellular signalling during development, environmental stress conditions, and hormone treatments.
Collapse
|
18
|
Distinct composition and amplification dynamics of transposable elements in sacred lotus (Nelumbo nucifera Gaertn.). THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2022; 112:172-192. [PMID: 35959634 PMCID: PMC9804982 DOI: 10.1111/tpj.15938] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/21/2022] [Revised: 07/19/2022] [Accepted: 08/08/2022] [Indexed: 06/15/2023]
Abstract
Sacred lotus (Nelumbo nucifera Gaertn.) is a basal eudicot plant with a unique lifestyle, physiological features, and evolutionary characteristics. Here we report the unique profile of transposable elements (TEs) in the genome, using a manually curated repeat library. TEs account for 59% of the genome, and hAT (Ac/Ds) elements alone represent 8%, more than in any other known plant genome. About 18% of the lotus genome is comprised of Copia LTR retrotransposons, and over 25% of them are associated with non-canonical termini (non-TGCA). Such high abundance of non-canonical LTR retrotransposons has not been reported for any other organism. TEs are very abundant in genic regions, with retrotransposons enriched in introns and DNA transposons primarily in flanking regions of genes. The recent insertion of TEs in introns has led to significant intron size expansion, with a total of 200 Mb in the 28 455 genes. This is accompanied by declining TE activity in intergenic regions, suggesting distinct control efficacy of TE amplification in different genomic compartments. Despite the prevalence of TEs in genic regions, some genes are associated with fewer TEs, such as those involved in fruit ripening and stress responses. Other genes are enriched with TEs, and genes in epigenetic pathways are the most associated with TEs in introns, indicating a dynamic interaction between TEs and the host surveillance machinery. The dramatic differential abundance of TEs with genes involved in different biological processes as well as the variation of target preference of different TEs suggests the composition and activity of TEs influence the path of evolution.
Collapse
|
19
|
Transposable Elements in the Revealing of Polymorphism-Based Differences in the Seeds of Flax Varieties Grown in Remediated Chernobyl Area. PLANTS 2022; 11:plants11192567. [PMID: 36235434 PMCID: PMC9571286 DOI: 10.3390/plants11192567] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/14/2022] [Revised: 09/20/2022] [Accepted: 09/21/2022] [Indexed: 11/16/2022]
Abstract
The nuclear reactor accident in Chernobyl, Ukraine, resulted in effects both locally and farther away. Most of the contaminated areas were the agricultural fields and forests. Experimental fields were established near Chernobyl—radioactively contaminated fields localized 5 km from Chernobyl Nuclear Power Plant as well as the remediated soil that is localized directly in the Chernobyl town. Two flax varieties growing under chronic exposition to ionizing radiation were used for this study—the local Ukrainian variety Kyivskyi and a commercial variety Bethune. The screening of the length polymorphism generated by transposable elements insertions were performed. All known types of common flax transposon, retrotransposons and iPBS approach were used. In the iPBS multiplex analyze, for the Kyivskyi variety, a unique addition was found in the seeds from the radioactive contaminated field and for the Bethune variety, a total of five amplicon additions were obtained and one deletion. For the TRIM Cassandra fingerprints, two amplicon additions were generated in the seeds from radioactive contaminated fields for the Bethune variety. In summary, the obtained data represent the genetic diversity between control and irradiated subgroups of flax seeds from Chernobyl area and the presence of activated transposable elements due to the irradiation stress.
Collapse
|
20
|
Impact of LTR-Retrotransposons on Genome Structure, Evolution, and Function in Curcurbitaceae Species. Int J Mol Sci 2022; 23:ijms231710158. [PMID: 36077556 PMCID: PMC9456015 DOI: 10.3390/ijms231710158] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2022] [Revised: 09/02/2022] [Accepted: 09/02/2022] [Indexed: 11/17/2022] Open
Abstract
Long terminal repeat (LTR)-retrotransposons (LTR-RTs) comprise a major portion of many plant genomes and may exert a profound impact on genome structure, function, and evolution. Although many studies have focused on these elements in an individual species, their dynamics on a family level remains elusive. Here, we investigated the abundance, evolutionary dynamics, and impact on associated genes of LTR-RTs in 16 species in an economically important plant family, Cucurbitaceae. Results showed that full-length LTR-RT numbers and LTR-RT content varied greatly among different species, and they were highly correlated with genome size. Most of the full-length LTR-RTs were amplified after the speciation event, reflecting the ongoing rapid evolution of these genomes. LTR-RTs highly contributed to genome size variation via species-specific distinct proliferations. The Angela and Tekay lineages with a greater evolutionary age were amplified in Trichosanthes anguina, whereas a recent activity burst of Reina and another ancient round of Tekay activity burst were examined in Sechium edule. In addition, Tekay and Retand lineages belonging to the Gypsy superfamily underwent a recent burst in Gynostemma pentaphyllum. Detailed investigation of genes with intronic and promoter LTR-RT insertion showed diverse functions, but the term of metabolism was enriched in most species. Further gene expression analysis in G.pentaphyllum revealed that the LTR-RTs within introns suppress the corresponding gene expression, whereas the LTR-RTs within promoters exert a complex influence on the downstream gene expression, with the main function of promoting gene expression. This study provides novel insights into the organization, evolution, and function of LTR-RTs in Cucurbitaceae genomes.
Collapse
|
21
|
Comprehensive survey of transposon mPing insertion sites and transcriptome analysis for identifying candidate genes controlling high protein content of rice. FRONTIERS IN PLANT SCIENCE 2022; 13:969582. [PMID: 36119631 PMCID: PMC9479144 DOI: 10.3389/fpls.2022.969582] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/15/2022] [Accepted: 08/05/2022] [Indexed: 06/15/2023]
Abstract
Rice is the most important crop species in the world, being staple food of more than 80% of people in Asia. About 80% of rice grain is composed of carbohydrates (starch), with its protein content as low as 7-8%. Therefore, increasing the protein content of rice offers way to create a stable protein source that contributes to improving malnutrition and health problems worldwide. We detected two rice lines harboring a significantly higher protein content (namely, HP5-7 and HP7-5) in the EG4 population. The EG4 strain of rice is a unique material in that the transposon mPing has high transpositional activity and high copy numbers under natural conditions. Other research indicated that mPing is abundant in the gene-rich euchromatic regions, suggesting that mPing amplification should create new allelic variants, novel regulatory networks, and phenotypic changes in the EG4 population. Here, we aimed to identify the candidate genes and/or mPing insertion sites causing high protein content by comprehensively identifying the mPing insertion sites and carrying out an RNA-seq-based transcriptome analysis. By utilizing the next-generation sequencing (NGS)-based methods, ca. 570 mPing insertion sites were identified per line in the EG4 population. Our results also indicated that mPing apparently has a preference for inserting itself in the region near a gene, with 38 genes in total found to contain the mPing insertion in the HP lines, of which 21 and 17 genes were specific to HP5-7 and HP7-5, respectively. Transcriptome analysis revealed that most of the genes related to protein synthesis (encoding glutelin, prolamin, and globulin) were up-regulated in HP lines relative to the control line. Interestingly, the differentially expressed gene (DEG) analysis revealed that the expression levels of many genes related to photosynthesis decreased in both HP lines; this suggests the amount of starch may have decreased, indirectly contributing to the increased protein content. The high-protein lines studied here are expected to contribute to the development of high protein-content rice by introducing valuable phenotypic traits such as high and stable yield, disease resistance, and abundant nutrients.
Collapse
|
22
|
Stress does not induce a general transcription of transposable elements in Drosophila. Mol Biol Rep 2022; 49:9033-9040. [PMID: 35980533 DOI: 10.1007/s11033-022-07839-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2022] [Accepted: 08/03/2022] [Indexed: 11/28/2022]
Abstract
Transposable elements, also known as "jumping genes," have the ability to hop within the host genome. Nonetheless, this capacity is kept in check by the host cell defense systems to avoid unbridled TE mobilization. Different types of stressors can activate TEs in Drosophila, suggesting that TEs may play an adaptive role in the stress response, especially in generating genetic variability for adaptive evolution. TE activation by stressors may also lead to the notion, usually found in the literature, that any form of stress could activate all or the majority of TEs. In this review, we define what stress is. We then present and discuss RNA sequencing results from several studies demonstrating that stress does not trigger TE transcription broadly in Drosophila. An explanation for the LTR order of TEs being the most overexpressed is also proposed.
Collapse
|
23
|
Evolution of complex genome architecture in gymnosperms. Gigascience 2022; 11:6659718. [PMID: 35946987 PMCID: PMC9364684 DOI: 10.1093/gigascience/giac078] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2022] [Revised: 06/09/2022] [Accepted: 07/15/2022] [Indexed: 11/25/2022] Open
Abstract
Gymnosperms represent an ancient lineage that diverged from early spermatophytes during the Devonian. The long fossil records and low diversity in living species prove their complex evolutionary history, which included ancient radiations and massive extinctions. Due to their ultra-large genome size, the whole-genome assembly of gymnosperms has only generated in the past 10 years and is now being further expanded into more taxonomic representations. Here, we provide an overview of the publicly available gymnosperm genome resources and discuss their assembly quality and recent findings in large genome architectures. In particular, we describe the genomic features most related to changes affecting the whole genome. We also highlight new realizations relative to repetitive sequence dynamics, paleopolyploidy, and long introns. Based on the results of relevant genomic studies of gymnosperms, we suggest additional efforts should be made toward exploring the genomes of medium-sized (5–15 gigabases) species. Lastly, more comparative analyses among high-quality assemblies are needed to understand the genomic shifts and the early species diversification of seed plants.
Collapse
|
24
|
Annotation of Siberian Larch (Larix sibirica Ledeb.) Nuclear Genome—One of the Most Cold-Resistant Tree Species in the Only Deciduous GENUS in Pinaceae. PLANTS 2022; 11:plants11152062. [PMID: 35956540 PMCID: PMC9370799 DOI: 10.3390/plants11152062] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/24/2022] [Revised: 07/22/2022] [Accepted: 07/26/2022] [Indexed: 11/17/2022]
Abstract
The recent release of the nuclear, chloroplast and mitochondrial genome assemblies of Siberian larch (Larix sibirica Ledeb.), one of the most cold-resistant tree species in the only deciduous genus of Pinaceae, with seasonal senescence and a rot-resistant valuable timber widely used in construction, greatly contributed to the development of genomic resources for the larch genus. Here, we present an extensive repeatome analysis and the first annotation of the draft nuclear Siberian larch genome assembly. About 66% of the larch genome consists of highly repetitive elements (REs), with the likely wave of retrotransposons insertions into the larch genome estimated to occur 4–5 MYA. In total, 39,370 gene models were predicted, with 87% of them having homology to the Arabidopsis-annotated proteins and 78% having at least one GO term assignment. The current state of the genome annotations allows for the exploration of the gymnosperm and angiosperm species for relative gene abundance in different functional categories. Comparative analysis of functional gene categories across different angiosperm and gymnosperm species finds that the Siberian larch genome has an overabundance of genes associated with programmed cell death (PCD), autophagy, stress hormone biosynthesis and regulatory pathways; genes that may play important roles in seasonal senescence and stress response to extreme cold in larch. Despite being incomplete, the draft assemblies and annotations of the conifer genomes are at a point of development where they now represent a valuable source for further genomic, genetic and population studies.
Collapse
|
25
|
Comparative repeatome analysis reveals new evidence on genome evolution in wild diploid Arachis (Fabaceae) species. PLANTA 2022; 256:50. [PMID: 35895167 DOI: 10.1007/s00425-022-03961-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/21/2022] [Accepted: 07/12/2022] [Indexed: 06/15/2023]
Abstract
Opposing changes in the abundance of satellite DNA and long terminal repeat (LTR) retroelements are the main contributors to the variation in genome size and heterochromatin amount in Arachis diploids. The South American genus Arachis (Fabaceae) comprises 83 species organized in nine taxonomic sections. Among them, section Arachis is characterized by species with a wide genome and karyotype diversity. Such diversity is determined mainly by the amount and composition of repetitive DNA. Here we performed computational analysis on low coverage genome sequencing to infer the dynamics of changes in major repeat families that led to the differentiation of genomes in diploid species (x = 10) of genus Arachis, focusing on section Arachis. Estimated repeat content ranged from 62.50 to 71.68% of the genomes. Species with different genome composition tended to have different landscapes of repeated sequences. Athila family retrotransposons were the most abundant and variable lineage among Arachis repeatomes, with peaks of transpositional activity inferred at different times in the evolution of the species. Satellite DNAs (satDNAs) were less abundant, but differentially represented among species. High rates of evolution of an AT-rich superfamily of satDNAs led to the differential accumulation of heterochromatin in Arachis genomes. The relationship between genome size variation and the repetitive content is complex. However, largest genomes presented a higher accumulation of LTR elements and lower contents of satDNAs. In contrast, species with lowest genome sizes tended to accumulate satDNAs in detriment of LTR elements. Phylogenetic analysis based on repetitive DNA supported the genome arrangement of section Arachis. Altogether, our results provide the most comprehensive picture on the repeatome dynamics that led to the genome differentiation of Arachis species.
Collapse
|
26
|
Melatonin: Regulation of Viral Phase Separation and Epitranscriptomics in Post-Acute Sequelae of COVID-19. Int J Mol Sci 2022; 23:8122. [PMID: 35897696 PMCID: PMC9368024 DOI: 10.3390/ijms23158122] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2022] [Revised: 07/09/2022] [Accepted: 07/20/2022] [Indexed: 01/27/2023] Open
Abstract
The relentless, protracted evolution of the SARS-CoV-2 virus imposes tremendous pressure on herd immunity and demands versatile adaptations by the human host genome to counter transcriptomic and epitranscriptomic alterations associated with a wide range of short- and long-term manifestations during acute infection and post-acute recovery, respectively. To promote viral replication during active infection and viral persistence, the SARS-CoV-2 envelope protein regulates host cell microenvironment including pH and ion concentrations to maintain a high oxidative environment that supports template switching, causing extensive mitochondrial damage and activation of pro-inflammatory cytokine signaling cascades. Oxidative stress and mitochondrial distress induce dynamic changes to both the host and viral RNA m6A methylome, and can trigger the derepression of long interspersed nuclear element 1 (LINE1), resulting in global hypomethylation, epigenetic changes, and genomic instability. The timely application of melatonin during early infection enhances host innate antiviral immune responses by preventing the formation of "viral factories" by nucleocapsid liquid-liquid phase separation that effectively blockades viral genome transcription and packaging, the disassembly of stress granules, and the sequestration of DEAD-box RNA helicases, including DDX3X, vital to immune signaling. Melatonin prevents membrane depolarization and protects cristae morphology to suppress glycolysis via antioxidant-dependent and -independent mechanisms. By restraining the derepression of LINE1 via multifaceted strategies, and maintaining the balance in m6A RNA modifications, melatonin could be the quintessential ancient molecule that significantly influences the outcome of the constant struggle between virus and host to gain transcriptomic and epitranscriptomic dominance over the host genome during acute infection and PASC.
Collapse
|
27
|
Automatic curation of LTR retrotransposon libraries from plant genomes through machine learning. J Integr Bioinform 2022; 19:jib-2021-0036. [PMID: 35822734 PMCID: PMC9521825 DOI: 10.1515/jib-2021-0036] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2021] [Accepted: 06/10/2022] [Indexed: 11/19/2022] Open
Abstract
Transposable elements are mobile sequences that can move and insert themselves into chromosomes, activating under internal or external stimuli, giving the organism the ability to adapt to the environment. Annotating transposable elements in genomic data is currently considered a crucial task to understand key aspects of organisms such as phenotype variability, species evolution, and genome size, among others. Because of the way they replicate, LTR retrotransposons are the most common transposable elements in plants, accounting in some cases for up to 80% of all DNA information. To annotate these elements, a reference library is usually created, a curation process is performed, eliminating TE fragments and false positives and then annotated in the genome using the homology method. However, the curation process can take weeks, requires extensive manual work and the execution of multiple time-consuming bioinformatics software. Here, we propose a machine learning-based approach to perform this process automatically on plant genomes, obtaining up to 91.18% F1-score. This approach was tested with four plant species, obtaining up to 93.6% F1-score (Oryza granulata) in only 22.61 s, where bioinformatics methods took approximately 6 h. This acceleration demonstrates that the ML-based approach is efficient and could be used in massive sequencing projects.
Collapse
|
28
|
SubPhaser: a robust allopolyploid subgenome phasing method based on subgenome-specific k-mers. THE NEW PHYTOLOGIST 2022; 235:801-809. [PMID: 35460274 DOI: 10.1111/nph.18173] [Citation(s) in RCA: 31] [Impact Index Per Article: 15.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/20/2021] [Accepted: 04/04/2022] [Indexed: 05/02/2023]
Abstract
With advanced sequencing technology, dozens of complex polyploid plant genomes have been characterized. However, for many polyploid species, their diploid ancestors are unknown or extinct, making it impossible to unravel the subgenomes and genome evolution directly. We developed a novel subgenome-phasing algorithm, SubPhaser, specifically designed for a neoallopolyploid or a homoploid hybrid. SubPhaser first searches for the subgenome-specific sequence (k-mer), then assigns homoeologous chromosomes into subgenomes, and further provides tools to annotate and investigate specific sequences. SubPhaser works well on neoallopolyploids and homoploid hybrids containing subgenome-specific sequences like wheat, but fails on autopolyploids lacking subgenome-specific sequences like alfalfa, indicating that SubPhaser can phase neoallopolyploid/homoploid hybrids with high accuracy, sensitivity and performance. This highly accurate, highly sensitive, ancestral data free chromosome phasing algorithm, SubPhaser, offers significant application value for subgenome phasing in neoallopolyploids and homoploid hybrids, and for the subsequent exploration of genome evolution and related genetic/epigenetic mechanisms.
Collapse
|
29
|
Amplification of LTRs of extrachromosomal linear DNAs (ALE-seq) identifies two active Oryco LTR retrotransposons in the rice cultivar Dongjin. Mob DNA 2022; 13:18. [PMID: 35698176 PMCID: PMC9190103 DOI: 10.1186/s13100-022-00274-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2022] [Accepted: 05/27/2022] [Indexed: 11/26/2022] Open
Abstract
Long terminal repeat retrotransposons (LTR-RTs) make up a considerable portion of plant genomes. New insertions of these active LTR-RTs modify gene structures and functions and play an important role in genome evolution. Therefore, identifying active forms of LTR-RTs could uncover the effects of these elements in plants. Extrachromosomal linear DNA (eclDNA) forms during LTR-RT replication; therefore, amplification LTRs of eclDNAs followed by sequencing (ALE-seq) uncover the current transpositional potential of the LTR-RTs. The ALE-seq protocol was validated by identification of Tos17 in callus of Nipponbare cultivar. Here, we identified two active LTR-RTs belonging to the Oryco family on chromosomes 6 and 9 in rice cultivar Dongjin callus based on the ALE-seq technology. Each Oryco family member has paired LTRs with identical sequences and internal domain regions. Comparison of the two LTR-RTs revealed 97% sequence identity in their internal domains and 65% sequence identity in their LTRs. These two putatively active Oryco LTR-RT family members could be used to expand our knowledge of retrotransposition mechanisms and the effects of LTR-RTs on the rice genome.
Collapse
|
30
|
Characterisation of LTR-Retrotransposons of Stevia rebaudiana and Their Use for the Analysis of Genetic Variability. Int J Mol Sci 2022; 23:ijms23116220. [PMID: 35682899 PMCID: PMC9181549 DOI: 10.3390/ijms23116220] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2022] [Revised: 05/25/2022] [Accepted: 05/30/2022] [Indexed: 02/01/2023] Open
Abstract
Stevia rebaudiana is one of the most important crops belonging to the Asteraceae family. Stevia is cultivated all over the world as it represents a valid natural alternative to artificial sweeteners thanks to its leaves, which produce steviol glycosides that have high sweetening power and reduced caloric value. In this work, the stevia genome sequence was used to isolate and characterise full-length long-terminal repeat retrotransposons (LTR-REs), which account for more than half of the genome. The Gypsy retrotransposons were twice as abundant as the Copia ones. A disproportionate abundance of elements belonging to the Chromovirus/Tekay lineage was observed among the Gypsy elements. Only the SIRE and Angela lineages represented significant portions of the genome among the Copia elements. The dynamics with which LTR-REs colonised the stevia genome were also estimated; all isolated full-length elements turned out to be relatively young, with a proliferation peak around 1–2 million years ago. However, a different analysis conducted by comparing sequences encoding retrotranscriptase showed the occurrence of an older period in which there was a lot of LTR-RE proliferation. Finally, a group of isolated full-length elements belonging to the lineage Angela was used to analyse the genetic variability in 25 accessions of S. rebaudiana using the Inter-Retrotransposon Amplified Polymorphism (IRAP) protocol. The obtained fingerprints highlighted a high degree of genetic variability and were used to study the genomic structures of the different accessions. It was hypothesised that there are four ancestral subpopulations at the root of the analysed accessions, which all turned out to be admixed. Overall, these data may be useful for genome sequence annotations and for evaluating genetic variability in this species, which may be useful in stevia breeding.
Collapse
|
31
|
Genome Size Variation in Dianthus sylvestris Wulfen sensu lato (Caryophyllaceae). PLANTS (BASEL, SWITZERLAND) 2022; 11:1481. [PMID: 35684254 PMCID: PMC9183063 DOI: 10.3390/plants11111481] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/11/2022] [Revised: 05/26/2022] [Accepted: 05/28/2022] [Indexed: 06/15/2023]
Abstract
Genome size (GS) is an important characteristic that may be helpful in delimitation of taxa, and multiple studies have shown correlations between intraspecific GS variation and morphological or environmental factors, as well as its geographical segregation. We estimated a relative GS (RGS) of 707 individuals from 162 populations of Dianthus sylvestris with a geographic focus on the Balkan Peninsula, but also including several populations from the European Alps. Dianthus sylvestris is morphologically variable species thriving in various habitats and six subspecies have been recognized from the Balkan Peninsula. Our RGS data backed-up with chromosome counts revealed that the majority of populations were diploid (2n = 30), but ten tetraploid populations have been recorded in D. sylvestris subsp. sylvestris from Istria (Croatia, Italy). Their monoploid RGS is significantly lower than that of the diploids, indicating genome downsizing. In addition, the tetraploids significantly differ from their diploid counterparts in an array of morphological and environmental characteristics. Within the diploid populations, the RGS is geographically and only partly taxonomically correlated, with the highest RGS inferred in the southern Balkan Peninsula and the Alps. We demonstrate greater RGS variation among the Balkan populations compared to the Alps, which is likely a result of more pronounced evolutionary differentiation within the Balkan Peninsula. In addition, a deep RGS divergence within the Alps likely points to persistence of the alpine populations in different Pleistocene refugia.
Collapse
|
32
|
Genome size evolution in the beetle genus Diabrotica. G3 (BETHESDA, MD.) 2022; 12:jkac052. [PMID: 35234880 PMCID: PMC8982398 DOI: 10.1093/g3journal/jkac052] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/28/2022] [Accepted: 02/22/2022] [Indexed: 11/20/2022]
Abstract
Diabrocite corn rootworms are one of the most economically significant pests of maize in the United States and Europe and an emerging model for insect-plant interactions. Genome sizes of several species in the genus Diabrotica were estimated using flow cytometry along with that of Acalymma vittatum as an outgroup. Genome sizes ranged between 1.56 and 1.64 gigabase pairs and between 2.26 and 2.59 Gb, respectively, for the Diabrotica subgroups fucata and virgifera; the Acalymma vittatum genome size was around 1.65 Gb. This result indicated that a substantial increase in genome size occurred in the ancestor of the virgifera group. Further analysis of the fucata group and the virgifera group genome sequencing reads indicated that the genome size difference between the Diabrotica subgroups could be attributed to a higher content of transposable elements, mostly miniature inverted-transposable elements and gypsy-like long terminal repeat retroelements.
Collapse
|
33
|
Long-distance transport RNAs between rootstocks and scions and graft hybridization. PLANTA 2022; 255:96. [PMID: 35348893 DOI: 10.1007/s00425-022-03863-w] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/15/2021] [Accepted: 02/17/2022] [Indexed: 06/14/2023]
Abstract
The present review addresses the advances of the identification methods, functions, and transportation mechanism of long-distance transport RNAs between rootstock and scion. In addition, we highlight the cognitive processes and potential mechanisms of graft hybridization. Phloem, the main transport channel of higher plants, plays an important role in the growth and development of plants. Numerous studies have identified a large number of RNAs, including mRNAs, miRNAs, siRNAs, and lncRNAs, in the plant phloem. They can not only be transported to long distances across the grafting junction in the phloem, but also act as signal molecules to regulate the growth, development, and stress resistance of remote cells or tissues, resulting in changes in the traits of rootstocks and scions. Many mobile RNAs have been discovered, but their detection methods, functions, and long-distance transport mechanisms remain to be elucidated. In addition, grafting hybridization, a phenomenon that has been questioned before, and which has an important role in selecting for superior traits, is gradually being recognized with the emergence of new evidence and the prevalence of horizontal gene transfer between parasitic plants. In this review, we outline the species, functions, identification methods, and potential mechanisms of long-distance transport RNAs between rootstocks and scions after grafting. In addition, we summarize the process of recognition and the potential mechanisms of graft hybridization. This study aimed to emphasize the role of grafting in the study of long-distance signals and selection for superior traits and to provide ideas and clues for further research on long-distance transport RNAs and graft hybridization.
Collapse
|
34
|
A 69 kbp Deletion at the Berry Color Locus Is Responsible for Berry Color Recovery in Vitis vinifera L. Cultivar 'Riesling Rot'. Int J Mol Sci 2022; 23:ijms23073708. [PMID: 35409066 PMCID: PMC8998622 DOI: 10.3390/ijms23073708] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2022] [Revised: 03/23/2022] [Accepted: 03/25/2022] [Indexed: 11/18/2022] Open
Abstract
‘Riesling Weiss’ is a white grapevine variety famous worldwide for fruity wines with higher acidity. Hardly known is ‘Riesling Rot’, a red-berried variant of ‘Riesling Weiss’ that disappeared from commercial cultivation but has increased in awareness in the last decades. The question arises of which variant, white or red, is the original and, consequently, which cultivar is the true ancestor. Sequencing the berry color locus of ‘Riesling Rot’ revealed a new VvmybA gene variant in one of the two haplophases called VvmybA3/1RR. The allele displays homologous recombination of VvmybA3 and VvmybA1 with a deletion of about 69 kbp between both genes that restores VvmybA1 transcripts. Furthermore, analysis of ‘Riesling Weiss’, ‘Riesling Rot’, and the ancestor ‘Heunisch Weiss’ along chromosome 2 using SSR (simple sequence repeat) markers elucidated that the haplophase of ‘Riesling Weiss’ was inherited from the white-berried parent variety ‘Heunisch Weiss’. Since no color mutants of ‘Heunisch Weiss’ are described that could have served as allele donors, we concluded that, in contrast to the public opinion, ‘Riesling Rot’ resulted from a mutational event in ‘Riesling Weiss’ and not vice versa.
Collapse
|
35
|
Characterization of Repetitive DNA in Saccharum officinarum and Saccharum spontaneum by Genome Sequencing and Cytological Assays. FRONTIERS IN PLANT SCIENCE 2022; 13:814620. [PMID: 35273624 PMCID: PMC8902033 DOI: 10.3389/fpls.2022.814620] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/14/2021] [Accepted: 01/28/2022] [Indexed: 06/14/2023]
Abstract
In most plant species, DNA repeated elements such as satellites and retrotransposons are composing the majority of their genomes. Saccharum officinarum (2n = 8x = 80) and S. spontaneum (2n = 40-128) are the two fundamental donors of modern sugarcane cultivars. These two species are polyploids with large genome sizes and are enriched in repetitive elements. In this work, we adopted a de novo strategy to isolate highly repetitive and abundant sequences in S. officinarum LA Purple and S. spontaneum SES208. The findings obtained from alignment to the genome assemblies revealed that the vast majority of the repeats (97.9% in LA Purple and 96.5% in SES208) were dispersed in the respective genomes. Fluorescence in situ hybridization assays were performed on 27 representative repeats to investigate their distributions and abundances. The results showed that the copies of some highly repeated sequences, including rDNA and centromeric or telomeric repeats, were underestimated in current genome assemblies. The analysis of the raw read mapping strategy showed more copy numbers for all studied repeats, suggesting that copy number underestimation is common for highly repeated sequences in current genome assemblies of LA Purple and SES208. In addition, the data showed that the centromeric retrotransposons in all SES208 centromeres were absent in certain S. spontaneum clones with different ploidies. This rapid turnover of centromeric DNA in sugarcane provides new clues regarding the pattern of centromeric retrotransposon formation and accumulation.
Collapse
|
36
|
AnnoSINE: a short interspersed nuclear elements annotation tool for plant genomes. PLANT PHYSIOLOGY 2022; 188:955-970. [PMID: 34792587 PMCID: PMC8825457 DOI: 10.1093/plphys/kiab524] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/16/2021] [Accepted: 10/01/2021] [Indexed: 06/13/2023]
Abstract
Short interspersed nuclear elements (SINEs) are a widespread type of small transposable element (TE). With increasing evidence for their impact on gene function and genome evolution in plants, accurate genome-scale SINE annotation becomes a fundamental step for studying the regulatory roles of SINEs and their relationship with other components in the genomes. Despite the overall promising progress made in TE annotation, SINE annotation remains a major challenge. Unlike some other TEs, SINEs are short and heterogeneous, and they usually lack well-conserved sequence or structural features. Thus, current SINE annotation tools have either low sensitivity or high false discovery rates. Given the demand and challenges, we aimed to provide a more accurate and efficient SINE annotation tool for plant genomes. The pipeline starts with maximizing the pool of SINE candidates via profile hidden Markov model-based homology search and de novo SINE search using structural features. Then, it excludes the false positives by integrating all known features of SINEs and the features of other types of TEs that can often be misannotated as SINEs. As a result, the pipeline substantially improves the tradeoff between sensitivity and accuracy, with both values close to or over 90%. We tested our tool in Arabidopsis thaliana and rice (Oryza sativa), and the results show that our tool competes favorably against existing SINE annotation tools. The simplicity and effectiveness of this tool would potentially be useful for generating more accurate SINE annotations for other plant species. The pipeline is freely available at https://github.com/yangli557/AnnoSINE.
Collapse
|
37
|
An Eruption of LTR Retrotransposons in the Autopolyploid Genomes of Chrysanthemum nankingense (Asteraceae). PLANTS (BASEL, SWITZERLAND) 2022; 11:plants11030315. [PMID: 35161296 PMCID: PMC8839533 DOI: 10.3390/plants11030315] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/11/2022] [Revised: 01/21/2022] [Accepted: 01/22/2022] [Indexed: 05/09/2023]
Abstract
Whole genome duplication, associated with the induction of widespread genetic changes, has played an important role in the evolution of many plant taxa. All extant angiosperm species have undergone at least one polyploidization event, forming either an auto- or allopolyploid organism. Compared with allopolyploidization, however, few studies have examined autopolyploidization, and few studies have focused on the response of genetic changes to autopolyploidy. In the present study, newly synthesized C. nankingense autotetraploids (Asteraceae) were employed to characterize the genome shock following autopolyploidization. Available evidence suggested that the genetic changes primarily involved the loss of old fragments and the gain of novel fragments, and some novel sequences were potential long terminal repeat (LTR) retrotransposons. As Ty1-copia and Ty3-gypsy elements represent the two main superfamilies of LTR retrotransposons, the dynamics of Ty1-copia and Ty3-gypsy were evaluated using RT-PCR, transcriptome sequencing, and LTR retrotransposon-based molecular marker techniques. Additionally, fluorescence in situ hybridization(FISH)results suggest that autopolyploidization might also be accompanied by perturbations of LTR retrotransposons, and emergence retrotransposon insertions might show more rapid divergence, resulting in diploid-like behaviour, potentially accelerating the evolutionary process among progenies. Our results strongly suggest a need to expand the current evolutionary framework to include a genetic dimension when seeking to understand genomic shock following autopolyploidization in Asteraceae.
Collapse
|
38
|
Centromere-Specific Retrotransposons and Very-Long-Chain Fatty Acid Biosynthesis in the Genome of Yellowhorn ( Xanthoceras sorbifolium, Sapindaceae), an Oil-Producing Tree With Significant Drought Resistance. FRONTIERS IN PLANT SCIENCE 2021; 12:766389. [PMID: 34880890 PMCID: PMC8647845 DOI: 10.3389/fpls.2021.766389] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/02/2021] [Accepted: 10/18/2021] [Indexed: 05/17/2023]
Abstract
In-depth genome characterization is still lacking for most of biofuel crops, especially for centromeres, which play a fundamental role during nuclear division and in the maintenance of genome stability. This study applied long-read sequencing technologies to assemble a highly contiguous genome for yellowhorn (Xanthoceras sorbifolium), an oil-producing tree, and conducted extensive comparative analyses to understand centromere structure and evolution, and fatty acid biosynthesis. We produced a reference-level genome of yellowhorn, ∼470 Mb in length with ∼95% of contigs anchored onto 15 chromosomes. Genome annotation identified 22,049 protein-coding genes and 65.7% of the genome sequence as repetitive elements. Long terminal repeat retrotransposons (LTR-RTs) account for ∼30% of the yellowhorn genome, which is maintained by a moderate birth rate and a low removal rate. We identified the centromeric regions on each chromosome and found enrichment of centromere-specific retrotransposons of LINE1 and Gypsy in these regions, which have evolved recently (∼0.7 MYA). We compared the genomes of three cultivars and found frequent inversions. We analyzed the transcriptomes from different tissues and identified the candidate genes involved in very-long-chain fatty acid biosynthesis and their expression profiles. Collinear block analysis showed that yellowhorn shared the gamma (γ) hexaploidy event with Vitis vinifera but did not undergo any further whole-genome duplication. This study provides excellent genomic resources for understanding centromere structure and evolution and for functional studies in this important oil-producing plant.
Collapse
|
39
|
Comparative Genomics Analysis of Repetitive Elements in Ten Gymnosperm Species: "Dark Repeatome" and Its Abundance in Conifer and Gnetum Species. Life (Basel) 2021; 11:life11111234. [PMID: 34833110 PMCID: PMC8620675 DOI: 10.3390/life11111234] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2021] [Revised: 11/09/2021] [Accepted: 11/09/2021] [Indexed: 11/16/2022] Open
Abstract
Repetitive elements (RE) and transposons (TE) can comprise up to 80% of some plant genomes and may be essential for regulating their evolution and adaptation. The “repeatome” information is often unavailable in assembled genomes because genomic areas of repeats are challenging to assemble and are often missing from final assembly. However, raw genomic sequencing data contain rich information about RE/TEs. Here, raw genomic NGS reads of 10 gymnosperm species were studied for the content and abundance patterns of their “repeatome”. We utilized a combination of alignment on databases of repetitive elements and de novo assembly of highly repetitive sequences from genomic sequencing reads to characterize and calculate the abundance of known and putative repetitive elements in the genomes of 10 conifer plants: Pinus taeda, Pinus sylvestris, Pinus sibirica, Picea glauca, Picea abies, Abies sibirica, Larix sibirica, Juniperus communis, Taxus baccata, and Gnetum gnemon. We found that genome abundances of known and newly discovered putative repeats are specific to phylogenetically close groups of species and match biological taxa. The grouping of species based on abundances of known repeats closely matches the grouping based on abundances of newly discovered putative repeats (kChains) and matches the known taxonomic relations.
Collapse
|
40
|
Impact of transposable elements on the evolution of complex living systems and their epigenetic control. Biosystems 2021; 210:104566. [PMID: 34718084 DOI: 10.1016/j.biosystems.2021.104566] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2021] [Revised: 10/21/2021] [Accepted: 10/21/2021] [Indexed: 10/20/2022]
Abstract
Transposable elements (TEs) contribute to genomic innovations, as well as genome instability, across a wide variety of species. Popular designations such as 'selfish DNA' and 'junk DNA,' common in the 1980s, may be either inaccurate or misleading, while a more enlightened view of the TE-host relationship covers a range from parasitism to mutualism. Both plant and animal hosts have evolved epigenetic mechanisms to reduce the impact of TEs, both by directly silencing them and by reducing their ability to transpose in the genome. However, TEs have also been co-opted by both plant and animal genomes to perform a variety of physiological functions, ranging from TE-derived proteins acting directly in normal biological functions to innovations in transcription factor activity and also influencing gene expression. Their presence, in fact, can affect a range of features at genome, phenotype, and population levels. The impact TEs have had on evolution is multifaceted, and many aspects still remain unexplored. In this review, the epigenetic control of TEs is contextualized according to the evolution of complex living systems.
Collapse
|
41
|
Regulation of retrotransposition in Arabidopsis. Biochem Soc Trans 2021; 49:2241-2251. [PMID: 34495315 DOI: 10.1042/bst20210337] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2021] [Revised: 08/10/2021] [Accepted: 08/12/2021] [Indexed: 01/01/2023]
Abstract
Plant genomes are largely comprised of retrotransposons which can replicate through 'copy and paste' mechanisms. Long terminal repeat (LTR) retrotransposons are the major class of retrotransposons in plant species, and importantly they broadly affect the expression of nearby genes. Although most LTR retrotransposons are non-functional, active retrotranspositions have been reported in plant species or mutants under normal growth condition and environmental stresses. With the well-defined reference genome and numerous mutant alleles, Arabidopsis studies have significantly expanded our understanding of retrotransposon regulation. Active LTR retrotransposon loci produce virus-like particles to perform reverse transcription, and their complementary DNA can be inserted into new genomic loci. Due to the detrimental consequences of retrotransposition, plants like animals, have developed transcriptional and post-transcriptional silencing mechanisms. Recently several different genome-wide techniques have been developed to understand LTR retrotransposition in Arabidopsis and different plant species. Transposome, methylome, transcriptome, translatome and small RNA sequencing data have revealed how host silencing mechanisms can affect multiple steps of retrotransposition. These recent advances shed light on future mechanistic studies of retrotransposition as well as retrotransposon diversity.
Collapse
|
42
|
DRD1, a SWI/SNF-like chromatin remodeling protein, regulates a heat-activated transposon in Arabidopsis thaliana. Genes Genet Syst 2021; 96:151-158. [PMID: 34373369 DOI: 10.1266/ggs.21-00005] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
ONSEN is a heat-activated LTR retrotransposon in Arabidopsis thaliana. Screens to identify transcriptional regulatory factors of ONSEN revealed a SWI/SNF-like chromatin remodeling protein, DRD1, which cooperates with plant-specific RNA polymerase and is involved in RNA-directed DNA methylation. ONSEN transcript level was increased in the drd1 mutant relative to wild-type under heat stress, indicating that DRD1 plays a significant role in the silencing of activated ONSEN under the stress condition. The transcript level of HsfA2, which is directly involved in transcriptional activation of ONSEN, was not higher in the drd1 mutant than in the wild-type. Interestingly, no transgenerational transposition of ONSEN was observed in the drd1 mutant, even though DNA methylation levels were significantly reduced and expression levels were increased compared to the wild-type. These results suggest that other factors are involved in the regulation of ONSEN transposition in addition to the transcript level of ONSEN.
Collapse
|
43
|
In Silico Analysis of Fatty Acid Desaturases Structures in Camelina sativa, and Functional Evaluation of Csafad7 and Csafad8 on Seed Oil Formation and Seed Morphology. Int J Mol Sci 2021; 22:ijms221910857. [PMID: 34639198 PMCID: PMC8532002 DOI: 10.3390/ijms221910857] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2021] [Revised: 10/01/2021] [Accepted: 10/05/2021] [Indexed: 12/19/2022] Open
Abstract
Fatty acid desaturases add a second bond into a single bond of carbon atoms in fatty acid chains, resulting in an unsaturated bond between the two carbons. They are classified into soluble and membrane-bound desaturases, according to their structure, subcellular location, and function. The orthologous genes in Camelina sativa were identified and analyzed, and a total of 62 desaturase genes were identified. It was revealed that they had the common fatty acid desaturase domain, which has evolved separately, and the proteins of the same family also originated from the same ancestry. A mix of conserved, gained, or lost intron structure was obvious. Besides, conserved histidine motifs were found in each family, and transmembrane domains were exclusively revealed in the membrane-bound desaturases. The expression profile analysis of C. sativa desaturases revealed an increase in young leaves, seeds, and flowers. C. sativa ω3-fatty acid desaturases CsaFAD7 and CsaDAF8 were cloned and the subcellular localization analysis showed their location in the chloroplast. They were transferred into Arabidopsis thaliana to obtain transgenic lines. It was revealed that the ω3-fatty acid desaturase could increase the C18:3 level at the expense of C18:2, but decreases in oil content and seed weight, and wrinkled phenotypes were observed in transgenic CsaFAD7 lines, while no significant change was observed in transgenic CsaFAD8 lines in comparison to the wild-type. These findings gave insights into the characteristics of desaturase genes, which could provide an excellent basis for further investigation for C. sativa improvement, and overexpression of ω3-fatty acid desaturases in seeds could be useful in genetic engineering strategies, which are aimed at modifying the fatty acid composition of seed oil.
Collapse
|
44
|
Algae-mediated processes for the treatment of antiretroviral drugs in wastewater: Prospects and challenges. CHEMOSPHERE 2021; 280:130674. [PMID: 34162077 DOI: 10.1016/j.chemosphere.2021.130674] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/08/2021] [Revised: 04/21/2021] [Accepted: 04/22/2021] [Indexed: 06/13/2023]
Abstract
The prevalence of pharmaceuticals (PCs), especially antiretroviral (ARV) drugs in various aquatic ecosystems has been expansively reported, wherein wastewater treatment plants (WWTPs) are identified as the primary point source. Consequently, the occurrence, ecotoxicity and treatment of ARV drugs in WWTPs have drawn much attention in recent years. Numerous studies have shown that the widely employed activated sludge-based WWTPs are incapable of removing ARV drugs efficiently from wastewater. Recently, algae-based wastewater treatment processes have shown promising results in PCs removal from wastewater, either completely or partially, through different processes such as biosorption, bioaccumulation, and intra-/inter-cellular degradation. Algal species have also shown to tolerate high concentrations of ARV drugs than the reported concentrations in the environmental matrices. In this review, emphasis has been given on discussing the current status of the occurrence of ARV drugs in the aquatic environment and WWTPs. Besides, the current trends and future perspectives of PCs removal by algae are critically reviewed and discussed. The potential pathways and mechanisms of ARV drugs removal by algae have also been discussed.
Collapse
|
45
|
Development of molecular markers based on LTR retrotransposon in the Cleistogenes songorica genome. J Appl Genet 2021; 63:61-72. [PMID: 34554437 DOI: 10.1007/s13353-021-00658-9] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2020] [Revised: 08/09/2021] [Accepted: 08/23/2021] [Indexed: 11/26/2022]
Abstract
Long terminal repeat retrotransposons (LTR-RTs) contribute a large fraction of many sequenced plant genomes and play important roles in genomic diversity and phenotypic variations. LTR-RTs are abundantly distributed in plant genomes, facilitating the development of markers based on LTR-RTs for a variety of genotyping purposes. Whole-genome analysis of LTR-RTs was performed in Cleistogenes songorica. A total of 299,079 LTR-RTs were identified and classified as Gypsy type, Copia type, or other type. LTR-RTs were widely distributed in the genome, enriched in the heterochromatic region of the chromosome, and negatively correlated with gene distribution. However, approximately one-fifth of genes were still interrupted by LTR-RTs, and these genes are annotated. Furthermore, four types of primer pairs (PPs) were designed, namely, retrotransposon-based insertion polymorphisms, inter-retrotransposon amplified polymorphisms, insertion site-based polymorphisms, and retrotransposon-microsatellite amplified polymorphisms. A total of 350 PPs were screened in 23 accessions of the genus Cleistogenes, of which 80 PPs showed polymorphism, and 72 PPs showed transferability among Gramineae and non-Gramineae species. In addition, a comparative analysis of homologous LTR-RTs was performed with other related grasses. Taken together, the study will serve as a valuable resource for genotyping applications for C. songorica and related grasses.
Collapse
|
46
|
Insights from the first genome assembly of Onion (Allium cepa). G3 (BETHESDA, MD.) 2021; 11. [PMID: 34544132 DOI: 10.1101/2021.03.05.434149] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/29/2021] [Accepted: 07/06/2021] [Indexed: 05/18/2023]
Abstract
Onion is an important vegetable crop with an estimated genome size of 16 Gb. We describe the de novo assembly and ab initio annotation of the genome of a doubled haploid onion line DHCU066619, which resulted in a final assembly of 14.9 Gb with an N50 of 464 Kb. Of this, 2.4 Gb was ordered into eight pseudomolecules using four genetic linkage maps. The remainder of the genome is available in 89.6 K scaffolds. Only 72.4% of the genome could be identified as repetitive sequences and consist, to a large extent, of (retro) transposons. In addition, an estimated 20% of the putative (retro) transposons had accumulated a large number of mutations, hampering their identification, but facilitating their assembly. These elements are probably already quite old. The ab initio gene prediction indicated 540,925 putative gene models, which is far more than expected, possibly due to the presence of pseudogenes. Of these models, 47,066 showed RNASeq support. No gene rich regions were found, genes are uniformly distributed over the genome. Analysis of synteny with Allium sativum (garlic) showed collinearity but also major rearrangements between both species. This assembly is the first high-quality genome sequence available for the study of onion and will be a valuable resource for further research.
Collapse
|
47
|
Insights from the first genome assembly of Onion (Allium cepa). G3 (BETHESDA, MD.) 2021; 11:jkab243. [PMID: 34544132 PMCID: PMC8496297 DOI: 10.1093/g3journal/jkab243] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/29/2021] [Accepted: 07/06/2021] [Indexed: 11/17/2022]
Abstract
Onion is an important vegetable crop with an estimated genome size of 16 Gb. We describe the de novo assembly and ab initio annotation of the genome of a doubled haploid onion line DHCU066619, which resulted in a final assembly of 14.9 Gb with an N50 of 464 Kb. Of this, 2.4 Gb was ordered into eight pseudomolecules using four genetic linkage maps. The remainder of the genome is available in 89.6 K scaffolds. Only 72.4% of the genome could be identified as repetitive sequences and consist, to a large extent, of (retro) transposons. In addition, an estimated 20% of the putative (retro) transposons had accumulated a large number of mutations, hampering their identification, but facilitating their assembly. These elements are probably already quite old. The ab initio gene prediction indicated 540,925 putative gene models, which is far more than expected, possibly due to the presence of pseudogenes. Of these models, 47,066 showed RNASeq support. No gene rich regions were found, genes are uniformly distributed over the genome. Analysis of synteny with Allium sativum (garlic) showed collinearity but also major rearrangements between both species. This assembly is the first high-quality genome sequence available for the study of onion and will be a valuable resource for further research.
Collapse
|
48
|
A spinach genome assembly with remarkable completeness, and its use for rapid identification of candidate genes for agronomic traits. DNA Res 2021; 28:6303609. [PMID: 34142133 PMCID: PMC8231376 DOI: 10.1093/dnares/dsab004] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2021] [Indexed: 01/23/2023] Open
Abstract
Spinach (Spinacia oleracea) is grown as a nutritious leafy vegetable worldwide. To accelerate spinach breeding efficiency, a high-quality reference genome sequence with great completeness and continuity is needed as a basic infrastructure. Here, we used long-read and linked-read technologies to construct a de novo spinach genome assembly, designated SOL_r1.1, which was comprised of 287 scaffolds (total size: 935.7 Mb; N50 = 11.3 Mb) with a low proportion of undetermined nucleotides (Ns = 0.34%) and with high gene completeness (BUSCO complete 96.9%). A genome-wide survey of resistance gene analogues identified 695 genes encoding nucleotide-binding site domains, receptor-like protein kinases, receptor-like proteins and transmembrane-coiled coil domains. Based on a high-density double-digest restriction-site associated DNA sequencing-based linkage map, the genome assembly was anchored to six pseudomolecules representing ∼73.5% of the whole genome assembly. In addition, we used SOL_r1.1 to identify quantitative trait loci for bolting timing and fruit/seed shape, which harbour biologically plausible candidate genes, such as homologues of the FLOWERING LOCUS T and EPIDERMAL PATTERNING FACTOR-LIKE genes. The new genome assembly, SOL_r1.1, will serve as a useful resource for identifying loci associated with important agronomic traits and for developing molecular markers for spinach breeding/selection programs.
Collapse
|
49
|
A comprehensive annotation dataset of intact LTR retrotransposons of 300 plant genomes. Sci Data 2021; 8:174. [PMID: 34267227 PMCID: PMC8282616 DOI: 10.1038/s41597-021-00968-x] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2021] [Accepted: 06/07/2021] [Indexed: 12/11/2022] Open
Abstract
LTR retrotransposons (LTR-RTs) are ubiquitous and represent the dominant repeat element in plant genomes, playing important roles in functional variation, genome plasticity and evolution. With the advent of new sequencing technologies, a growing number of whole-genome sequences have been made publicly available, making it possible to carry out systematic analyses of LTR-RTs. However, a comprehensive and unified annotation of LTR-RTs in plant groups is still lacking. Here, we constructed a plant intact LTR-RTs dataset, which is designed to classify and annotate intact LTR-RTs with a standardized procedure. The dataset currently comprises a total of 2,593,685 intact LTR-RTs from genomes of 300 plant species representing 93 families of 46 orders. The dataset is accompanied by sequence, diverse structural and functional annotation, age determination and classification information associated with the LTR-RTs. This dataset will contribute valuable resources for investigating the evolutionary dynamics and functional implications of LTR-RTs in plant genomes.
Collapse
|
50
|
Genome-wide analyses of tandem repeats and transposable elements in patchouli. Genes Genet Syst 2021; 96:81-87. [PMID: 33883323 DOI: 10.1266/ggs.20-00044] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
Patchouli, Pogostemon cablin (Blanco) Benth., is a traditional Chinese medicinal plant from the order Lamiales. It is considered a valuable herb due to its essential oil content and range of therapeutic effects. This study aimed to explore the evolutionary history of repetitive sequences in the patchouli genome by analyzing tandem repeats and transposable elements (TEs). We first retrieved genomic data for patchouli and four other Lamiales species from the GenBank database. Next, the content of tandem repeats with different period sizes was identified. Long terminal repeats (LTRs) were then identified with LTR_STRUC. Finally, the evolutionary landscape of TEs was explored using an in-house PERL program. The analysis of repetitive sequences revealed that tandem repeats constitute a higher proportion of the patchouli genome compared to the four other species. Analyses of TE families showed that most of the repetitive sequences in the patchouli genome are TEs, and that recently inserted TEs make up a comparatively larger proportion than older ones. Our analyses of LTR retrotransposons in their host genome indicated the existence of ancient LTR retrotransposon expansion, and the escape of these elements from natural selection revealed their ages. Our identification and analyses of repetitive sequences should provide new insights for further investigation of patchouli evolution.
Collapse
|