1
|
The Cissus quadrangularis genome reveals its adaptive features in an arid habitat. HORTICULTURE RESEARCH 2024; 11:uhae038. [PMID: 38595910 PMCID: PMC11001597 DOI: 10.1093/hr/uhae038] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/11/2023] [Accepted: 01/26/2024] [Indexed: 04/11/2024]
Abstract
Cissus quadrangularis is a tetraploid species belonging to the Vitaceae family and is known for the Crassulacean acid metabolism (CAM) pathway in the succulent stem, while the leaves perform C3 photosynthesis. Here, we report a high-quality genome of C. quadrangularis comprising a total size of 679.2 Mb which was phased into two subgenomes. Genome annotation identified 51 857 protein-coding genes, while approximately 47.75% of the genome was composed of repetitive sequences. Gene expression ratios of two subgenomes demonstrated that the sub-A genome as the dominant subgenome played a vital role during the drought tolerance. Genome divergence analysis suggests that the tetraploidization event occurred around 8.9 million years ago. Transcriptome data revealed that pathways related to cutin, suberine, and wax metabolism were enriched in the stem during drought treatment, suggesting that these genes contributed to the drought adaption. Additionally, a subset of CAM-related genes displayed diurnal expression patterns in the succulent stems but not in leaves, indicating that stem-biased expression of existing genes contributed to the CAM evolution. Our findings provide insights into the mechanisms of drought adaptation and photosynthesis transition in plants.
Collapse
|
2
|
Integrative multiomics profiling of passion fruit reveals the genetic basis for fruit color and aroma. PLANT PHYSIOLOGY 2024; 194:2491-2510. [PMID: 38039148 DOI: 10.1093/plphys/kiad640] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/21/2023] [Revised: 10/26/2023] [Accepted: 10/29/2023] [Indexed: 12/03/2023]
Abstract
Passion fruit (Passiflora edulis) possesses a complex aroma and is widely grown in tropical and subtropical areas. Here, we conducted the de novo assembly, annotation, and comparison of PPF (P. edulis Sims) and YPF (P. edulis f. flavicarpa) reference genomes using PacBio, Illumina, and Hi-C technologies. Notably, we discovered evidence of recent whole-genome duplication events in P. edulis genomes. Comparative analysis revealed 7.6∼8.1 million single nucleotide polymorphisms, 1 million insertions/deletions, and over 142 Mb presence/absence variations among different P. edulis genomes. During the ripening of yellow passion fruit, metabolites related to flavor, aroma, and color were substantially accumulated or changed. Through joint analysis of genomic variations, differentially expressed genes, and accumulated metabolites, we explored candidate genes associated with flavor, aroma, and color distinctions. Flavonoid biosynthesis pathways, anthocyanin biosynthesis pathways, and related metabolites are pivotal factors affecting the coloration of passion fruit, and terpenoid metabolites accumulated more in PPF. Finally, by heterologous expression in yeast (Saccharomyces cerevisiae), we functionally characterized 12 terpene synthases. Our findings revealed that certain TPS homologs in both YPF and PPF varieties produce identical terpene products, while others yield distinct compounds or even lose their functionality. These discoveries revealed the genetic and metabolic basis of unique characteristics in aroma and flavor between the 2 passion fruit varieties. This study provides resources for better understanding the genome architecture and accelerating genetic improvement of passion fruits.
Collapse
|
3
|
Einkorn genomics sheds light on history of the oldest domesticated wheat. Nature 2023; 620:830-838. [PMID: 37532937 PMCID: PMC10447253 DOI: 10.1038/s41586-023-06389-7] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2022] [Accepted: 06/29/2023] [Indexed: 08/04/2023]
Abstract
Einkorn (Triticum monococcum) was the first domesticated wheat species, and was central to the birth of agriculture and the Neolithic Revolution in the Fertile Crescent around 10,000 years ago1,2. Here we generate and analyse 5.2-Gb genome assemblies for wild and domesticated einkorn, including completely assembled centromeres. Einkorn centromeres are highly dynamic, showing evidence of ancient and recent centromere shifts caused by structural rearrangements. Whole-genome sequencing analysis of a diversity panel uncovered the population structure and evolutionary history of einkorn, revealing complex patterns of hybridizations and introgressions after the dispersal of domesticated einkorn from the Fertile Crescent. We also show that around 1% of the modern bread wheat (Triticum aestivum) A subgenome originates from einkorn. These resources and findings highlight the history of einkorn evolution and provide a basis to accelerate the genomics-assisted improvement of einkorn and bread wheat.
Collapse
|
4
|
Applying molecular and genetic methods to trees and their fungal communities. Appl Microbiol Biotechnol 2023; 107:2783-2830. [PMID: 36988668 PMCID: PMC10106355 DOI: 10.1007/s00253-023-12480-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2022] [Revised: 03/05/2023] [Accepted: 03/07/2023] [Indexed: 03/30/2023]
Abstract
Forests provide invaluable economic, ecological, and social services. At the same time, they are exposed to several threats, such as fragmentation, changing climatic conditions, or increasingly destructive pests and pathogens. Trees, the inherent species of forests, cannot be viewed as isolated organisms. Manifold (micro)organisms are associated with trees playing a pivotal role in forest ecosystems. Of these organisms, fungi may have the greatest impact on the life of trees. A multitude of molecular and genetic methods are now available to investigate tree species and their associated organisms. Due to their smaller genome sizes compared to tree species, whole genomes of different fungi are routinely compared. Such studies have only recently started in forest tree species. Here, we summarize the application of molecular and genetic methods in forest conservation genetics, tree breeding, and association genetics as well as for the investigation of fungal communities and their interrelated ecological functions. These techniques provide valuable insights into the molecular basis of adaptive traits, the impacts of forest management, and changing environmental conditions on tree species and fungal communities and can enhance tree-breeding cycles due to reduced time for field testing. It becomes clear that there are multifaceted interactions among microbial species as well as between these organisms and trees. We demonstrate the versatility of the different approaches based on case studies on trees and fungi. KEY POINTS: • Current knowledge of genetic methods applied to forest trees and associated fungi. • Genomic methods are essential in conservation, breeding, management, and research. • Important role of phytobiomes for trees and their ecosystems.
Collapse
|
5
|
PlantLTRdb: An interactive database for 195 plant species LTR-retrotransposons. FRONTIERS IN PLANT SCIENCE 2023; 14:1134627. [PMID: 36950350 PMCID: PMC10025401 DOI: 10.3389/fpls.2023.1134627] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/30/2022] [Accepted: 02/16/2023] [Indexed: 05/29/2023]
Abstract
LTR-retrotransposons (LTR-RTs) are a large group of transposable elements that replicate through an RNA intermediate and alter genome structure. The activities of LTR-RTs in plant genomes provide helpful information about genome evolution and gene function. LTR-RTs near or within genes can directly alter gene function. This work introduces PlantLTRdb, an intact LTR-RT database for 195 plant species. Using homology- and de novo structure-based methods, a total of 150.18 Gbp representing 3,079,469 pseudomolecules/scaffolds were analyzed to identify, characterize, annotate LTR-RTs, estimate insertion ages, detect LTR-RT-gene chimeras, and determine nearby genes. Accordingly, 520,194 intact LTR-RTs were discovered, including 29,462 autonomous and 490,732 nonautonomous LTR-RTs. The autonomous LTR-RTs included 10,286 Gypsy and 19,176 Copia, while the nonautonomous were divided into 224,906 Gypsy, 218,414 Copia, 1,768 BARE-2, 3,147 TR-GAG and 4,2497 unknown. Analysis of the identified LTR-RTs located within genes showed that a total of 36,236 LTR-RTs were LTR-RT-gene chimeras and 11,619 LTR-RTs were within pseudo-genes. In addition, 50,026 genes are within 1 kbp of LTR-RTs, and 250,587 had a distance of 1 to 10 kbp from LTR-RTs. PlantLTRdb allows researchers to search, visualize, BLAST and analyze plant LTR-RTs. PlantLTRdb can contribute to the understanding of structural variations, genome organization, functional genomics, and the development of LTR-RT target markers for molecular plant breeding. PlantLTRdb is available at https://bioinformatics.um6p.ma/PlantLTRdb.
Collapse
|
6
|
The giant diploid faba genome unlocks variation in a global protein crop. Nature 2023; 615:652-659. [PMID: 36890232 PMCID: PMC10033403 DOI: 10.1038/s41586-023-05791-5] [Citation(s) in RCA: 25] [Impact Index Per Article: 25.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2022] [Accepted: 02/03/2023] [Indexed: 03/10/2023]
Abstract
Increasing the proportion of locally produced plant protein in currently meat-rich diets could substantially reduce greenhouse gas emissions and loss of biodiversity1. However, plant protein production is hampered by the lack of a cool-season legume equivalent to soybean in agronomic value2. Faba bean (Vicia faba L.) has a high yield potential and is well suited for cultivation in temperate regions, but genomic resources are scarce. Here, we report a high-quality chromosome-scale assembly of the faba bean genome and show that it has expanded to a massive 13 Gb in size through an imbalance between the rates of amplification and elimination of retrotransposons and satellite repeats. Genes and recombination events are evenly dispersed across chromosomes and the gene space is remarkably compact considering the genome size, although with substantial copy number variation driven by tandem duplication. Demonstrating practical application of the genome sequence, we develop a targeted genotyping assay and use high-resolution genome-wide association analysis to dissect the genetic basis of seed size and hilum colour. The resources presented constitute a genomics-based breeding platform for faba bean, enabling breeders and geneticists to accelerate the improvement of sustainable protein production across the Mediterranean, subtropical and northern temperate agroecological zones.
Collapse
|
7
|
Phylotranscriptomics and evolution of key genes for terpene biosynthesis in Pinaceae. FRONTIERS IN PLANT SCIENCE 2023; 14:1114579. [PMID: 36875589 PMCID: PMC9982022 DOI: 10.3389/fpls.2023.1114579] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/02/2022] [Accepted: 02/01/2023] [Indexed: 06/18/2023]
Abstract
Pinaceae is the largest family of conifers, dominating forest ecosystems and serving as the backbone of northern, temperate and mountain forests. The terpenoid metabolism of conifers is responsive to pests, diseases, and environmental stress. Determining the phylogeny and evolution of terpene synthase genes in Pinaceae may shed light on early adaptive evolution. We used different inference methods and datasets to reconstruct the Pinaceae phylogeny based on our assembled transcriptomes. We identified the final species tree of Pinaceae by comparing and summarizing different phylogenetic trees. The genes encoding terpene synthase (TPS) and cytochrome P450 proteins in Pinaceae showed a trend of expansion compared with those in Cycas. Gene family analysis revealed that the number of TPS genes decreased while the number of P450 genes increased in loblolly pine. Expression profiles showed that TPSs and P450s were mainly expressed in leaf buds and needles, which may be the result of long-term evolution to protect these two vulnerable tissues. Our research provides insights into the phylogeny and evolution of terpene synthase genes in Pinaceae and offers some useful references for the investigation of terpenoids in conifers.
Collapse
|
8
|
Retrotransposons: How the continuous evolutionary front shapes plant genomes for response to heat stress. FRONTIERS IN PLANT SCIENCE 2022; 13:1064847. [PMID: 36570931 PMCID: PMC9780303 DOI: 10.3389/fpls.2022.1064847] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/08/2022] [Accepted: 11/21/2022] [Indexed: 05/28/2023]
Abstract
Long terminal repeat retrotransposons (LTR retrotransposons) are the most abundant group of mobile genetic elements in eukaryotic genomes and are essential in organizing genomic architecture and phenotypic variations. The diverse families of retrotransposons are related to retroviruses. As retrotransposable elements are dispersed and ubiquitous, their "copy-out and paste-in" life cycle of replicative transposition leads to new genome insertions without the excision of the original element. The overall structure of retrotransposons and the domains responsible for the various phases of their replication is highly conserved in all eukaryotes. The two major superfamilies of LTR retrotransposons, Ty1/Copia and Ty3/Gypsy, are distinguished and dispersed across the chromosomes of higher plants. Members of these superfamilies can increase in copy number and are often activated by various biotic and abiotic stresses due to retrotransposition bursts. LTR retrotransposons are important drivers of species diversity and exhibit great variety in structure, size, and mechanisms of transposition, making them important putative actors in genome evolution. Additionally, LTR retrotransposons influence the gene expression patterns of adjacent genes by modulating potential small interfering RNA (siRNA) and RNA-directed DNA methylation (RdDM) pathways. Furthermore, comparative and evolutionary analysis of the most important crop genome sequences and advanced technologies have elucidated the epigenetics and structural and functional modifications driven by LTR retrotransposon during speciation. However, mechanistic insights into LTR retrotransposons remain obscure in plant development due to a lack of advancement in high throughput technologies. In this review, we focus on the key role of LTR retrotransposons response in plants during heat stress, the role of centromeric LTR retrotransposons, and the role of LTR retrotransposon markers in genome expression and evolution.
Collapse
|
9
|
Evolution of complex genome architecture in gymnosperms. Gigascience 2022; 11:6659718. [PMID: 35946987 PMCID: PMC9364684 DOI: 10.1093/gigascience/giac078] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2022] [Revised: 06/09/2022] [Accepted: 07/15/2022] [Indexed: 11/25/2022] Open
Abstract
Gymnosperms represent an ancient lineage that diverged from early spermatophytes during the Devonian. The long fossil records and low diversity in living species prove their complex evolutionary history, which included ancient radiations and massive extinctions. Due to their ultra-large genome size, the whole-genome assembly of gymnosperms has only generated in the past 10 years and is now being further expanded into more taxonomic representations. Here, we provide an overview of the publicly available gymnosperm genome resources and discuss their assembly quality and recent findings in large genome architectures. In particular, we describe the genomic features most related to changes affecting the whole genome. We also highlight new realizations relative to repetitive sequence dynamics, paleopolyploidy, and long introns. Based on the results of relevant genomic studies of gymnosperms, we suggest additional efforts should be made toward exploring the genomes of medium-sized (5–15 gigabases) species. Lastly, more comparative analyses among high-quality assemblies are needed to understand the genomic shifts and the early species diversification of seed plants.
Collapse
|
10
|
Chromosome-level genome assembly of the aquatic plant Nymphoides indica reveals transposable element bursts and NBS-LRR gene family expansion shedding light on its invasiveness. DNA Res 2022; 29:6617837. [PMID: 35751614 PMCID: PMC9267246 DOI: 10.1093/dnares/dsac022] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2022] [Accepted: 06/24/2022] [Indexed: 11/19/2022] Open
Abstract
Nymphoides indica, an aquatic plant, is an invasive species that causes both ecological and economic damage in North America and elsewhere. However, the lack of genomic data of N. indica limits the in-depth analysis of this invasive species. Here, we report a chromosome-level genome assembly of nine pseudochromosomes of N. indica with a total size of ∼ 520 Mb. More than half of the N. indica genome consists of transposable elements (TEs), and a higher density of TEs around genes may play a significant role in response to an ever-changing environment by regulating the nearby gene. Additionally, our analysis revealed that N. indica only experienced a gamma (γ) whole-genome triplication event. Functional enrichment of the N. indica-specific and expanded gene families highlighted genes involved in the responses to hypoxia and plant–pathogen interactions, which may strengthen the ability to adapt to external challenges and improve ecological fitness. Furthermore, we identified 160 members of the nucleotide-binding site and leucine-rich repeat gene family, which may be linked to the defence response. Collectively, the high-quality N. indica genome reported here opens a novel avenue to understand the evolution and rapid invasion of Nymphoides spp.
Collapse
|
11
|
Genomic insights into the origin, adaptive evolution, and herbicide resistance of Leptochloa chinensis, a devastating tetraploid weedy grass in rice fields. MOLECULAR PLANT 2022; 15:1045-1058. [PMID: 35524410 DOI: 10.1016/j.molp.2022.05.001] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/17/2022] [Revised: 04/30/2022] [Accepted: 05/01/2022] [Indexed: 06/14/2023]
Abstract
Chinese sprangletop (Leptochloa chinensis), belonging to the grass subfamily Chloridoideae, is one of the most notorious weeds in rice ecosystems. Here, we report a chromosome-scale reference genome assembly and a genomic variation map of the tetraploid L. chinensis. The L. chinensis genome is derived from two diploid progenitors that diverged ∼10.9 million years ago, and its two subgenomes display neither fractionation bias nor overall gene expression dominance. Comparative genomic analyses reveal substantial genome rearrangements in L. chinensis after its divergence from the common ancestor of Chloridoideae and, together with transcriptome profiling, demonstrate the important contribution of tetraploidization to the gene sources for the herbicide resistance of L. chinensis. Population genomic analyses of 89 accessions from China reveal that L. chinensis accessions collected from southern/southwestern provinces have substantially higher nucleotide diversity than those from the middle and lower reaches of the Yangtze River, suggesting that L. chinensis spread in China from the southern/southwestern provinces to the middle and lower reaches of the Yangtze River. During this spread, L. chinensis developed significantly increased herbicide resistance, accompanied by the selection of numerous genes involved in herbicide resistance. Taken together, our study generated valuable genomic resources for future fundamental research and agricultural management of L. chinensis, and provides significant new insights into the herbicide resistance as well as the origin and adaptive evolution of L. chinensis.
Collapse
|
12
|
Genomes, repeatomes and interphase chromosome organization in the meadowfoam family (Limnanthaceae, Brassicales). THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2022; 110:1462-1475. [PMID: 35352402 DOI: 10.1111/tpj.15750] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/21/2022] [Revised: 03/17/2022] [Accepted: 03/28/2022] [Indexed: 06/14/2023]
Abstract
The meadowfoam family (Limnanthaceae) is one of the smallest and genomically underexplored families of the Brassicales. The Limnanthaceae harbor about seven species in the genus Limnanthes (meadowfoam) and Floerkea proserpinacoides (false mermaidweed), all native to North America. Because all Limnanthes and Floerkea species have only five chromosome pairs, i.e., a chromosome number rare in Brassicales and shared with Arabidopsis thaliana (Arabidopsis), we examined the Limnanthaceae genomes as a potential model system. Using low-coverage whole-genome sequencing data, we reexamined phylogenetic relationships and characterized the repeatomes of Limnanthaceae genomes. Phylogenies based on complete chloroplast and 35S rDNA sequences corroborated the sister relationship between Floerkea and Limnanthes and two major clades in the latter genus. The genome size of Limnanthaceae species ranges from 1.5 to 2.1 Gb, apparently due to the large increase in DNA repeats, which constitute 60-70% of their genomes. Repeatomes are dominated by long terminal repeat retrotransposons, while tandem repeats represent only less than 0.5% of the genomes. The average chromosome size in Limnanthaceae species (340-420 Mb) is more than 10 times larger than in Arabidopsis (32 Mb). A three-dimensional fluorescence in situ hybridization analysis demonstrated that the five chromosome pairs in interphase nuclei of Limnanthes species adopt the Rabl-like configuration.
Collapse
|
13
|
The Chinese pine genome and methylome unveil key features of conifer evolution. Cell 2021; 185:204-217.e14. [PMID: 34965378 DOI: 10.1016/j.cell.2021.12.006] [Citation(s) in RCA: 94] [Impact Index Per Article: 31.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2021] [Revised: 10/23/2021] [Accepted: 12/03/2021] [Indexed: 12/30/2022]
Abstract
Conifers dominate the world's forest ecosystems and are the most widely planted tree species. Their giant and complex genomes present great challenges for assembling a complete reference genome for evolutionary and genomic studies. We present a 25.4-Gb chromosome-level assembly of Chinese pine (Pinus tabuliformis) and revealed that its genome size is mostly attributable to huge intergenic regions and long introns with high transposable element (TE) content. Large genes with long introns exhibited higher expressions levels. Despite a lack of recent whole-genome duplication, 91.2% of genes were duplicated through dispersed duplication, and expanded gene families are mainly related to stress responses, which may underpin conifers' adaptation, particularly in cold and/or arid conditions. The reproductive regulation network is distinct compared with angiosperms. Slow removal of TEs with high-level methylation may have contributed to genomic expansion. This study provides insights into conifer evolution and resources for advancing research on conifer adaptation and development.
Collapse
|
14
|
Methylation patterns of Tf2 retrotransposons linked to rapid adaptive stress response in the brown planthopper (Nilaparvata lugens). Genomics 2021; 113:4214-4226. [PMID: 34774681 DOI: 10.1016/j.ygeno.2021.11.007] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2021] [Revised: 10/12/2021] [Accepted: 11/07/2021] [Indexed: 11/23/2022]
Abstract
Transposable elements (TEs) exhibit vast diversity across insect orders and are one of the major factors driving insect evolution and speciation. Presence of TEs can be both beneficial and deleterious to their host. While it is well-established that TEs impact life-history traits, adaptations and survivability of insects under hostile environments, the influence of the ecological niche on TE-landscape remains unclear. Here, we analysed the dynamics of Tf2 retrotransposons in the brown planthopper (BPH), under environmental fluctuations. BPH, a major pest of rice, is found in almost all rice-growing ecosystems. We believe genome plasticity, attributed to TEs, has allowed BPH to adapt and colonise novel ecological niches. Our study revealed bimodal age-distribution for Tf2 elements in BPH, indicating the occurrence of two major transpositional events in its evolutionary history and their contribution in shaping BPH genome. While TEs can provide genome flexibility and facilitate adaptations, they impose massive load on the genome. Hence, we investigated the involvement of methylation in modulating transposition in BPH. We performed comparative analyses of the methylation patterns of Tf2 elements in BPH feeding on resistant- and susceptible-rice varieties, and also under pesticide stress, across different life-stages. Results confirmed that methylation, particularly in non-CG context, is involved in TE regulation and dynamics under stress. Furthermore, we observed differential methylation for BPH adults and nymphs, emphasising the importance of screening juvenile life-stages in understanding adaptive-stress-responses in insects. Collectively, this study enhances our understanding of the role of transposons in influencing the evolutionary trajectory and survival strategies of BPH across generations.
Collapse
|
15
|
Genome Size Doubling Arises From the Differential Repetitive DNA Dynamics in the Genus Heloniopsis (Melanthiaceae). Front Genet 2021; 12:726211. [PMID: 34552621 PMCID: PMC8450539 DOI: 10.3389/fgene.2021.726211] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2021] [Accepted: 08/19/2021] [Indexed: 12/23/2022] Open
Abstract
Plant genomes are highly diverse in size and repetitive DNA composition. In the absence of polyploidy, the dynamics of repetitive elements, which make up the bulk of the genome in many species, are the main drivers underpinning changes in genome size and the overall evolution of the genomic landscape. The advent of high-throughput sequencing technologies has enabled investigation of genome evolutionary dynamics beyond model plants to provide exciting new insights in species across the biodiversity of life. Here we analyze the evolution of repetitive DNA in two closely related species of Heloniopsis (Melanthiaceae), which despite having the same chromosome number differ nearly twofold in genome size [i.e., H. umbellata (1C = 4,680 Mb), and H. koreana (1C = 2,480 Mb)]. Low-coverage genome skimming and the RepeatExplorer2 pipeline were used to identify the main repeat families responsible for the significant differences in genome sizes. Patterns of repeat evolution were found to correlate with genome size with the main classes of transposable elements identified being twice as abundant in the larger genome of H. umbellata compared with H. koreana. In addition, among the satellite DNA families recovered, a single shared satellite (HeloSAT) was shown to have contributed significantly to the genome expansion of H. umbellata. Evolutionary changes in repetitive DNA composition and genome size indicate that the differences in genome size between these species have been underpinned by the activity of several distinct repeat lineages.
Collapse
|
16
|
Genome downsizing after polyploidy: mechanisms, rates and selection pressures. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2021; 107:1003-1015. [PMID: 34077584 DOI: 10.1111/tpj.15363] [Citation(s) in RCA: 30] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/11/2021] [Revised: 05/07/2021] [Accepted: 05/13/2021] [Indexed: 05/20/2023]
Abstract
An analysis of over 10 000 plant genome sizes (GSs) indicates that most species have smaller genomes than expected given the incidence of polyploidy in their ancestries, suggesting selection for genome downsizing. However, comparing ancestral GS with the incidence of ancestral polyploidy suggests that the rate of DNA loss following polyploidy is likely to have been very low (4-70 Mb/million years, 4-482 bp/generation). This poses a problem. How might such small DNA losses be visible to selection, overcome the power of genetic drift and drive genome downsizing? Here we explore that problem, focussing on the role that double-strand break (DSB) repair pathways (non-homologous end joining and homologous recombination) may have played. We also explore two hypotheses that could explain how selection might favour genome downsizing following polyploidy: to reduce (i) nitrogen (N) and phosphate (P) costs associated with nucleic acid synthesis in the nucleus and the transcriptome and (ii) the impact of scaling effects of GS on cell size, which influences CO2 uptake and water loss. We explore the hypothesis that losses of DNA must be fastest in early polyploid generations. Alternatively, if DNA loss is a more continuous process over evolutionary time, then we propose it is a byproduct of selection elsewhere, such as limiting the damaging activity of repetitive DNA. If so, then the impact of GS on photosynthesis, water use efficiency and/or nutrient costs at the nucleus level may be emergent properties, which have advantages, but not ones that could have been selected for over generational timescales.
Collapse
|
17
|
The Welwitschia genome reveals a unique biology underpinning extreme longevity in deserts. Nat Commun 2021; 12:4247. [PMID: 34253727 PMCID: PMC8275611 DOI: 10.1038/s41467-021-24528-4] [Citation(s) in RCA: 36] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2021] [Accepted: 06/21/2021] [Indexed: 02/06/2023] Open
Abstract
The gymnosperm Welwitschia mirabilis belongs to the ancient, enigmatic gnetophyte lineage. It is a unique desert plant with extreme longevity and two ever-elongating leaves. We present a chromosome-level assembly of its genome (6.8 Gb/1 C) together with methylome and transcriptome data to explore its astonishing biology. We also present a refined, high-quality assembly of Gnetum montanum to enhance our understanding of gnetophyte genome evolution. The Welwitschia genome has been shaped by a lineage-specific ancient, whole genome duplication (~86 million years ago) and more recently (1-2 million years) by bursts of retrotransposon activity. High levels of cytosine methylation (particularly at CHH motifs) are associated with retrotransposons, whilst long-term deamination has resulted in an exceptionally GC-poor genome. Changes in copy number and/or expression of gene families and transcription factors (e.g. R2R3MYB, SAUR) controlling cell growth, differentiation and metabolism underpin the plant's longevity and tolerance to temperature, nutrient and water stress.
Collapse
|
18
|
Conversion between 100-million-year-old duplicated genes contributes to rice subspecies divergence. BMC Genomics 2021; 22:460. [PMID: 34147070 PMCID: PMC8214281 DOI: 10.1186/s12864-021-07776-y] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2020] [Accepted: 06/03/2021] [Indexed: 12/16/2022] Open
Abstract
BACKGROUND Duplicated gene pairs produced by ancient polyploidy maintain high sequence similarity over a long period of time and may result from illegitimate recombination between homeologous chromosomes. The genomes of Asian cultivated rice Oryza sativa ssp. indica (XI) and Oryza sativa ssp. japonica (GJ) have recently been updated, providing new opportunities for investigating ongoing gene conversion events and their impact on genome evolution. RESULTS Using comparative genomics and phylogenetic analyses, we evaluated gene conversion rates between duplicated genes produced by polyploidization 100 million years ago (mya) in GJ and XI. At least 5.19-5.77% of genes duplicated across the three rice genomes were affected by whole-gene conversion after the divergence of GJ and XI at ~ 0.4 mya, with more (7.77-9.53%) showing conversion of only portions of genes. Independently converted duplicates surviving in the genomes of different subspecies often use the same donor genes. The ongoing gene conversion frequency was higher near chromosome termini, with a single pair of homoeologous chromosomes, 11 and 12, in each rice genome being most affected. Notably, ongoing gene conversion has maintained similarity between very ancient duplicates, provided opportunities for further gene conversion, and accelerated rice divergence. Chromosome rearrangements after polyploidization are associated with ongoing gene conversion events, and they directly restrict recombination and inhibit duplicated gene conversion between homeologous regions. Furthermore, we found that the converted genes tended to have more similar expression patterns than nonconverted duplicates. Gene conversion affects biological functions associated with multiple genes, such as catalytic activity, implying opportunities for interaction among members of large gene families, such as NBS-LRR disease-resistance genes, contributing to the occurrence of the gene conversion. CONCLUSION Duplicated genes in rice subspecies generated by grass polyploidization ~ 100 mya remain affected by gene conversion at high frequency, with important implications for the divergence of rice subspecies.
Collapse
|
19
|
Gene Conversion amongst Alu SINE Elements. Genes (Basel) 2021; 12:genes12060905. [PMID: 34208107 PMCID: PMC8230782 DOI: 10.3390/genes12060905] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2021] [Revised: 05/30/2021] [Accepted: 06/08/2021] [Indexed: 11/17/2022] Open
Abstract
The process of non-allelic gene conversion acts on homologous sequences during recombination, replacing parts of one with the other to make them uniform. Such concerted evolution is best described as paralogous ribosomal RNA gene unification that serves to preserve the essential house-keeping functions of the converted genes. Transposed elements (TE), especially Alu short interspersed elements (SINE) that have more than a million copies in primate genomes, are a significant source of homologous units and a verified target of gene conversion. The consequences of such a recombination-based process are diverse, including multiplications of functional TE internal binding domains and, for evolutionists, confusing divergent annotations of orthologous transposable elements in related species. We systematically extracted and compared 68,097 Alu insertions in various primates looking for potential events of TE gene conversion and discovered 98 clear cases of Alu-Alu gene conversion, including 64 cases for which the direction of conversion was identified (e.g., AluS conversion to AluY). Gene conversion also does not necessarily affect the entire homologous sequence, and we detected 69 cases of partial gene conversion that resulted in virtual hybrids of two elements. Phylogenetic screening of gene-converted Alus revealed three clear hotspots of the process in the ancestors of Catarrhini, Hominoidea, and gibbons. In general, our systematic screening of orthologous primate loci for gene-converted TEs provides a new strategy and view of a post-integrative process that changes the identities of such elements.
Collapse
|
20
|
Horizontal Gene Transfer Involving Chloroplasts. Int J Mol Sci 2021; 22:ijms22094484. [PMID: 33923118 PMCID: PMC8123421 DOI: 10.3390/ijms22094484] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2021] [Revised: 04/22/2021] [Accepted: 04/23/2021] [Indexed: 02/04/2023] Open
Abstract
Horizontal gene transfer (HGT)- is defined as the acquisition of genetic material from another organism. However, recent findings indicate a possible role of HGT in the acquisition of traits with adaptive significance, suggesting that HGT is an important driving force in the evolution of eukaryotes as well as prokaryotes. It has been noted that, in eukaryotes, HGT is more prevalent than originally thought. Mitochondria and chloroplasts lost a large number of genes after their respective endosymbiotic events occurred. Even after this major content loss, organelle genomes still continue to lose their own genes. Many of these are subsequently acquired by intracellular gene transfer from the original plastid. The aim of our review was to elucidate the role of chloroplasts in the transfer of genes. This review also explores gene transfer involving mitochondrial and nuclear genomes, though recent studies indicate that chloroplast genomes are far more active in HGT as compared to these other two DNA-containing cellular compartments.
Collapse
|
21
|
RNA directed DNA methylation and seed plant genome evolution. PLANT CELL REPORTS 2020; 39:983-996. [PMID: 32594202 DOI: 10.1007/s00299-] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Subscribe] [Scholar Register] [Received: 03/31/2020] [Accepted: 06/08/2020] [Indexed: 05/28/2023]
Abstract
RNA Directed DNA Methylation (RdDM) is a pathway that mediates de novo DNA methylation, an evolutionary conserved chemical modification of cytosine bases, which exists in living organisms and utilizes small interfering RNA. Plants utilize DNA methylation for transposable element (TE) repression, regulation of gene expression and developmental regulation. TE activity strongly influences genome size and evolution, therefore making DNA methylation a key component in understanding divergence in genome evolution among seed plants. Multiple proteins that have extensively been studied in model plant Arabidopsis thaliana catalyze RNA dependent DNA Methylation pathway along with small interfering RNA. Several developmental functions have also been attributed to DNA methylation. This review will highlight aspects of RdDM pathway dynamics, evolution and functions in seed plants with focus on recent findings on conserved and non-conserved attributes between angiosperms and gymnosperms to potentially explain how methylation has impacted variations in evolutionary and developmental complexity among them and advance current understanding of this crucial epigenetic pathway.
Collapse
|
22
|
RNA directed DNA methylation and seed plant genome evolution. PLANT CELL REPORTS 2020; 39:983-996. [PMID: 32594202 PMCID: PMC7359171 DOI: 10.1007/s00299-020-02558-4] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/31/2020] [Accepted: 06/08/2020] [Indexed: 05/11/2023]
Abstract
RNA Directed DNA Methylation (RdDM) is a pathway that mediates de novo DNA methylation, an evolutionary conserved chemical modification of cytosine bases, which exists in living organisms and utilizes small interfering RNA. Plants utilize DNA methylation for transposable element (TE) repression, regulation of gene expression and developmental regulation. TE activity strongly influences genome size and evolution, therefore making DNA methylation a key component in understanding divergence in genome evolution among seed plants. Multiple proteins that have extensively been studied in model plant Arabidopsis thaliana catalyze RNA dependent DNA Methylation pathway along with small interfering RNA. Several developmental functions have also been attributed to DNA methylation. This review will highlight aspects of RdDM pathway dynamics, evolution and functions in seed plants with focus on recent findings on conserved and non-conserved attributes between angiosperms and gymnosperms to potentially explain how methylation has impacted variations in evolutionary and developmental complexity among them and advance current understanding of this crucial epigenetic pathway.
Collapse
|
23
|
Abstract
Transposable elements (TEs) are insertional mutagens that contribute greatly to the plasticity of eukaryotic genomes, influencing the evolution and adaptation of species as well as physiology or disease in individuals. Measuring TE expression helps to understand not only when and where TE mobilization can occur but also how this process alters gene expression, chromatin accessibility or cellular signalling pathways. Although genome-wide gene expression assays such as RNA sequencing include transposon-derived transcripts, most computational analytical tools discard or misinterpret TE-derived reads. Emerging approaches are improving the identification of expressed TE loci and helping to discriminate TE transcripts that permit TE mobilization from chimeric gene-TE transcripts or pervasive transcription. Here we review the main challenges associated with the detection of TE expression, including mappability, insertional and internal sequence polymorphisms, and the diversity of the TE transcriptional landscape, as well as the different experimental and computational strategies to solve them.
Collapse
|
24
|
Reprogramming of Retrotransposon Activity during Speciation of the Genus Citrus. Genome Biol Evol 2020; 11:3478-3495. [PMID: 31710678 PMCID: PMC7145672 DOI: 10.1093/gbe/evz246] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/04/2019] [Indexed: 12/13/2022] Open
Abstract
Speciation of the genus Citrus from a common ancestor has recently been established to begin ∼8 Ma during the late Miocene, a period of major climatic alterations. Here, we report the changes in activity of Citrus LTR retrotransposons during the process of diversification that gave rise to the current Citrus species. To reach this goal, we analyzed four pure species that diverged early during Citrus speciation, three recent admixtures derived from those species and an outgroup of the Citrus clade. More than 30,000 retrotransposons were grouped in ten linages. Estimations of LTR insertion times revealed that retrotransposon activity followed a species-specific pattern of change that could be ascribed to one of three different models. In some genomes, the expected pattern of gradual transposon accumulation was suddenly arrested during the radiation of the ancestor that gave birth to the current Citrus species. The individualized analyses of retrotransposon lineages showed that in each and every species studied, not all lineages follow the general pattern of the species itself. For instance, in most of the genomes, the retrotransposon activity of elements from the SIRE lineage reached its highest level just before Citrus speciation, while for Retrofit elements, it has been steadily growing. Based on these observations, we propose that Citrus retrotransposons may respond to stressful conditions driving speciation as a part of the genetic response involved in adaptation. This proposal implies that the evolving conditions of each species interact with the internal regulatory mechanisms of the genome controlling the proliferation of mobile elements.
Collapse
|
25
|
What Can Long Terminal Repeats Tell Us About the Age of LTR Retrotransposons, Gene Conversion and Ectopic Recombination? FRONTIERS IN PLANT SCIENCE 2020; 11:644. [PMID: 32508870 PMCID: PMC7251063 DOI: 10.3389/fpls.2020.00644] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/14/2019] [Accepted: 04/27/2020] [Indexed: 05/10/2023]
Abstract
LTR retrotransposons constitute a significant part of plant genomes and their evolutionary dynamics play an important role in genome size changes. Current methods of LTR retrotransposon age estimation are based only on LTR (long terminal repeat) divergence. This has prompted us to analyze sequence similarity of LTRs in 25,144 LTR retrotransposons from fifteen plant species as well as formation of solo LTRs. We found that approximately one fourth of nested retrotransposons showed a higher LTR divergence than the pre-existing retrotransposons into which they had been inserted. Moreover, LTR similarity was correlated with LTR length. We propose that gene conversion can contribute to this phenomenon. Gene conversion prediction in LTRs showed potential converted regions in 25% of LTR pairs. Gene conversion was higher in species with smaller genomes while the proportion of solo LTRs did not change with genome size in analyzed species. The negative correlation between the extent of gene conversion and the abundance of solo LTRs suggests interference between gene conversion and ectopic recombination. Since such phenomena limit the traditional methods of LTR retrotransposon age estimation, we recommend an improved approach based on the exclusion of regions affected by gene conversion.
Collapse
|
26
|
Variant Calling Using Whole Genome Resequencing and Sequence Capture for Population and Evolutionary Genomic Inferences in Norway Spruce (Picea Abies). COMPENDIUM OF PLANT GENOMES 2020. [DOI: 10.1007/978-3-030-21001-4_2] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
|
27
|
Intergenomic gene transfer in diploid and allopolyploid Gossypium. BMC PLANT BIOLOGY 2019; 19:492. [PMID: 31718541 PMCID: PMC6852956 DOI: 10.1186/s12870-019-2041-2] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/01/2019] [Accepted: 09/20/2019] [Indexed: 05/03/2023]
Abstract
BACKGROUND Intergenomic gene transfer (IGT) between nuclear and organellar genomes is a common phenomenon during plant evolution. Gossypium is a useful model to evaluate the genomic consequences of IGT for both diploid and polyploid species. Here, we explore IGT among nuclear, mitochondrial, and plastid genomes of four cotton species, including two allopolyploids and their model diploid progenitors (genome donors, G. arboreum: A2 and G. raimondii: D5). RESULTS Extensive IGT events exist for both diploid and allotetraploid cotton (Gossypium) species, with the nuclear genome being the predominant recipient of transferred DNA followed by the mitochondrial genome. The nuclear genome has integrated 100 times more foreign sequences than the mitochondrial genome has in total length. In the nucleus, the integrated length of chloroplast DNA (cpDNA) was between 1.87 times (in diploids) to nearly four times (in allopolyploids) greater than that of mitochondrial DNA (mtDNA). In the mitochondrion, the length of nuclear DNA (nuDNA) was typically three times than that of cpDNA. Gossypium mitochondrial genomes integrated three nuclear retrotransposons and eight chloroplast tRNA genes, and incorporated chloroplast DNA prior to divergence between the diploids and allopolyploid formation. For mitochondrial chloroplast-tRNA genes, there were 2-6 bp conserved microhomologies flanking their insertion sites across distantly related genera, which increased to 10 bp microhomologies for the four cotton species studied. For organellar DNA sequences, there are source hotspots, e.g., the atp6-trnW intergenic region in the mitochondrion and the inverted repeat region in the chloroplast. Organellar DNAs in the nucleus were rarely expressed, and at low levels. Surprisingly, there was asymmetry in the survivorship of ancestral insertions following allopolyploidy, with most numts (nuclear mitochondrial insertions) decaying or being lost whereas most nupts (nuclear plastidial insertions) were retained. CONCLUSIONS This study characterized and compared intracellular transfer among nuclear and organellar genomes within two cultivated allopolyploids and their ancestral diploid cotton species. A striking asymmetry in the fate of IGTs in allopolyploid cotton was discovered, with numts being preferentially lost relative to nupts. Our results connect intergenomic gene transfer with allotetraploidy and provide new insight into intracellular genome evolution.
Collapse
|
28
|
Retrotransposons in Plant Genomes: Structure, Identification, and Classification through Bioinformatics and Machine Learning. Int J Mol Sci 2019; 20:E3837. [PMID: 31390781 PMCID: PMC6696364 DOI: 10.3390/ijms20153837] [Citation(s) in RCA: 34] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2019] [Revised: 07/31/2019] [Accepted: 08/02/2019] [Indexed: 01/26/2023] Open
Abstract
Transposable elements (TEs) are genomic units able to move within the genome of virtually all organisms. Due to their natural repetitive numbers and their high structural diversity, the identification and classification of TEs remain a challenge in sequenced genomes. Although TEs were initially regarded as "junk DNA", it has been demonstrated that they play key roles in chromosome structures, gene expression, and regulation, as well as adaptation and evolution. A highly reliable annotation of these elements is, therefore, crucial to better understand genome functions and their evolution. To date, much bioinformatics software has been developed to address TE detection and classification processes, but many problematic aspects remain, such as the reliability, precision, and speed of the analyses. Machine learning and deep learning are algorithms that can make automatic predictions and decisions in a wide variety of scientific applications. They have been tested in bioinformatics and, more specifically for TEs, classification with encouraging results. In this review, we will discuss important aspects of TEs, such as their structure, importance in the evolution and architecture of the host, and their current classifications and nomenclatures. We will also address current methods and their limitations in identifying and classifying TEs.
Collapse
|
29
|
Abstract
Among the multitude of papers published yearly in scientific journals, precious few publications may be worth looking back in half a century to appreciate the significance of the discoveries that would later become common knowledge and get a chance to shape a field or several adjacent fields. Here, Kimura's fundamental concept of neutral mutation-random drift, which was published 50 years ago, is re-examined in light of its pervasive influence on comparative genomics and, more specifically, on the contribution of transposable elements to eukaryotic genome evolution.
Collapse
|
30
|
Novel Insights into Plant Genome Evolution and Adaptation as Revealed through Transposable Elements and Non-Coding RNAs in Conifers. Genes (Basel) 2019; 10:genes10030228. [PMID: 30889931 PMCID: PMC6470726 DOI: 10.3390/genes10030228] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2019] [Revised: 03/08/2019] [Accepted: 03/11/2019] [Indexed: 01/03/2023] Open
Abstract
Plant genomes are punctuated by repeated bouts of proliferation of transposable elements (TEs), and these mobile bursts are followed by silencing and decay of most of the newly inserted elements. As such, plant genomes reflect TE-related genome expansion and shrinkage. In general, these genome activities involve two mechanisms: small RNA-mediated epigenetic repression and long-term mutational decay and deletion, that is, genome-purging. Furthermore, the spatial relationships between TE insertions and genes are an important force in shaping gene regulatory networks, their downstream metabolic and physiological outputs, and thus their phenotypes. Such cascading regulations finally set up a fitness differential among individuals. This brief review demonstrates factual evidence that unifies most updated conceptual frameworks covering genome size, architecture, epigenetic reprogramming, and gene expression. It aims to give an overview of the impact that TEs may have on genome and adaptive evolution and to provide novel insights into addressing possible causes and consequences of intimidating genome sizes (20⁻30 Gb) in a taxonomic group, conifers.
Collapse
|
31
|
Exploring the loblolly pine (Pinus taeda L.) genome by BAC sequencing and Cot analysis. Gene 2018; 663:165-177. [PMID: 29655895 DOI: 10.1016/j.gene.2018.04.024] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2017] [Revised: 03/20/2018] [Accepted: 04/10/2018] [Indexed: 02/06/2023]
Abstract
Loblolly pine (LP; Pinus taeda L.) is an economically and ecologically important tree in the southeastern U.S. To advance understanding of the loblolly pine (LP; Pinus taeda L.) genome, we sequenced and analyzed 100 BAC clones and performed a Cot analysis. The Cot analysis indicates that the genome is composed of 57, 24, and 10% highly-repetitive, moderately-repetitive, and single/low-copy sequences, respectively (the remaining 9% of the genome is a combination of fold back and damaged DNA). Although single/low-copy DNA only accounts for 10% of the LP genome, the amount of single/low-copy DNA in LP is still 14 times the size of the Arabidopsis genome. Since gene numbers in LP are similar to those in Arabidopsis, much of the single/low-copy DNA of LP would appear to be composed of DNA that is both gene- and repeat-poor. Macroarrays prepared from a LP bacterial artificial chromosome (BAC) library were hybridized with probes designed from cell wall synthesis/wood development cDNAs, and 50 of the "targeted" clones were selected for further analysis. An additional 25 clones were selected because they contained few repeats, while 25 more clones were selected at random. The 100 BAC clones were Sanger sequenced and assembled. Of the targeted BACs, 80% contained all or part of the cDNA used to target them. One targeted BAC was found to contain fungal DNA and was eliminated from further analysis. Combinations of similarity-based and ab initio gene prediction approaches were utilized to identify and characterize potential coding regions in the 99 BACs containing LP DNA. From this analysis, we identified 154 gene models (GMs) representing both putative protein-coding genes and likely pseudogenes. Ten of the GMs (all of which were specifically targeted) had enough support to be classified as intact genes. Interestingly, the 154 GMs had statistically indistinguishable (α = 0.05) distributions in the targeted and random BAC clones (15.18 and 12.61 GM/Mb, respectively), whereas the low-repeat BACs contained significantly fewer GMs (7.08 GM/Mb). However, when GM length was considered, the targeted BACs had a significantly greater percentage of their length in GMs (3.26%) when compared to random (1.63%) and low-repeat (0.62%) BACs. The results of our study provide insight into LP evolution and inform ongoing efforts to produce a reference genome sequence for LP, while characterization of genes involved in cell wall production highlights carbon metabolism pathways that can be leveraged for increasing wood production.
Collapse
|
32
|
Genome Size Diversity and Its Impact on the Evolution of Land Plants. Genes (Basel) 2018; 9:E88. [PMID: 29443885 PMCID: PMC5852584 DOI: 10.3390/genes9020088] [Citation(s) in RCA: 150] [Impact Index Per Article: 25.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2018] [Revised: 02/02/2018] [Accepted: 02/05/2018] [Indexed: 01/09/2023] Open
Abstract
Genome size is a biodiversity trait that shows staggering diversity across eukaryotes, varying over 64,000-fold. Of all major taxonomic groups, land plants stand out due to their staggering genome size diversity, ranging ca. 2400-fold. As our understanding of the implications and significance of this remarkable genome size diversity in land plants grows, it is becoming increasingly evident that this trait plays not only an important role in shaping the evolution of plant genomes, but also in influencing plant community assemblages at the ecosystem level. Recent advances and improvements in novel sequencing technologies, as well as analytical tools, make it possible to gain critical insights into the genomic and epigenetic mechanisms underpinning genome size changes. In this review we provide an overview of our current understanding of genome size diversity across the different land plant groups, its implications on the biology of the genome and what future directions need to be addressed to fill key knowledge gaps.
Collapse
|
33
|
|