1
|
Genome-wide DNA methylation dynamics following recent polyploidy in the allotetraploid Tragopogon miscellus (Asteraceae). THE NEW PHYTOLOGIST 2024; 242:1363-1376. [PMID: 38450804 DOI: 10.1111/nph.19655] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/18/2023] [Accepted: 01/15/2024] [Indexed: 03/08/2024]
Abstract
Polyploidy is an important evolutionary force, yet epigenetic mechanisms, such as DNA methylation, that regulate genome-wide expression of duplicated genes remain largely unknown. Here, we use Tragopogon (Asteraceae) as a model system to discover patterns and temporal dynamics of DNA methylation in recently formed polyploids. The naturally occurring allotetraploid Tragopogon miscellus formed in the last 95-100 yr from parental diploids Tragopogon dubius and T. pratensis. We profiled the DNA methylomes of these three species using whole-genome bisulfite sequencing. Genome-wide methylation levels in T. miscellus were intermediate between its diploid parents. However, nonadditive CG and CHG methylation occurred in transposable elements (TEs), with variation among TE types. Most differentially methylated regions (DMRs) showed parental legacy, but some novel DMRs were detected in the polyploid. Differentially methylated genes (DMGs) were also identified and characterized. This study provides the first assessment of both overall and locus-specific patterns of DNA methylation in a recent natural allopolyploid and shows that novel methylation variants can be generated rapidly after polyploid formation. Together, these results demonstrate that mechanisms to regulate duplicate gene expression may arise soon after allopolyploid formation and that these mechanisms vary among genes.
Collapse
|
2
|
Abundant incongruence in a clade endemic to a biodiversity hotspot: Phylogenetics of the scrub mint clade (Lamiaceae). Mol Phylogenet Evol 2024; 192:108014. [PMID: 38199595 DOI: 10.1016/j.ympev.2024.108014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2023] [Revised: 12/26/2023] [Accepted: 01/06/2024] [Indexed: 01/12/2024]
Abstract
The Scrub Mint clade(Lamiaceae) provides a unique system for investigating the evolutionary processes driving diversification in the North American Coastal Plain from both a systematic and biogeographic context. The clade comprisesDicerandra, Conradina, Piloblephis, Stachydeoma, and four species of the broadly defined genus Clinopodium(Mentheae; Lamiaceae), almost all of which are endemic to the North American Eastern Coastal Plain. Most species of this clade are threatened or endangered and restricted to sandhill or a mosaic of scrub habitats. We analyzed relationships in this clade to understand the evolution of the group and identify evolutionary mechanisms acting on the clade, with important implications for conservation. We used a target-capture method to sequence and analyze 238 nuclear loci across all species of scrub mints, reconstructed the phylogeny, and calculated gene tree concordance, gene tree estimation error, and reticulation indices for every node in the tree using ML methods. Phylogenetic networks were used to determine reticulation events. Our nuclear phylogenetic estimates were consistent with previous results, while greatly increasing the robustness of taxon sampling. The phylogeny resolved the full relationship between Dicerandra and Conradina and the less-studied members of the clade (Piloblephis, Stachydeoma, Clinopodium spp.). We found hotspots of gene tree discordance and reticulation throughout the tree, especially in perennial Dicerandra. Several instances of reticulation events were uncovered between annual and perennial Dicerandra, and within the Conradina + allies clade. Incomplete lineage sorting also likely contributed to phylogenetic discordance. These results clarify phylogenetic relationships in the clade and provide insight on important evolutionary drivers in the clade, such as hybridization. General relationships in the group were confirmed, while the large amount of gene tree discordance is likely due to reticulation across the phylogeny.
Collapse
|
3
|
Phylogenetic diversity of eastern Asia-eastern North America disjunct plants is mainly associated with divergence time. PLANT DIVERSITY 2023; 45:27-35. [PMID: 36876316 PMCID: PMC9975473 DOI: 10.1016/j.pld.2022.09.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/24/2022] [Revised: 09/29/2022] [Accepted: 09/30/2022] [Indexed: 06/18/2023]
Abstract
The underlying causes of biodiversity disparities among geographic regions have long been a fundamental theme in ecology and evolution. However, the patterns of phylogenetic diversity (PD) and phylogenetic beta diversity (PBD) of congeners that are disjunctly distributed between eastern Asia-eastern North America (EA-ENA disjuncts) and their associated factors remain unknown. Here we investigated the standardized effect size of PD (SES-PD), PBD, and potentially associated factors in 11 natural mixed forest sites (five in EA and six in ENA) where abundant EA-ENA disjuncts occur. We found that the disjuncts in ENA possessed higher SES-PD than those in EA at the continental scale (1.96 vs -1.12), even though the number of disjunct species in ENA is much lower than in EA (128 vs 263). SES-PD of the EA-ENA disjuncts tended to decrease with increasing latitude in 11 sites. The latitudinal diversity gradient of SES-PD was stronger in EA sites than in ENA sites. Based on the unweighted unique fraction metric (UniFrac) distance and the phylogenetic community dissimilarity, PBD showed that the two northern sites in EA were more similar to the six-site ENA group than to the remaining southern EA sites. Based on the standardized effect size of mean pairwise distances (SES-MPD), nine of eleven studied sites showed a neutral community structure (-1.96 ≤ SES-MPD ≤ 1.96). Both Pearson's r and structural equation modeling suggested that SES-PD of the EA-ENA disjuncts was mostly associated with mean divergence time. Moreover, SES-PD of the EA-ENA disjuncts was positively correlated with temperature-related climatic factors, although negatively correlated with mean diversification rate and community structure. By applying approaches from phylogenetics and community ecology, our work sheds light on historical patterns of the EA-ENA disjunction and paves the way for further research.
Collapse
|
4
|
Plastid phylogenomic insights into relationships of all flowering plant families. BMC Biol 2021; 19:232. [PMID: 34711223 PMCID: PMC8555322 DOI: 10.1186/s12915-021-01166-2] [Citation(s) in RCA: 81] [Impact Index Per Article: 27.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2021] [Accepted: 10/14/2021] [Indexed: 11/17/2022] Open
Abstract
BACKGROUND Flowering plants (angiosperms) are dominant components of global terrestrial ecosystems, but phylogenetic relationships at the familial level and above remain only partially resolved, greatly impeding our full understanding of their evolution and early diversification. The plastome, typically mapped as a circular genome, has been the most important molecular data source for plant phylogeny reconstruction for decades. RESULTS Here, we assembled by far the largest plastid dataset of angiosperms, composed of 80 genes from 4792 plastomes of 4660 species in 2024 genera representing all currently recognized families. Our phylogenetic tree (PPA II) is essentially congruent with those of previous plastid phylogenomic analyses but generally provides greater clade support. In the PPA II tree, 75% of nodes at or above the ordinal level and 78% at or above the familial level were resolved with high bootstrap support (BP ≥ 90). We obtained strong support for many interordinal and interfamilial relationships that were poorly resolved previously within the core eudicots, such as Dilleniales, Saxifragales, and Vitales being resolved as successive sisters to the remaining rosids, and Santalales, Berberidopsidales, and Caryophyllales as successive sisters to the asterids. However, the placement of magnoliids, although resolved as sister to all other Mesangiospermae, is not well supported and disagrees with topologies inferred from nuclear data. Relationships among the five major clades of Mesangiospermae remain intractable despite increased sampling, probably due to an ancient rapid radiation. CONCLUSIONS We provide the most comprehensive dataset of plastomes to date and a well-resolved phylogenetic tree, which together provide a strong foundation for future evolutionary studies of flowering plants.
Collapse
|
5
|
A new, simple, highly scalable, and efficient protocol for genomic DNA extraction from diverse plant taxa. APPLICATIONS IN PLANT SCIENCES 2021; 9:e11413. [PMID: 33854847 PMCID: PMC8027365 DOI: 10.1002/aps3.11413] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/23/2020] [Accepted: 01/29/2021] [Indexed: 06/12/2023]
Abstract
PREMISE Commonly used molecular techniques such as next-generation sequencing require reliable methods to extract DNA quickly and efficiently. Secondary compounds within plant tissues make this requirement all the more challenging, often forcing researchers to test different extraction methods tailored to their particular species of interest in order to obtain large amounts of high-quality genomic DNA. The opportunities provided by high-throughput, next-generation sequencing only exacerbate these problems, especially when trying to extract DNA from multiple species at the same time. Several methods have attempted to resolve the challenges of obtaining suitable DNA from plants; however, a rapid, high-yield, high-quality, and highly scalable DNA extraction method is still needed. METHODS AND RESULTS We present a rapid DNA extraction protocol that utilizes a buffer with relatively large amounts of cetyltrimethylammonium bromide (CTAB) and sodium chloride, combined with a silica maxi-column cleanup of the extracted DNA. The new method is easy to implement using standard equipment and inexpensive reagents. The entire procedure (from grinding to the final elution) can be completed in less than two hours for a single sample and can be easily scaled to meet desired research goals. It works on diverse green plants with highly varied secondary chemistries (e.g., ferns, gymnosperms, and phylogenetically divergent angiosperms). CONCLUSIONS Application of the protocol to various plant species yielded DNA of high quality in less than two hours and can be adjusted to extract DNA at large (maxi-preps) or small (96-well minipreps) scales. We anticipate that our method will be of wide utility for rapidly isolating large quantities of quality genomic DNA from diverse plant species and will have broad applications in phylogenetic studies utilizing PCR and short-read DNA sequencing.
Collapse
|
6
|
Green giant-a tiny chloroplast genome with mighty power to produce high-value proteins: history and phylogeny. PLANT BIOTECHNOLOGY JOURNAL 2021; 19:430-447. [PMID: 33484606 PMCID: PMC7955891 DOI: 10.1111/pbi.13556] [Citation(s) in RCA: 63] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/14/2020] [Revised: 01/11/2021] [Accepted: 01/16/2021] [Indexed: 05/04/2023]
Abstract
Free-living cyanobacteria were entrapped by eukaryotic cells ~2 billion years ago, ultimately giving rise to chloroplasts. After a century of debate, the presence of chloroplast DNA was demonstrated in the 1960s. The first chloroplast genomes were sequenced in the 1980s, followed by ~100 vegetable, fruit, cereal, beverage, oil and starch/sugar crop chloroplast genomes in the past three decades. Foreign genes were expressed in isolated chloroplasts or intact plant cells in the late 1980s and stably integrated into chloroplast genomes, with typically maternal inheritance shown in the 1990s. Since then, chloroplast genomes conferred the highest reported levels of tolerance or resistance to biotic or abiotic stress. Although launching products with agronomic traits in important crops using this concept has been elusive, commercial products developed include enzymes used in everyday life from processing fruit juice, to enhancing water absorption of cotton fibre or removal of stains as laundry detergents and in dye removal in the textile industry. Plastid genome sequences have revealed the framework of green plant phylogeny as well as the intricate history of plastid genome transfer events to other eukaryotes. Discordant historical signals among plastid genes suggest possible variable constraints across the plastome and further understanding and mitigation of these constraints may yield new opportunities for bioengineering. In this review, we trace the evolutionary history of chloroplasts, status of autonomy and recent advances in products developed for everyday use or those advanced to the clinic, including treatment of COVID-19 patients and SARS-CoV-2 vaccine.
Collapse
|
7
|
Genetic insights into the evolution of genera with the eastern Asia-eastern North America floristic disjunction: a transcriptomics analysis. AMERICAN JOURNAL OF BOTANY 2020; 107:1736-1748. [PMID: 33280088 DOI: 10.1002/ajb2.1579] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/03/2020] [Accepted: 06/29/2020] [Indexed: 06/12/2023]
Abstract
PREMISE Large disjunctions in species distributions provide excellent opportunities to study processes that shape biogeographic patterns. One such disjunction is the eastern Asia-eastern North America (EA-ENA) floristic disjunction. For many genera with this disjunction, species richness is greater in EA than in ENA; this pattern has been attributed, in part, to higher rates of molecular evolution and speciation in EA. Longer branch lengths have been found in some EA clades, relative to their ENA sister clades, suggesting that the EA lineages have evolved at a higher rate, possibly due to environmental heterogeneity, potentially contributing to the species richness anomaly. METHODS To evaluate whether rates of molecular evolution are elevated in EA relative to ENA, we used transcriptomes from species in 11 genera displaying this disjunction. Rates of molecular evolution were estimated for up to 385 orthologous nuclear loci per genus. RESULTS No statistically significant differences were identified in pairwise comparisons between EA and ENA sister species, suggesting equal rates of molecular evolution for both species; the data also suggest similar selection pressures in both regions. For larger genera, evidence likewise argues against more species-rich clades having higher molecular evolutionary rates, regardless of region. Our results suggest that genes across multiple gene ontology categories are evolving at similar rates under purifying selection in species in both regions. CONCLUSIONS Our data support the hypothesis that greater species richness in EA than ENA is due to factors other than an overall increase in rates of molecular evolution in EA.
Collapse
|
8
|
Recent accelerated diversification in rosids occurred outside the tropics. Nat Commun 2020; 11:3333. [PMID: 32620894 PMCID: PMC7335165 DOI: 10.1038/s41467-020-17116-5] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2019] [Accepted: 06/05/2020] [Indexed: 01/28/2023] Open
Abstract
Conflicting relationships have been found between diversification rate and temperature across disparate clades of life. Here, we use a supermatrix comprising nearly 20,000 species of rosids-a clade of ~25% of all angiosperm species-to understand global patterns of diversification and its climatic association. Our approach incorporates historical global temperature, assessment of species' temperature niche, and two broad-scale characterizations of tropical versus non-tropical niche occupancy. We find the diversification rates of most subclades dramatically increased over the last 15 million years (Myr) during cooling associated with global expansion of temperate habitats. Climatic niche is negatively associated with diversification rates, with tropical rosids forming older communities and experiencing speciation rates ~2-fold below rosids in cooler climates. Our results suggest long-term cooling had a disproportionate effect on non-tropical diversification rates, leading to dynamic young communities outside of the tropics, while relative stability in tropical climes led to older, slower-evolving but still species-rich communities.
Collapse
|
9
|
Investigating the gene expression profiles of rehabilitated Florida manatees (Trichechus manatus latirostris) following red tide exposure. PLoS One 2020; 15:e0234150. [PMID: 32614830 PMCID: PMC7331979 DOI: 10.1371/journal.pone.0234150] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2020] [Accepted: 05/19/2020] [Indexed: 01/14/2023] Open
Abstract
To investigate a Florida manatee (Trichechus manatus latirostris) mortality event following a red tide bloom in Southwest Florida, an RNA sequencing experiment was conducted. Gene expression changes in white blood cells were assessed in manatees rescued from a red tide affected area (n = 4) and a control group (n = 7) using RNA sequencing. The genes with the largest fold changes were compared between the two groups to identify molecular pathways related to cellular and disease processes. In total, 591 genes (false discovery rate <0.05) were differentially expressed in the red tide group. Of these, 158 were upregulated and 433 were downregulated. This suggests major changes in white blood cell composition following an exposure to red tide. The most highly upregulated gene, Osteoclast associated 2C immunoglobulin-like receptor (OSCAR), was upregulated 12-fold. This gene is involved in initiating the immune response and maintaining a role in adaptive and innate immunity. The most highly downregulated gene, Piccolo presynaptic cytomatrix protein (PCLO), was downregulated by a factor of 977-fold. This gene is associated with cognitive functioning and neurotransmitter release. Downregulation of this gene in other studies was associated with neuronal loss and neuron synapse dysfunction. Among the cellular pathways that were most affected, immune response, including inflammation, wounds and injuries, cell proliferation, and apoptosis were the most predominant. The pathway with the most differentially expressed genes was the immune response pathway with 98 genes involved, many of them downregulated. Assessing the changes in gene expression associated with red tide exposure enhances our understanding of manatee immune response to the red tide toxins and will aid in the development of red tide biomarkers.
Collapse
|
10
|
Estimating rates and patterns of diversification with incomplete sampling: a case study in the rosids. AMERICAN JOURNAL OF BOTANY 2020; 107:895-909. [PMID: 32519354 PMCID: PMC7384126 DOI: 10.1002/ajb2.1479] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/28/2019] [Accepted: 03/03/2020] [Indexed: 05/03/2023]
Abstract
PREMISE Recent advances in generating large-scale phylogenies enable broad-scale estimation of species diversification. These now common approaches typically are characterized by (1) incomplete species coverage without explicit sampling methodologies and/or (2) sparse backbone representation, and usually rely on presumed phylogenetic placements to account for species without molecular data. We used empirical examples to examine the effects of incomplete sampling on diversification estimation and provide constructive suggestions to ecologists and evolutionary biologists based on those results. METHODS We used a supermatrix for rosids and one well-sampled subclade (Cucurbitaceae) as empirical case studies. We compared results using these large phylogenies with those based on a previously inferred, smaller supermatrix and on a synthetic tree resource with complete taxonomic coverage. Finally, we simulated random and representative taxon sampling and explored the impact of sampling on three commonly used methods, both parametric (RPANDA and BAMM) and semiparametric (DR). RESULTS We found that the impact of sampling on diversification estimates was idiosyncratic and often strong. Compared to full empirical sampling, representative and random sampling schemes either depressed or inflated speciation rates, depending on methods and sampling schemes. No method was entirely robust to poor sampling, but BAMM was least sensitive to moderate levels of missing taxa. CONCLUSIONS We suggest caution against uncritical modeling of missing taxa using taxonomic data for poorly sampled trees and in the use of summary backbone trees and other data sets with high representative bias, and we stress the importance of explicit sampling methodologies in macroevolutionary studies.
Collapse
|
11
|
Nuclear phylogenomic analyses of asterids conflict with plastome trees and support novel relationships among major lineages. AMERICAN JOURNAL OF BOTANY 2020; 107:790-805. [PMID: 32406108 DOI: 10.1002/ajb2.1468] [Citation(s) in RCA: 43] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/26/2019] [Accepted: 02/26/2020] [Indexed: 05/14/2023]
Abstract
PREMISE Discordance between nuclear and organellar phylogenies (cytonuclear discordance) is a well-documented phenomenon at shallow evolutionary levels but has been poorly investigated at deep levels of plant phylogeny. Determining the extent of cytonuclear discordance across major plant lineages is essential not only for elucidating evolutionary processes, but also for evaluating the currently used framework of plant phylogeny, which is largely based on the plastid genome. METHODS We present a phylogenomic examination of a major angiosperm clade (Asteridae) based on sequence data from the nuclear, plastid, and mitochondrial genomes as a means of evaluating currently accepted relationships inferred from the plastome and exploring potential sources of genomic conflict in this group. RESULTS We recovered at least five instances of well-supported cytonuclear discordance concerning the placements of major asterid lineages (i.e., Ericales, Oncothecaceae, Aquifoliales, Cassinopsis, and Icacinaceae). We attribute this conflict to a combination of incomplete lineage sorting and hybridization, the latter supported in part by previously inferred whole-genome duplications. CONCLUSIONS Our results challenge several long-standing hypotheses of asterid relationships and have implications for morphological character evolution and for the importance of ancient whole-genome duplications in early asterid evolution. These findings also highlight the value of reevaluating broad-scale angiosperm and green-plant phylogeny with nuclear genomic data.
Collapse
|
12
|
Genetic relationships and polyploid origins in the Lippia alba complex. AMERICAN JOURNAL OF BOTANY 2020; 107:466-476. [PMID: 32115694 DOI: 10.1002/ajb2.1443] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/03/2019] [Accepted: 12/04/2019] [Indexed: 06/10/2023]
Abstract
PREMISE Plant genomes vary in size and complexity due in part to polyploidization. Latitudinal analyses of polyploidy are biased toward floras of temperate regions, with much less research done in the tropics. Lippia alba has been described as a tropical polyploid complex with diploid, triploid, tetraploid, and hexaploid accessions. However, no data regarding relationships among the ploidal levels and their origins have been reported. Our goals are to clarify the relationships among accessions of Lippia alba and the origins of each ploidal level. METHODS We investigated 98 samples representing all five geographical regions of Brazil and all ploidal levels using microsatellite (SSR) allelic variation and DNA sequences of ITS and trnL-F. Nine morphological structures were analyzed from 33 herbarium samples, and the chemical compounds of 78 accessions were analyzed by GC-MS. RESULTS Genetic distance analysis, the alignment block pattern, as well as RAxML and Bayesian trees showed that accessions grouped by ploidal level. The triploids form a well-defined group that originated from a single group of diploids. The tetraploids and hexaploid grouped together in SSR and trnL-F analyses. The recovered groups agree with chemical data and morphology. CONCLUSIONS The accessions grouped by ploidal level. Only one origin of triploids from a single group of diploids was observed. The tetraploid origin is uncertain; however, it appears to have contributed to the origin of the hexaploid. This framework reveals linkages among the ploidal levels, providing new insights into the evolution of a polyploid complex of tropical plants.
Collapse
|
13
|
Access to RNA-sequencing data from 1,173 plant species: The 1000 Plant transcriptomes initiative (1KP). Gigascience 2019; 8:giz126. [PMID: 31644802 PMCID: PMC6808545 DOI: 10.1093/gigascience/giz126] [Citation(s) in RCA: 84] [Impact Index Per Article: 16.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2019] [Revised: 08/08/2019] [Accepted: 09/28/2019] [Indexed: 11/12/2022] Open
Abstract
BACKGROUND The 1000 Plant transcriptomes initiative (1KP) explored genetic diversity by sequencing RNA from 1,342 samples representing 1,173 species of green plants (Viridiplantae). FINDINGS This data release accompanies the initiative's final/capstone publication on a set of 3 analyses inferring species trees, whole genome duplications, and gene family expansions. These and previous analyses are based on de novo transcriptome assemblies and related gene predictions. Here, we assess their data and assembly qualities and explain how we detected potential contaminations. CONCLUSIONS These data will be useful to plant and/or evolutionary scientists with interests in particular gene families, either across the green plant tree of life or in more focused lineages.
Collapse
|
14
|
Origin of angiosperms and the puzzle of the Jurassic gap. NATURE PLANTS 2019; 5:461-470. [PMID: 31061536 DOI: 10.1038/s41477-019-0421-0] [Citation(s) in RCA: 331] [Impact Index Per Article: 66.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/25/2018] [Accepted: 04/02/2019] [Indexed: 05/19/2023]
Abstract
Angiosperms are by far the most species-rich clade of land plants, but their origin and early evolutionary history remain poorly understood. We reconstructed angiosperm phylogeny based on 80 genes from 2,881 plastid genomes representing 85% of extant families and all orders. With a well-resolved plastid tree and 62 fossil calibrations, we dated the origin of the crown angiosperms to the Upper Triassic, with major angiosperm radiations occurring in the Jurassic and Lower Cretaceous. This estimated crown age is substantially earlier than that of unequivocal angiosperm fossils, and the difference is here termed the 'Jurassic angiosperm gap'. Our time-calibrated plastid phylogenomic tree provides a highly relevant framework for future comparative studies of flowering plant evolution.
Collapse
|
15
|
Evolutionary insights from comparative transcriptome and transcriptome-wide coalescence analyses in Tetrastigma hemsleyanum. BMC PLANT BIOLOGY 2018; 18:208. [PMID: 30249188 PMCID: PMC6154912 DOI: 10.1186/s12870-018-1429-8] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/30/2017] [Accepted: 09/17/2018] [Indexed: 05/22/2023]
Abstract
BACKGROUND Tetrastigma hemsleyanum is of great medicinal importance and used as a model system to address the evolutionary history of warm-temperate evergreen (WTE) forest biomes in East Asia over Neogene time scales. However, further studies on the neutral and adaptive divergence processes of T. hemsleyanum are currently impeded by a lack of genomic resources. In this study, we de novo assembled and annotated a reference transcriptome for two cpDNA lineages (Central-South-East vs. Southwest) of T. hemsleyanum. We further used comparative genomic and multilocus coalescent approaches to investigate the tempo and mode of lineage diversification in T. hemsleyanum. RESULTS A total of 52,838 and 65,197 unigenes with an N50 of 1,667 and 1,841 bp for Central-South-East (CSE) and Southwest (SW) lineages, respectively, were recovered, and 6,692 putative orthologs were identified between the two lineages. Estimation of Ka/Ks ratios for these orthologs revealed that ten genes had Ka/Ks values significantly greater than 0.5 (P < 0.05), whereas 2,099 (Ka/Ks < 0.5, P < 0.05) were inferred to be under purifying selection. Based on three bioinformatic strategies, we identified a total of 1,018 single-copy nuclear genes (SCNGs) from the orthologs. We successfully designed eight nuclear gene primer pairs with high intraspecific variation (e.g. hT = 0.923, πT = 1.68×10-3), when surveyed across a subset of T. hemsleyanum individuals. Concordant with the previous cpDNA data, the haplotype networks constructed for most nuclear gene loci clearly identified the two lineages. A multilocus coalescence analysis suggested that the separation between the two lineages appears to have occurred during the mid-Pliocene. Despite their ancient divergence, both lineages experienced expansion at rather localized scales and have continued to exchange genes at a low rate. CONCLUSIONS This study demonstrated the utility of transcriptome sequencing as a basis for SCNG development in non-model species and the advantages of integrating multiple nuclear loci for phylogeographic and phylogenetic studies.
Collapse
|
16
|
Genome-wide association analysis of common genetic variants of resistant hypertension. THE PHARMACOGENOMICS JOURNAL 2018; 19:295-304. [PMID: 30237584 DOI: 10.1038/s41397-018-0049-x] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/11/2017] [Revised: 08/02/2018] [Accepted: 08/10/2018] [Indexed: 12/24/2022]
Abstract
Resistant hypertension (RHTN), defined as uncontrolled blood pressure (BP) ≥ 140/90 using three or more drugs or controlled BP (<140/90) using four or more drugs, is associated with adverse outcomes, including decline in kidney function. We conducted a genome-wide association analysis in 1194 White and Hispanic participants with hypertension and coronary artery disease from the INternational VErapamil-SR Trandolapril STudy-GENEtic Substudy (INVEST-GENES). Top variants associated with RHTN at p < 10-4 were tested for replication in 585 White and Hispanic participants with hypertension and subcortical strokes from the Secondary Prevention of Subcortical Strokes GENEtic Substudy (SPS3-GENES). A genetic risk score for RHTN was created by summing the risk alleles of replicated RHTN signals. rs11749255 in MSX2 was associated with RHTN in INVEST (odds ratio (OR) (95% CI) = 1.50 (1.2-1.8), p = 7.3 × 10-5) and replicated in SPS3 (OR = 2.0 (1.4-2.8), p = 4.3 × 10-5), with genome-wide significance in meta-analysis (OR = 1.60 (1.3-1.9), p = 3.8 × 10-8). Other replicated signals were in IFLTD1 and PTPRD. IFLTD1 rs6487504 was associated with RHTN in INVEST (OR = 1.90 (1.4-2.5), p = 1.1 × 10-5) and SPS3 (OR = 1.70 (1.2-2.5), p = 4 × 10-3). PTPRD rs324498, a previously reported RHTN signal, was among the top signals in INVEST (OR = 1.60 (1.3-2.0), p = 3.4 × 10-5) and replicated in SPS3 (OR = 1.60 (1.1-2.4), one-sided p = 0.005). Participants with the highest number of risk alleles were at increased risk of RHTN compared to participants with a lower number (p-trend = 1.8 × 10-15). Overall, we identified and replicated associations with RHTN in the MSX2, IFLTD1, and PTPRD regions, and combined these associations to create a genetic risk score.
Collapse
|
17
|
Methods for exploring the plant tree of life. APPLICATIONS IN PLANT SCIENCES 2018; 6:e1039. [PMCID: PMC5895194 DOI: 10.1002/aps3.1039] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/13/2018] [Accepted: 03/15/2018] [Indexed: 05/24/2023]
|
18
|
Linking genome signatures of selection and adaptation in non-model plants: exploring potential and limitations in the angiosperm Amborella. CURRENT OPINION IN PLANT BIOLOGY 2018; 42:81-89. [PMID: 29694936 DOI: 10.1016/j.pbi.2018.04.003] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/08/2018] [Revised: 03/24/2018] [Accepted: 04/02/2018] [Indexed: 06/08/2023]
Abstract
Selective sweeps may be caused by environmental conditions that select for a gene function or trait at one locus, causing reduced variation at neighboring sites due to linkage, with specific non-selected variants being swept along with the selected variant. For many species, genomic and environmental data are available to test hypotheses that environmental conditions are correlated with selected regions. Most genomic studies relating selection to environment use model organisms or crop species; typically, these studies have genomic data from large numbers of individuals and extensive environmental data. Here, we review studies associating selective sweeps with environment and consider the impediments to successful application of these methods to non-model species. We present an initial investigation into linking genomic regions of selection to environmental conditions in the narrowly distributed, non-model plant Amborella trichopoda (Amborellaceae), the sister species to all other living flowering plants and one of over 2500 plant species endemic to New Caledonia.
Collapse
|
19
|
Plastid phylogenomic analysis of green plants: A billion years of evolutionary history. AMERICAN JOURNAL OF BOTANY 2018; 105:291-301. [PMID: 29603143 DOI: 10.1002/ajb2.1048] [Citation(s) in RCA: 160] [Impact Index Per Article: 26.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/08/2017] [Accepted: 11/13/2017] [Indexed: 05/18/2023]
Abstract
PREMISE OF THE STUDY For the past one billion years, green plants (Viridiplantae) have dominated global ecosystems, yet many key branches in their evolutionary history remain poorly resolved. Using the largest analysis of Viridiplantae based on plastid genome sequences to date, we examined the phylogeny and implications for morphological evolution at key nodes. METHODS We analyzed amino acid sequences from protein-coding genes from complete (or nearly complete) plastomes for 1879 taxa, including representatives across all major clades of Viridiplantae. Much of the data used was derived from transcriptomes from the One Thousand Plants Project (1KP); other data were taken from GenBank. KEY RESULTS Our results largely agree with previous plastid-based analyses. Noteworthy results include (1) the position of Zygnematophyceae as sister to land plants (Embryophyta), (2) a bryophyte clade (hornworts, mosses + liverworts), (3) Equisetum + Psilotaceae as sister to Marattiales + leptosporangiate ferns, (4) cycads + Ginkgo as sister to the remaining extant gymnosperms, within which Gnetophyta are placed within conifers as sister to non-Pinaceae (Gne-Cup hypothesis), and (5) Amborella, followed by water lilies (Nymphaeales), as successive sisters to all other extant angiosperms. Within angiosperms, there is support for Mesangiospermae, a clade that comprises magnoliids, Chloranthales, monocots, Ceratophyllum, and eudicots. The placements of Ceratophyllum and Dilleniaceae remain problematic. Within Pentapetalae, two major clades (superasterids and superrosids) are recovered. CONCLUSIONS This plastid data set provides an important resource for elucidating morphological evolution, dating divergence times in Viridiplantae, comparisons with emerging nuclear phylogenies, and analyses of molecular evolutionary patterns and dynamics of the plastid genome.
Collapse
|
20
|
Deep reticulation and incomplete lineage sorting obscure the diploid phylogeny of rain-lilies and allies (Amaryllidaceae tribe Hippeastreae). Mol Phylogenet Evol 2017; 111:231-247. [PMID: 28390909 DOI: 10.1016/j.ympev.2017.04.003] [Citation(s) in RCA: 41] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2016] [Revised: 03/31/2017] [Accepted: 04/03/2017] [Indexed: 12/23/2022]
Abstract
Hybridization is a frequent and important force in plant evolution. Next-generation sequencing (NGS) methods offer new possibilities for clade resolution and ambitious sampling of gene genealogies, yet difficulty remains in detecting deep reticulation events using currently available methods. We reconstructed the phylogeny of diploid representatives of Amaryllidaceae tribe Hippeastreae to test the hypothesis of ancient hybridizations preceding the radiation of its major subclade, Hippeastrinae. Through hybrid enrichment of DNA libraries and NGS, we obtained data for 18 nuclear loci through a curated assembly approach and nearly complete plastid genomes for 35 ingroup taxa plus 5 outgroups. Additionally, we obtained alignments for 39 loci through an automated assembly algorithm. These data were analyzed with diverse phylogenetic methods, including concatenation, coalescence-based species tree estimation, Bayesian concordance analysis, and network reconstructions, to provide insights into the evolutionary relationships of Hippeastreae. Causes for gene tree heterogeneity and cytonuclear discordance were examined through a Bayesian posterior predictive approach (JML) and coalescent simulations. Two major clades were found, Hippeastrinae and Traubiinae, as previously reported. Our results suggest the presence of two major nuclear lineages in Hippeastrinae characterized by different chromosome numbers: (1) Tocantinia and Hippeastrum with 2n=22, and (2) Eithea, Habranthus, Rhodophiala, and Zephyranthes mostly with 2n=12, 14, and 18. Strong cytonuclear discordance was confirmed in Hippeastrinae, and a network scenario with at least six hybridization events is proposed to reconcile nuclear and plastid signals, along a backbone that may also have been affected by incomplete lineage sorting at the base of each major subclade.
Collapse
|
21
|
The report of my death was an exaggeration: A review for researchers using microsatellites in the 21st century. APPLICATIONS IN PLANT SCIENCES 2016; 4:apps1600025. [PMID: 27347456 PMCID: PMC4915923 DOI: 10.3732/apps.1600025] [Citation(s) in RCA: 91] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/04/2016] [Accepted: 05/25/2016] [Indexed: 05/19/2023]
Abstract
Microsatellites, or simple sequence repeats (SSRs), have long played a major role in genetic studies due to their typically high polymorphism. They have diverse applications, including genome mapping, forensics, ascertaining parentage, population and conservation genetics, identification of the parentage of polyploids, and phylogeography. We compare SSRs and newer methods, such as genotyping by sequencing (GBS) and restriction site associated DNA sequencing (RAD-Seq), and offer recommendations for researchers considering which genetic markers to use. We also review the variety of techniques currently used for identifying microsatellite loci and developing primers, with a particular focus on those that make use of next-generation sequencing (NGS). Additionally, we review software for microsatellite development and report on an experiment to assess the utility of currently available software for SSR development. Finally, we discuss the future of microsatellites and make recommendations for researchers preparing to use microsatellites. We argue that microsatellites still have an important place in the genomic age as they remain effective and cost-efficient markers.
Collapse
|
22
|
A new resource for the development of SSR markers: Millions of loci from a thousand plant transcriptomes. APPLICATIONS IN PLANT SCIENCES 2016; 4:apps1600024. [PMID: 27347455 PMCID: PMC4915922 DOI: 10.3732/apps.1600024] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/07/2015] [Accepted: 05/09/2016] [Indexed: 05/29/2023]
Abstract
PREMISE OF THE STUDY The One Thousand Plant Transcriptomes Project (1KP, 1000+ assembled plant transcriptomes) provides an enormous resource for developing microsatellite loci across the plant tree of life. We developed loci from these transcriptomes and tested their utility. METHODS AND RESULTS Using software packages and custom scripts, we identified microsatellite loci in 1KP transcriptomes. We assessed the potential for cross-amplification and whether loci were biased toward exons, as compared to markers derived from genomic DNA. We characterized over 5.7 million simple sequence repeat (SSR) loci from 1334 plant transcriptomes. Eighteen percent of loci substantially overlapped with open reading frames (ORFs), and electronic PCR revealed that over half the loci would amplify successfully in conspecific taxa. Transcriptomic SSRs were approximately three times more likely to map to translated regions than genomic SSRs. CONCLUSIONS We believe microsatellites still have a place in the genomic age-they remain effective and cost-efficient markers. The loci presented here are a valuable resource for researchers.
Collapse
|
23
|
A new resource for the development of SSR markers: Millions of loci from a thousand plant transcriptomes. APPLICATIONS IN PLANT SCIENCES 2016. [PMID: 27347455 DOI: 10.5061/dryad.rb7h0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 05/14/2023]
Abstract
PREMISE OF THE STUDY The One Thousand Plant Transcriptomes Project (1KP, 1000+ assembled plant transcriptomes) provides an enormous resource for developing microsatellite loci across the plant tree of life. We developed loci from these transcriptomes and tested their utility. METHODS AND RESULTS Using software packages and custom scripts, we identified microsatellite loci in 1KP transcriptomes. We assessed the potential for cross-amplification and whether loci were biased toward exons, as compared to markers derived from genomic DNA. We characterized over 5.7 million simple sequence repeat (SSR) loci from 1334 plant transcriptomes. Eighteen percent of loci substantially overlapped with open reading frames (ORFs), and electronic PCR revealed that over half the loci would amplify successfully in conspecific taxa. Transcriptomic SSRs were approximately three times more likely to map to translated regions than genomic SSRs. CONCLUSIONS We believe microsatellites still have a place in the genomic age-they remain effective and cost-efficient markers. The loci presented here are a valuable resource for researchers.
Collapse
|
24
|
Are microsatellite fragment lengths useful for population-level studies? The case of Polygala lewtonii (Polygalaceae). APPLICATIONS IN PLANT SCIENCES 2016; 4:apps1500115. [PMID: 26949579 PMCID: PMC4760751 DOI: 10.3732/apps.1500115] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/13/2015] [Accepted: 01/05/2016] [Indexed: 05/10/2023]
Abstract
PREMISE OF THE STUDY Microsatellites, despite being commonly used population-level markers, contain biases because scoring relies solely on fragment length. Their complexity can lead to homoplasy, the effects of which are poorly understood. We investigated the impact of using fragment lengths, repeats, or flanking region sequences on common population-level analyses. METHODS Five polymorphic microsatellite markers amplified across the central Florida scrub endemic Polygala lewtonii (Polygalaceae) and its close, widespread congener P. polygama. We genotyped 147 individuals of P. lewtonii and 156 of P. polygama, and sequenced the amplicons of four markers across all individuals. We ran basic statistics, spatial clustering analysis, historical demographics, and migration tests. RESULTS One population of intermediate morphology was genetically clearly identified as P. polygama, making it the southernmost population of this species. Statistics were comparable between the fragment length and repeat numbers, with some notable differences. Flanking regions exhibited surprisingly high polymorphism between species, and between geographically distant conspecific populations. DISCUSSION The increasing use of markers developed in one species and amplified in another is only a good practice if precautions are taken, notably the sequencing of the fragments between species and populations. Flanking region sequences are a useful marker at the interspecific level.
Collapse
|
25
|
Abstract
Ferns are one of the few remaining major clades of land plants for which a complete genome sequence is lacking. Knowledge of genome space in ferns will enable broad-scale comparative analyses of land plant genes and genomes, provide insights into genome evolution across green plants, and shed light on genetic and genomic features that characterize ferns, such as their high chromosome numbers and large genome sizes. As part of an initial exploration into fern genome space, we used a whole genome shotgun sequencing approach to obtain low-density coverage (∼0.4X to 2X) for six fern species from the Polypodiales (Ceratopteris, Pteridium, Polypodium, Cystopteris), Cyatheales (Plagiogyria), and Gleicheniales (Dipteris). We explore these data to characterize the proportion of the nuclear genome represented by repetitive sequences (including DNA transposons, retrotransposons, ribosomal DNA, and simple repeats) and protein-coding genes, and to extract chloroplast and mitochondrial genome sequences. Such initial sweeps of fern genomes can provide information useful for selecting a promising candidate fern species for whole genome sequencing. We also describe variation of genomic traits across our sample and highlight some differences and similarities in repeat structure between ferns and seed plants.
Collapse
|
26
|
Population genetic variation, geographic structure, and multiple origins of autopolyploidy in Galax urceolata. AMERICAN JOURNAL OF BOTANY 2015; 102:973-82. [PMID: 26101421 DOI: 10.3732/ajb.1400554] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/21/2014] [Accepted: 04/29/2015] [Indexed: 05/20/2023]
Abstract
PREMISE OF THE STUDY Whereas population genetic studies have examined allopolyploids, comparable studies of naturally occurring autopolyploids remain rare. To address fundamental questions regarding autopolyploidy, we undertook a detailed population genetic study of one of the classic examples of autopolyploidy, Galax urceolata (Diapensiaceae), which comprises diploid, triploid, and autotetraploid cytotypes. Galax is endemic to the Appalachian Mountains, the adjacent piedmont, sandhills, and coastal plain and represents perhaps the most widely known example of autopolyploidy in nature. METHODS Flow cytometry was used to diagnose ploidal level of ∼1000 individuals across 71 populations. We used 10 microsatellite markers to examine genetic variation across the geographic range of Galax and assessed multiple origins though comparisons of diploid, triploid, and tetraploid accessions using multiple analytical approaches. KEY RESULTS Tetraploids had higher levels of heterozygosity than diploids did. Genetic variation in diploid and tetraploid Galax is geographically structured among the ecoregions of the southeastern United States. Autotetraploidy in Galax urceolata has occurred independently at least 46 times, with triploidy having occurred a minimum of 31 times. CONCLUSIONS Genetic differentiation among ecoregions suggests historical patterns of local adaptation. The numerous independent origins of tetraploid Galax reported here are among the highest frequencies of independent polyploidizations ever reported for any polyploid (auto- or allopolyploid).
Collapse
|
27
|
Modified CTAB and TRIzol protocols improve RNA extraction from chemically complex Embryophyta. APPLICATIONS IN PLANT SCIENCES 2015; 3:apps1400105. [PMID: 25995975 PMCID: PMC4435465 DOI: 10.3732/apps.1400105] [Citation(s) in RCA: 53] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/31/2014] [Accepted: 03/03/2015] [Indexed: 05/24/2023]
Abstract
PREMISE OF THE STUDY Here we present a series of protocols for RNA extraction across a diverse array of plants; we focus on woody, aromatic, aquatic, and other chemically complex taxa. METHODS AND RESULTS Ninety-one taxa were subjected to RNA extraction with three methods presented here: (1) TRIzol/TURBO DNA-free kits using the manufacturer's protocol with the addition of sarkosyl; (2) a combination method using cetyltrimethylammonium bromide (CTAB) and TRIzol/sarkosyl/TURBO DNA-free; and (3) a combination of CTAB and QIAGEN RNeasy Plant Mini Kit. Bench-ready protocols are given. CONCLUSIONS After an iterative process of working with chemically complex taxa, we conclude that the use of TRIzol supplemented with sarkosyl and the TURBO DNA-free kit is an effective, efficient, and robust method for obtaining RNA from 100 mg of leaf tissue of land plant species (Embryophyta) examined. Our protocols can be used to provide RNA of suitable stability, quantity, and quality for transcriptome sequencing.
Collapse
|
28
|
The Phenotypic and Genetic Underpinnings of Flower Size in Polemoniaceae. FRONTIERS IN PLANT SCIENCE 2015; 6:1144. [PMID: 26779209 PMCID: PMC4700140 DOI: 10.3389/fpls.2015.01144] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/05/2015] [Accepted: 12/02/2015] [Indexed: 05/08/2023]
Abstract
Corolla length is a labile flower feature and has strong implications for pollinator success. However, the phenotypic and genetic bases of corolla elongation are not well known, largely due to a lack of good candidate genes for potential genetic exploration and functional work. We investigate both the cellular phenotypic differences in corolla length, as well as the genetic control of this trait, in Saltugilia (Polemoniaceae). Taxa in this clade exhibit a large range of flower sizes and differ dramatically in pollinator guilds. Flowers of each species were collected from multiple individuals during four stages of flower development to ascertain if cell number or cell size is more important in determining flower size. In Saltugilia, increased flower size during development appears to be driven more by cell size than cell number. Differences in flower size between species are governed by both cell size and cell number, with the large-flowered S. splendens subsp. grantii having nearly twice as many cells as the small-flowered species. Fully mature flowers of all taxa contain jigsaw cells similar to cells seen in sepals and leaves; however, these cells are not typically found in the developing flowers of most species. The proportion of this cell type in mature flowers appears to have substantial implications, comprising 17-68% of the overall flower size. To identify candidate genes responsible for differences in cell area and cell type, transcriptomes were generated for two individuals of the species with the smallest (S. australis) and largest (S. splendens subsp. grantii) flowers across the same four developmental stages visualized with confocal microscopy. Analyses identified genes associated with cell wall formation that are up-regulated in the mature flower stage compared to mid-stage flowers (75% of mature size). This developmental change is associated with the origin of jigsaw cells in the corolla tube of mature flowers. Further comparisons between mature flowers in the two species revealed 354 transcripts that are up-regulated in the large-flowered S. splendens subsp. grantii compared to the small-flowered S. australis. These results are likely broadly applicable to Polemoniaceae, a clade of nearly 400 species, with extensive variation in floral form and shape.
Collapse
|
29
|
Abstract
Reconstructing the origin and evolution of land plants and their algal relatives is a fundamental problem in plant phylogenetics, and is essential for understanding how critical adaptations arose, including the embryo, vascular tissue, seeds, and flowers. Despite advances in molecular systematics, some hypotheses of relationships remain weakly resolved. Inferring deep phylogenies with bouts of rapid diversification can be problematic; however, genome-scale data should significantly increase the number of informative characters for analyses. Recent phylogenomic reconstructions focused on the major divergences of plants have resulted in promising but inconsistent results. One limitation is sparse taxon sampling, likely resulting from the difficulty and cost of data generation. To address this limitation, transcriptome data for 92 streptophyte taxa were generated and analyzed along with 11 published plant genome sequences. Phylogenetic reconstructions were conducted using up to 852 nuclear genes and 1,701,170 aligned sites. Sixty-nine analyses were performed to test the robustness of phylogenetic inferences to permutations of the data matrix or to phylogenetic method, including supermatrix, supertree, and coalescent-based approaches, maximum-likelihood and Bayesian methods, partitioned and unpartitioned analyses, and amino acid versus DNA alignments. Among other results, we find robust support for a sister-group relationship between land plants and one group of streptophyte green algae, the Zygnematophyceae. Strong and robust support for a clade comprising liverworts and mosses is inconsistent with a widely accepted view of early land plant evolution, and suggests that phylogenetic hypotheses used to understand the evolution of fundamental plant traits should be reevaluated.
Collapse
|
30
|
Are polyploids really evolutionary dead-ends (again)? A critical reappraisal of Mayrose et al. (). THE NEW PHYTOLOGIST 2014; 202:1105-1117. [PMID: 24754325 DOI: 10.1111/nph.12756] [Citation(s) in RCA: 103] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/25/2023]
|
31
|
Population genetic structure, genetic diversity, and natural history of the South American species of Nothofagus subgenus Lophozonia (Nothofagaceae) inferred from nuclear microsatellite data. Ecol Evol 2014; 4:2450-71. [PMID: 25360279 PMCID: PMC4203291 DOI: 10.1002/ece3.1108] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2013] [Revised: 04/14/2014] [Accepted: 04/23/2014] [Indexed: 11/07/2022] Open
Abstract
The effect of glaciation on the levels and patterns of genetic variation has been well studied in the Northern Hemisphere. However, although glaciation has undoubtedly shaped the genetic structure of plants in the Southern Hemisphere, fewer studies have characterized the effect, and almost none of them using microsatellites. Particularly, complex patterns of genetic structure might be expected in areas such as the Andes, where both latitudinal and altitudinal glacial advance and retreat have molded modern plant communities. We therefore studied the population genetics of three closely related, hybridizing species of Nothofagus (N. obliqua, N. alpina, and N. glauca, all of subgenus Lophozonia; Nothofagaceae) from Chile. To estimate population genetic parameters and infer the influence of the last ice age on the spatial and genetic distribution of these species, we examined and analyzed genetic variability at seven polymorphic microsatellite DNA loci in 640 individuals from 40 populations covering most of the ranges of these species in Chile. Populations showed no significant inbreeding and exhibited relatively high levels of genetic diversity (HE = 0.502–0.662) and slight, but significant, genetic structure (RST = 8.7–16.0%). However, in N. obliqua, the small amount of genetic structure was spatially organized into three well-defined latitudinal groups. Our data may also suggest some introgression of N. alpina genes into N. obliqua in the northern populations. These results allowed us to reconstruct the influence of the last ice age on the genetic structure of these species, suggesting several centers of genetic diversity for N. obliqua and N. alpina, in agreement with the multiple refugia hypothesis.
Collapse
|
32
|
From algae to angiosperms-inferring the phylogeny of green plants (Viridiplantae) from 360 plastid genomes. BMC Evol Biol 2014; 14:23. [PMID: 24533922 PMCID: PMC3933183 DOI: 10.1186/1471-2148-14-23] [Citation(s) in RCA: 317] [Impact Index Per Article: 31.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2013] [Accepted: 01/13/2014] [Indexed: 11/21/2022] Open
Abstract
BACKGROUND Next-generation sequencing has provided a wealth of plastid genome sequence data from an increasingly diverse set of green plants (Viridiplantae). Although these data have helped resolve the phylogeny of numerous clades (e.g., green algae, angiosperms, and gymnosperms), their utility for inferring relationships across all green plants is uncertain. Viridiplantae originated 700-1500 million years ago and may comprise as many as 500,000 species. This clade represents a major source of photosynthetic carbon and contains an immense diversity of life forms, including some of the smallest and largest eukaryotes. Here we explore the limits and challenges of inferring a comprehensive green plant phylogeny from available complete or nearly complete plastid genome sequence data. RESULTS We assembled protein-coding sequence data for 78 genes from 360 diverse green plant taxa with complete or nearly complete plastid genome sequences available from GenBank. Phylogenetic analyses of the plastid data recovered well-supported backbone relationships and strong support for relationships that were not observed in previous analyses of major subclades within Viridiplantae. However, there also is evidence of systematic error in some analyses. In several instances we obtained strongly supported but conflicting topologies from analyses of nucleotides versus amino acid characters, and the considerable variation in GC content among lineages and within single genomes affected the phylogenetic placement of several taxa. CONCLUSIONS Analyses of the plastid sequence data recovered a strongly supported framework of relationships for green plants. This framework includes: i) the placement of Zygnematophyceace as sister to land plants (Embryophyta), ii) a clade of extant gymnosperms (Acrogymnospermae) with cycads + Ginkgo sister to remaining extant gymnosperms and with gnetophytes (Gnetophyta) sister to non-Pinaceae conifers (Gnecup trees), and iii) within the monilophyte clade (Monilophyta), Equisetales + Psilotales are sister to Marattiales + leptosporangiate ferns. Our analyses also highlight the challenges of using plastid genome sequences in deep-level phylogenomic analyses, and we provide suggestions for future analyses that will likely incorporate plastid genome sequence data for thousands of species. We particularly emphasize the importance of exploring the effects of different partitioning and character coding strategies.
Collapse
|
33
|
|
34
|
Abstract
The 1,000 plants (1KP) project is an international multi-disciplinary consortium that has generated transcriptome data from over 1,000 plant species, with exemplars for all of the major lineages across the Viridiplantae (green plants) clade. Here, we describe how to access the data used in a phylogenomics analysis of the first 85 species, and how to visualize our gene and species trees. Users can develop computational pipelines to analyse these data, in conjunction with data of their own that they can upload. Computationally estimated protein-protein interactions and biochemical pathways can be visualized at another site. Finally, we comment on our future plans and how they fit within this scalable system for the dissemination, visualization, and analysis of large multi-species data sets.
Collapse
|
35
|
A targeted enrichment strategy for massively parallel sequencing of angiosperm plastid genomes. APPLICATIONS IN PLANT SCIENCES 2013; 1:apps1200497. [PMID: 25202518 PMCID: PMC4105372 DOI: 10.3732/apps.1200497] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/17/2012] [Accepted: 12/09/2012] [Indexed: 05/02/2023]
Abstract
PREMISE OF THE STUDY We explored a targeted enrichment strategy to facilitate rapid and low-cost next-generation sequencing (NGS) of numerous complete plastid genomes from across the phylogenetic breadth of angiosperms. • METHODS AND RESULTS A custom RNA probe set including the complete sequences of 22 previously sequenced eudicot plastomes was designed to facilitate hybridization-based targeted enrichment of eudicot plastid genomes. Using this probe set and an Agilent SureSelect targeted enrichment kit, we conducted an enrichment experiment including 24 angiosperms (22 eudicots, two monocots), which were subsequently sequenced on a single lane of the Illumina GAIIx with single-end, 100-bp reads. This approach yielded nearly complete to complete plastid genomes with exceptionally high coverage (mean coverage: 717×), even for the two monocots. • CONCLUSIONS Our enrichment experiment was highly successful even though many aspects of the capture process employed were suboptimal. Hence, significant improvements to this methodology are feasible. With this general approach and probe set, it should be possible to sequence more than 300 essentially complete plastid genomes in a single Illumina GAIIx lane (achieving ∼50× mean coverage). However, given the complications of pooling numerous samples for multiplex sequencing and the limited number of barcodes (e.g., 96) available in commercial kits, we recommend 96 samples as a current practical maximum for multiplex plastome sequencing. This high-throughput approach should facilitate large-scale plastid genome sequencing at any level of phylogenetic diversity in angiosperms.
Collapse
|
36
|
Abstract
PREMISE OF THE STUDY Recent analyses employing up to five genes have provided numerous insights into angiosperm phylogeny, but many relationships have remained unresolved or poorly supported. In the hope of improving our understanding of angiosperm phylogeny, we expanded sampling of taxa and genes beyond previous analyses. METHODS We conducted two primary analyses based on 640 species representing 330 families. The first included 25260 aligned base pairs (bp) from 17 genes (representing all three plant genomes, i.e., nucleus, plastid, and mitochondrion). The second included 19846 aligned bp from 13 genes (representing only the nucleus and plastid). KEY RESULTS Many important questions of deep-level relationships in the nonmonocot angiosperms have now been resolved with strong support. Amborellaceae, Nymphaeales, and Austrobaileyales are successive sisters to the remaining angiosperms (Mesangiospermae), which are resolved into Chloranthales + Magnoliidae as sister to Monocotyledoneae + [Ceratophyllaceae + Eudicotyledoneae]. Eudicotyledoneae contains a basal grade subtending Gunneridae. Within Gunneridae, Gunnerales are sister to the remainder (Pentapetalae), which comprises (1) Superrosidae, consisting of Rosidae (including Vitaceae) and Saxifragales; and (2) Superasteridae, comprising Berberidopsidales, Santalales, Caryophyllales, Asteridae, and, based on this study, Dilleniaceae (although other recent analyses disagree with this placement). Within the major subclades of Pentapetalae, most deep-level relationships are resolved with strong support. CONCLUSIONS Our analyses confirm that with large amounts of sequence data, most deep-level relationships within the angiosperms can be resolved. We anticipate that this well-resolved angiosperm tree will be of broad utility for many areas of biology, including physiology, ecology, paleobiology, and genomics.
Collapse
|
37
|
Microsatellite marker development for the federally listed Prunus geniculata (Rosaceae). AMERICAN JOURNAL OF BOTANY 2011; 98:e58-e60. [PMID: 21613125 DOI: 10.3732/ajb.1000435] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/30/2023]
Abstract
PREMISE OF THE STUDY Prunus geniculata is a federally listed species endemic to the central Florida scrub. To investigate levels and geographic partitioning of genetic variation in this species, we developed microsatellite markers for P. geniculata and its sister species, P. maritima. METHODS AND RESULTS Following a protocol optimized in our laboratory, we designed 36 primer pairs, of which 8 amplified across both species and showed informative polymorphism. Product size varied from 163 to 370 bp, and the number of alleles ranged from 5 to 27. He across all populations ranged from 0.604 to 0.9155. CONCLUSIONS Eight loci were useful at the population level and will be used for conservation genetic and phylogeographic studies of Prunus geniculata and P. maritima. Furthermore, from five to all eight of these loci can also be amplified in several other North American Prunus species.
Collapse
|
38
|
Isolation and characterization of novel microsatellite markers for Arctium minus (Compositae). AMERICAN JOURNAL OF BOTANY 2010; 97:e4-e6. [PMID: 21622378 DOI: 10.3732/ajb.0900376] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/30/2023]
Abstract
PREMISE OF THE STUDY Microsatellite primers were developed for the invasive plant Arctium minus to investigate the effects of facultative self-pollination and the biannual habit on population genetic structure, as well as the colonization of the Americas by this Eurasian species. • METHODS AND RESULTS Sixteen di- and trinucleotide microsatellite loci were identified in six populations. The number of alleles per locus ranged from one to 10, observed heterozygosities ranged from 0.000 to 0.897, and the mean value of F(IS) was 0.316. • CONCLUSIONS These results indicate the utility of these loci in future studies of population genetics in A. minus.
Collapse
|
39
|
|
40
|
The utility of amplified fragment length polymorphisms in phylogenetics: a comparison of homology within and between genomes. Syst Biol 2007; 56:477-84. [PMID: 17562471 DOI: 10.1080/10635150701427077] [Citation(s) in RCA: 74] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022] Open
Abstract
The amplified fragment length polymorphism (AFLP) technique is being increasingly used in phylogenetic studies, especially in groups of rapidly radiating taxa. One of the key issues in the phylogenetic suitability of this technique is whether the DNA fragments generated via the AFLP method are homologous within and among the taxa being studied. We used a bioinformatics approach to assess homology based on both chromosomal location and sequence similarity of AFLP fragments. The AFLP technique was electronically simulated on genomes from eight organisms that represented a range of genome sizes. The results demonstrated that within a genome, the number of fragments is positively associated with genome size, and the degree of homology decreases with increasing numbers of fragments generated. The average homology of fragments was 89% for small genomes (< 400 Mb) but decreased to 59% for large genomes (> 2 Gb). Fragment homology for large genomes can be increased by excluding smaller fragments, although there is no clear upper limit for the size of fragments to exclude. A second approach is to increase the number of selective nucleotides in the final selective amplification step. For strains of the same organism, homology based on chromosome location and sequence similarity of fragments was 100%. Fragment homology for more distantly related taxa, however, decreased with greater time since divergence. We conclude that AFLP data are best suited for examining phylogeographic patterns within species and among very recently diverged species.
Collapse
|
41
|
Molecular phylogeny, biogeography, and systematics of Dicerandra (Lamiaceae), a genus endemic to the southeastern United States. AMERICAN JOURNAL OF BOTANY 2007; 94:1017-1027. [PMID: 21636471 DOI: 10.3732/ajb.94.6.1017] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/30/2023]
Abstract
Dicerandra, an endemic mint of the southeastern United States, comprises nine species, all of which are threatened or endangered and restricted to sandhill vegetation and a mosaic of scrub habitats. Molecular phylogenetic analyses of Dicerandra based on data from the nuclear and plastid genomes for all 13 taxa of the genus, identified two strongly supported clades, corresponding to the four annual and to the five perennial species of Dicerandra. However, the nuclear and plastid trees were incongruent in their placement of two perennial taxa, D. cornutissima and D. immaculata var. savannarum, perhaps due to ancient hybridization or to lineage sorting. Based on these analyses, the widespread D. linearifolia is not monophyletic, with populations of D. linearifolia var. linearifolia falling into either western or eastern clades. The western clade, comprising populations of D. linearifolia var. linearifolia and var. robustior, occurs in an area drained by rivers flowing toward the Gulf of Mexico, whereas the eastern clade, comprising populations of D. linearifolia var. linearifolia, D. densiflora, D. odoratissima, and D. radfordiana (i.e., all the annual species), occupies a region drained by rivers flowing to the Atlantic Ocean. Although this pattern of genetic differentiation between populations from these two river drainages has been documented in several animal species, it has not previously been reported for plants. A revised subgeneric classification is presented to reflect the annual and perennial clades.
Collapse
|
42
|
Patterns of genetic variation in rare and widespread plant congeners. AMERICAN JOURNAL OF BOTANY 2000. [PMID: 10860909 DOI: 10.2307/2656886] [Citation(s) in RCA: 270] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]
Abstract
Rare species are typically considered to maintain low levels of genetic variation, and this view has been supported by several reviews of large numbers of isozyme studies. Although these reviews have provided valuable data on levels of variability in plant species in general, and rare species in particular, these broad overviews involve comparisons that may confound the effects of rarity with a multitude of other factors that affect genetic variability. Additionally, the statistical analyses employed assume the data to be independent, which is not the case for organisms that share a common phylogenetic history. As the role of evolutionary history and historical constraints has become better understood, more researchers have studied widespread congeners when investigating the genetic diversity of rare species in an effort to control for these effects. We summarize the available data from such studies, comparing for rare and widespread congeners (1) the levels of genetic variability at the population and species levels and (2) measures of population substructuring. At the population level, we summarized data for percentage polymorphic loci (%P(pop)), mean number of alleles per locus (A(pop)), and observed heterozygosity (H(o)). Species-level measures used were percentage polymorphic loci (%P(spp)), mean number of alleles per locus (A(spp)), and total genetic diversity (H(T)). Indices of population subdivision (either F(ST) or G(ST)) were also examined. Using Wilcoxon signed rank tests, we found significant, but small, differences between rare and widespread species for all diversity measures except H(T). However, there does not appear to be a difference between rare and widespread congeners in terms of how genetic variation is partitioned within and among populations. Levels of diversity, for all measures examined, between rare and widespread congeners are highly correlated.
Collapse
|
43
|
Genetics of Cronartium ribicola. IV. Population structure in western North America. ACTA ACUST UNITED AC 1998. [DOI: 10.1139/cjb-76-1-91] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
|
44
|
Genetics of Cronartium ribicola. IV. Population structure in western North America. ACTA ACUST UNITED AC 1998. [DOI: 10.1139/b97-167] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Population genetic parameters were estimated for six populations of Cronartium ribicola in western North America from British Columbia to the southern Sierra Nevada, and two outgroups from eastern North America, using isozyme, random amplified polymorphic DNA (RAPD) and restriction fragment length polymorphism (RFLP) markers on cultured haploid clones. Diversity was low, with only 8% polymorphism in the 212 markers identified. Each polymorphic locus had only two alleles, except for an RFLP marker in the ribosomal DNA complex with multiple alleles, that resulted from variable numbers of tandem repeats. Expected heterozygosity within populations, estimated from diploid teliospores, was only 0.025. The three types of markers were highly consistent with each other for these parameters. Yet, populations were highly differentiated; the proportion of the total variation attributable to differences among populations was 0.205. Multivariate statistical analysis as well as different clustering algorithms based on contrasting evolutionary assumptions (drift, mutation) all showed similar relationships and differences among populations. Genetic distances were not associated with geographic distances; western populations within a few kilometres of each other were often more distant from each other genetically than they were from eastern populations across the continent. The lack of pattern over the landscape of this metapopulation is consistent with aspects of the life cycle and epidemiological behavior of the pathogen, in which genetic drift appears to play a major role.Key words: white pine blister rust, isozymes, RAPDs, RFLPs, multivariate analysis, genetic drift.
Collapse
|
45
|
Abstract
Lack of genetic markers has hindered the study of the mating system of Cronartium ribicola, an exotic forest pathogen Meeting natural and cultivated white pines throughout North America. Isozymes, randomly amplified polymorphic DNA (RAPDs), and restriction length polymorphisms (RFLPs) were used to study the mating system of this rust. Heterozygosity (outcrossing) in diploid telia was demonstrated by analysis of cultures derived from the meiotic products (basidiospores) of individual telia. Families of basidiospores cultured from single telia were used to test for Mendelian segregation and for conformance of loci to Hardy–Weinberg equilibrium. A total of 18 polymorphic loci were identified with the three marker systems. All except for three RAPD loci showed Mendelian segregation in the single-telium families. To quantify the level of outcrossing, gene and genotype frequencies were calculated for families from a single population. Up to 24 families were surveyed with isozymes, 14 with RAPDs, and 18 with RFLPs. Except for one isozyme locus (MPI) in one sample, all 14 loci tested with these families were in Hardy–Weinberg equilibrium, indicating random mating. Further studies, with a different sample from the same population, showed all three isozyme loci to be in Hardy–Weinberg equilibrium. The three marker systems were consistent as to the amount of variation detected. Resistance selection and breeding programs must consider the implications of genetic recombination that outcrossing affords the rust. Keywords: isozymes, RAPDs, RFLPs, Hardy–Weinberg equilibrium, white pine blister rust.
Collapse
|