1
|
Genome-wide analysis of FRF gene family and functional identification of HvFRF9 under drought stress in barley. FRONTIERS IN PLANT SCIENCE 2024; 15:1347842. [PMID: 38328701 PMCID: PMC10847358 DOI: 10.3389/fpls.2024.1347842] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/01/2023] [Accepted: 01/09/2024] [Indexed: 02/09/2024]
Abstract
FHY3 and its homologous protein FAR1 are the founding members of FRS family. They exhibited diverse and powerful physiological functions during evolution, and participated in the response to multiple abiotic stresses. FRF genes are considered to be truncated FRS family proteins. They competed with FRS for DNA binding sites to regulate gene expression. However, only few studies are available on FRF genes in plants participating in the regulation of abiotic stress. With wide adaptability and high stress-resistance, barley is an excellent candidate for the identification of stress-resistance-related genes. In this study, 22 HvFRFs were detected in barley using bioinformatic analysis from whole genome. According to evolution and conserved motif analysis, the 22 HvFRFs could be divided into subfamilies I and II. Most promoters of subfamily I members contained abscisic acid and methyl jasmonate response elements; however, a large number promoters of subfamily II contained gibberellin and salicylic acid response elements. HvFRF9, one of the members of subfamily II, exhibited a expression advantage in different tissues, and it was most significantly upregulated under drought stress. In-situ PCR revealed that HvFRF9 is mainly expressed in the root epidermal cells, as well as xylem and phloem of roots and leaves, indicating that HvFRF9 may be related to absorption and transportation of water and nutrients. The results of subcellular localization indicated that HvFRF9 was mainly expressed in the nuclei of tobacco epidermal cells and protoplast of arabidopsis. Further, transgenic arabidopsis plants with HvFRF9 overexpression were generated to verify the role of HvFRF9 in drought resistance. Under drought stress, leaf chlorosis and wilting, MDA and O2 - contents were significantly lower, meanwhile, fresh weight, root length, PRO content, and SOD, CAT and POD activities were significantly higher in HvFRF9-overexpressing arabidopsis plants than in wild-type plants. Therefore, overexpression of HvFRF9 could significantly enhance the drought resistance in arabidopsis. These results suggested that HvFRF9 may play a key role in drought resistance in barley by increasing the absorption and transportation of water and the activity of antioxidant enzymes. This study provided a theoretical basis for drought resistance in barley and provided new genes for drought resistance breeding.
Collapse
|
2
|
Plastid phylogenomics uncovers multiple species in Medicago truncatula (Fabaceae) germplasm accessions. Sci Rep 2022; 12:21172. [PMID: 36477422 PMCID: PMC9729603 DOI: 10.1038/s41598-022-25381-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2022] [Accepted: 11/29/2022] [Indexed: 12/12/2022] Open
Abstract
Medicago truncatula is a model legume that has been extensively investigated in diverse subdisciplines of plant science. Medicago littoralis can interbreed with M. truncatula and M. italica; these three closely related species form a clade, i.e. TLI clade. Genetic studies have indicated that M. truncatula accessions are heterogeneous but their taxonomic identities have not been verified. To elucidate the phylogenetic position of diverse M. truncatula accessions within the genus, we assembled 54 plastid genomes (plastomes) using publicly available next-generation sequencing data and conducted phylogenetic analyses using maximum likelihood. Five accessions showed high levels of plastid DNA polymorphism. Three of these highly polymorphic accessions contained sequences from both M. truncatula and M. littoralis. Phylogenetic analyses of sequences placed some accessions closer to distantly related species suggesting misidentification of source material. Most accessions were placed within the TLI clade and maximally supported the interrelationships of three subclades. Two Medicago accessions were placed within a M. italica subclade of the TLI clade. Plastomes with a 45-kb (rpl20-ycf1) inversion were placed within the M. littoralis subclade. Our results suggest that the M. truncatula accession genome pool represents more than one species due to possible mistaken identities and gene flow among closely related species.
Collapse
|
3
|
Born in the mitochondrion and raised in the nucleus: evolution of a novel tandem repeat family in Medicago polymorpha (Fabaceae). THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2022; 110:389-406. [PMID: 35061308 DOI: 10.1111/tpj.15676] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/07/2021] [Accepted: 01/13/2022] [Indexed: 06/14/2023]
Abstract
Plant nuclear genomes harbor sequence elements derived from the organelles (mitochondrion and plastid) through intracellular gene transfer (IGT). Nuclear genomes also show a dramatic range of repeat content, suggesting that any sequence can be readily amplified. These two aspects of plant nuclear genomes are well recognized but have rarely been linked. Through investigation of 31 Medicago taxa we detected exceptionally high post-IGT amplification of mitochondrial (mt) DNA sequences containing rps10 in the nuclear genome of Medicago polymorpha and closely related species. The amplified sequences were characterized as tandem arrays of five distinct repeat motifs (2157, 1064, 987, 971, and 587 bp) that have diverged from the mt genome (mitogenome) in the M. polymorpha nuclear genome. The mt rps10-like arrays were identified in seven loci (six intergenic and one telomeric) of the nuclear chromosome assemblies and were the most abundant tandem repeat family, representing 1.6-3.0% of total genomic DNA, a value approximately three-fold greater than the entire mitogenome in M. polymorpha. Compared to a typical mt gene, the mt rps10-like sequence coverage level was 691.5-7198-fold higher in M. polymorpha and closely related species. In addition to the post-IGT amplification, our analysis identified the canonical telomeric repeat and the species-specific satellite arrays that are likely attributable to an ancestral chromosomal fusion in M. polymorpha. A possible relationship between chromosomal instability and the mt rps10-like tandem repeat family in the M. polymorpha clade is discussed.
Collapse
|
4
|
The genome of a wild Medicago species provides insights into the tolerant mechanisms of legume forage to environmental stress. BMC Biol 2021; 19:96. [PMID: 33957908 PMCID: PMC8103640 DOI: 10.1186/s12915-021-01033-0] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2020] [Accepted: 04/21/2021] [Indexed: 11/21/2022] Open
Abstract
BACKGROUND Medicago ruthenica, a wild and perennial legume forage widely distributed in semi-arid grasslands, is distinguished by its outstanding tolerance to environmental stress. It is a close relative of commonly cultivated forage of alfalfa (Medicago sativa). The high tolerance of M. ruthenica to environmental stress makes this species a valuable genetic resource for understanding and improving traits associated with tolerance to harsh environments. RESULTS We sequenced and assembled genome of M. ruthenica using an integrated approach, including PacBio, Illumina, 10×Genomics, and Hi-C. The assembled genome was 904.13 Mb with scaffold N50 of 99.39 Mb, and 50,162 protein-coding genes were annotated. Comparative genomics and transcriptomic analyses were used to elucidate mechanisms underlying its tolerance to environmental stress. The expanded FHY3/FAR1 family was identified to be involved in tolerance of M. ruthenica to drought stress. Many genes involved in tolerance to abiotic stress were retained in M. ruthenica compared to other cultivated Medicago species. Hundreds of candidate genes associated with drought tolerance were identified by analyzing variations in single nucleotide polymorphism using accessions of M. ruthenica with varying tolerance to drought. Transcriptomic data demonstrated the involvements of genes related to transcriptional regulation, stress response, and metabolic regulation in tolerance of M. ruthenica. CONCLUSIONS We present a high-quality genome assembly and identification of drought-related genes in the wild species of M. ruthenica, providing a valuable resource for genomic studies on perennial legume forages.
Collapse
|
5
|
Phylogeny and Species Delimitation of Chinese Medicago (Leguminosae) and Its Relatives Based on Molecular and Morphological Evidence. FRONTIERS IN PLANT SCIENCE 2021; 11:619799. [PMID: 33584760 PMCID: PMC7874099 DOI: 10.3389/fpls.2020.619799] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/21/2020] [Accepted: 12/17/2020] [Indexed: 05/12/2023]
Abstract
Medicago and its relatives, Trigonella and Melilotus comprise the most important forage resources globally. The alfalfa selected from the wild relatives has been cultivated worldwide as the forage queen. In the Flora of China, 15 Medicago, eight Trigonella, and four Melilotus species are recorded, of which six Medicago and two Trigonella species are introduced. Although several studies have been conducted to investigate the phylogenetic relationship within the three genera, many Chinese naturally distributed or endemic species are not included in those studies. Therefore, the taxonomic identity and phylogenetic relationship of these species remains unclear. In this study, we collected samples representing 18 out of 19 Chinese naturally distributed species of these three genera and three introduced Medicago species, and applied an integrative approach by combining evidences from population-based morphological clusters and molecular data to investigate species boundaries. A total of 186 individuals selected from 156 populations and 454 individuals from 124 populations were collected for genetic and morphological analyses, respectively. We sequenced three commonly used DNA barcodes (trnH-psbA, trnK-matK, and ITS) and one nuclear marker (GA3ox1) for phylogenetic analyses. We found that 16 out of 21 species could be well delimited based on phylogenetic analyses and morphological clusters. Two Trigonella species may be merged as one species or treated as two subspecies, and Medicago falcata should be treated as a subspecies of the M. sativa complex. We further found that major incongruences between the chloroplast and nuclear trees mainly occurred among the deep diverging lineages, which may be resulted from hybridization, incomplete lineage sorting and/or sampling errors. Further studies involving a finer sampling of species associated with large scale genomic data should be employed to better understand the species delimitation of these three genera.
Collapse
|
6
|
MYB transcription factors in alfalfa ( Medicago sativa): genome-wide identification and expression analysis under abiotic stresses. PeerJ 2019; 7:e7714. [PMID: 31576246 PMCID: PMC6753925 DOI: 10.7717/peerj.7714] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2019] [Accepted: 08/21/2019] [Indexed: 12/13/2022] Open
Abstract
Background Alfalfa is the most widely cultivated forage legume and one of the most economically valuable crops in the world. Its survival and production are often hampered by environmental changes. However, there are few studies on stress-resistance genes in alfalfa because of its incomplete genomic information and rare expression profile data. The MYB proteins are characterized by a highly conserved DNA-binding domain, which is large, functionally diverse, and represented in all eukaryotes. The role of MYB proteins in plant development is essential; they function in diverse biological processes, including stress and defense responses, and seed and floral development. Studies on the MYB gene family have been reported in several species, but they have not been comprehensively analyzed in alfalfa. Methods To identify more comprehensive MYB transcription factor family genes, the sequences of 168 Arabidopsis thaliana, 430 Glycine max, 185 Medicago truncatula, and 130 Oryza sativa MYB proteins were downloaded from the Plant Transcription Factor Database. These sequences were used as queries in a BLAST search against the M. sativa proteome sequences provided by the Noble Research Institute. Results In the present study, a total of 265 MsMYB proteins were obtained, including 50 R1-MYB, 186 R2R3-MYB, 26 R1R2R3-MYB, and three atypical-MYB proteins. These predicted MsMYB proteins were divided into 12 subgroups by phylogenetic analysis, and gene ontology (GO) analysis indicated that most of the MsMYB genes are involved in various biological processes. The expression profiles and quantitative real-time PCR analysis indicated that some MsMYB genes might play a crucial role in the response to abiotic stresses. Additionally, a total of 170 and 914 predicted protein–protein and protein-DNA interactions were obtained, respectively. The interactions between MsMYB043 and MSAD320162, MsMYB253 and MSAD320162, and MsMYB253 and MSAD308489 were confirmed by a yeast two-hybrid system. This work provides information on the MYB family in alfalfa that was previously lacking and might promote the cultivation of stress-resistant alfalfa.
Collapse
|
7
|
Evolutionary networks from RADseq loci point to hybrid origins of Medicago carstiensis and Medicago cretacea. AMERICAN JOURNAL OF BOTANY 2019; 106:1219-1228. [PMID: 31535720 DOI: 10.1002/ajb2.1352] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/21/2019] [Accepted: 07/12/2019] [Indexed: 06/10/2023]
Abstract
PREMISE Although hybridization has played an important role in the evolution of many plant species, phylogenetic reconstructions that include hybridizing lineages have been historically constrained by the available models and data. Restriction-site-associated DNA sequencing (RADseq) has been a popular sequencing technique for the reconstruction of hybridization in the next-generation sequencing era. However, the utility of RADseq for the reconstruction of complex evolutionary networks has not been thoroughly investigated. Conflicting phylogenetic relationships in the genus Medicago have been mainly attributed to hybridization, but the specific hybrid origins of taxa have not been yet clarified. METHODS We obtained new molecular data from diploid species of Medicago section Medicago using single-digest RADseq to reconstruct evolutionary networks from gene trees, an approach that is computationally tractable with data sets that include several species and complex hybridization patterns. RESULTS Our analyses revealed that assembly filters to exclusively select a small set of loci with high phylogenetic information led to the most-divergent network topologies. Conversely, alternative clustering thresholds or filters on the number of samples per locus had a lower impact on networks. A strong hybridization signal was detected for M. carstiensis and M. cretacea, while signals were less clear for M. rugosa, M. rhodopea, M. suffruticosa, M. marina, M. scutellata, and M. sativa. CONCLUSIONS Complex network reconstructions from RADseq gene trees were not robust under variations of the assembly parameters and filters. But when the most-divergent networks were discarded, all remaining analyses consistently supported a hybrid origin for M. carstiensis and M. cretacea.
Collapse
|
8
|
Trans-lineage polymorphism and nonbifurcating diversification of the genus Picea. THE NEW PHYTOLOGIST 2019; 222:576-587. [PMID: 30415488 DOI: 10.1111/nph.15590] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/19/2018] [Accepted: 11/02/2018] [Indexed: 06/09/2023]
Abstract
Nonbifurcating divergence caused by introgressive hybridization is continuously reported for groups of closely related species. In this study, we aimed to reconstruct the genome-scale classification of deep lineages of the conifer genus Picea, establish their phylogenetic relationships and test the bifurcating hypothesis between deeply branching lineages based on genomic data. We sequenced the transcriptomes of 35 individuals of 27 taxa covering all main lineages of the genus. Four major lineages, comprising three to 12 taxa each, largely consistent with morphological evidence, were recovered across the coalescent and integrated nuclear phylogeny. However, many of the individual gene trees recovered contradict one another. Moreover, the well-supported coalescent tree inferred here differs from previous studies based on various DNA markers, with respect to topology and inter-lineage relationships. We identified the shared polymorphisms between four major lineages. ABBA-BABA tests confirmed the inter-lineage gene flow and thus violated the bifurcating divergence model. Gene flow occurred more frequently between lineages distributed in the same continent than those disjunct between continents. Our results indicate that introgression and nonbifurcating diversification apply, even between deeply branching lineages of the conifer genus Picea.
Collapse
|
9
|
Species delimitation and interspecific relationships of the endangered herb genus Notopterygium inferred from multilocus variations. Mol Phylogenet Evol 2019; 133:142-151. [PMID: 30639766 DOI: 10.1016/j.ympev.2019.01.002] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2018] [Revised: 12/28/2018] [Accepted: 01/01/2019] [Indexed: 01/29/2023]
Abstract
Species identification and discrimination is the basis of biodiversity research. In general, it is considered that numerous nucleotide variations (e.g., whole chloroplast genomes) can identify species with higher resolution than a few loci, e.g., partial chloroplast or nuclear gene fragments. In this study, we tested this hypothesis by sampling population genetics samples of the endangered herb genus Notopterygium. We sequenced the complete plastomes, five nuclear gene regions, three chloroplast DNA fragments, and a nuclear internal transcribed spacer (nrITS) region for 18 populations sampled throughout most of the geographic ranges of all six Notopterygium species. Species identification analysis showed that four DNA barcodes (matK, rbcL, trnS-trnG, and nrITS) and/or combinations of these markers achieved Notopterygium species discrimination at higher resolution than the general plastomes and nuclear gene sequences. In particular, nrITS had the highest discriminatory power among all of the individual markers. Molecular data sets and morphological evidence indicated that all six Notopterygium species could be reclassified unambiguously to four putative species clades. N. oviforme and N. franchetii had the closest relationship. Molecular dating showed that the origin and divergence of Notopterygium species was significantly associated with geological and climatic fluctuations during the middle of the Pliocene. In conclusion, our results suggest that a few nucleotide variations can achieve species discrimination with higher resolution than numerous plastomes and general nuclear gene fragments when discerning related Notopterygium species.
Collapse
|
10
|
Phylogenomics, biogeography, and adaptive radiation of grapes. Mol Phylogenet Evol 2018; 129:258-267. [DOI: 10.1016/j.ympev.2018.08.021] [Citation(s) in RCA: 34] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2018] [Revised: 08/30/2018] [Accepted: 08/30/2018] [Indexed: 12/27/2022]
|
11
|
Using Genomic Location and Coalescent Simulation to Investigate Gene Tree Discordance in Medicago L. Syst Biol 2018; 66:934-949. [PMID: 28177088 DOI: 10.1093/sysbio/syx035] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2015] [Accepted: 02/01/2017] [Indexed: 12/28/2022] Open
Abstract
Several well-documented evolutionary processes are known to cause conflict between species-level phylogenies and gene-level phylogenies. Three of the most challenging processes for species tree inference are incomplete lineage sorting, hybridization and gene duplication, which may result in unwarranted comparisons of paralogous genes. Several existing methods have dealt with these processes but none has yet been able to untangle all three at once. Here, we propose a stepwise method by which these processes can be discerned using information on genomic location coupled with coalescent simulations. In the first step, highly discordant genes within genomic blocks (putative paralogs) are identified and excluded from the data set and, in the second step, blocks of linked genes are grouped according to their hybrid history. Existing multispecies coalescent software can then be applied to recover the principal tree(s) that make up the species tree/network without violating the underlying model. The potential of the approach is evaluated on simulated data derived from a species network composed of nine species, of which one is of hybrid origin, and displaying a single-gene duplication that leads to paralogous comparisons. We apply our method to an empirical set of 12 genes from 7 species sampled in the plant genus Medicago that display phylogenetic discordance. We identify the causes of the discordance and demonstrate that the Medicago orbicularis lineage experienced an episode of ancient hybridization. Our results show promise as a new way to explore phylogenetic sequence data that can significantly improve species tree inference in presence of hybridization and undetected paralogy or other causes leading to extremely discordant gene trees. [Coalescent simulation; gene tree; genomic location; hybridization; incomplete lineage sorting; paralogy; phylogenetic incongruence; principal tree; species tree.].
Collapse
|
12
|
Species trees from consensus single nucleotide polymorphism (SNP) data: Testing phylogenetic approaches with simulated and empirical data. Mol Phylogenet Evol 2017; 116:192-201. [DOI: 10.1016/j.ympev.2017.07.018] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2016] [Revised: 02/21/2017] [Accepted: 07/22/2017] [Indexed: 12/21/2022]
|
13
|
No evidence for adaptation to local rhizobial mutualists in the legume Medicago lupulina. Ecol Evol 2017; 7:4367-4376. [PMID: 28649348 PMCID: PMC5478075 DOI: 10.1002/ece3.3012] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2016] [Revised: 03/15/2017] [Accepted: 03/27/2017] [Indexed: 12/31/2022] Open
Abstract
Local adaptation is a common but not ubiquitous feature of species interactions, and understanding the circumstances under which it evolves illuminates the factors that influence adaptive population divergence. Antagonistic species interactions dominate the local adaptation literature relative to mutualistic ones, preventing an overall assessment of adaptation within interspecific interactions. Here, we tested whether the legume Medicago lupulina is adapted to the locally abundant species of mutualistic nitrogen-fixing rhizobial bacteria that vary in frequency across its eastern North American range. We reciprocally inoculated northern and southern M. lupulina genotypes with the northern (Ensifer medicae) or southern bacterium (E. meliloti) in a greenhouse experiment. Despite producing different numbers of root nodules (the structures in which the plants house the bacteria), neither northern nor southern plants produced more seeds, flowered earlier, or were more likely to flower when inoculated with their local rhizobia. We then used a pre-existing dataset to perform a genome scan for loci that showed elevated differentiation between field-collected plants that hosted different bacteria. None of the loci we identified belonged to the well-characterized suite of legume-rhizobia symbiosis genes, suggesting that the rhizobia do not drive genetic divergence between M. lupulina populations. Our results demonstrate that symbiont local adaptation has not evolved in this mutualism despite large-scale geographic variation in the identity of the interacting species.
Collapse
|
14
|
Specific Host-Responsive Associations Between Medicago truncatula Accessions and Sinorhizobium Strains. MOLECULAR PLANT-MICROBE INTERACTIONS : MPMI 2017; 30:399-409. [PMID: 28437159 DOI: 10.1094/mpmi-01-17-0009-r] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/06/2023]
Abstract
Legume plants interact with rhizobia to form nitrogen-fixing root nodules. Legume-rhizobium interactions are specific and only compatible rhizobia and plant species will lead to nodule formation. Even within compatible interactions, the genotype of both the plant and the bacterial symbiont will impact on the efficiency of nodule functioning and nitrogen-fixation activity. The model legume Medicago truncatula forms nodules with several species of the Sinorhizobium genus. However, the efficiency of these bacterial strains is highly variable. In this study, we compared the symbiotic efficiency of Sinorhizobium meliloti strains Sm1021, 102F34, and FSM-MA, and Sinorhizobium medicae strain WSM419 on the two widely used M. truncatula accessions A17 and R108. The efficiency of the interactions was determined by multiple parameters. We found a high effectiveness of the FSM-MA strain with both M. truncatula accessions. In contrast, specific highly efficient interactions were obtained for the A17-WSM419 and R108-102F34 combinations. Remarkably, the widely used Sm1021 strain performed weakly on both hosts. We showed that Sm1021 efficiently induced nodule organogenesis but cannot fully activate the differentiation of the symbiotic nodule cells, explaining its weaker performance. These results will be informative for the selection of appropriate rhizobium strains in functional studies on symbiosis using these M. truncatula accessions, particularly for research focusing on late stages of the nodulation process.
Collapse
|
15
|
Exploring structural variation and gene family architecture with De Novo assemblies of 15 Medicago genomes. BMC Genomics 2017; 18:261. [PMID: 28347275 PMCID: PMC5369179 DOI: 10.1186/s12864-017-3654-1] [Citation(s) in RCA: 60] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2016] [Accepted: 03/22/2017] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Previous studies exploring sequence variation in the model legume, Medicago truncatula, relied on mapping short reads to a single reference. However, read-mapping approaches are inadequate to examine large, diverse gene families or to probe variation in repeat-rich or highly divergent genome regions. De novo sequencing and assembly of M. truncatula genomes enables near-comprehensive discovery of structural variants (SVs), analysis of rapidly evolving gene families, and ultimately, construction of a pan-genome. RESULTS Genome-wide synteny based on 15 de novo M. truncatula assemblies effectively detected different types of SVs indicating that as much as 22% of the genome is involved in large structural changes, altogether affecting 28% of gene models. A total of 63 million base pairs (Mbp) of novel sequence was discovered, expanding the reference genome space for Medicago by 16%. Pan-genome analysis revealed that 42% (180 Mbp) of genomic sequences is missing in one or more accession, while examination of de novo annotated genes identified 67% (50,700) of all ortholog groups as dispensable - estimates comparable to recent studies in rice, maize and soybean. Rapidly evolving gene families typically associated with biotic interactions and stress response were found to be enriched in the accession-specific gene pool. The nucleotide-binding site leucine-rich repeat (NBS-LRR) family, in particular, harbors the highest level of nucleotide diversity, large effect single nucleotide change, protein diversity, and presence/absence variation. However, the leucine-rich repeat (LRR) and heat shock gene families are disproportionately affected by large effect single nucleotide changes and even higher levels of copy number variation. CONCLUSIONS Analysis of multiple M. truncatula genomes illustrates the value of de novo assemblies to discover and describe structural variation, something that is often under-estimated when using read-mapping approaches. Comparisons among the de novo assemblies also indicate that different large gene families differ in the architecture of their structural variation.
Collapse
|
16
|
Analysis of phylogenetic relationships and genome size evolution of the Amaranthus genus using GBS indicates the ancestors of an ancient crop. Mol Phylogenet Evol 2017; 109:80-92. [PMID: 28057554 DOI: 10.1016/j.ympev.2016.12.029] [Citation(s) in RCA: 70] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2016] [Revised: 12/22/2016] [Accepted: 12/25/2016] [Indexed: 11/19/2022]
Abstract
The genus Amaranthus consists of 50-70 species and harbors several cultivated and weedy species of great economic importance. A small number of suitable traits, phenotypic plasticity, gene flow and hybridization made it difficult to establish the taxonomy and phylogeny of the whole genus despite various studies using molecular markers. We inferred the phylogeny of the Amaranthus genus using genotyping by sequencing (GBS) of 94 genebank accessions representing 35 Amaranthus species and measured their genome sizes. SNPs were called by de novo and reference-based methods, for which we used the distant sugarbeet Beta vulgaris and the closely related Amaranthus hypochondriacus as references. SNP counts and proportions of missing data differed between methods, but the resulting phylogenetic trees were highly similar. A distance-based neighbor joining tree of individual accessions and a species tree calculated with the multispecies coalescent supported a previous taxonomic classification into three subgenera although the subgenus A. Acnida consists of two highly differentiated clades. The analysis of the Hybridus complex within the A. Amaranthus subgenus revealed insights on the history of cultivated grain amaranths. The complex includes the three cultivated grain amaranths and their wild relatives and was well separated from other species in the subgenus. Wild and cultivated amaranth accessions did not differentiate according to the species assignment but clustered by their geographic origin from South and Central America. Different geographically separated populations of Amaranthus hybridus appear to be the common ancestors of the three cultivated grain species and A. quitensis might be additionally be involved in the evolution of South American grain amaranth (A. caudatus). We also measured genome sizes of the species and observed little variation with the exception of two lineages that showed evidence for a recent polyploidization. With the exception of two lineages, genome sizes are quite similar and indicate that polyploidization did not play a major role in the history of the genus.
Collapse
|
17
|
Different cytokinin histidine kinase receptors regulate nodule initiation as well as later nodule developmental stages in Medicago truncatula. PLANT, CELL & ENVIRONMENT 2016; 39:2198-209. [PMID: 27341695 DOI: 10.1111/pce.12779] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/05/2015] [Accepted: 06/11/2016] [Indexed: 05/08/2023]
Abstract
Legume plants adapt to low nitrogen by developing an endosymbiosis with nitrogen-fixing soil bacteria to form a new specific organ: the nitrogen-fixing nodule. In the Medicago truncatula model legume, the MtCRE1 cytokinin receptor is essential for this symbiotic interaction. As three other putative CHASE-domain containing histidine kinase (CHK) cytokinin receptors exist in M. truncatula, we determined their potential contribution to this symbiotic interaction. The four CHKs have extensive redundant expression patterns at early nodulation stages but diverge in differentiated nodules, even though MtCHK1/MtCRE1 has the strongest expression at all stages. Mutant and knock-down analyses revealed that other CHKs than MtCHK1/CRE1 are positively involved in nodule initiation, which explains the delayed nodulation phenotype of the chk1/cre1 mutant. In addition, cre1 nodules exhibit an increased growth, whereas other chk mutants have no detectable phenotype, and the maintained nitrogen fixation capacity in cre1 requires other CHK genes. Interestingly, an AHK4/CRE1 genomic locus from the aposymbiotic Arabidopsis plant rescues nodule initiation but not the nitrogen fixation capacity. This indicates that different CHK cytokinin signalling pathways regulate not only nodule initiation but also later developmental stages, and that legume-specific determinants encoded by the MtCRE1 gene are required for later nodulation stages than initiation.
Collapse
|
18
|
Species Delimitation and Interspecific Relationships of the Genus Orychophragmus (Brassicaceae) Inferred from Whole Chloroplast Genomes. FRONTIERS IN PLANT SCIENCE 2016; 7:1826. [PMID: 27999584 PMCID: PMC5138468 DOI: 10.3389/fpls.2016.01826] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/01/2016] [Accepted: 11/21/2016] [Indexed: 05/20/2023]
Abstract
Genetic variations from few chloroplast DNA fragments show lower discriminatory power in the delimitation of closely related species and less resolution ability in discerning interspecific relationships than from nrITS. Here we use Orychophragmus (Brassicaceae) as a model system to test the hypothesis that the whole chloroplast genomes (plastomes), with accumulation of more variations despite the slow evolution, can overcome these weaknesses. We used Illumina sequencing technology via a reference-guided assembly to construct complete plastomes of 17 individuals from six putatively assumed species in the genus. All plastomes are highly conserved in genome structure, gene order, and orientation, and they are around 153 kb in length and contain 113 unique genes. However, nucleotide variations are quite substantial to support the delimitation of all sampled species and to resolve interspecific relationships with high statistical supports. As expected, the estimated divergences between major clades and species are lower than those estimated from nrITS probably due to the slow substitution rate of the plastomes. However, the plastome and nrITS phylogenies were contradictory in the placements of most species, thus suggesting that these species may have experienced complex non-bifurcating evolutions with incomplete lineage sorting and/or hybrid introgressions. Overall, our case study highlights the importance of using plastomes to examine species boundaries and establish an independent phylogeny to infer the speciation history of plants.
Collapse
|
19
|
Impact of gene family evolutionary histories on phylogenetic species tree inference by gene tree parsimony. Mol Phylogenet Evol 2015; 96:9-16. [PMID: 26702957 DOI: 10.1016/j.ympev.2015.12.002] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2015] [Revised: 10/11/2015] [Accepted: 12/03/2015] [Indexed: 11/21/2022]
Abstract
Complicated history of gene duplication and loss brings challenge to molecular phylogenetic inference, especially in deep phylogenies. However, phylogenomic approaches, such as gene tree parsimony (GTP), show advantage over some other approaches in its ability to use gene families with duplications. GTP searches the 'optimal' species tree by minimizing the total cost of biological events such as duplications, but accuracy of GTP and phylogenetic signal in the context of different gene families with distinct histories of duplication and loss are unclear. To evaluate how different evolutionary properties of different gene families can impact on species tree inference, 3900 gene families from seven angiosperms encompassing a wide range of gene content, lineage-specific expansions and contractions were analyzed. It was found that the gene content and total duplication number in a gene family strongly influence species tree inference accuracy, with the highest accuracy achieved at either very low or very high gene content (or duplication number) and lowest accuracy centered in intermediate gene content (or duplication number), as the relationship can fit a binomial regression. Besides, for gene families of similar level of average gene content, those with relatively higher lineage-specific expansion or duplication rates tend to show lower accuracy. Additional correlation tests support that high accuracy for those gene families with large gene content may rely on abundant ancestral copies to provide many subtrees to resolve conflicts, whereas high accuracy for single or low copy gene families are just subject to sequence substitution per se. Very low accuracy reached by gene families of intermediate gene content or duplication number can be due to insufficient subtrees to resolve the conflicts from loss of alternative copies. As these evolutionary properties can significantly influence species tree accuracy, I discussed the potential weighting of the duplication cost by evolutionary properties of gene families in future GTP analyses.
Collapse
|
20
|
Phylogenomic reconstruction supports supercontinent origins for Leishmania. INFECTION GENETICS AND EVOLUTION 2015; 38:101-109. [PMID: 26708057 DOI: 10.1016/j.meegid.2015.11.030] [Citation(s) in RCA: 41] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/01/2015] [Revised: 11/25/2015] [Accepted: 11/26/2015] [Indexed: 11/23/2022]
Abstract
Leishmania, a genus of parasites transmitted to human hosts and mammalian/reptilian reservoirs by an insect vector, is the causative agent of the human disease complex leishmaniasis. The evolutionary relationships within the genus Leishmania and its origins are the source of ongoing debate, reflected in conflicting phylogenetic and biogeographic reconstructions. This study employs a recently described bioinformatics method, SISRS, to identify over 200,000 informative sites across the genome from newly sequenced and publicly available Leishmania data. This dataset is used to reconstruct the evolutionary relationships of this genus. Additionally, we constructed a large multi-gene dataset, using it to reconstruct the phylogeny and estimate divergence dates for species. We conclude that the genus Leishmania evolved at least 90-100 million years ago, supporting a modified version of the Multiple Origins hypothesis that we call the Supercontinent hypothesis. According to this scenario, separate Leishmania clades emerged prior to, and during, the breakup of Gondwana. Additionally, we confirm that reptile-infecting Leishmania are derived from mammalian forms and that the species that infect porcupines and sloths form a clade long separated from other species. Finally, we firmly place the guinea-pig infecting species, Leishmaniaenriettii, the globally dispersed Leishmaniasiamensis, and the newly identified Australian species from a kangaroo, as sibling species whose distribution arises from the ancient connection between Australia, Antarctica, and South America.
Collapse
|
21
|
Short Tree, Long Tree, Right Tree, Wrong Tree: New Acquisition Bias Corrections for Inferring SNP Phylogenies. Syst Biol 2015; 64:1032-47. [PMID: 26227865 PMCID: PMC4604835 DOI: 10.1093/sysbio/syv053] [Citation(s) in RCA: 201] [Impact Index Per Article: 22.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2015] [Accepted: 07/24/2015] [Indexed: 01/01/2023] Open
Abstract
Single nucleotide polymorphisms (SNPs) are useful markers for phylogenetic studies owing in part to their ubiquity throughout the genome and ease of collection. Restriction site associated DNA sequencing (RADseq) methods are becoming increasingly popular for SNP data collection, but an assessment of the best practises for using these data in phylogenetics is lacking. We use computer simulations, and new double digest RADseq (ddRADseq) data for the lizard family Phrynosomatidae, to investigate the accuracy of RAD loci for phylogenetic inference. We compare the two primary ways RAD loci are used during phylogenetic analysis, including the analysis of full sequences (i.e., SNPs together with invariant sites), or the analysis of SNPs on their own after excluding invariant sites. We find that using full sequences rather than just SNPs is preferable from the perspectives of branch length and topological accuracy, but not of computational time. We introduce two new acquisition bias corrections for dealing with alignments composed exclusively of SNPs, a conditional likelihood method and a reconstituted DNA approach. The conditional likelihood method conditions on the presence of variable characters only (the number of invariant sites that are unsampled but known to exist is not considered), while the reconstituted DNA approach requires the user to specify the exact number of unsampled invariant sites prior to the analysis. Under simulation, branch length biases increase with the amount of missing data for both acquisition bias correction methods, but branch length accuracy is much improved in the reconstituted DNA approach compared to the conditional likelihood approach. Phylogenetic analyses of the empirical data using concatenation or a coalescent-based species tree approach provide strong support for many of the accepted relationships among phrynosomatid lizards, suggesting that RAD loci contain useful phylogenetic signal across a range of divergence times despite the presence of missing data. Phylogenetic analysis of RAD loci requires careful attention to model assumptions, especially if downstream analyses depend on branch lengths.
Collapse
|
22
|
A composite genome approach to identify phylogenetically informative data from next-generation sequencing. BMC Bioinformatics 2015; 16:193. [PMID: 26062548 PMCID: PMC4464851 DOI: 10.1186/s12859-015-0632-y] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2014] [Accepted: 05/29/2015] [Indexed: 11/16/2022] Open
Abstract
Background Improvements in sequencing technology now allow easy acquisition of large datasets; however, analyzing these data for phylogenetics can be challenging. We have developed a novel method to rapidly obtain homologous genomic data for phylogenetics directly from next-generation sequencing reads without the use of a reference genome. This software, called SISRS, avoids the time consuming steps of de novo whole genome assembly, multiple genome alignment, and annotation. Results For simulations SISRS is able to identify large numbers of loci containing variable sites with phylogenetic signal. For genomic data from apes, SISRS identified thousands of variable sites, from which we produced an accurate phylogeny. Finally, we used SISRS to identify phylogenetic markers that we used to estimate the phylogeny of placental mammals. We recovered eight phylogenies that resolved the basal relationships among mammals using datasets with different levels of missing data. The three alternate resolutions of the basal relationships are consistent with the major hypotheses for the relationships among mammals, all of which have been supported previously by different molecular datasets. Conclusions SISRS has the potential to transform phylogenetic research. This method eliminates the need for expensive marker development in many studies by using whole genome shotgun sequence data directly. SISRS is open source and freely available at https://github.com/rachelss/SISRS/releases. Electronic supplementary material The online version of this article (doi:10.1186/s12859-015-0632-y) contains supplementary material, which is available to authorized users.
Collapse
|
23
|
Variable mating behaviors and the maintenance of tropical biodiversity. Front Genet 2015; 6:183. [PMID: 26042148 PMCID: PMC4437050 DOI: 10.3389/fgene.2015.00183] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2014] [Accepted: 04/30/2015] [Indexed: 12/25/2022] Open
Abstract
Current theoretical studies on mechanisms promoting species co-existence in diverse communities assume that species are fixed in their mating behavior. Each species is a discrete evolutionary unit, even though most empirical evidence indicates that inter-specific gene flow occurs in plant and animal groups. Here, in a data-driven meta-community model of species co-existence, we allow mating behavior to respond to local species composition and abundance. While individuals primarily out-cross, species maintain a diminished capacity for selfing and hybridization. Mate choice is treated as a variable behavior, which responds to intrinsic traits determining mate choice and the density and availability of sympatric inter-fertile individuals. When mate choice is strongly limited, even low survivorship of selfed offspring can prevent extinction of rare species. With increasing mate choice, low hybridization success rates maintain community level diversity for extended periods of time. In high diversity tropical tree communities, competition among sympatric congeneric species is negligible, because direct spatial proximity with close relatives is infrequent. Therefore, the genomic donorship presents little cost. By incorporating variable mating behavior into evolutionary models of diversification, we also discuss how participation in a syngameon may be selectively advantageous. We view this behavior as a genomic mutualism, where maintenance of genomic structure and diminished inter-fertility, allows each species in the syngameon to benefit from a greater effective population size during episodes of selective disadvantage. Rare species would play a particularly important role in these syngameons as they are more likely to produce heterospecific crosses and transgressive phenotypes. We propose that inter-specific gene flow can play a critical role by allowing genomic mutualists to avoid extinction and gain local adaptations.
Collapse
|
24
|
Deep phylogenetic incongruence in the angiosperm clade Rosidae. Mol Phylogenet Evol 2015; 83:156-66. [DOI: 10.1016/j.ympev.2014.11.003] [Citation(s) in RCA: 82] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2014] [Revised: 11/01/2014] [Accepted: 11/05/2014] [Indexed: 10/24/2022]
|
25
|
Abstract
A large proportion of genomic information, particularly repetitive elements, is usually ignored when researchers are using next-generation sequencing. Here we demonstrate the usefulness of this repetitive fraction in phylogenetic analyses, utilizing comparative graph-based clustering of next-generation sequence reads, which results in abundance estimates of different classes of genomic repeats. Phylogenetic trees are then inferred based on the genome-wide abundance of different repeat types treated as continuously varying characters; such repeats are scattered across chromosomes and in angiosperms can constitute a majority of nuclear genomic DNA. In six diverse examples, five angiosperms and one insect, this method provides generally well-supported relationships at interspecific and intergeneric levels that agree with results from more standard phylogenetic analyses of commonly used markers. We propose that this methodology may prove especially useful in groups where there is little genetic differentiation in standard phylogenetic markers. At the same time as providing data for phylogenetic inference, this method additionally yields a wealth of data for comparative studies of genome evolution.
Collapse
|
26
|
The ecological genomic basis of salinity adaptation in Tunisian Medicago truncatula. BMC Genomics 2014; 15:1160. [PMID: 25534372 PMCID: PMC4410866 DOI: 10.1186/1471-2164-15-1160] [Citation(s) in RCA: 42] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2014] [Accepted: 12/12/2014] [Indexed: 11/10/2022] Open
Abstract
Background As our world becomes warmer, agriculture is increasingly impacted by rising soil salinity and understanding plant adaptation to salt stress can help enable effective crop breeding. Salt tolerance is a complex plant phenotype and we know little about the pathways utilized by naturally tolerant plants. Legumes are important species in agricultural and natural ecosystems, since they engage in symbiotic nitrogen-fixation, but are especially vulnerable to salinity stress. Results Our studies of the model legume Medicago truncatula in field and greenhouse settings demonstrate that Tunisian populations are locally adapted to saline soils at the metapopulation level and that saline origin genotypes are less impacted by salt than non-saline origin genotypes; these populations thus likely contain adaptively diverged alleles. Whole genome resequencing of 39 wild accessions reveals ongoing migration and candidate genomic regions that assort non-randomly with soil salinity. Consistent with natural selection acting at these sites, saline alleles are typically rare in the range-wide species' gene pool and are also typically derived relative to the sister species M. littoralis. Candidate regions for adaptation contain genes that regulate physiological acclimation to salt stress, such as abscisic acid and jasmonic acid signaling, including a novel salt-tolerance candidate orthologous to the uncharacterized gene AtCIPK21. Unexpectedly, these regions also contain biotic stress genes and flowering time pathway genes. We show that flowering time is differentiated between saline and non-saline populations and may allow salt stress escape. Conclusions This work nominates multiple potential pathways of adaptation to naturally stressful environments in a model legume. These candidates point to the importance of both tolerance and avoidance in natural legume populations. We have uncovered several promising targets that could be used to breed for enhanced salt tolerance in crop legumes to enhance food security in an era of increasing soil salinization. Electronic supplementary material The online version of this article (doi:10.1186/1471-2164-15-1160) contains supplementary material, which is available to authorized users.
Collapse
|
27
|
Phylogenetic properties of 50 nuclear loci in Medicago (Leguminosae) generated using multiplexed sequence capture and next-generation sequencing. PLoS One 2014; 9:e109704. [PMID: 25329401 PMCID: PMC4201463 DOI: 10.1371/journal.pone.0109704] [Citation(s) in RCA: 47] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2014] [Accepted: 09/10/2014] [Indexed: 11/18/2022] Open
Abstract
Next-generation sequencing technology has increased the capacity to generate molecular data for plant biological research, including phylogenetics, and can potentially contribute to resolving complex phylogenetic problems. The evolutionary history of Medicago L. (Leguminosae: Trifoliae) remains unresolved due to incongruence between published phylogenies. Identification of the processes causing this genealogical incongruence is essential for the inference of a correct species phylogeny of the genus and requires that more molecular data, preferably from low-copy nuclear genes, are obtained across different species. Here we report the development of 50 novel LCN markers in Medicago and assess the phylogenetic properties of each marker. We used the genomic resources available for Medicago truncatula Gaertn., hybridisation-based gene enrichment (sequence capture) techniques and Next-Generation Sequencing to generate sequences. This alternative proves to be a cost-effective approach to amplicon sequencing in phylogenetic studies at the genus or tribe level and allows for an increase in number and size of targeted loci. Substitution rate estimates for each of the 50 loci are provided, and an overview of the variation in substitution rates among a large number of low-copy nuclear genes in plants is presented for the first time. Aligned sequences of major species lineages of Medicago and its sister genus are made available and can be used in further probe development for sequence-capture of the same markers.
Collapse
|
28
|
Genomic characterization of the LEED..PEEDs, a gene family unique to the medicago lineage. G3 (BETHESDA, MD.) 2014; 4:2003-12. [PMID: 25155275 PMCID: PMC4199706 DOI: 10.1534/g3.114.011874] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/25/2014] [Accepted: 08/18/2014] [Indexed: 12/18/2022]
Abstract
The LEED..PEED (LP) gene family in Medicago truncatula (A17) is composed of 13 genes coding small putatively secreted peptides with one to two conserved domains of negatively charged residues. This family is not present in the genomes of Glycine max, Lotus japonicus, or the IRLC species Cicer arietinum. LP genes were also not detected in a Trifolium pratense draft genome or Pisum sativum nodule transcriptome, which were sequenced de novo in this study, suggesting that the LP gene family arose within the past 25 million years. M. truncatula accession HM056 has 13 LP genes with high similarity to those in A17, whereas M. truncatula ssp. tricycla (R108) and M. sativa have 11 and 10 LP gene copies, respectively. In M. truncatula A17, 12 LP genes are located on chromosome 7 within a 93-kb window, whereas one LP gene copy is located on chromosome 4. A phylogenetic analysis of the gene family is consistent with most gene duplications occurring prior to Medicago speciation events, mainly through local tandem duplications and one distant duplication across chromosomes. Synteny comparisons between R108 and A17 confirm that gene order is conserved between the two subspecies, although a further duplication occurred solely in A17. In M. truncatula A17, all 13 LPs are exclusively transcribed in nodules and absent from other plant tissues, including roots, leaves, flowers, seeds, seed shells, and pods. The recent expansion of LP genes in Medicago spp. and their timing and location of expression suggest a novel function in nodulation, possibly as an aftermath of the evolution of bacteroid terminal differentiation or potentially associated with rhizobial-host specificity.
Collapse
|
29
|
Molecular data do not provide unambiguous support for the monophyly of flatfishes (Pleuronectiformes): A reply to Betancur-R and Ortí. Mol Phylogenet Evol 2014; 75:149-53. [DOI: 10.1016/j.ympev.2014.02.011] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2014] [Revised: 02/11/2014] [Accepted: 02/16/2014] [Indexed: 11/24/2022]
|
30
|
Genome variations account for different response to three mineral elements between Medicago truncatula ecotypes Jemalong A17 and R108. BMC PLANT BIOLOGY 2014; 14:122. [PMID: 24885873 PMCID: PMC4031900 DOI: 10.1186/1471-2229-14-122] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/06/2014] [Accepted: 04/30/2014] [Indexed: 05/26/2023]
Abstract
BACKGROUND Resequencing can be used to identify genome variations underpinning many morphological and physiological phenotypes. Legume model plant Medicago truncatula ecotypes Jemalong A17 (J. A17) and R108 differ in their responses to mineral toxicity of aluminum and sodium, and mineral deficiency of iron in growth medium. The difference may result from their genome variations, but no experimental evidence supports this hypothesis. RESULTS A total of 12,750 structure variations, 135,045 short insertions/deletions and 764,154 single nucleotide polymorphisms were identified by resequencing the genome of R108. The suppressed expression of MtAACT that encodes a putative aluminum-induced citrate efflux transporter by deletion of partial sequence of the second intron may account for the less aluminum-induced citrate exudation and greater accumulation of aluminum in roots of R108 than in roots of J. A17, thus rendering R108 more sensitive to aluminum toxicity. The higher expression-level of MtZpt2-1 encoding a TFIIIA-related transcription factor in J. A17 than R108 under conditions of salt stress can be explained by the greater number of stress-responsive elements in its promoter sequence, thus conferring J. A17 more tolerant to salt stress than R108 plants by activating the expression of downstream stress-responsive genes. YSLs (Yellow Stripe-Likes) are involved in long-distance transport of iron in plants. We found that an YSL gene was deleted in the genome of R108 plants, thus rendering R108 less tolerance to iron deficiency than J. A17 plants. CONCLUSIONS The deletion or change in several genes may account for the different responses of M. truncatula ecotypes J. A17 and R108 to mineral toxicity of aluminum and sodium as well as iron deficiency. Uncovering genome variations by resequencing is an effective method to identify different traits between species/ecotypes that are genetically related. These findings demonstrate that analyses of genome variations by resequencing can shed important light on differences in responses of M. truncatula ecotypes to abiotic stress in general and mineral stress in particular.
Collapse
|
31
|
Disentangling Methodological and Biological Sources of Gene Tree Discordance on Oryza (Poaceae) Chromosome 3. Syst Biol 2014; 63:645-59. [DOI: 10.1093/sysbio/syu027] [Citation(s) in RCA: 41] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
|
32
|
High-density genome-wide association mapping implicates an F-box encoding gene in Medicago truncatula resistance to Aphanomyces euteiches. THE NEW PHYTOLOGIST 2014; 201:1328-1342. [PMID: 24283472 DOI: 10.1111/nph.12611] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/12/2013] [Accepted: 10/23/2013] [Indexed: 05/18/2023]
Abstract
• The use of quantitative disease resistance (QDR) is a promising strategy for promoting durable resistance to plant pathogens, but genes involved in QDR are largely unknown. To identify genetic components and accelerate improvement of QDR in legumes to the root pathogen Aphanomyces euteiches, we took advantage of both the recently generated massive genomic data for Medicago truncatula and natural variation of this model legume. • A high-density (≈5.1 million single nucleotide polymorphisms (SNPs)) genome-wide association study (GWAS) was performed with both in vitro and glasshouse phenotyping data collected for 179 lines. • GWAS identified several candidate genes and pinpointed two independent major loci on the top of chromosome 3 that were detected in both phenotyping methods. Candidate SNPs in the most significant locus (σ(A)²= 23%) were in the promoter and coding regions of an F-box protein coding gene. Subsequent qRT-PCR and bioinformatic analyses performed on 20 lines demonstrated that resistance is associated with mutations directly affecting the interaction domain of the F-box protein rather than gene expression. • These results refine the position of previously identified QTL to specific candidate genes, suggest potential molecular mechanisms, and identify new loci explaining QDR against A. euteiches.
Collapse
|
33
|
Selection, genome-wide fitness effects and evolutionary rates in the model legume Medicago truncatula. Mol Ecol 2013; 22:3525-38. [PMID: 23773281 DOI: 10.1111/mec.12329] [Citation(s) in RCA: 49] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2012] [Revised: 02/22/2013] [Accepted: 03/12/2013] [Indexed: 12/15/2022]
Abstract
Sequence data for >20 000 annotated genes from 56 accessions of Medicago truncatula were used to identify potential targets of positive selection, the determinants of evolutionary rate variation and the relative importance of positive and purifying selection in shaping nucleotide diversity. Based upon patterns of intraspecific diversity and interspecific divergence, c. 50-75% of nonsynonymous polymorphisms are subject to strong purifying selection and 1% of the sampled genes harbour a signature of positive selection. Combining polymorphism with expression data, we estimated the distribution of fitness effects and found that the proportion of deleterious mutations is significantly greater for expressed genes than for genes with undetected transcripts (nonexpressed) in a previous RNA-seq experiment and greater for broadly expressed genes than those expressed in only a single tissue. Expression level is the strongest correlate of evolutionary rates at nonsynonymous sites, and despite multiple genomic features being significantly correlated with evolutionary rates, they explain less than 20% of the variation in nonsynonymous rates (dN) and <15% of the variation in either synonymous rates (dS) or dN:dS. Among putative targets of selection were genes involved in defence against pathogens and herbivores, genes with roles in mediating the relationship with rhizobial symbionts and one-third of annotated histone-lysine methyltransferases. Adaptive evolution of the methyltransferases suggests that positive selection in gene expression may have occurred through evolution of enzymes involved in epigenetic modification.
Collapse
|
34
|
A change in SHATTERPROOF protein lies at the origin of a fruit morphological novelty and a new strategy for seed dispersal in medicago genus. PLANT PHYSIOLOGY 2013; 162:907-17. [PMID: 23640757 PMCID: PMC3668079 DOI: 10.1104/pp.113.217570] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]
Abstract
Angiosperms are the most diverse and numerous group of plants, and it is generally accepted that this evolutionary success owes in part to the diversity found in fruits, key for protecting the developing seeds and ensuring seed dispersal. Although studies on the molecular basis of morphological innovations are few, they all illustrate the central role played by transcription factors acting as developmental regulators. Here, we show that a small change in the protein sequence of a MADS-box transcription factor correlates with the origin of a highly modified fruit morphology and the change in seed dispersal strategies that occurred in Medicago, a genus belonging to the large legume family. This protein sequence modification alters the functional properties of the protein, affecting the affinities for other protein partners involved in high-order complexes. Our work illustrates that variation in coding regions can generate evolutionary novelties not based on gene duplication/subfunctionalization but by interactions in complex networks, contributing also to the current debate on the relative importance of changes in regulatory or coding regions of master regulators in generating morphological novelties.
Collapse
|