1
|
A phylogenomic study of Iridaceae Juss. based on complete plastid genome sequences. FRONTIERS IN PLANT SCIENCE 2023; 14:1066708. [PMID: 36844099 PMCID: PMC9948625 DOI: 10.3389/fpls.2023.1066708] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/11/2022] [Accepted: 01/09/2023] [Indexed: 06/18/2023]
Abstract
The plastid genome has proven to be an effective tool for examining deep correlations in plant phylogenetics, owing to its highly conserved structure, uniparental inheritance, and limited variation in evolutionary rates. Iridaceae, comprising more than 2,000 species, includes numerous economically significant taxa that are frequently utilized in food industries and medicines and for ornamental and horticulture purposes. Molecular studies on chloroplast DNA have confirmed the position of this family in the order Asparagales with non-asparagoids. The current subfamilial classification of Iridaceae recognizes seven subfamilies-Isophysioideae, Nivenioideae, Iridoideae, Crocoideae, Geosiridaceae, Aristeoideae, and Patersonioideae-which are supported by limited plastid DNA regions. To date, no comparative phylogenomic studies have been conducted on the family Iridaceae. We assembled and annotated (de novo) the plastid genomes of 24 taxa together with seven published species representing all the seven subfamilies of Iridaceae and performed comparative genomics using the Illumina MiSeq platform. The plastomes of the autotrophic Iridaceae represent 79 protein-coding, 30 tRNA, and four rRNA genes, with lengths ranging from 150,062 to 164,622 bp. The phylogenetic analysis of the plastome sequences based on maximum parsimony, maximum likelihood, and Bayesian inference analyses suggested that Watsonia and Gladiolus were closely related, supported by strong support values, which differed considerably from recent phylogenetic studies. In addition, we identified genomic events, such as sequence inversions, deletions, mutations, and pseudogenization, in some species. Furthermore, the largest nucleotide variability was found in the seven plastome regions, which can be used in future phylogenetic studies. Notably, three subfamilies-Crocoideae, Nivenioideae, and Aristeoideae-shared a common ycf2 gene locus deletion. Our study is a preliminary report of a comparative study of the complete plastid genomes of 7/7 subfamilies and 9/10 tribes, elucidating the structural characteristics and shedding light on plastome evolution and phylogenetic relationships within Iridaceae. Additionally, further research is required to update the relative position of Watsonia within the tribal classification of the subfamily Crocoideae.
Collapse
|
2
|
The evolution of multi-gene families and metabolic pathways in the evening primroses (Oenothera: Onagraceae): A comparative transcriptomics approach. PLoS One 2022; 17:e0269307. [PMID: 35749399 PMCID: PMC9231714 DOI: 10.1371/journal.pone.0269307] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2021] [Accepted: 05/18/2022] [Indexed: 12/02/2022] Open
Abstract
The plant genus Oenothera has played an important role in the study of plant evolution of genomes and plant defense and reproduction. Here, we build on the 1kp transcriptomic dataset by creating 44 new transcriptomes and analyzing a total of 63 transcriptomes to present a large-scale comparative study across 29 Oenothera species. Our dataset included 30.4 million reads per individual and 2.3 million transcripts on average. We used this transcriptome resource to examine genome-wide evolutionary patterns and functional diversification by searching for orthologous genes and performed gene family evolution analysis. We found wide heterogeneity in gene family evolution across the genus, with section Oenothera exhibiting the most pronounced evolutionary changes. Overall, more significant gene family expansions occurred than contractions. We also analyzed the molecular evolution of phenolic metabolism by retrieving proteins annotated for phenolic enzymatic complexes. We identified 1,568 phenolic genes arranged into 83 multigene families that varied widely across the genus. All taxa experienced rapid phenolic evolution (fast rate of genomic turnover) involving 33 gene families, which exhibited large expansions, gaining about 2-fold more genes than they lost. Upstream enzymes phenylalanine ammonia-lyase (PAL) and 4-coumaroyl: CoA ligase (4CL) accounted for most of the significant expansions and contractions. Our results suggest that adaptive and neutral evolutionary processes have contributed to Oenothera diversification and rapid gene family evolution.
Collapse
|
3
|
The Chloroplast Phylogenomics and Systematics of Zoysia (Poaceae). PLANTS 2021; 10:plants10081517. [PMID: 34451562 PMCID: PMC8400354 DOI: 10.3390/plants10081517] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/08/2021] [Revised: 07/13/2021] [Accepted: 07/22/2021] [Indexed: 11/16/2022]
Abstract
The genus Zoysia Willd. (Chloridoideae) is widely distributed from the temperate regions of Northeast Asia—including China, Japan, and Korea—to the tropical regions of Southeast Asia. Among these, four species—Zoysia japonica Steud., Zoysia sinica Hance, Zoysia tenuifolia Thiele, and Zoysia macrostachya Franch. & Sav.—are naturally distributed in the Korean Peninsula. In this study, we report the complete plastome sequences of these Korean Zoysia species (NCBI acc. nos. MF953592, MF967579~MF967581). The length of Zoysia plastomes ranges from 135,854 to 135,904 bp, and the plastomes have a typical quadripartite structure, which consists of a pair of inverted repeat regions (20,962~20,966 bp) separated by a large (81,348~81,392 bp) and a small (12,582~12,586 bp) single-copy region. In terms of gene order and structure, Zoysia plastomes are similar to the typical plastomes of Poaceae. The plastomes encode 110 genes, of which 76 are protein-coding genes, 30 are tRNA genes, and four are rRNA genes. Fourteen genes contain single introns and one gene has two introns. Three evolutionary hotspot spacer regions—atpB~rbcL, rps16~rps3, and rpl32~trnL-UAG—were recognized among six analyzed Zoysia species. The high divergences in the atpB~rbcL spacer and rpl16~rpl3 region are primarily due to the differences in base substitutions and indels. In contrast, the high divergence between rpl32~trnL-UAG spacers is due to a small inversion with a pair of 22 bp stem and an 11 bp loop. Simple sequence repeats (SSRs) were identified in 59 different locations in Z. japonica, 63 in Z. sinica, 62 in Z. macrostachya, and 63 in Z. tenuifolia plastomes. Phylogenetic analysis showed that the Zoysia (Zoysiinae) forms a monophyletic group, which is sister to Sporobolus (Sporobolinae), with 100% bootstrap support. Within the Zoysia clade, the relationship of (Z. sinica, Z japonica), (Z. tenuifolia, Z. matrella), (Z. macrostachya, Z. macrantha) was suggested.
Collapse
|
4
|
Horizontal Gene Transfer Involving Chloroplasts. Int J Mol Sci 2021; 22:ijms22094484. [PMID: 33923118 PMCID: PMC8123421 DOI: 10.3390/ijms22094484] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2021] [Revised: 04/22/2021] [Accepted: 04/23/2021] [Indexed: 02/04/2023] Open
Abstract
Horizontal gene transfer (HGT)- is defined as the acquisition of genetic material from another organism. However, recent findings indicate a possible role of HGT in the acquisition of traits with adaptive significance, suggesting that HGT is an important driving force in the evolution of eukaryotes as well as prokaryotes. It has been noted that, in eukaryotes, HGT is more prevalent than originally thought. Mitochondria and chloroplasts lost a large number of genes after their respective endosymbiotic events occurred. Even after this major content loss, organelle genomes still continue to lose their own genes. Many of these are subsequently acquired by intracellular gene transfer from the original plastid. The aim of our review was to elucidate the role of chloroplasts in the transfer of genes. This review also explores gene transfer involving mitochondrial and nuclear genomes, though recent studies indicate that chloroplast genomes are far more active in HGT as compared to these other two DNA-containing cellular compartments.
Collapse
|
5
|
Differentiation of Hedyotis diffusa and Common Adulterants Based on Chloroplast Genome Sequencing and DNA Barcoding Markers. PLANTS (BASEL, SWITZERLAND) 2021; 10:161. [PMID: 33467716 PMCID: PMC7829813 DOI: 10.3390/plants10010161] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/15/2020] [Revised: 01/05/2021] [Accepted: 01/13/2021] [Indexed: 12/21/2022]
Abstract
Chinese herbal tea, also known as Liang Cha or cooling beverage, is popular in South China. It is regarded as a quick-fix remedy to relieve minor health problems. Hedyotis diffusa Willd. (colloquially Baihuasheshecao) is a common ingredient of cooling beverages. H. diffusa is also used to treat cancer and bacterial infections. Owing to the high demand for H. diffusa, two common adulterants, Hedyotis brachypoda (DC.) Sivar and Biju (colloquially Nidingjingcao) and Hedyotis corymbosa (L.) Lam. (colloquially Shuixiancao), are commonly encountered in the market. Owing to the close similarity of their morphological characteristics, it is difficult to differentiate them. Here, we sequenced the complete chloroplast genomes of the three species of Hedyotis using next-generation sequencing (NGS). By comparing the complete chloroplast genomes, we found that they are closely related in the subfamily Rubioideae. We also discovered that there are significant differences in the number and repeating motifs of microsatellites and complex repeats and revealed three divergent hotspots, rps16-trnQ intergenic spacer, ndhD and ycf1. By using these species-specific sequences, we propose new DNA barcoding markers for the authentication of H. diffusa and its two common adulterants.
Collapse
|
6
|
Evolutionary dynamics of the chloroplast genome sequences of six Colobanthus species. Sci Rep 2020; 10:11522. [PMID: 32661280 PMCID: PMC7359349 DOI: 10.1038/s41598-020-68563-5] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2020] [Accepted: 06/25/2020] [Indexed: 11/08/2022] Open
Abstract
The complete plastome sequences of six species were sequenced to better understand the evolutionary relationships and mutation patterns in the chloroplast genome of the genus Colobanthus. The length of the chloroplast genome sequences of C. acicularis, C. affinis, C. lycopodioides, C. nivicola, C. pulvinatus and C. subulatus ranged from 151,050 to 151,462 bp. The quadripartite circular structure of these genome sequences has the same overall organization and gene content with 73 protein-coding genes, 30 tRNA genes, four rRNA genes and five conserved chloroplast open reading frames. A total of 153 repeat sequences were revealed. Forward repeats were dominant, whereas complementary repeats were found only in C. pulvinatus. The mononucleotide SSRs composed of A/T units were most common, and hexanucleotide SSRs were detected least often. Eleven highly variable regions which could be utilized as potential markers for phylogeny reconstruction, species identification or phylogeography were identified within Colobanthus chloroplast genomes. Seventy-three protein-coding genes were used in phylogenetic analyses. Reconstructed phylogeny was consistent with the systematic position of the studied species, and the representatives of the same genus were grouped in one clade. All studied Colobanthus species formed a single group and C. lycopodioides was least similar to the remaining species.
Collapse
|
7
|
Plastome Structural Conservation and Evolution in the Clusioid Clade of Malpighiales. Sci Rep 2020; 10:9091. [PMID: 32499506 PMCID: PMC7272398 DOI: 10.1038/s41598-020-66024-7] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2019] [Accepted: 05/14/2020] [Indexed: 11/17/2022] Open
Abstract
The clusioid clade of Malpighiales is comprised of five families: Bonnetiaceae, Calophyllaceae, Clusiaceae, Hypericaceae and Podostemaceae. Recent studies have found the plastome structure of Garcinia mangostana L. from Clusiaceae was conserved, while plastomes of five riverweed species from Podostemaceae showed significant structural variations. The diversification pattern of plastome structure of the clusioid clade worth a thorough investigation. Here we determined five complete plastomes representing four families of the clusioid clade. Our results found that the plastomes of the early diverged three families (Clusiaceae, Bonnetiaceae and Calophyllaceae) in the clusioid clade are relatively conserved, while the plastomes of the other two families show significant variations. The Inverted Repeat (IR) regions of Tristicha trifaria and Marathrum foeniculaceum (Podostemaceae) are greatly reduced following the loss of the ycf1 and ycf2 genes. An inversion over 50 kb spanning from trnK-UUU to rbcL in the LSC region is shared by Cratoxylum cochinchinense (Hypericaceae), T. trifaria and Ma. foeniculaceum (Podostemaceae). The large inversed colinear block in Hypericaceae and Podostemaceae contains all the genes in the 50-kb inversed colinear block in a clade of Papilionoideae, with two extra genes (trnK-UUU and matK) at one end. Another endpoint of both inversions in the two clusioids families and Papilionoideae is located between rbcL and accD. This study greatly helped to clarify the plastome evolution in the clusioid clade.
Collapse
|
8
|
Plastome sequencing of Myripnois dioica and comparison within Asteraceae. PLANT DIVERSITY 2019; 41:315-322. [PMID: 31934676 PMCID: PMC6951274 DOI: 10.1016/j.pld.2019.07.003] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/22/2019] [Revised: 07/05/2019] [Accepted: 07/09/2019] [Indexed: 06/10/2023]
Abstract
Myripnois is a monotypic shrub genus in the daisy family constricted to northern China. Although wild populations of Myripnois dioica are relatively rare, this plant may potentially be cultured as a fine ornamental. In the present study, we sequenced the complete plastome of M. dioica, generating the first plastome sequences of the subfamily Pertyoideae. The plastome of M. dioica has a typical quadripartite circular structure. A large ∼20-kb and a small ∼3-kb inversion were detected in the large single copy (LSC) region and shared by other Asteraceae species. Plastome phylogenomic analyses based on 78 Asteraceae species and three outgroups revealed four groups, corresponding to four Asteraceae subfamilies: Asteroideae, Cichorioideae, Pertyoideae and Carduoideae. Among these four subfamilies, Pertyoideae is sister to Asteroideae + Cichorioideae; Carduoideae is the most basal clade. In addition, we characterized 13 simple sequence repeats (SSRs) that may be useful in future studies on population genetics.
Collapse
|
9
|
Complete chloroplast genome of seven Fritillaria species, variable DNA markers identification and phylogenetic relationships within the genus. PLoS One 2018; 13:e0194613. [PMID: 29543905 PMCID: PMC5854438 DOI: 10.1371/journal.pone.0194613] [Citation(s) in RCA: 33] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2017] [Accepted: 03/06/2018] [Indexed: 02/07/2023] Open
Abstract
Fritillaria spp. constitute important traditional Chinese medicinal plants. Xinjiang is one of two diversity hotspots in China in which eight Fritillaria species occur, two of which are endemic to the region. Furthermore, the phylogenetic relationships of Xinjiang Fritillaria species (including F. yuminensis) within the genus are unclear. In the present study, we sequenced the chloroplast (cp) genomes of seven Fritillaria species in Xinjiang using the Illumina HiSeq platform, with the aim of assessing the global structural patterns of the seven cp genomes and identifying highly variable cp DNA sequences. These were compared to previously sequenced Fritillaria cp genomes. Phylogenetic analysis was then used to evaluate the relationships of the Xinjiang species and assess the evolution of an undivided stigma. The seven cp genomes ranged from 151,764 to 152,112 bp, presenting a traditional quadripartite structure. The gene order and gene content of the seven cp genomes were identical. A comparison of the 13 cp genomes indicated that the structure is highly conserved. Ten highly divergent regions were identified that could be valuable in phylogenetic and population genetic studies. The phylogenetic relationships of the 13 Fritillaria species inferred from the protein-coding genes, large single-copy, small single-copy, and inverted repeat regions were identical and highly resolved. The phylogenetic relationships of the species corresponded with their geographic distribution patterns, in that the north group (consisting of eight species from Xinjiang and Heilongjiang in North China) and the south group (including six species from South China) were basically divided at 40°N. Species with an undivided stigma were not monophyletic, suggesting that this trait might have evolved several times in the genus.
Collapse
|
10
|
Genus-Wide Screening Reveals Four Distinct Types of Structural Plastid Genome Organization in Pelargonium (Geraniaceae). Genome Biol Evol 2018; 9:64-76. [PMID: 28172771 PMCID: PMC5381562 DOI: 10.1093/gbe/evw271] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/10/2016] [Indexed: 12/22/2022] Open
Abstract
Geraniaceae are known for their unusual plastid genomes (plastomes), with the genus Pelargonium being most conspicuous with regard to plastome size and gene organization as judged by the sequenced plastomes of P. x hortorum and P. alternans. However, the hybrid origin of P. x hortorum and the uncertain phylogenetic position of P. alternans obscure the events that led to these extraordinary plastomes. Here, we examine all plastid reconfiguration hotspots for 60 Pelargonium species across all subgenera using a PCR and sequencing approach. Our reconstruction of the rearrangement history revealed four distinct plastome types. The ancestral plastome configuration in the two subgenera Magnipetala and Pelargonium is consistent with that of the P. alternans plastome, whereas that of the subgenus Parvulipetala deviates from this organization by one synapomorphic inversion in the trnNGUU–ndhF region. The plastome of P. x hortorum resembles those of one group of the subgenus Paucisignata, but differs from a second group by another inversion in the psaI–psaJ region. The number of microstructural changes and amount of repetitive DNA are generally elevated in all inverted regions. Nucleotide substitution rates correlate positively with the number of indels in all regions across the different subgenera. We also observed lineage- and species-specific changes in the gene content, including gene duplications and fragmentations. For example, the plastid rbcL–psaI region of Pelargonium contains a highly variable accD-like region. Our results suggest alternative evolutionary paths under possibly changing modes of plastid transmission and indicate the non-functionalization of the plastid accD gene in Pelargonium.
Collapse
|
11
|
Characterization of the complete chloroplast genome of Arabis stellari and comparisons with related species. PLoS One 2017; 12:e0183197. [PMID: 28809950 PMCID: PMC5557495 DOI: 10.1371/journal.pone.0183197] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2017] [Accepted: 07/31/2017] [Indexed: 01/25/2023] Open
Abstract
Arabis stellari var. japonica is an ornamental plant of the Brassicaceae family, and is widely distributed in South Korea. However, no information is available about its molecular biology and no genomic study has been performed on A. stellari. In this paper, the authors report the complete chloroplast genome sequence of A. stellari. The plastome of A. stellari was 153,683 bp in length with 36.4% GC and included a pair of inverted repeats (IRs) of 26,423 bp that separated a large single-copy (LSC) region of 82,807 bp and a small single-copy (SSC) region of 18,030 bp. It was also found to contain 113 unique genes, of which 79 were protein-coding genes, 30 were transfer RNAs, and four were ribosomal RNAs. The gene content and organization of the A. stellari chloroplast genome were similar to those of other Brassicaceae genomes except for the absence of the rps16 protein-coding gene. A total of 991 SSRs were identified in the genome. The chloroplast genome of A. stellari was compared with closely related species of the Brassicaceae family. Comparative analysis showed a minor divergence occurred in the protein-coding matK, ycf1, ccsA, accD and rpl22 genes and that the KA/KS nucleotide substitution ratio of the ndhA genes of A. stellari and A. hirsuta was 1.35135. The genes infA and rps16 were absent in the Arabis genus and phylogenetic evolutionary studies revealed that these genes evolved independently. However, phylogenetic analysis showed that the positions of Brassicaceae species are highly conserved. The present study provides A. stellari genomic information that may be found useful in conservation and molecular phylogenetic studies on Brassicaceae.
Collapse
|
12
|
The complete chloroplast genome sequences of Lychnis wilfordii and Silene capitata and comparative analyses with other Caryophyllaceae genomes. PLoS One 2017; 12:e0172924. [PMID: 28241056 PMCID: PMC5328339 DOI: 10.1371/journal.pone.0172924] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2016] [Accepted: 02/10/2017] [Indexed: 11/18/2022] Open
Abstract
The complete chloroplast genomes of Lychnis wilfordii and Silene capitata were determined and compared with ten previously reported Caryophyllaceae chloroplast genomes. The chloroplast genome sequences of L. wilfordii and S. capitata contain 152,320 bp and 150,224 bp, respectively. The gene contents and orders among 12 Caryophyllaceae species are consistent, but several microstructural changes have occurred. Expansion of the inverted repeat (IR) regions at the large single copy (LSC)/IRb and small single copy (SSC)/IR boundaries led to partial or entire gene duplications. Additionally, rearrangements of the LSC region were caused by gene inversions and/or transpositions. The 18 kb inversions, which occurred three times in different lineages of tribe Sileneae, were thought to be facilitated by the intermolecular duplicated sequences. Sequence analyses of the L. wilfordii and S. capitata genomes revealed 39 and 43 repeats, respectively, including forward, palindromic, and reverse repeats. In addition, a total of 67 and 56 simple sequence repeats were discovered in the L. wilfordii and S. capitata chloroplast genomes, respectively. Finally, we constructed phylogenetic trees of the 12 Caryophyllaceae species and two Amaranthaceae species based on 73 protein-coding genes using both maximum parsimony and likelihood methods.
Collapse
|
13
|
Phylogenetic Resolution in Juglans Based on Complete Chloroplast Genomes and Nuclear DNA Sequences. FRONTIERS IN PLANT SCIENCE 2017; 8:1148. [PMID: 28713409 PMCID: PMC5492656 DOI: 10.3389/fpls.2017.01148] [Citation(s) in RCA: 64] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/19/2017] [Accepted: 06/15/2017] [Indexed: 05/19/2023]
Abstract
Walnuts (Juglans of the Juglandaceae) are well-known economically important resource plants for the edible nuts, high-quality wood, and medicinal use, with a distribution from tropical to temperate zones and from Asia to Europe and Americas. There are about 21 species in Juglans. Classification of Juglans at section level is problematic, because the phylogenetic position of Juglans cinerea is disputable. Lacking morphological and DNA markers severely inhibited the development of related researches. In this study, the complete chloroplast genomes and two nuclear DNA regions (the internal transcribed spacer and ubiquitin ligase gene) of 10 representative taxa of Juglans were used for comparative genomic analyses in order to deepen the understanding on the application value of genetic information for inferring the phylogenetic relationship of the genus. The Juglans chloroplast genomes possessed the typical quadripartite structure of angiosperms, consisting of a pair of inverted repeat regions separated by a large single-copy region and a small single-copy region. All the 10 chloroplast genomes possessed 112 unique genes arranged in the same order, including 78 protein-coding, 30 tRNA, and 4 rRNA genes. A combined sequence data set from two nuclear DNA regions revealed that Juglans plants could be classified into three branches: (1) section Juglans, (2) section Cardiocaryon including J. cinerea which is closer to J. mandshurica, and (3) section Rhysocaryon. However, three branches with a different phylogenetic topology were recognized in Juglans using the complete chloroplast genome sequences: (1) section Juglans, (2) section Cardiocaryon, and (3) section Rhysocaryon plus J. cinerea. The molecular taxonomy of Juglans is almost compatible to the morphological taxonomy except J. cinerea (section Trachycaryon). Based on the complete chloroplast genome sequence data, the divergence time between section Juglans and section Cardiocaryon was 44.77 Mya, while section Rhysocaryon diverged from other sections in the genus Juglans was 47.61 Mya. Eleven of the 12 small inversions in the chloroplast genomes provided valuable phylogenetic information for classification of walnut plants at section and species levels. Our results are valuable for future studies on Juglans genetic diversity and will enhance the understanding on the phylogenetic evolution of Juglandaceae.
Collapse
|
14
|
Microsatellites for Oenothera gayleana and O. hartwegii subsp. filifolia (Onagraceae), and their utility in section Calylophus. APPLICATIONS IN PLANT SCIENCES 2016; 4:apps1500107. [PMID: 26949578 PMCID: PMC4760750 DOI: 10.3732/apps.1500107] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 09/18/2015] [Accepted: 10/16/2015] [Indexed: 06/05/2023]
Abstract
PREMISE OF THE STUDY Eleven nuclear and four plastid microsatellite markers were screened for two gypsum endemic species, Oenothera gayleana and O. hartwegii subsp. filifolia, and tested for cross-amplification in the remaining 11 taxa within Oenothera sect. Calylophus (Onagraceae). METHODS AND RESULTS Microsatellite markers were tested in two to three populations spanning the ranges of both O. gayleana and O. hartwegii subsp. filifolia. The nuclear microsatellite loci consisted of both di- and trinucleotide repeats with one to 17 alleles per population. Several loci showed significant deviation from Hardy-Weinberg equilibrium, which may be evidence of chromosomal rings. The plastid microsatellite markers identified one to seven haplotypes per population. The transferability of these markers was confirmed in all 11 taxa within Oenothera sect. Calylophus. CONCLUSIONS The microsatellite loci characterized here are the first developed and tested in Oenothera sect. Calylophus. These markers will be used to assess whether pollinator foraging distance influences population genetic parameters in predictable ways.
Collapse
|
15
|
Analysis of the Complete Chloroplast Genome of a Medicinal Plant, Dianthus superbus var. longicalyncinus, from a Comparative Genomics Perspective. PLoS One 2015; 10:e0141329. [PMID: 26513163 PMCID: PMC4626046 DOI: 10.1371/journal.pone.0141329] [Citation(s) in RCA: 46] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2015] [Accepted: 10/06/2015] [Indexed: 11/18/2022] Open
Abstract
Dianthus superbus var. longicalycinus is an economically important traditional Chinese medicinal plant that is also used for ornamental purposes. In this study, D. superbus was compared to its closely related family of Caryophyllaceae chloroplast (cp) genomes such as Lychnis chalcedonica and Spinacia oleracea. D. superbus had the longest large single copy (LSC) region (82,805 bp), with some variations in the inverted repeat region A (IRA)/LSC regions. The IRs underwent both expansion and constriction during evolution of the Caryophyllaceae family; however, intense variations were not identified. The pseudogene ribosomal protein subunit S19 (rps19) was identified at the IRA/LSC junction, but was not present in the cp genome of other Caryophyllaceae family members. The translation initiation factor IF-1 (infA) and ribosomal protein subunit L23 (rpl23) genes were absent from the Dianthus cp genome. When the cp genome of Dianthus was compared with 31 other angiosperm lineages, the infA gene was found to have been lost in most members of rosids, solanales of asterids and Lychnis of Caryophyllales, whereas rpl23 gene loss or pseudogization had occurred exclusively in Caryophyllales. Nevertheless, the cp genome of Dianthus and Spinacia has two introns in the proteolytic subunit of ATP-dependent protease (clpP) gene, but Lychnis has lost introns from the clpP gene. Furthermore, phylogenetic analysis of individual protein-coding genes infA and rpl23 revealed that gene loss or pseudogenization occurred independently in the cp genome of Dianthus. Molecular phylogenetic analysis also demonstrated a sister relationship between Dianthus and Lychnis based on 78 protein-coding sequences. The results presented herein will contribute to studies of the evolution, molecular biology and genetic engineering of the medicinal and ornamental plant, D. superbus var. longicalycinus.
Collapse
|
16
|
The Chloroplast Genome of Elaeagnus macrophylla and trnH Duplication Event in Elaeagnaceae. PLoS One 2015; 10:e0138727. [PMID: 26394223 PMCID: PMC4579063 DOI: 10.1371/journal.pone.0138727] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2015] [Accepted: 09/02/2015] [Indexed: 11/18/2022] Open
Abstract
Elaeagnaceae, which harbor nitrogen-fixing actinomycetes, is a plant family of the Rosales and sister to Rhamnaceae, Barbeyaceae and Dirachmaceae. The results of previous molecular studies have not strongly supported the families of Elaeagnaceae, Rhamnaceae, Barbeyaceae and Dirachmaceae. However, chloroplast genome studies provide valuable phylogenetic information; therefore, we determined the chloroplast genome of Elaeaganus macrophylla and compared it to that of Rosales such as IR junction and infA gene. The chloroplast genome of Elaeagnus macrophylla is 152,224 bp in length and the infA gene of E. macrophylla was psuedogenation. Phylogenetic analyses based on 79 genes in 30 species revealed that Elaeagnus was closely related to Morus. Comparison of the IR junction in six other rosids revealed that the trnH gene contained the LSC region, whereas E. macrophylla contained a trnH gene duplication in the IR region. Comparison of the LSC/IRb (JLB) and the IRa/LSC (JLA) regions of Elaeagnaceae (Elaeagnus and Shephedia) and Rhamnaceae (Rhamnus) showed that trnH gene duplication only occurred in the Elaeagnaceae. The complete chloroplast genome of Elaeagnus macrophylla provides unique characteristics in rosids. The infA gene has been lost or transferred to the nucleus in rosids, while E. macrophylla lost the infA gene. Evaluation of the chloroplast genome of Elaeagnus revealed trnH gene duplication for the first time in rosids. The availability of Elaeagnus cp genomes provides valuable information describing the relationship of Elaeagnaceae, Barbeyaceae and Dirachmaceae, IR junction that will be valuable to future systematics studies.
Collapse
|
17
|
Complete plastid genome sequence of the chickpea (Cicer arietinum) and the phylogenetic distribution of rps12 and clpP intron losses among legumes (Leguminosae). Mol Phylogenet Evol 2008; 48:1204-17. [PMID: 18638561 DOI: 10.1016/j.ympev.2008.06.013] [Citation(s) in RCA: 123] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2008] [Revised: 06/12/2008] [Accepted: 06/15/2008] [Indexed: 01/06/2023]
Abstract
Chickpea (Cicerarietinum, Leguminosae), an important grain legume, is widely used for food and fodder throughout the world. We sequenced the complete plastid genome of chickpea, which is 125,319bp in size, and contains only one copy of the inverted repeat (IR). The genome encodes 108 genes, including 4 rRNAs, 29 tRNAs, and 75 proteins. The genes rps16, infA, and ycf4 are absent in the chickpea plastid genome, and ndhB has an internal stop codon in the 5'exon, similar to other legumes. Two genes have lost their introns, one in the 3'exon of the transpliced gene rps12, and the one between exons 1 and 2 of clpP; this represents the first documented case of the loss of introns from both of these genes in the same plastid genome. An extensive phylogenetic survey of these intron losses was performed on 302 taxa across legumes and the related family Polygalaceae. The clpP intron has been lost exclusively in taxa from the temperate "IR-lacking clade" (IRLC), whereas the rps12 intron has been lost in most members of the IRLC (with the exception of Wisteria, Callerya, Afgekia, and certain species of Millettia, which represent the earliest diverging lineages of this clade), and in the tribe Desmodieae, which is closely related to the tribes Phaseoleae and Psoraleeae. Data provided here suggest that the loss of the rps12 intron occurred after the loss of the IR. The two new genomic changes identified in the present study provide additional support of the monophyly of the IR-loss clade, and resolution of the pattern of the earliest-branching lineages in this clade. The availability of the complete chickpea plastid genome sequence also provides valuable information on intergenic spacer regions among legumes and endogenous regulatory sequences for plastid genetic engineering.
Collapse
|
18
|
The complete nucleotide sequences of the five genetically distinct plastid genomes of Oenothera, subsection Oenothera: I. sequence evaluation and plastome evolution. Nucleic Acids Res 2008; 36:2366-78. [PMID: 18299283 PMCID: PMC2367718 DOI: 10.1093/nar/gkn081] [Citation(s) in RCA: 79] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2007] [Revised: 02/01/2008] [Accepted: 02/08/2008] [Indexed: 12/02/2022] Open
Abstract
The flowering plant genus Oenothera is uniquely suited for studying molecular mechanisms of speciation. It assembles an intriguing combination of genetic features, including permanent translocation heterozygosity, biparental transmission of plastids, and a general interfertility of well-defined species. This allows an exchange of plastids and nuclei between species often resulting in plastome-genome incompatibility. For evaluation of its molecular determinants we present the complete nucleotide sequences of the five basic, genetically distinguishable plastid chromosomes of subsection Oenothera (=Euoenothera) of the genus, which are associated in distinct combinations with six basic genomes. Sizes of the chromosomes range from 163 365 bp (plastome IV) to 165 728 bp (plastome I), display between 96.3% and 98.6% sequence similarity and encode a total of 113 unique genes. Plastome diversification is caused by an abundance of nucleotide substitutions, small insertions, deletions and repetitions. The five plastomes deviate from the general ancestral design of plastid chromosomes of vascular plants by a subsection-specific 56 kb inversion within the large single-copy segment. This inversion disrupted operon structures and predates the divergence of the subsection presumably 1 My ago. Phylogenetic relationships suggest plastomes I-III in one clade, while plastome IV appears to be closest to the common ancestor.
Collapse
|
19
|
Extensive rearrangements in the chloroplast genome of Trachelium caeruleum are associated with repeats and tRNA genes. J Mol Evol 2008; 66:350-61. [PMID: 18330485 DOI: 10.1007/s00239-008-9086-4] [Citation(s) in RCA: 179] [Impact Index Per Article: 11.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2007] [Revised: 01/27/2008] [Accepted: 02/08/2008] [Indexed: 11/28/2022]
Abstract
Chloroplast genome organization, gene order, and content are highly conserved among land plants. We sequenced the chloroplast genome of Trachelium caeruleum L. (Campanulaceae), a member of an angiosperm family known for highly rearranged genomes. The total genome size is 162,321 bp, with an inverted repeat (IR) of 27,273 bp, large single-copy (LSC) region of 100,114 bp, and small single-copy (SSC) region of 7,661 bp. The genome encodes 112 different genes, with 17 duplicated in the IR, a tRNA gene (trnI-cau) duplicated once in the LSC region, and a protein-coding gene (psbJ) with two duplicate copies, for a total of 132 putatively intact genes. ndhK may be a pseudogene with internal stop codons, and clpP, ycf1, and ycf2 are so highly diverged that they also may be pseudogenes. ycf15, rpl23, infA, and accD are truncated and likely nonfunctional. The most conspicuous feature of the Trachelium genome is the presence of 18 internally unrearranged blocks of genes inverted or relocated within the genome relative to the ancestral gene order of angiosperm chloroplast genomes. Recombination between repeats or tRNA genes has been suggested as a mechanism of chloroplast genome rearrangements. The Trachelium chloroplast genome shares with Pelargonium and Jasminum both a higher number of repeats and larger repeated sequences in comparison to eight other angiosperm chloroplast genomes, and these are concentrated near rearrangement endpoints. Genes for tRNAs occur at many but not all inversion endpoints, so some combination of repeats and tRNA genes may have mediated these rearrangements.
Collapse
|
20
|
Dynamics and evolution of the inverted repeat-large single copy junctions in the chloroplast genomes of monocots. BMC Evol Biol 2008; 8:36. [PMID: 18237435 PMCID: PMC2275221 DOI: 10.1186/1471-2148-8-36] [Citation(s) in RCA: 270] [Impact Index Per Article: 16.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2007] [Accepted: 01/31/2008] [Indexed: 11/24/2022] Open
Abstract
Background Various expansions or contractions of inverted repeats (IRs) in chloroplast genomes led to fluxes in the IR-LSC (large single copy) junctions. Previous studies revealed that some monocot IRs contain a trnH-rps19 gene cluster, and it has been speculated that this may be an evidence of a duplication event prior to the divergence of monocot lineages. Therefore, we compared the organizations of genes flanking two IR-LSC junctions in 123 angiosperm representatives to uncover the evolutionary dynamics of IR-LSC junctions in basal angiosperms and monocots. Results The organizations of genes flanking IR-LSC junctions in angiosperms can be classified into three types. Generally each IR of monocots contains a trnH-rps19 gene cluster near the IR-LSC junctions, which differs from those in non-monocot angiosperms. Moreover, IRs expanded more progressively in monocots than in non-monocot angiosperms. IR-LSC junctions commonly occurred at polyA tract or A-rich regions in angiosperms. Our RT-PCR assays indicate that in monocot IRA the trnH-rps19 gene cluster is regulated by two opposing promoters, S10A and psbA. Conclusion Two hypotheses are proposed to account for the evolution of IR expansions in monocots. Based on our observations, the inclusion of a trnH-rps19 cluster in majority of monocot IRs could be reasonably explained by the hypothesis that a DSB event first occurred at IRB and led to the expansion of IRs to trnH, followed by a successive DSB event within IRA and lead to the expansion of IRs to rps19 or to rpl22 so far. This implies that the duplication of trnH-rps19 gene cluster was prior to the diversification of extant monocot lineages. The duplicated trnH genes in the IRB of most monocots and non-monocot angiosperms have distinct fates, which are likely regulated by different expression levels of S10A and S10B promoters. Further study is needed to unravel the evolutionary significance of IR expansion in more recently diverged monocots.
Collapse
|
21
|
Phylogenetic and evolutionary implications of complete chloroplast genome sequences of four early-diverging angiosperms: Buxus (Buxaceae), Chloranthus (Chloranthaceae), Dioscorea (Dioscoreaceae), and Illicium (Schisandraceae). Mol Phylogenet Evol 2007; 45:547-63. [PMID: 17644003 DOI: 10.1016/j.ympev.2007.06.004] [Citation(s) in RCA: 112] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2007] [Revised: 06/05/2007] [Accepted: 06/11/2007] [Indexed: 10/23/2022]
Abstract
We have determined the complete chloroplast genome sequences of four early-diverging lineages of angiosperms, Buxus (Buxaceae), Chloranthus (Chloranthaceae), Dioscorea (Dioscoreaceae), and Illicium (Schisandraceae), to examine the organization and evolution of plastid genomes and to estimate phylogenetic relationships among angiosperms. For the most part, the organization of these plastid genomes is quite similar to the ancestral angiosperm plastid genome with a few notable exceptions. Dioscorea has lost one protein-coding gene, rps16; this gene loss has also happened independently in four other land plant lineages, liverworts, conifers, Populus, and legumes. There has also been a small expansion of the inverted repeat (IR) in Dioscorea that has duplicated trnH-GUG. This event has also occurred multiple times in angiosperms, including in monocots, and in the two basal angiosperms Nuphar and Drimys. The Illicium chloroplast genome is unusual by having a 10 kb contraction of the IR. The four taxa sequenced represent key groups in resolving phylogenetic relationships among angiosperms. Illicium is one of the basal angiosperms in the Austrobaileyales, Chloranthus (Chloranthales) remains unplaced in angiosperm classifications, and Buxus and Dioscorea are early-diverging eudicots and monocots, respectively. We have used sequences for 61 shared protein-coding genes from these four genomes and combined them with sequences from 35 other genomes to estimate phylogenetic relationships using parsimony, likelihood, and Bayesian methods. There is strong congruence among the trees generated by the three methods, and most nodes have high levels of support. The results indicate that Amborella alone is sister to the remaining angiosperms; the Nymphaeales represent the next-diverging clade followed by Illicium; Chloranthus is sister to the magnoliids and together this group is sister to a large clade that includes eudicots and monocots; and Dioscorea represents an early-diverging lineage of monocots just internal to Acorus.
Collapse
|
22
|
Rapid evolutionary change of common bean (Phaseolus vulgaris L) plastome, and the genomic diversification of legume chloroplasts. BMC Genomics 2007; 8:228. [PMID: 17623083 PMCID: PMC1940014 DOI: 10.1186/1471-2164-8-228] [Citation(s) in RCA: 59] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2007] [Accepted: 07/10/2007] [Indexed: 12/05/2022] Open
Abstract
Background Fabaceae (legumes) is one of the largest families of flowering plants, and some members are important crops. In contrast to what we know about their great diversity or economic importance, our knowledge at the genomic level of chloroplast genomes (cpDNAs or plastomes) for these crops is limited. Results We sequenced the complete genome of the common bean (Phaseolus vulgaris cv. Negro Jamapa) chloroplast. The plastome of P. vulgaris is a 150,285 bp circular molecule. It has gene content similar to that of other legume plastomes, but contains two pseudogenes, rpl33 and rps16. A distinct inversion occurred at the junction points of trnH-GUG/rpl14 and rps19/rps8, as in adzuki bean [1]. These two pseudogenes and the inversion were confirmed in 10 varieties representing the two domestication centers of the bean. Genomic comparative analysis indicated that inversions generally occur in legume plastomes and the magnitude and localization of insertions/deletions (indels) also vary. The analysis of repeat sequences demonstrated that patterns and sequences of tandem repeats had an important impact on sequence diversification between legume plastomes and tandem repeats did not belong to dispersed repeats. Interestingly, P. vulgaris plastome had higher evolutionary rates of change on both genomic and gene levels than G. max, which could be the consequence of pressure from both mutation and natural selection. Conclusion Legume chloroplast genomes are widely diversified in gene content, gene order, indel structure, abundance and localization of repetitive sequences, intracellular sequence exchange and evolutionary rates. The P. vulgaris plastome is a rapidly evolving genome.
Collapse
|
23
|
A comparative analysis of the Lactuca and Helianthus (Asteraceae) plastid genomes: identification of divergent regions and categorization of shared repeats. AMERICAN JOURNAL OF BOTANY 2007; 94:302-12. [PMID: 21636403 DOI: 10.3732/ajb.94.3.302] [Citation(s) in RCA: 168] [Impact Index Per Article: 9.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]
Abstract
We have sequenced two complete chloroplast genomes in the Asteraceae, Helianthus annuus (sunflower), and Lactuca sativa (lettuce), which belong to the distantly related subfamilies, Asteroideae and Cichorioideae, respectively. The Helianthus chloroplast genome is 151 104 bp and the Lactuca genome is 152 772 bp long, which is within the usual size range for chloroplast genomes in flowering plants. When compared to tobacco, both genomes have two inversions: a large 22.8-kb inversion and a smaller 3.3-kb inversion nested within it. Pairwise sequence divergence across all genes, introns, and spacers in Helianthus and Lactuca has resulted in the discovery of new, fast-evolving DNA sequences for use in species-level phylogenetics, such as the trnY-rpoB, trnL-rpl32, and ndhC-trnV spacers. Analysis and categorization of shared repeats resulted in seven classes useful for future repeat studies: double tandem repeats, three or more tandem repeats, direct repeats dispersed in the genome, repeats found in reverse complement orientation, hairpin loops, runs of A's or T's in excess of 12 bp, and gene or tRNA similarity. Results from BLAST searches of our genomic sequence against expressed sequence tag (EST) databases for both genomes produced eight likely RNA edited sites (C → U changes). These detailed analyses in Asteraceae contribute to a broader understanding of plastid evolution across flowering plants.
Collapse
|
24
|
Rosales sister to Fabales: towards resolving the rosid puzzle. Mol Phylogenet Evol 2006; 44:488-93. [PMID: 17196401 DOI: 10.1016/j.ympev.2006.11.014] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2006] [Revised: 11/02/2006] [Accepted: 11/13/2006] [Indexed: 11/26/2022]
|
25
|
Complete plastid genome sequences of Drimys, Liriodendron, and Piper: implications for the phylogenetic relationships of magnoliids. BMC Evol Biol 2006; 6:77. [PMID: 17020608 PMCID: PMC1626487 DOI: 10.1186/1471-2148-6-77] [Citation(s) in RCA: 105] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2006] [Accepted: 10/04/2006] [Indexed: 11/20/2022] Open
Abstract
Background The magnoliids with four orders, 19 families, and 8,500 species represent one of the largest clades of early diverging angiosperms. Although several recent angiosperm phylogenetic analyses supported the monophyly of magnoliids and suggested relationships among the orders, the limited number of genes examined resulted in only weak support, and these issues remain controversial. Furthermore, considerable incongruence resulted in phylogenetic reconstructions supporting three different sets of relationships among magnoliids and the two large angiosperm clades, monocots and eudicots. We sequenced the plastid genomes of three magnoliids, Drimys (Canellales), Liriodendron (Magnoliales), and Piper (Piperales), and used these data in combination with 32 other angiosperm plastid genomes to assess phylogenetic relationships among magnoliids and to examine patterns of variation of GC content. Results The Drimys, Liriodendron, and Piper plastid genomes are very similar in size at 160,604, 159,886 bp, and 160,624 bp, respectively. Gene content and order are nearly identical to many other unrearranged angiosperm plastid genomes, including Calycanthus, the other published magnoliid genome. Overall GC content ranges from 34–39%, and coding regions have a substantially higher GC content than non-coding regions. Among protein-coding genes, GC content varies by codon position with 1st codon > 2nd codon > 3rd codon, and it varies by functional group with photosynthetic genes having the highest percentage and NADH genes the lowest. Phylogenetic analyses using parsimony and likelihood methods and sequences of 61 protein-coding genes provided strong support for the monophyly of magnoliids and two strongly supported groups were identified, the Canellales/Piperales and the Laurales/Magnoliales. Strong support is reported for monocots and eudicots as sister clades with magnoliids diverging before the monocot-eudicot split. The trees also provided moderate or strong support for the position of Amborella as sister to a clade including all other angiosperms. Conclusion Evolutionary comparisons of three new magnoliid plastid genome sequences, combined with other published angiosperm genomes, confirm that GC content is unevenly distributed across the genome by location, codon position, and functional group. Furthermore, phylogenetic analyses provide the strongest support so far for the hypothesis that the magnoliids are sister to a large clade that includes both monocots and eudicots.
Collapse
|
26
|
An exceptional horizontal gene transfer in plastids: gene replacement by a distant bacterial paralog and evidence that haptophyte and cryptophyte plastids are sisters. BMC Biol 2006; 4:31. [PMID: 16956407 PMCID: PMC1570145 DOI: 10.1186/1741-7007-4-31] [Citation(s) in RCA: 121] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2006] [Accepted: 09/06/2006] [Indexed: 11/30/2022] Open
Abstract
Background Horizontal gene transfer (HGT) to the plant mitochondrial genome has recently been shown to occur at a surprisingly high rate; however, little evidence has been found for HGT to the plastid genome, despite extensive sequencing. In this study, we analyzed all genes from sequenced plastid genomes to unearth any neglected cases of HGT and to obtain a measure of the overall extent of HGT to the plastid. Results Although several genes gave strongly supported conflicting trees under certain conditions, we are confident of HGT in only a single case beyond the rubisco HGT already reported. Most of the conflicts involved near neighbors connected by long branches (e.g. red algae and their secondary hosts), where phylogenetic methods are prone to mislead. However, three genes – clpP, ycf2, and rpl36 – provided strong support for taxa moving far from their organismal position. Further taxon sampling of clpP and ycf2 resulted in rejection of HGT due to long-branch attraction and a serious error in the published plastid genome sequence of Oenothera elata, respectively. A single new case, a bacterial rpl36 gene transferred into the ancestor of the cryptophyte and haptophyte plastids, appears to be a true HGT event. Interestingly, this rpl36 gene is a distantly related paralog of the rpl36 type found in other plastids and most eubacteria. Moreover, the transferred gene has physically replaced the native rpl36 gene, yet flanking genes and intergenic regions show no sign of HGT. This suggests that gene replacement somehow occurred by recombination at the very ends of rpl36, without the level and length of similarity normally expected to support recombination. Conclusion The rpl36 HGT discovered in this study is of considerable interest in terms of both molecular mechanism and phylogeny. The plastid acquisition of a bacterial rpl36 gene via HGT provides the first strong evidence for a sister-group relationship between haptophyte and cryptophyte plastids to the exclusion of heterokont and alveolate plastids. Moreover, the bacterial gene has replaced the native plastid rpl36 gene by an uncertain mechanism that appears inconsistent with existing models for the recombinational basis of gene conversion.
Collapse
|
27
|
Complete plastid genome sequence of Daucus carota: implications for biotechnology and phylogeny of angiosperms. BMC Genomics 2006; 7:222. [PMID: 16945140 PMCID: PMC1579219 DOI: 10.1186/1471-2164-7-222] [Citation(s) in RCA: 69] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2006] [Accepted: 08/31/2006] [Indexed: 11/16/2022] Open
Abstract
BACKGROUND Carrot (Daucus carota) is a major food crop in the US and worldwide. Its capacity for storage and its lifecycle as a biennial make it an attractive species for the introduction of foreign genes, especially for oral delivery of vaccines and other therapeutic proteins. Until recently efforts to express recombinant proteins in carrot have had limited success in terms of protein accumulation in the edible tap roots. Plastid genetic engineering offers the potential to overcome this limitation, as demonstrated by the accumulation of BADH in chromoplasts of carrot taproots to confer exceedingly high levels of salt resistance. The complete plastid genome of carrot provides essential information required for genetic engineering. Additionally, the sequence data add to the rapidly growing database of plastid genomes for assessing phylogenetic relationships among angiosperms. RESULTS The complete carrot plastid genome is 155,911 bp in length, with 115 unique genes and 21 duplicated genes within the IR. There are four ribosomal RNAs, 30 distinct tRNA genes and 18 intron-containing genes. Repeat analysis reveals 12 direct and 2 inverted repeats > or = 30 bp with a sequence identity > or = 90%. Phylogenetic analysis of nucleotide sequences for 61 protein-coding genes using both maximum parsimony (MP) and maximum likelihood (ML) were performed for 29 angiosperms. Phylogenies from both methods provide strong support for the monophyly of several major angiosperm clades, including monocots, eudicots, rosids, asterids, eurosids II, euasterids I, and euasterids II. CONCLUSION The carrot plastid genome contains a number of dispersed direct and inverted repeats scattered throughout coding and non-coding regions. This is the first sequenced plastid genome of the family Apiaceae and only the second published genome sequence of the species-rich euasterid II clade. Both MP and ML trees provide very strong support (100% bootstrap) for the sister relationship of Daucus with Panax in the euasterid II clade. These results provide the best taxon sampling of complete chloroplast genomes and the strongest support yet for the sister relationship of Caryophyllales to the asterids. The availability of the complete plastid genome sequence should facilitate improved transformation efficiency and foreign gene expression in carrot through utilization of endogenous flanking sequences and regulatory elements.
Collapse
|
28
|
A new test of phylogenetic model fitness addresses the issue of the basal angiosperm phylogeny. Gene 2006; 381:81-91. [PMID: 16959440 DOI: 10.1016/j.gene.2006.07.002] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2006] [Revised: 06/28/2006] [Accepted: 07/02/2006] [Indexed: 10/24/2022]
Abstract
We readdress the issue of phylogeny of the basal extant angiosperms employing a source previously not systematically investigated, specifically, the non-coding sequences of cpDNA. Comparison of trees with and without grasses or the outgroup (Pinus) in our analyses revealed no rearrangements in tree topology that might be expected if LBA were distorting the position of the magnoliids. For each model applied, irrespective of whether monocots or ANITA members appeared basally divergent, the orchid Phalaenopsis assumed the same position on the trees with the reduced taxon set as did the branch bearing the orchid plus the grasses in the full alignment. However, our new test of model fitness revealed a different flaw influencing the placement of monocots, which is related to model mis-specification. This flaw similarly affects the full alignment and the alignment with grasses removed. In both cases the models favouring a relatively derived position for the monocots and basal placement of the branch of Amborella plus Nymphaea provide better overall prediction of the observed data structure. In the view of apparent unsuitability of the bootstrap method for large data sets, our novel test provides a new means of exploring conflicts caused by systematic errors in phylogenetic analyses.
Collapse
|
29
|
Phylogenetic analyses of Vitis (Vitaceae) based on complete chloroplast genome sequences: effects of taxon sampling and phylogenetic methods on resolving relationships among rosids. BMC Evol Biol 2006; 6:32. [PMID: 16603088 PMCID: PMC1479384 DOI: 10.1186/1471-2148-6-32] [Citation(s) in RCA: 148] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2005] [Accepted: 04/09/2006] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The Vitaceae (grape) is an economically important family of angiosperms whose phylogenetic placement is currently unresolved. Recent phylogenetic analyses based on one to several genes have suggested several alternative placements of this family, including sister to Caryophyllales, asterids, Saxifragales, Dilleniaceae or to rest of rosids, though support for these different results has been weak. There has been a recent interest in using complete chloroplast genome sequences for resolving phylogenetic relationships among angiosperms. These studies have clarified relationships among several major lineages but they have also emphasized the importance of taxon sampling and the effects of different phylogenetic methods for obtaining accurate phylogenies. We sequenced the complete chloroplast genome of Vitis vinifera and used these data to assess relationships among 27 angiosperms, including nine taxa of rosids. RESULTS The Vitis vinifera chloroplast genome is 160,928 bp in length, including a pair of inverted repeats of 26,358 bp that are separated by small and large single copy regions of 19,065 bp and 89,147 bp, respectively. The gene content and order of Vitis is identical to many other unrearranged angiosperm chloroplast genomes, including tobacco. Phylogenetic analyses using maximum parsimony and maximum likelihood were performed on DNA sequences of 61 protein-coding genes for two datasets with 28 or 29 taxa, including eight or nine taxa from four of the seven currently recognized major clades of rosids. Parsimony and likelihood phylogenies of both data sets provide strong support for the placement of Vitaceae as sister to the remaining rosids. However, the position of the Myrtales and support for the monophyly of the eurosid I clade differs between the two data sets and the two methods of analysis. In parsimony analyses, the inclusion of Gossypium is necessary to obtain trees that support the monophyly of the eurosid I clade. However, maximum likelihood analyses place Cucumis as sister to the Myrtales and therefore do not support the monophyly of the eurosid I clade. CONCLUSION Phylogenies based on DNA sequences from complete chloroplast genome sequences provide strong support for the position of the Vitaceae as the earliest diverging lineage of rosids. Our phylogenetic analyses support recent assertions that inadequate taxon sampling and incorrect model specification for concatenated multi-gene data sets can mislead phylogenetic inferences when using whole chloroplast genomes for phylogeny reconstruction.
Collapse
|
30
|
The chloroplast genome of Nicotiana sylvestris and Nicotiana tomentosiformis: complete sequencing confirms that the Nicotiana sylvestris progenitor is the maternal genome donor of Nicotiana tabacum. Mol Genet Genomics 2006; 275:367-73. [PMID: 16435119 DOI: 10.1007/s00438-005-0092-6] [Citation(s) in RCA: 59] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2005] [Accepted: 12/10/2005] [Indexed: 10/25/2022]
Abstract
The tobacco cultivar Nicotiana tabacum is a natural amphidiploid that is thought to be derived from ancestors of Nicotiana sylvestris and Nicotiana tomentosiformis. To compare these chloroplast genomes, DNA was prepared from isolated chloroplasts from green leaves of N. sylvestris and N. tomentosiformis, and subjected to whole-genome shotgun sequencing. The N. sylvestris chloroplast genome comprises of 155,941 bp and shows identical gene organization with that of N. tabacum, except one ORF. Detailed comparison revealed only seven different sites between N. tabacum and N. sylvestris; three in introns, two in spacer regions and two in coding regions. The chloroplast DNA of N. tomentosiformis is 155,745 bp long and possesses also identical gene organization with that of N. tabacum, except four ORFs and one pseudogene. However, 1,194 sites differ between these two species. Compared with N. tabacum, the nucleotide substitution in the inverted repeat was much lower than that in the single-copy region. The present work confirms that the chloroplast genome from N. tabacum was derived from an ancestor of N. sylvestris, and suggests that the rate of nucleotide substitution of the chloroplast genomes from N. tabacum and N. sylvestris is very low.
Collapse
|
31
|
The complete chloroplast genome sequence of Gossypium hirsutum: organization and phylogenetic relationships to other angiosperms. BMC Genomics 2006; 7:61. [PMID: 16553962 PMCID: PMC1513215 DOI: 10.1186/1471-2164-7-61] [Citation(s) in RCA: 95] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2005] [Accepted: 03/23/2006] [Indexed: 11/13/2022] Open
Abstract
BACKGROUND Cotton (Gossypium hirsutum) is the most important fiber crop grown in 90 countries. In 2004-2005, US farmers planted 79% of the 5.7-million hectares of nuclear transgenic cotton. Unfortunately, genetically modified cotton has the potential to hybridize with other cultivated and wild relatives, resulting in geographical restrictions to cultivation. However, chloroplast genetic engineering offers the possibility of containment because of maternal inheritance of transgenes. The complete chloroplast genome of cotton provides essential information required for genetic engineering. In addition, the sequence data were used to assess phylogenetic relationships among the major clades of rosids using cotton and 25 other completely sequenced angiosperm chloroplast genomes. RESULTS The complete cotton chloroplast genome is 160,301 bp in length, with 112 unique genes and 19 duplicated genes within the IR, containing a total of 131 genes. There are four ribosomal RNAs, 30 distinct tRNA genes and 17 intron-containing genes. The gene order in cotton is identical to that of tobacco but lacks rpl22 and infA. There are 30 direct and 24 inverted repeats 30 bp or longer with a sequence identity > or = 90%. Most of the direct repeats are within intergenic spacer regions, introns and a 72 bp-long direct repeat is within the psaA and psaB genes. Comparison of protein coding sequences with expressed sequence tags (ESTs) revealed nucleotide substitutions resulting in amino acid changes in ndhC, rpl23, rpl20, rps3 and clpP. Phylogenetic analysis of a data set including 61 protein-coding genes using both maximum likelihood and maximum parsimony were performed for 28 taxa, including cotton and five other angiosperm chloroplast genomes that were not included in any previous phylogenies. CONCLUSION Cotton chloroplast genome lacks rpl22 and infA and contains a number of dispersed direct and inverted repeats. RNA editing resulted in amino acid changes with significant impact on their hydropathy. Phylogenetic analysis provides strong support for the position of cotton in the Malvales in the eurosids II clade sister to Arabidopsis in the Brassicales. Furthermore, there is strong support for the placement of the Myrtales sister to the eurosid I clade, although expanded taxon sampling is needed to further test this relationship.
Collapse
|
32
|
The chloroplast genome of Phalaenopsis aphrodite (Orchidaceae): comparative analysis of evolutionary rate with that of grasses and its phylogenetic implications. Mol Biol Evol 2005; 23:279-91. [PMID: 16207935 DOI: 10.1093/molbev/msj029] [Citation(s) in RCA: 233] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Whether the Amborella/Amborella-Nymphaeales or the grass lineage diverged first within the angiosperms has recently been debated. Central to this issue has been focused on the artifacts that might result from sampling only grasses within the monocots. We therefore sequenced the entire chloroplast genome (cpDNA) of Phalaenopsis aphrodite, Taiwan moth orchid. The cpDNA is a circular molecule of 148,964 bp with a comparatively short single-copy region (11,543 bp) due to the unusual loss and truncation/scattered deletion of certain ndh subunits. An open reading frame, orf91, located in the complementary strand of the rrn23 was reported for the first time. A comparison of nucleotide substitutions between P. aphrodite and the grasses indicates that only the plastid expression genes have a strong positive correlation between nonsynonymous (Ka) and synonymous (Ks) substitutions per site, providing evidence for a generation time effect, mainly across these genes. Among the intron-containing protein-coding genes of the sampled monocots, the Ks of the genes are significantly correlated to transitional substitutions of their introns. We compiled a concatenated 61 protein-coding gene alignment for the available 20 cpDNAs of vascular plants and analyzed the data set using Bayesian inference, maximum parsimony, and neighbor-joining (NJ) methods. The analyses yielded robust support for the Amborella/Amborella-Nymphaeales-basal hypothesis and for the orchid and grasses together being a monophyletic group nested within the remaining angiosperms. However, the NJ analysis using Ka, the first two codon positions, or amino acid sequences, respectively, supports the monocots-basal hypothesis. We demonstrated that these conflicting angiosperm phylogenies are most probably linked to the transitional sites at all codon positions, especially at the third one where the strong base-composition bias and saturation effect take place.
Collapse
|
33
|
Identifying the basal angiosperm node in chloroplast genome phylogenies: sampling one's way out of the Felsenstein zone. Mol Biol Evol 2005; 22:1948-63. [PMID: 15944438 DOI: 10.1093/molbev/msi191] [Citation(s) in RCA: 173] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
While there has been strong support for Amborella and Nymphaeales (water lilies) as branching from basal-most nodes in the angiosperm phylogeny, this hypothesis has recently been challenged by phylogenetic analyses of 61 protein-coding genes extracted from the chloroplast genome sequences of Amborella, Nymphaea, and 12 other available land plant chloroplast genomes. These character-rich analyses placed the monocots, represented by three grasses (Poaceae), as sister to all other extant angiosperm lineages. We have extracted protein-coding regions from draft sequences for six additional chloroplast genomes to test whether this surprising result could be an artifact of long-branch attraction due to limited taxon sampling. The added taxa include three monocots (Acorus, Yucca, and Typha), a water lily (Nuphar), a ranunculid (Ranunculus), and a gymnosperm (Ginkgo). Phylogenetic analyses of the expanded DNA and protein data sets together with microstructural characters (indels) provided unambiguous support for Amborella and the Nymphaeales as branching from the basal-most nodes in the angiosperm phylogeny. However, their relative positions proved to be dependent on the method of analysis, with parsimony favoring Amborella as sister to all other angiosperms and maximum likelihood (ML) and neighbor-joining methods favoring an Amborella + Nymphaeales clade as sister. The ML phylogeny supported the later hypothesis, but the likelihood for the former hypothesis was not significantly different. Parametric bootstrap analysis, single-gene phylogenies, estimated divergence dates, and conflicting indel characters all help to illuminate the nature of the conflict in resolution of the most basal nodes in the angiosperm phylogeny. Molecular dating analyses provided median age estimates of 161 MYA for the most recent common ancestor (MRCA) of all extant angiosperms and 145 MYA for the MRCA of monocots, magnoliids, and eudicots. Whereas long sequences reduce variance in branch lengths and molecular dating estimates, the impact of improved taxon sampling on the rooting of the angiosperm phylogeny together with the results of parametric bootstrap analyses demonstrate how long-branch attraction might mislead genome-scale phylogenetic analyses.
Collapse
|
34
|
Abstract
Determining the phylogenetic relationships among the major lines of angiosperms is a long-standing problem, yet the uncertainty as to the phylogenetic affinity of these lines persists. While a number of studies have suggested that the ANITA (Amborella-Nymphaeales-Illiciales-Trimeniales-Aristolochiales) grade is basal within angiosperms, studies of complete chloroplast genome sequences also suggested an alternative tree, wherein the line leading to the grasses branches first among the angiosperms. To improve taxon sampling in the existing chloroplast genome data, we sequenced the chloroplast genome of the monocot Acorus calamus. We generated a concatenated alignment (89,436 positions for 15 taxa), encompassing almost all sequences usable for phylogeny reconstruction within spermatophytes. The data still contain support for both the ANITA-basal and grasses-basal hypotheses. Using simulations we can show that were the ANITA-basal hypothesis true, parsimony (and distance-based methods with many models) would be expected to fail to recover it. The self-evident explanation for this failure appears to be a long-branch attraction (LBA) between the clade of grasses and the out-group. However, this LBA cannot explain the discrepancies observed between tree topology recovered using the maximum likelihood (ML) method and the topologies recovered using the parsimony and distance-based methods when grasses are deleted. Furthermore, the fact that neither maximum parsimony nor distance methods consistently recover the ML tree, when according to the simulations they would be expected to, when the out-group (Pinus) is deleted, suggests that either the generating tree is not correct or the best symmetric model is misspecified (or both). We demonstrate that the tree recovered under ML is extremely sensitive to model specification and that the best symmetric model is misspecified. Hence, we remain agnostic regarding phylogenetic relationships among basal angiosperm lineages.
Collapse
|
35
|
Abstract
During the past decade, there has been a rapid increase in our understanding of plastid genome organization and evolution due to the availability of many new completely sequenced genomes. There are 45 complete genomes published and ongoing projects are likely to increase this sampling to nearly 200 genomes during the next 5 years. Several groups of researchers including ours have been developing new techniques for gathering and analyzing entire plastid genome sequences and details of these developments are summarized in this chapter. The most important developments that enhance our ability to generate whole chloroplast genome sequences involve the generation of pure fractions of chloroplast genomes by whole genome amplification using rolling circle amplification, cloning genomes into Fosmid or bacterial artificial chromosome (BAC) vectors, and the development of an organellar annotation program (Dual Organellar GenoMe Annotator [DOGMA]). In addition to providing details of these methods, we provide an overview of methods for analyzing complete plastid genome sequences for repeats and gene content, as well as approaches for using gene order and sequence data for phylogeny reconstruction. This explosive increase in the number of sequenced plastid genomes and improved computational tools will provide many insights into the evolution of these genomes and much new data for assessing relationships at deep nodes in plants and other photosynthetic organisms.
Collapse
|
36
|
The complete nucleotide sequence of wild rice (Oryza nivara) chloroplast genome: first genome wide comparative sequence analysis of wild and cultivated rice. Gene 2004; 340:133-9. [PMID: 15556301 DOI: 10.1016/j.gene.2004.06.008] [Citation(s) in RCA: 54] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2004] [Revised: 05/15/2004] [Accepted: 06/01/2004] [Indexed: 10/26/2022]
Abstract
We determined the complete nucleotide sequence of the chloroplast genome of wild rice, Oryza nivara and compared it with the corresponding published sequence of relative cultivated rice, Oryza sativa. The genome was 134,494 bp long with a large single-copy region of 80,544 bp, a small single-copy region of 12,346 bp and two inverted repeats of 20,802 bp each. The overall A+T content was 61.0%. The O. nivara chloroplast genome encoded identical functional genes to O. sativa in the same order along the genome. On the other hand, detailed analysis revealed 57 insertion, 61 deletion and 159 base substitution events in the entire chloroplast genome of O. nivara. Among substitutions, transversions were much higher than transitions with the former even more frequent than the latter in the coding region. Most of the insertions/deletions were single-base but a few large length mutations were also detected. The frequency of insertion/deletion events was more in the coding region within inverted repeats. In contrast, a very few substitution events were identified in the coding region. Polymorphism was observed among rice cultivars at loci of large insertion/deletion events. This is the first report describing comparative and genome wide chloroplast analysis between a wild and cultivated crop.
Collapse
|
37
|
Dating the monocot-dicot divergence and the origin of core eudicots using whole chloroplast genomes. J Mol Evol 2004; 58:424-41. [PMID: 15114421 DOI: 10.1007/s00239-003-2564-9] [Citation(s) in RCA: 320] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2003] [Accepted: 10/23/2003] [Indexed: 11/30/2022]
Abstract
We estimated the dates of the monocot-dicot split and the origin of core eudicots using a large chloroplast (cp) genomic dataset. Sixty-one protein-coding genes common to the 12 completely sequenced cp genomes of land plants were concatenated and analyzed. Three reliable split events were used as calibration points and for cross references. Both the method based on the assumption of a constant rate and the Li-Tanimura unequal-rate method were used to estimate divergence times. The phylogenetic analyses indicated that nonsynonymous substitution rates of cp genomes are unequal among tracheophyte lineages. For this reason, the constant-rate method gave overestimates of the monocot-dicot divergence and the age of core eudicots, especially when fast-evolving monocots were included in the analysis. In contrast, the Li-Tanimura method gave estimates consistent with the known evolutionary sequence of seed plant lineages and with known fossil records. Combining estimates calibrated by two known fossil nodes and the Li-Tanimura method, we propose that monocots branched off from dicots 140-150 Myr ago (late Jurassic-early Cretaceous), at least 50 Myr younger than previous estimates based on the molecular clock hypothesis, and that the core eudicots diverged 100-115 Myr ago (Albian-Aptian of the Cretaceous). These estimates indicate that both the monocot-dicot divergence and the core eudicot's age are older than their respective fossil records.
Collapse
|
38
|
Dating the monocot-dicot divergence and the origin of core eudicots using whole chloroplast genomes. J Mol Evol 2004. [PMID: 15114421 DOI: 10.1007/s00239‐003‐2564‐9] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
We estimated the dates of the monocot-dicot split and the origin of core eudicots using a large chloroplast (cp) genomic dataset. Sixty-one protein-coding genes common to the 12 completely sequenced cp genomes of land plants were concatenated and analyzed. Three reliable split events were used as calibration points and for cross references. Both the method based on the assumption of a constant rate and the Li-Tanimura unequal-rate method were used to estimate divergence times. The phylogenetic analyses indicated that nonsynonymous substitution rates of cp genomes are unequal among tracheophyte lineages. For this reason, the constant-rate method gave overestimates of the monocot-dicot divergence and the age of core eudicots, especially when fast-evolving monocots were included in the analysis. In contrast, the Li-Tanimura method gave estimates consistent with the known evolutionary sequence of seed plant lineages and with known fossil records. Combining estimates calibrated by two known fossil nodes and the Li-Tanimura method, we propose that monocots branched off from dicots 140-150 Myr ago (late Jurassic-early Cretaceous), at least 50 Myr younger than previous estimates based on the molecular clock hypothesis, and that the core eudicots diverged 100-115 Myr ago (Albian-Aptian of the Cretaceous). These estimates indicate that both the monocot-dicot divergence and the core eudicot's age are older than their respective fossil records.
Collapse
|
39
|
Effects of selective inactivation of individual genes for low-molecular-mass subunits on the assembly of photosystem II, as revealed by chloroplast transformation: the psbEFLJoperon in Nicotiana tabacum. Mol Genet Genomics 2003; 268:699-710. [PMID: 12655396 DOI: 10.1007/s00438-002-0791-1] [Citation(s) in RCA: 55] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2002] [Accepted: 11/14/2002] [Indexed: 12/31/2022]
Abstract
Photosystem (PSII) is a supramolecular polypeptide complex found in oxygenic photosynthetic membranes, which is capable of extracting electrons from water for the reduction of plastoquinone. An intriguing feature of this assembly is the fact that it includes more than a dozen low-mass polypeptides of generally unknown function. Using a transplastomic approach, we have individually disrupted the genes of the psbEFLJoperon in Nicotiana tabacum, which encode four such polypeptides, without impairing expression of downstream loci of the operon. All four mutants exhibited distinct phenotypes; none of them was capable of photoautotrophic growth. All mutants bleached rapidly in the light. Disruption of psbEand psbF, which code for the alpha and beta apoproteins of cytochrome b(559), abolished PSII activity, as expected; Delta psbL and Delta psbJ plants displayed residual PSII activity in young leaves. Controlled partial solubilisation of thylakoid membranes uncovered surprisingly severe impairment of PSII structure, with subunit and assembly patterns varying depending on the mutant considered. In the Delta psbL mutant PSII was assembled primarily in a monomeric form, the homodimeric form was preponderant in Delta psbJ, and, unlike the case in Delta psbZ, the thylakoids of both mutants released some PSII supercomplexes. On the other hand, Photosystem I (PSI), the cytochrome b(6)f complex, ATP synthase, LHCII, and CP24/CP26/CP29 antennae were present in near wild-type levels. The data are discussed in terms of their implications for structural, biogenetic and functional aspects of PSII.
Collapse
|
40
|
Evolutionary analysis of Arabidopsis, cyanobacterial, and chloroplast genomes reveals plastid phylogeny and thousands of cyanobacterial genes in the nucleus. Proc Natl Acad Sci U S A 2002; 99:12246-51. [PMID: 12218172 PMCID: PMC129430 DOI: 10.1073/pnas.182432999] [Citation(s) in RCA: 741] [Impact Index Per Article: 33.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
Abstract
Chloroplasts were once free-living cyanobacteria that became endosymbionts, but the genomes of contemporary plastids encode only approximately 5-10% as many genes as those of their free-living cousins, indicating that many genes were either lost from plastids or transferred to the nucleus during the course of plant evolution. Previous estimates have suggested that between 800 and perhaps as many as 2,000 genes in the Arabidopsis genome might come from cyanobacteria, but genome-wide phylogenetic surveys that could provide direct estimates of this number are lacking. We compared 24,990 proteins encoded in the Arabidopsis genome to the proteins from three cyanobacterial genomes, 16 other prokaryotic reference genomes, and yeast. Of 9,368 Arabidopsis proteins sufficiently conserved for primary sequence comparison, 866 detected homologues only among cyanobacteria and 834 other branched with cyanobacterial homologues in phylogenetic trees. Extrapolating from these conserved proteins to the whole genome, the data suggest that approximately 4,500 of Arabidopsis protein-coding genes ( approximately 18% of the total) were acquired from the cyanobacterial ancestor of plastids. These proteins encompass all functional classes, and the majority of them are targeted to cell compartments other than the chloroplast. Analysis of 15 sequenced chloroplast genomes revealed 117 nuclear-encoded proteins that are also still present in at least one chloroplast genome. A phylogeny of chloroplast genomes inferred from 41 proteins and 8,303 amino acids sites indicates that at least two independent secondary endosymbiotic events have occurred involving red algae and that amino acid composition bias in chloroplast proteins strongly affects plastid genome phylogeny.
Collapse
|