1
|
Giorgashvili E, Reichel K, Caswara C, Kerimov V, Borsch T, Gruenstaeudl M. Software Choice and Sequencing Coverage Can Impact Plastid Genome Assembly-A Case Study in the Narrow Endemic Calligonum bakuense. FRONTIERS IN PLANT SCIENCE 2022; 13:779830. [PMID: 35874012 PMCID: PMC9296850 DOI: 10.3389/fpls.2022.779830] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/19/2021] [Accepted: 06/13/2022] [Indexed: 06/15/2023]
Abstract
Most plastid genome sequences are assembled from short-read whole-genome sequencing data, yet the impact that sequencing coverage and the choice of assembly software can have on the accuracy of the resulting assemblies is poorly understood. In this study, we test the impact of both factors on plastid genome assembly in the threatened and rare endemic shrub Calligonum bakuense. We aim to characterize the differences across plastid genome assemblies generated by different assembly software tools and levels of sequencing coverage and to determine if these differences are large enough to affect the phylogenetic position inferred for C. bakuense compared to congeners. Four assembly software tools (FastPlast, GetOrganelle, IOGA, and NOVOPlasty) and seven levels of sequencing coverage across the plastid genome (original sequencing depth, 2,000x, 1,000x, 500x, 250x, 100x, and 50x) are compared in our analyses. The resulting assemblies are evaluated with regard to reproducibility, contig number, gene complement, inverted repeat length, and computation time; the impact of sequence differences on phylogenetic reconstruction is assessed. Our results show that software choice can have a considerable impact on the accuracy and reproducibility of plastid genome assembly and that GetOrganelle produces the most consistent assemblies for C. bakuense. Moreover, we demonstrate that a sequencing coverage between 500x and 100x can reduce both the sequence variability across assembly contigs and computation time. When comparing the most reliable plastid genome assemblies of C. bakuense, a sequence difference in only three nucleotide positions is detected, which is less than the difference potentially introduced through software choice.
Collapse
Affiliation(s)
- Eka Giorgashvili
- Systematische Botanik und Pflanzengeographie, Institut für Biologie, Freie Universität Berlin, Berlin, Germany
| | - Katja Reichel
- Systematische Botanik und Pflanzengeographie, Institut für Biologie, Freie Universität Berlin, Berlin, Germany
| | - Calvinna Caswara
- Systematische Botanik und Pflanzengeographie, Institut für Biologie, Freie Universität Berlin, Berlin, Germany
| | - Vuqar Kerimov
- Institute of Botany, Azerbaijan National Academy of Sciences (ANAS), Baku, Azerbaijan
| | - Thomas Borsch
- Systematische Botanik und Pflanzengeographie, Institut für Biologie, Freie Universität Berlin, Berlin, Germany
- Botanischer Garten und Botanisches Museum Berlin, Freie Universität Berlin, Berlin, Germany
| | - Michael Gruenstaeudl
- Systematische Botanik und Pflanzengeographie, Institut für Biologie, Freie Universität Berlin, Berlin, Germany
| |
Collapse
|
2
|
Guo M, Pang X, Xu Y, Jiang W, Liao B, Yu J, Xu J, Song J, Chen S. Plastid genome data provide new insights into the phylogeny and evolution of the genus Epimedium. J Adv Res 2022; 36:175-185. [PMID: 35127172 PMCID: PMC8799909 DOI: 10.1016/j.jare.2021.06.020] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2020] [Revised: 05/14/2021] [Accepted: 06/26/2021] [Indexed: 10/25/2022] Open
|
3
|
Sun J, Wang Y, Garran TA, Qiao P, Wang M, Yuan Q, Guo L, Huang L. Heterogeneous Genetic Diversity Estimation of a Promising Domestication Medicinal Motherwort Leonurus Cardiaca Based on Chloroplast Genome Resources. Front Genet 2021; 12:721022. [PMID: 34603384 PMCID: PMC8479170 DOI: 10.3389/fgene.2021.721022] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2021] [Accepted: 09/01/2021] [Indexed: 11/30/2022] Open
Abstract
Leonurus cardiaca has a long history of use in western herbal medicine and is applied for the treatment of gynaecological conditions, anxiety, and heart diseases. Because of its botanical relationship to the primary Chinese species, L. japonicus, and extensive medical indications that go beyond the traditional indications for the Chinese species, it is a promising medicinal resource. Therefore, the features of genetic diversity and variability in the species have been prioritized. To explore these issues, we sequenced the chloroplast genomes of 22 accessions of L. cardiaca from different geographical locations worldwide using high-throughput sequencing. The results indicate that L. cardiaca has a typical quadripartite structure and range from 1,51,236 bp to 1,51,831 bp in size, forming eight haplotypes. The genomes all contain 114 distinct genes, including 80 protein-coding genes, 30 transfer RNA genes and four ribosomal RNA genes. Comparative analysis showed abundant diversity of single nucleotide polymorphisms (SNPs), indels, simple sequence repeats (SSRs) in 22 accessions. Codon usage showed highly similar results for L. cardiaca species. The phylogenetic and network analysis indicated 22 accessions forming four clades that were partly related to the geographical distribution. In summary, our study highlights the advantage of chloroplast genome with large data sets in intraspecific diversity evaluation and provides a new tool to facilitate medicinal plant conservation and domestication.
Collapse
Affiliation(s)
- Jiahui Sun
- National Resource Center for Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing, China
| | - Yiheng Wang
- National Resource Center for Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing, China
| | - Thomas Avery Garran
- National Resource Center for Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing, China
| | - Ping Qiao
- National Resource Center for Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing, China
- Academician workstation, Jiangxi University of Traditional Chinese Medicine, Nanchang, China
| | - Mengli Wang
- National Resource Center for Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing, China
| | - Qingjun Yuan
- National Resource Center for Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing, China
| | - Lanping Guo
- National Resource Center for Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing, China
| | - Luqi Huang
- National Resource Center for Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing, China
| |
Collapse
|
4
|
Park J, Xi H, Son J, Shin HT, Kang H, Park S. The complete chloroplast genome of Castanopsis sieboldii (Makino) Hatus (Fagaceae). MITOCHONDRIAL DNA PART B-RESOURCES 2021; 6:2743-2745. [PMID: 34447890 PMCID: PMC8386698 DOI: 10.1080/23802359.2021.1966339] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Castanopsis sieboldii (Makino) Hatus is an evergreen tree that distributes in Eastern Asia including Islands of Korea and Japan. The chloroplast genome of C. sieboldii was successfully sequenced. Its length is 160,705 bp long (GC ratio is 36.8%) and has four subregions: 90,821 bp of large single copy (34.6%) and 19,014 bp of small single copy (30.8%) regions are separated by 25,075 bp of inverted repeat (42.8%) regions including 134 genes (89 protein-coding genes, eight rRNAs, and 37 tRNAs). Interspecific variations of Castanopsis are at a moderate level in comparison to those of the other genera. Phylogenetic trees show that C. sieboldii chloroplast genome was clustered with the other two Castanopsis species.
Collapse
Affiliation(s)
- Jongsun Park
- InfoBoss Inc, Seoul, Republic of Korea.,InfoBoss Research Center, Seoul, Republic of Korea
| | - Hong Xi
- InfoBoss Inc, Seoul, Republic of Korea.,InfoBoss Research Center, Seoul, Republic of Korea
| | - Janghyuk Son
- InfoBoss Inc, Seoul, Republic of Korea.,InfoBoss Research Center, Seoul, Republic of Korea
| | - Hyun Tak Shin
- DMZ Botanic Garden, Korea National Arboretum, Yanggu, South Korea
| | - Hyunmi Kang
- Department of Landscape Architecture, Mokpo National University, Muan, Republic of Korea
| | - Seokgon Park
- Division of Forest Resources and Landscape Architecture, Sunchon National University, Sunchoen, Republic of Korea
| |
Collapse
|
5
|
Šlenker M, Kantor A, Marhold K, Schmickl R, Mandáková T, Lysak MA, Perný M, Caboňová M, Slovák M, Zozomová-Lihová J. Allele Sorting as a Novel Approach to Resolving the Origin of Allotetraploids Using Hyb-Seq Data: A Case Study of the Balkan Mountain Endemic Cardamine barbaraeoides. FRONTIERS IN PLANT SCIENCE 2021; 12:659275. [PMID: 33995457 PMCID: PMC8115912 DOI: 10.3389/fpls.2021.659275] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/27/2021] [Accepted: 03/10/2021] [Indexed: 05/19/2023]
Abstract
Mountains of the Balkan Peninsula are significant biodiversity hotspots with great species richness and a large proportion of narrow endemics. Processes that have driven the evolution of the rich Balkan mountain flora, however, are still insufficiently explored and understood. Here we focus on a group of Cardamine (Brassicaceae) perennials growing in wet, mainly mountainous habitats. It comprises several Mediterranean endemics, including those restricted to the Balkan Peninsula. We used target enrichment with genome skimming (Hyb-Seq) to infer their phylogenetic relationships, and, along with genomic in situ hybridization (GISH), to resolve the origin of tetraploid Cardamine barbaraeoides endemic to the Southern Pindos Mts. (Greece). We also explored the challenges of phylogenomic analyses of polyploid species and developed a new approach of allele sorting into homeologs that allows identifying subgenomes inherited from different progenitors. We obtained a robust phylogenetic reconstruction for diploids based on 1,168 low-copy nuclear genes, which suggested both allopatric and ecological speciation events. In addition, cases of plastid-nuclear discordance, in agreement with divergent nuclear ribosomal DNA (nrDNA) copy variants in some species, indicated traces of interspecific gene flow. Our results also support biogeographic links between the Balkan and Anatolian-Caucasus regions and illustrate the contribution of the latter region to high Balkan biodiversity. An allopolyploid origin was inferred for C. barbaraeoides, which highlights the role of mountains in the Balkan Peninsula both as refugia and melting pots favoring species contacts and polyploid evolution in response to Pleistocene climate-induced range dynamics. Overall, our study demonstrates the importance of a thorough phylogenomic approach when studying the evolution of recently diverged species complexes affected by reticulation events at both diploid and polyploid levels. We emphasize the significance of retrieving allelic and homeologous variation from nuclear genes, as well as multiple nrDNA copy variants from genome skim data.
Collapse
Affiliation(s)
- Marek Šlenker
- Institute of Botany, Plant Science and Biodiversity Centre, Slovak Academy of Sciences, Bratislava, Slovakia
- Department of Botany, Faculty of Science, Charles University, Prague, Czechia
| | - Adam Kantor
- Institute of Botany, Plant Science and Biodiversity Centre, Slovak Academy of Sciences, Bratislava, Slovakia
| | - Karol Marhold
- Institute of Botany, Plant Science and Biodiversity Centre, Slovak Academy of Sciences, Bratislava, Slovakia
- Department of Botany, Faculty of Science, Charles University, Prague, Czechia
| | - Roswitha Schmickl
- Department of Botany, Faculty of Science, Charles University, Prague, Czechia
- Institute of Botany, The Czech Academy of Sciences, Průhonice, Czechia
| | - Terezie Mandáková
- Central European Institute of Technology, Masaryk University, Brno, Czechia
- Department of Experimental Biology, Faculty of Science, Masaryk University, Brno, Czechia
| | - Martin A. Lysak
- Central European Institute of Technology, Masaryk University, Brno, Czechia
- National Centre for Biomolecular Research, Faculty of Science, Masaryk University, Brno, Czechia
| | | | - Michaela Caboňová
- Institute of Botany, Plant Science and Biodiversity Centre, Slovak Academy of Sciences, Bratislava, Slovakia
| | - Marek Slovák
- Institute of Botany, Plant Science and Biodiversity Centre, Slovak Academy of Sciences, Bratislava, Slovakia
- Department of Botany, Faculty of Science, Charles University, Prague, Czechia
| | - Judita Zozomová-Lihová
- Institute of Botany, Plant Science and Biodiversity Centre, Slovak Academy of Sciences, Bratislava, Slovakia
| |
Collapse
|
6
|
del Valle JC, Herman JA, Whittall JB. Genome skimming and microsatellite analysis reveal contrasting patterns of genetic diversity in a rare sandhill endemic (Erysimum teretifolium, Brassicaceae). PLoS One 2020; 15:e0227523. [PMID: 32459825 PMCID: PMC7252598 DOI: 10.1371/journal.pone.0227523] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2019] [Accepted: 04/28/2020] [Indexed: 11/19/2022] Open
Abstract
Barriers between islands often inhibit gene flow creating patterns of isolation by distance. In island species, the majority of genetic diversity should be distributed among isolated populations. However, a self-incompatible mating system leads to higher genetic variation within populations and very little between-population subdivision. We examine these two contrasting predictions in Erysimum teretifolium, a rare self-incompatible plant endemic to island-like sandhill habitats in Santa Cruz County, California. We used genome skimming and nuclear microsatellites to assess the distribution of genetic diversity within and among eight of the 13 remaining populations. Phylogenetic analyses of the chloroplast genomes revealed a deep separation of three of the eight populations. The nuclear ribosomal DNA cistron showed no genetic subdivision. Nuclear microsatellites suggest 83% of genetic variation resides within populations. Despite this, 18 of 28 between-population comparisons exhibited significant population structure (mean FST = 0.153). No isolation by distance existed among all populations, however when one outlier population was removed from the analysis due to uncertain provenance, significant isolation by distance emerged (r2 = 0.5611, p = 0.005). Population census size did not correlate with allelic richness as predicted on islands. Bayesian population assignment detected six genetic groupings with substantial admixture. Unique genetic clusters were concentrated at the periphery of the species’ range. Since the overall distribution of nuclear genetic diversity reflects E. tereifolium’s self-incompatible mating system, the vast majority of genetic variation could be sampled within any individual population. Yet, the chloroplast genome results suggest a deep split and some of the nuclear microsatellite analyses indicate some island-like patterns of genetic diversity. Restoration efforts intending to maximize genetic variation should include representatives from both lineages of the chloroplast genome and, for maximum nuclear genetic diversity, should include representatives of the smaller, peripheral populations.
Collapse
Affiliation(s)
- José Carlos del Valle
- Department of Molecular Biology and Biochemical Engineering, Pablo de Olavide University, Seville, Spain
| | - Julie A. Herman
- Department of Biology, Santa Clara University, Santa Clara, CA, United States of America
| | - Justen B. Whittall
- Department of Biology, Santa Clara University, Santa Clara, CA, United States of America
- * E-mail:
| |
Collapse
|