1
|
Yan Y, da Fonseca RR, Rahbek C, Borregaard MK, Davis CC. A new nuclear phylogeny of the tea family (Theaceae) unravels rapid radiations in genus Camellia. Mol Phylogenet Evol 2024; 196:108089. [PMID: 38679302 DOI: 10.1016/j.ympev.2024.108089] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2023] [Revised: 03/08/2024] [Accepted: 04/25/2024] [Indexed: 05/01/2024]
Abstract
Molecular analyses of rapidly radiating groups often reveal incongruence between gene trees. This mainly results from incomplete lineage sorting, introgression, and gene tree estimation error, which complicate the estimation of phylogenetic relationships. In this study, we reconstruct the phylogeny of Theaceae using 348 nuclear loci from 68 individuals and two outgroup taxa. Sequence data were obtained by target enrichment using the recently released Angiosperm 353 universal probe set applied to herbarium specimens. The robustness of the topologies to variation in data quality was established under a range of different filtering schemes, using both coalescent and concatenation approaches. Our results confirmed most of the previously hypothesized relationships among tribes and genera, while clarifying additional interspecific relationships within the rapidly radiating genus Camellia. We recovered a remarkably high degree of gene tree heterogeneity indicative of rapid radiation in the group and observed cytonuclear conflicts, especially within Camellia. This was especially pronounced around short branches, which we primarily associate with gene tree estimation error. Our analysis also indicates that incomplete lineage sorting (ILS) contributed to gene-tree conflicts and accounted for approximately 14 % of the explained variation, whereas inferred introgression levels were low. Our study advances the understanding of the evolution of this important plant family and provides guidance on the application of target capture methods and the evaluation of key processes that influence phylogenetic discordances.
Collapse
Affiliation(s)
- Yujing Yan
- Center for Macroecology, Evolution and Climate, Globe Institute, University of Copenhagen, Universitetsparken 15, 2100 Copenhagen, Denmark; Department of Organismic and Evolutionary Biology, Harvard University Herbaria, 22 Divinity Ave, Cambridge, MA 02138, USA.
| | - Rute R da Fonseca
- Center for Macroecology, Evolution and Climate, Globe Institute, University of Copenhagen, Universitetsparken 15, 2100 Copenhagen, Denmark
| | - Carsten Rahbek
- Center for Macroecology, Evolution and Climate, Globe Institute, University of Copenhagen, Universitetsparken 15, 2100 Copenhagen, Denmark; Center for Global Mountain Biodiversity, Globe Institute, University of Copenhagen, Universitetsparken 15, 2100 Copenhagen, Denmark; Department of Life Sciences, Imperial College London, Silkwood Park campus, Ascot SL5 7PY, UK; Danish Institute for Advanced Study, University of Southern Denmark, 5230 Odense M, Denmark
| | - Michael K Borregaard
- Center for Macroecology, Evolution and Climate, Globe Institute, University of Copenhagen, Universitetsparken 15, 2100 Copenhagen, Denmark
| | - Charles C Davis
- Department of Organismic and Evolutionary Biology, Harvard University Herbaria, 22 Divinity Ave, Cambridge, MA 02138, USA
| |
Collapse
|
2
|
Zhang G, Ma H. Nuclear phylogenomics of angiosperms and insights into their relationships and evolution. JOURNAL OF INTEGRATIVE PLANT BIOLOGY 2024; 66:546-578. [PMID: 38289011 DOI: 10.1111/jipb.13609] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/21/2023] [Accepted: 01/03/2024] [Indexed: 02/06/2024]
Abstract
Angiosperms (flowering plants) are by far the most diverse land plant group with over 300,000 species. The sudden appearance of diverse angiosperms in the fossil record was referred to by Darwin as the "abominable mystery," hence contributing to the heightened interest in angiosperm evolution. Angiosperms display wide ranges of morphological, physiological, and ecological characters, some of which have probably influenced their species richness. The evolutionary analyses of these characteristics help to address questions of angiosperm diversification and require well resolved phylogeny. Following the great successes of phylogenetic analyses using plastid sequences, dozens to thousands of nuclear genes from next-generation sequencing have been used in angiosperm phylogenomic analyses, providing well resolved phylogenies and new insights into the evolution of angiosperms. In this review we focus on recent nuclear phylogenomic analyses of large angiosperm clades, orders, families, and subdivisions of some families and provide a summarized Nuclear Phylogenetic Tree of Angiosperm Families. The newly established nuclear phylogenetic relationships are highlighted and compared with previous phylogenetic results. The sequenced genomes of Amborella, Nymphaea, Chloranthus, Ceratophyllum, and species of monocots, Magnoliids, and basal eudicots, have facilitated the phylogenomics of relationships among five major angiosperms clades. All but one of the 64 angiosperm orders were included in nuclear phylogenomics with well resolved relationships except the placements of several orders. Most families have been included with robust and highly supported placements, especially for relationships within several large and important orders and families. Additionally, we examine the divergence time estimation and biogeographic analyses of angiosperm on the basis of the nuclear phylogenomic frameworks and discuss the differences compared with previous analyses. Furthermore, we discuss the implications of nuclear phylogenomic analyses on ancestral reconstruction of morphological, physiological, and ecological characters of angiosperm groups, limitations of current nuclear phylogenomic studies, and the taxa that require future attention.
Collapse
Affiliation(s)
- Guojin Zhang
- College of Life Sciences, Hunan Normal University, Changsha, 410081, China
- Department of Biology, 510 Mueller Laboratory, Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, PA, 16802, USA
| | - Hong Ma
- Department of Biology, 510 Mueller Laboratory, Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, PA, 16802, USA
| |
Collapse
|
3
|
Li J, Han G, Tian X, Liang D, Zhang P. UPrimer: A Clade-Specific Primer Design Program Based on Nested-PCR Strategy and Its Applications in Amplicon Capture Phylogenomics. Mol Biol Evol 2023; 40:msad230. [PMID: 37832226 PMCID: PMC10630340 DOI: 10.1093/molbev/msad230] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2023] [Revised: 09/12/2023] [Accepted: 10/09/2023] [Indexed: 10/15/2023] Open
Abstract
Amplicon capture is a promising target sequence capture approach for phylogenomic analyses, and the design of clade-specific nuclear protein-coding locus (NPCL) amplification primers is crucial for its successful application. In this study, we developed a primer design program called UPrimer that can quickly design clade-specific NPCL amplification primers based on genome data, without requiring manual intervention. Unlike other available primer design programs, UPrimer uses a nested-PCR strategy that greatly improves the amplification success rate of the designed primers. We examined all available metazoan genome data deposited in NCBI and developed NPCL primer sets for 21 metazoan groups with UPrimer, covering a wide range of taxa, including arthropods, mollusks, cnidarians, echinoderms, and vertebrates. On average, each clade-specific NPCL primer set comprises ∼1,000 NPCLs. PCR amplification tests were performed in 6 metazoan groups, and the developed primers showed a PCR success rate exceeding 95%. Furthermore, we demonstrated a phylogenetic case study in Lepidoptera, showing how NPCL primers can be used for phylogenomic analyses with amplicon capture. Our results indicated that using 100 NPCL probes recovered robust high-level phylogenetic relationships among butterflies, highlighting the utility of the newly designed NPCL primer sets for phylogenetic studies. We anticipate that the automated tool UPrimer and the developed NPCL primer sets for 21 metazoan groups will enable researchers to obtain phylogenomic data more efficiently and cost-effectively and accelerate the resolution of various parts of the Tree of Life.
Collapse
Affiliation(s)
- JiaXuan Li
- State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-Sen University, Guangzhou 510275, China
| | - GuangCheng Han
- State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-Sen University, Guangzhou 510275, China
| | - Xiao Tian
- State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-Sen University, Guangzhou 510275, China
| | - Dan Liang
- State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-Sen University, Guangzhou 510275, China
| | - Peng Zhang
- State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-Sen University, Guangzhou 510275, China
| |
Collapse
|
4
|
Yisilam G, Liu Z, Turdi R, Chu Z, Luo W, Tian X. Assembly and comparative analysis of the complete mitochondrial genome of Isopyrum anemonoides (Ranunculaceae). PLoS One 2023; 18:e0286628. [PMID: 37796878 PMCID: PMC10553351 DOI: 10.1371/journal.pone.0286628] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2023] [Accepted: 05/19/2023] [Indexed: 10/07/2023] Open
Abstract
Ranunculaceae is a large family of angiosperms comprising 2500 known species-a few with medicinal and ornamental values. Despite this, only two mitochondrial genomes (mitogenomes) of the family have been released in GenBank. Isopyrum anemonoides is a medicinal plant belonging to the family Ranunculaceae, and its chloroplast genome has recently been reported; however, its mitogenome remains unexplored. In this study, we assembled and analyzed the complete mitochondrial genome of I. anemonoides and performed a comparative analysis against different Ranunculaceae species, reconstructing the phylogenetic framework of Isopyrum. The circular mitogenome of I. anemonoides has a length of 206,722 bp, with a nucleotide composition of A (26.4%), T (26.4%), C (23.6%), and G (23.6%), and contains 62 genes, comprising 37 protein-coding genes (PCGs), 22 transfer RNA (tRNA) genes, and three ribosomal RNA (rRNA) genes. Abundantly interspersed repetitive and simple sequence repeat (SSR) loci were detected in the I. anemonoides mitogenome, with tetranucleotide repeats accounting for the highest proportion of SSRs. By detecting gene migration, we observed gene exchange between the chloroplast and mitogenome in I. anemonoides, including six intact tRNA genes, six PCG fragments, and fragments from two rRNA genes. Comparative mitogenome analysis of three Ranunculaceae species indicated that the PCG contents were conserved and the GC contents were similar. Selective pressure analysis revealed that only two genes (nad1 and rpl5) were under positive selection during their evolution in Ranunculales, and two specific RNA editing sites (atp6 and mttB) were detected in the I. anemonoides mitogenome. Moreover, a phylogenetic analysis based on the mitogenomes of I. anemonoides and the other 15 taxa accurately reflected the evolutionary and taxonomic status of I. anemonoides. Overall, this study provides new insights into the genetics, systematics, and evolution of mitochondrial evolution in Ranunculaceae, particularly I. anemonoides.
Collapse
Affiliation(s)
- Gulbar Yisilam
- Xinjiang Key Laboratory of Biological Resources and Genetic Engineering, College of Life science and Technology, Xinjiang University, Urumqi, Xinjiang, China
| | - Zhiyou Liu
- City Management and Service Centre of Tiemenguan, Xinjiang, China
| | - Rayhangul Turdi
- Xinjiang Key Laboratory of Biological Resources and Genetic Engineering, College of Life science and Technology, Xinjiang University, Urumqi, Xinjiang, China
| | - Zhenzhou Chu
- Xinjiang Key Laboratory of Biological Resources and Genetic Engineering, College of Life science and Technology, Xinjiang University, Urumqi, Xinjiang, China
| | - Wei Luo
- Xinjiang Key Laboratory of Biological Resources and Genetic Engineering, College of Life science and Technology, Xinjiang University, Urumqi, Xinjiang, China
| | - Xinmin Tian
- Xinjiang Key Laboratory of Biological Resources and Genetic Engineering, College of Life science and Technology, Xinjiang University, Urumqi, Xinjiang, China
- Key Laboratory of Ecology of Rare and Endangered Species and Environmental Protection, College of Life Science, Guangxi Normal University, Ministry of Education, Guilin, 541004, China
| |
Collapse
|
5
|
Benítez-Villaseñor A, Granados Mendoza C, Wanke S, Peñafiel Cevallos M, Freire ME, Lemmon EM, Lemmon AR, Magallón S. The use of Anchored Hybrid Enrichment data to resolve higher-level phylogenetic relationships: A proof-of-concept applied to Asterales (Eudicotyledoneae; Angiosperms). Mol Phylogenet Evol 2023; 181:107714. [PMID: 36708940 DOI: 10.1016/j.ympev.2023.107714] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2022] [Revised: 10/28/2022] [Accepted: 01/18/2023] [Indexed: 01/26/2023]
Abstract
Anchored Hybrid Enrichment (AHE) is a tool for capturing orthologous regions of the nuclear genome shared in low or single copy across lineages. Despite the increasing number of studies using this method, its usefulness to estimate relationships at deeper taxonomic levels in plants has not been fully explored. Here we present a proof of concept about the performance of nuclear loci obtained with AHE to infer phylogenetic relationships and explore the use of gene sampling schemes to estimate divergence times in Asterales. We recovered low-copy nuclear loci using the AHE method from herbarium material and silica-preserved samples. Maximum likelihood, Bayesian inference, and coalescence approaches were used to reconstruct phylogenomic relationships. Dating analyses were conducted under a multispecies coalescent approach by jointly inferring species tree and divergence times with random gene sampling schemes and multiple calibrations. We recovered 403 low-copy nuclear loci for 63 species representing nine out of eleven families of Asterales. Phylogenetic hypotheses were congruent among the applied methods and previously published results. Analyses with concatenated datasets were strongly supported, but coalescence-based analyses showed low support for the phylogenetic position of families Argophyllaceae and Alseuosmiaceae. Estimated family ages were congruent among gene sampling schemes, with the mean age for Asterales around 130 Myr. Our study documents the usefulness of AHE for resolving phylogenetic relationships at deep phylogenetic levels in Asterales. Observed phylogenetic inconsistencies were possibly due to the non-inclusion of families Phellinceae and Pentaphragmataceae. Random gene sampling schemes produced consistent age estimates with coalescence and species tree relaxed clock approaches.
Collapse
Affiliation(s)
- Adriana Benítez-Villaseñor
- Posgrado en Ciencias Biológicas, Instituto de Biología, Universidad Nacional Autónoma de México, A. P. 70-153, C.P.04510 Ciudad de México, Mexico.
| | - Carolina Granados Mendoza
- Departamento de Botánica, Instituto de Biología, Universidad Nacional Autónoma de México, 3er Circuito de Ciudad Universitaria, Coyoacán, Mexico City 04510, Mexico; Institut für Botanik, Technische Universität Dresden, Zellescher Weg 20, 01217 Dresden, Germany.
| | - Stefan Wanke
- Departamento de Botánica, Instituto de Biología, Universidad Nacional Autónoma de México, 3er Circuito de Ciudad Universitaria, Coyoacán, Mexico City 04510, Mexico; Institut für Botanik, Technische Universität Dresden, Zellescher Weg 20, 01217 Dresden, Germany.
| | - Marcia Peñafiel Cevallos
- Herbario Nacional del Ecuador (QCNE), Instituto Nacional de Biodiversidad, Quito 170135, Ecuador.
| | - M Efraín Freire
- Herbario Nacional del Ecuador (QCNE), Instituto Nacional de Biodiversidad, Quito 170135, Ecuador.
| | - Emily Moriarty Lemmon
- Department of Biology, Florida State University 319 Stadium Drive, P.O. Box 3064295, Tallahassee, FL 32306-4295, United States.
| | - Alan R Lemmon
- Department of Scientific Computing, Florida State University 400 Dirac Science Library, Tallahassee, FL 32306-4120, United States.
| | - Susana Magallón
- Departamento de Botánica, Instituto de Biología, Universidad Nacional Autónoma de México, 3er Circuito de Ciudad Universitaria, Coyoacán, Mexico City 04510, Mexico.
| |
Collapse
|
6
|
Das S, Greenbaum E, Meiri S, Bauer AM, Burbrink FT, Raxworthy CJ, Weinell JL, Brown RM, Brecko J, Pauwels OSG, Rabibisoa N, Raselimanana AP, Merilä J. Ultraconserved elements-based phylogenomic systematics of the snake superfamily Elapoidea, with the description of a new Afro-Asian family. Mol Phylogenet Evol 2023; 180:107700. [PMID: 36603697 DOI: 10.1016/j.ympev.2022.107700] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2022] [Revised: 12/27/2022] [Accepted: 12/29/2022] [Indexed: 01/04/2023]
Abstract
The highly diverse snake superfamily Elapoidea is considered to be a classic example of ancient, rapid radiation. Such radiations are challenging to fully resolve phylogenetically, with the highly diverse Elapoidea a case in point. Previous attempts at inferring a phylogeny of elapoids produced highly incongruent estimates of their evolutionary relationships, often with very low statistical support. We sought to resolve this situation by sequencing over 4,500 ultraconserved element loci from multiple representatives of every elapoid family/subfamily level taxon and inferring their phylogenetic relationships with multiple methods. Concatenation and multispecies coalescent based species trees yielded largely congruent and well-supported topologies. Hypotheses of a hard polytomy were not retained for any deep branches. Our phylogenies recovered Cyclocoridae and Elapidae as diverging early within Elapoidea. The Afro-Malagasy radiation of elapoid snakes, classified as multiple subfamilies of an inclusive Lamprophiidae by some earlier authors, was found to be monophyletic in all analyses. The genus Micrelaps was consistently recovered as sister to Lamprophiidae. We establish a new family, Micrelapidae fam. nov., for Micrelaps and assign Brachyophis to this family based on cranial osteological synapomorphy. We estimate that Elapoidea originated in the early Eocene and rapidly diversified into all the major lineages during this epoch. Ecological opportunities presented by the post-Cretaceous-Paleogene mass extinction event may have promoted the explosive radiation of elapoid snakes.
Collapse
Affiliation(s)
- Sunandan Das
- Ecological Genetics Research Unit, Organismal and Evolutionary Biology Research Programme, Faculty of Biological and Environmental Sciences, FI-00014 University of Helsinki, Finland.
| | - Eli Greenbaum
- Department of Biological Sciences, University of Texas at El Paso, 500 W. University Avenue, El Paso, TX 79968, USA
| | - Shai Meiri
- School of Zoology, Tel Aviv University, Tel Aviv, Israel; The Steinhardt Museum of Natural History, Tel Aviv University, Tel Aviv, Israel
| | - Aaron M Bauer
- Department of Biology and Center for Biodiversity and Ecosystem Stewardship, Villanova University, 800 Lancaster Avenue, Villanova, PA 19085, USA
| | - Frank T Burbrink
- Department of Herpetology, American Museum of Natural History, 200 Central Park West, New York, NY 10024-5192, USA
| | - Christopher J Raxworthy
- Department of Herpetology, American Museum of Natural History, 200 Central Park West, New York, NY 10024-5192, USA
| | - Jeffrey L Weinell
- Department of Herpetology, American Museum of Natural History, 200 Central Park West, New York, NY 10024-5192, USA; Department of Ecology and Evolutionary Biology and Biodiversity Institute, University of Kansas, Lawrence, KS 66045, USA
| | - Rafe M Brown
- Department of Ecology and Evolutionary Biology and Biodiversity Institute, University of Kansas, Lawrence, KS 66045, USA
| | - Jonathan Brecko
- Royal Belgian Institute of Natural Sciences, Rue Vautier 29, B-1000 Brussels, Belgium; Royal Museum for Central Africa, Tervuren, Belgium
| | - Olivier S G Pauwels
- Royal Belgian Institute of Natural Sciences, Rue Vautier 29, B-1000 Brussels, Belgium
| | - Nirhy Rabibisoa
- Sciences de la Vie et de l'Environnement, Faculté des Sciences, de Technologies et de l'Environnement, Université de Mahajanga, Campus Universitaire d'Ambondrona, BP 652, Mahajanga 401, Madagascar
| | - Achille P Raselimanana
- Zoologie et Biodiversité Animale, Faculté des Sciences, Université d'Antananarivo, BP 906, Antananarivo 101, Madagascar
| | - Juha Merilä
- Ecological Genetics Research Unit, Organismal and Evolutionary Biology Research Programme, Faculty of Biological and Environmental Sciences, FI-00014 University of Helsinki, Finland; Area of Ecology and Biodiversity, School of Biological Sciences, Kadoorie Biological Sciences Building, Pokfulam Road, The University of Hong Kong, Hong Kong Special Administrative Region
| |
Collapse
|
7
|
Salino A, Rouhan G, Kuo LY, Almeida TE. Editorial: Biology, systematics, and evolution of ferns and lycophytes in the omics era. FRONTIERS IN PLANT SCIENCE 2023; 14:1146829. [PMID: 36909387 PMCID: PMC9996449 DOI: 10.3389/fpls.2023.1146829] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/17/2023] [Accepted: 02/14/2023] [Indexed: 06/18/2023]
Affiliation(s)
- Alexandre Salino
- Programa de Pós-Graduação em Biologia Vegetal, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil
| | - Germinal Rouhan
- Institut de Systématique, Evolution, Biodiversité (ISYEB), Muséum national d’Histoire naturelle, Sorbonne Université, EPHE, UA, CNRS, Paris, France
| | - Li-Yaung Kuo
- Institute of Molecular & Cellular Biology, National Tsing Hua University, Hsinchu, Taiwan
| | - Thaís Elias Almeida
- Universidade Federal de Pernambuco, Departamento de Botânica, Centro de Biociências, Recife, Brazil
| |
Collapse
|
8
|
Guo C, Luo Y, Gao LM, Yi TS, Li HT, Yang JB, Li DZ. Phylogenomics and the flowering plant tree of life. JOURNAL OF INTEGRATIVE PLANT BIOLOGY 2023; 65:299-323. [PMID: 36416284 DOI: 10.1111/jipb.13415] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/09/2022] [Accepted: 11/22/2022] [Indexed: 06/16/2023]
Abstract
The advances accelerated by next-generation sequencing and long-read sequencing technologies continue to provide an impetus for plant phylogenetic study. In the past decade, a large number of phylogenetic studies adopting hundreds to thousands of genes across a wealth of clades have emerged and ushered plant phylogenetics and evolution into a new era. In the meantime, a roadmap for researchers when making decisions across different approaches for their phylogenomic research design is imminent. This review focuses on the utility of genomic data (from organelle genomes, to both reduced representation sequencing and whole-genome sequencing) in phylogenetic and evolutionary investigations, describes the baseline methodology of experimental and analytical procedures, and summarizes recent progress in flowering plant phylogenomics at the ordinal, familial, tribal, and lower levels. We also discuss the challenges, such as the adverse impact on orthology inference and phylogenetic reconstruction raised from systematic errors, and underlying biological factors, such as whole-genome duplication, hybridization/introgression, and incomplete lineage sorting, together suggesting that a bifurcating tree may not be the best model for the tree of life. Finally, we discuss promising avenues for future plant phylogenomic studies.
Collapse
Affiliation(s)
- Cen Guo
- Germplasm Bank of Wild Species, Kunming Institute of Botany, the Chinese Academy of Sciences, Kunming, 650201, China
| | - Yang Luo
- CAS Key Laboratory for Plant Diversity and Biogeography of East Asia, Kunming Institute of Botany, the Chinese Academy of Sciences, Kunming, 650201, China
| | - Lian-Ming Gao
- CAS Key Laboratory for Plant Diversity and Biogeography of East Asia, Kunming Institute of Botany, the Chinese Academy of Sciences, Kunming, 650201, China
- Lijiang Forest Diversity National Observation and Research Station, Kunming Institute of Botany, Chinese Academy of Sciences, Lijiang, 674100, China
| | - Ting-Shuang Yi
- Germplasm Bank of Wild Species, Kunming Institute of Botany, the Chinese Academy of Sciences, Kunming, 650201, China
- CAS Key Laboratory for Plant Diversity and Biogeography of East Asia, Kunming Institute of Botany, the Chinese Academy of Sciences, Kunming, 650201, China
| | - Hong-Tao Li
- Germplasm Bank of Wild Species, Kunming Institute of Botany, the Chinese Academy of Sciences, Kunming, 650201, China
| | - Jun-Bo Yang
- Germplasm Bank of Wild Species, Kunming Institute of Botany, the Chinese Academy of Sciences, Kunming, 650201, China
| | - De-Zhu Li
- Germplasm Bank of Wild Species, Kunming Institute of Botany, the Chinese Academy of Sciences, Kunming, 650201, China
- CAS Key Laboratory for Plant Diversity and Biogeography of East Asia, Kunming Institute of Botany, the Chinese Academy of Sciences, Kunming, 650201, China
- Lijiang Forest Diversity National Observation and Research Station, Kunming Institute of Botany, Chinese Academy of Sciences, Lijiang, 674100, China
- Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming, 650201, China
| |
Collapse
|
9
|
Zhou W, Jenny Xiang QY. Phylogenomics and Biogeography of Castanea (Chestnut) and Hamamelis (Witch-hazel) - Choosing between RAD-seq and Hyb-Seq Approaches. Mol Phylogenet Evol 2022; 176:107592. [DOI: 10.1016/j.ympev.2022.107592] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2021] [Revised: 06/18/2022] [Accepted: 07/20/2022] [Indexed: 10/31/2022]
|
10
|
Liu BB, Ren C, Kwak M, Hodel RGJ, Xu C, He J, Zhou WB, Huang CH, Ma H, Qian GZ, Hong DY, Wen J. Phylogenomic conflict analyses in the apple genus Malus s.l. reveal widespread hybridization and allopolyploidy driving diversification, with insights into the complex biogeographic history in the Northern Hemisphere. JOURNAL OF INTEGRATIVE PLANT BIOLOGY 2022; 64:1020-1043. [PMID: 35274452 DOI: 10.1111/jipb.13246] [Citation(s) in RCA: 23] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/02/2021] [Accepted: 03/08/2022] [Indexed: 06/14/2023]
Abstract
Phylogenomic evidence from an increasing number of studies has demonstrated that different data sets and analytical approaches often reconstruct strongly supported but conflicting relationships. In this study, 785 single-copy nuclear genes and 75 complete plastomes were used to infer the phylogenetic relationships and estimate the historical biogeography of the apple genus Malus sensu lato, an economically important lineage disjunctly distributed in the Northern Hemisphere and involved in known and suspected hybridization and allopolyploidy events. The nuclear phylogeny recovered the monophyly of Malus s.l. (including Docynia); however, the genus was supported to be biphyletic in the plastid phylogeny. An ancient chloroplast capture event in the Eocene in western North America best explains the cytonuclear discordance. Our conflict analysis demonstrated that ILS, hybridization, and allopolyploidy could explain the widespread nuclear gene tree discordance. One deep hybridization event (Malus doumeri) and one recent event (Malus coronaria) were detected in Malus s.l. Furthermore, our historical biogeographic analysis integrating living and fossil data supported a widespread East Asian-western North American origin of Malus s.l. in the Eocene, followed by several extinction and dispersal events in the Northern Hemisphere. We also propose a general workflow for assessing phylogenomic discordance and biogeographic analysis using deep genome skimming data sets.
Collapse
Affiliation(s)
- Bin-Bin Liu
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, the Chinese Academy of Sciences, Beijing, 100093, China
- Department of Botany, National Museum of Natural History, Smithsonian Institution, PO Box 37012, Washington, 20013-7012, DC, USA
| | - Chen Ren
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization, South China Botanical Garden, The Chinese Academy of Sciences, Guangzhou, 510650, China
- Guangdong Provincial Key Laboratory of Applied Botany, South China Botanical Garden, The Chinese Academy of Sciences, Guangzhou, 510650, China
| | - Myounghai Kwak
- National Institute of Biological Resources, Incheon, 22689, South Korea
| | - Richard G J Hodel
- Department of Botany, National Museum of Natural History, Smithsonian Institution, PO Box 37012, Washington, 20013-7012, DC, USA
| | - Chao Xu
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, the Chinese Academy of Sciences, Beijing, 100093, China
| | - Jian He
- School of Ecology and Nature Conservation, Beijing Forestry University, Beijing, 100083, China
| | - Wen-Bin Zhou
- Department of Plant and Microbial Biology, North Carolina State University, Raleigh, 27965, NC, USA
| | - Chien-Hsun Huang
- State Key Laboratory of Genetic Engineering and Collaborative Innovation Center of Genetics and Development, Ministry of Education Key Laboratory of Biodiversity and Ecological Engineering, Institute of Plant Biology, Center of Evolutionary Biology, School of Life Sciences, Fudan University, Shanghai, 200433, China
| | - Hong Ma
- Department of Biology, Huck Institutes of the Life Sciences, Pennsylvania State University, 510D Mueller Laboratory, University Park, Pennsylvania, 16802, USA
| | - Guan-Ze Qian
- College of Life Sciences, Liaocheng University, Liaocheng, 252059, China
| | - De-Yuan Hong
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, the Chinese Academy of Sciences, Beijing, 100093, China
| | - Jun Wen
- Department of Botany, National Museum of Natural History, Smithsonian Institution, PO Box 37012, Washington, 20013-7012, DC, USA
| |
Collapse
|
11
|
Phylogenomics, floral evolution, and biogeography of Lithospermum L. (Boraginaceae). Mol Phylogenet Evol 2021; 166:107317. [PMID: 34547439 DOI: 10.1016/j.ympev.2021.107317] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2020] [Revised: 08/29/2021] [Accepted: 09/15/2021] [Indexed: 11/23/2022]
Abstract
Lithospermum (Boraginaceae), a geographically cosmopolitan medium-sized genus, includes diverse floral morphology, with variation in corolla size and shape and in breeding system. Over the past decade, multiple studies have examined the evolutionary history of Lithospermum, with most utilizing DNA regions from the plastid genome and/or the nuclear ribosomal internal transcribed spacer. These studies have, in general, not resulted in well-resolved and well-supported phylogenies. In the present study, 298 nuclear DNA regions, amplified via target sequence capture, were utilized for phylogenetic reconstruction for Lithospermum and relatives in Boraginaceae, and patterns of floral evolution, species diversification, and biogeography were examined. Based on multiple phylogenetic methods, Lithospermum is resolved as monophyletic, and the New World species of the genus are also monophyletic. While minimal phylogenetic incongruence is resolved within the nuclear genome, incongruence between the nuclear and plastid genomes is recovered. This is likely due to incomplete lineage sorting during early diversification of the genus in the Americas approximately 7.8 million years ago. At least four shifts to longer corollas are identified throughout Lithospermum, and this may be due to selection for hummingbird-pollinated flowers, particularly for species in Mexico and the southwestern United States. In the New World, one clade of species of the genus diversified primarily across the United States and Canada, and another radiated throughout the mountains of Mexico.
Collapse
|
12
|
Duchêne DA, Mather N, Van Der Wal C, Ho SYW. Excluding loci with substitution saturation improves inferences from phylogenomic data. Syst Biol 2021; 71:676-689. [PMID: 34508605 PMCID: PMC9016599 DOI: 10.1093/sysbio/syab075] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2020] [Accepted: 09/07/2021] [Indexed: 11/21/2022] Open
Abstract
The historical signal in nucleotide sequences becomes eroded over time by substitutions occurring repeatedly at the same sites. This phenomenon, known as substitution saturation, is recognized as one of the primary obstacles to deep-time phylogenetic inference using genome-scale data sets. We present a new test of substitution saturation and demonstrate its performance in simulated and empirical data. For some of the 36 empirical phylogenomic data sets that we examined, we detect substitution saturation in around 50% of loci. We found that saturation tends to be flagged as problematic in loci with highly discordant phylogenetic signals across sites. Within each data set, the loci with smaller numbers of informative sites are more likely to be flagged as containing problematic levels of saturation. The entropy saturation test proposed here is sensitive to high evolutionary rates relative to the evolutionary timeframe, while also being sensitive to several factors known to mislead phylogenetic inference, including short internal branches relative to external branches, short nucleotide sequences, and tree imbalance. Our study demonstrates that excluding loci with substitution saturation can be an effective means of mitigating the negative impact of multiple substitutions on phylogenetic inferences. [Phylogenetic model performance; phylogenomics; substitution model; substitution saturation; test statistics.]
Collapse
Affiliation(s)
- David A Duchêne
- Centre for Evolutionary Hologenomics, University of Copenhagen, 1352 Copenhagen, Denmark
| | - Niklas Mather
- School of Life and Environmental Sciences, University of Sydney, Sydney, NSW 2006, Australia
| | - Cara Van Der Wal
- School of Life and Environmental Sciences, University of Sydney, Sydney, NSW 2006, Australia
| | - Simon Y W Ho
- School of Life and Environmental Sciences, University of Sydney, Sydney, NSW 2006, Australia
| |
Collapse
|
13
|
Vankan M, Ho SYW, Duchêne DA. Evolutionary Rate Variation Among Lineages in Gene Trees has a Negative Impact on Species-Tree Inference. Syst Biol 2021; 71:490-500. [PMID: 34255084 PMCID: PMC8830059 DOI: 10.1093/sysbio/syab051] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2021] [Revised: 06/18/2021] [Indexed: 11/12/2022] Open
Abstract
Phylogenetic analyses of genomic data provide a powerful means of reconstructing the evolutionary relationships among organisms, yet such analyses are often hindered by conflicting phylogenetic signals among loci. Identifying the signals that are most influential to species-tree estimation can help to inform the choice of data for phylogenomic analysis. We investigated this in an analysis of 30 phylogenomic data sets. For each data set, we examined the association between several branch-length characteristics of gene trees and the distance between these gene trees and the corresponding species trees. We found that the distance of each gene tree to the species tree inferred from the full data set was positively associated with variation in root-to-tip distances and negatively associated with mean branch support. However, no such associations were found for gene-tree length, a measure of the overall substitution rate at each locus. We further explored the usefulness of the best-performing branch-based characteristics for selecting loci for phylogenomic analyses. We found that loci that yield gene trees with high variation in root-to-tip distances have a disproportionately distant signal of tree topology compared with the complete data sets. These results suggest that rate variation across lineages should be taken into consideration when exploring and even selecting loci for phylogenomic analysis.[Branch support; data filtering; nucleotide substitution model; phylogenomics; substitution rate; summary coalescent methods.]
Collapse
Affiliation(s)
- Mezzalina Vankan
- School of Life and Environmental Sciences, University of Sydney, NSW 2006, Australia.,Research School of Biology, Australian National University, ACT 2601, Australia
| | - Simon Y W Ho
- School of Life and Environmental Sciences, University of Sydney, NSW 2006, Australia
| | - David A Duchêne
- Research School of Biology, Australian National University, ACT 2601, Australia.,Centre for Evolutionary Hologenomics, University of Copenhagen, Copenhagen 1352, Denmark
| |
Collapse
|
14
|
Ottenlips MV, Mansfield DH, Buerki S, Feist MAE, Downie SR, Dodsworth S, Forest F, Plunkett GM, Smith JF. Resolving species boundaries in a recent radiation with the Angiosperms353 probe set: the Lomatium packardiae/L. anomalum clade of the L. triternatum (Apiaceae) complex. AMERICAN JOURNAL OF BOTANY 2021; 108:1217-1233. [PMID: 34105148 PMCID: PMC8362113 DOI: 10.1002/ajb2.1676] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/01/2020] [Accepted: 02/26/2021] [Indexed: 05/29/2023]
Abstract
PREMISE Speciation not associated with morphological shifts is challenging to detect unless molecular data are employed. Using Sanger-sequencing approaches, the Lomatium packardiae/L. anomalum subcomplex within the larger Lomatium triternatum complex could not be resolved. Therefore, we attempt to resolve these boundaries here. METHODS The Angiosperms353 probe set was employed to resolve the ambiguity within Lomatium triternatum species complex using 48 accessions assigned to L. packardiae, L. anomalum, or L. triternatum. In addition to exon data, 54 nuclear introns were extracted and were complete for all samples. Three approaches were used to estimate evolutionary relationships and define species boundaries: STACEY, a Bayesian coalescent-based species tree analysis that takes incomplete lineage sorting into account; ASTRAL-III, another coalescent-based species tree analysis; and a concatenated approach using MrBayes. Climatic factors, morphological characters, and soil variables were measured and analyzed to provide additional support for recovered groups. RESULTS The STACEY analysis recovered three major clades and seven subclades, all of which are geographically structured, and some correspond to previously named taxa. No other analysis had full agreement between recovered clades and other parameters. Climatic niche and leaflet width and length provide some predictive ability for the major clades. CONCLUSIONS The results suggest that these groups are in the process of incipient speciation and incomplete lineage sorting has been a major barrier to resolving boundaries within this lineage previously. These results are hypothesized through sequencing of multiple loci and analyzing data using coalescent-based processes.
Collapse
Affiliation(s)
| | | | - Sven Buerki
- Department of Biological SciencesBoise State UniversityBoiseID83725USA
| | | | - Stephen R. Downie
- Department of Plant BiologyUniversity of Illinois at Urbana‐ChampaignUrbanaIL61801USA
| | - Steven Dodsworth
- Royal Botanic Gardens, KewRichmondSurreyTW9 3AEUK
- School of Life SciencesUniversity of BedfordshireLutonLU1 3JUUK
| | - Félix Forest
- Royal Botanic Gardens, KewRichmondSurreyTW9 3AEUK
| | - Gregory M. Plunkett
- Cullman Program for Molecular SystematicsNew York Botanical Garden2900 Southern BoulevardBronxNY10458USA
| | - James F. Smith
- Department of Biological SciencesBoise State UniversityBoiseID83725USA
| |
Collapse
|
15
|
Shah T, Schneider JV, Zizka G, Maurin O, Baker W, Forest F, Brewer GE, Savolainen V, Darbyshire I, Larridon I. Joining forces in Ochnaceae phylogenomics: a tale of two targeted sequencing probe kits. AMERICAN JOURNAL OF BOTANY 2021; 108:1201-1216. [PMID: 34180046 DOI: 10.1002/ajb2.1682] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/29/2020] [Accepted: 02/23/2021] [Indexed: 05/10/2023]
Abstract
PREMISE Both universal and family-specific targeted sequencing probe kits are becoming widely used for reconstruction of phylogenetic relationships in angiosperms. Within the pantropical Ochnaceae, we show that with careful data filtering, universal kits are equally as capable in resolving intergeneric relationships as custom probe kits. Furthermore, we show the strength in combining data from both kits to mitigate bias and provide a more robust result to resolve evolutionary relationships. METHODS We sampled 23 Ochnaceae genera and used targeted sequencing with two probe kits, the universal Angiosperms353 kit and a family-specific kit. We used maximum likelihood inference with a concatenated matrix of loci and multispecies-coalescence approaches to infer relationships in the family. We explored phylogenetic informativeness and the impact of missing data on resolution and tree support. RESULTS For the Angiosperms353 data set, the concatenation approach provided results more congruent with those of the Ochnaceae-specific data set. Filtering missing data was most impactful on the Angiosperms353 data set, with a relaxed threshold being the optimum scenario. The Ochnaceae-specific data set resolved consistent topologies using both inference methods, and no major improvements were obtained after data filtering. Merging of data obtained with the two kits resulted in a well-supported phylogenetic tree. CONCLUSIONS The Angiosperms353 data set improved upon data filtering, and missing data played an important role in phylogenetic reconstruction. The Angiosperms353 data set resolved the phylogenetic backbone of Ochnaceae as equally well as the family specific data set. All analyses indicated that both Sauvagesia L. and Campylospermum Tiegh. as currently circumscribed are polyphyletic and require revised delimitation.
Collapse
Affiliation(s)
- Toral Shah
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, UK
- Department of Life Sciences, Imperial College, Silwood Park Campus, Ascot, Berks, SL5 7PY, UK
| | - Julio V Schneider
- Department of Botany and Molecular Evolution, Senckenberg Research Institute and Natural History Museum Frankfurt, Senckenberganlage 25, Frankfurt am Main, D-60325, Germany
| | - Georg Zizka
- Department of Botany and Molecular Evolution, Senckenberg Research Institute and Natural History Museum Frankfurt, Senckenberganlage 25, Frankfurt am Main, D-60325, Germany
- Institute of Ecology, Evolution and Diversity, Goethe University, Max-von-Laue-Str. 13, Frankfurt am Main, 60438, Germany
| | - Olivier Maurin
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, UK
| | - William Baker
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, UK
| | - Félix Forest
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, UK
| | - Grace E Brewer
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, UK
| | - Vincent Savolainen
- Department of Life Sciences, Imperial College, Silwood Park Campus, Ascot, Berks, SL5 7PY, UK
| | | | - Isabel Larridon
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, UK
- Systematic and Evolutionary Botany Lab, Department of Biology, Ghent University, K.L., Ledeganckstraat 35, Gent, 9000, Belgium
| |
Collapse
|
16
|
Phylogenomic reconstruction addressing the Peltigeralean backbone (Lecanoromycetes, Ascomycota). FUNGAL DIVERS 2021. [DOI: 10.1007/s13225-021-00476-8] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]
|
17
|
Zhou W, Soghigian J, Xiang QYJ. A New Pipeline for Removing Paralogs in Target Enrichment Data. Syst Biol 2021; 71:410-425. [PMID: 34146111 PMCID: PMC8974407 DOI: 10.1093/sysbio/syab044] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2021] [Revised: 06/04/2021] [Accepted: 06/12/2021] [Indexed: 12/30/2022] Open
Abstract
Target enrichment (such as Hyb-Seq) is a well-established high throughput sequencing
method that has been increasingly used for phylogenomic studies. Unfortunately, current
widely used pipelines for analysis of target enrichment data do not have a vigorous
procedure to remove paralogs in target enrichment data. In this study, we develop a
pipeline we call Putative Paralogs Detection (PPD) to better address putative paralogs
from enrichment data. The new pipeline is an add-on to the existing HybPiper pipeline, and
the entire pipeline applies criteria in both sequence similarity and heterozygous sites at
each locus in the identification of paralogs. Users may adjust the thresholds of sequence
identity and heterozygous sites to identify and remove paralogs according to the level of
phylogenetic divergence of their group of interest. The new pipeline also removes highly
polymorphic sites attributed to errors in sequence assembly and gappy regions in the
alignment. We demonstrated the value of the new pipeline using empirical data generated
from Hyb-Seq and the Angiosperms353 kit for two woody genera Castanea
(Fagaceae, Fagales) and Hamamelis (Hamamelidaceae, Saxifragales).
Comparisons of data sets showed that the PPD identified many more putative paralogs than
the popular method HybPiper. Comparisons of tree topologies and divergence times showed
evident differences between data from HybPiper and data from our new PPD pipeline. We
further evaluated the accuracy and error rates of PPD by BLAST mapping of putative
paralogous and orthologous sequences to a reference genome sequence of Castanea
mollissima. Compared to HybPiper alone, PPD identified substantially more
paralogous gene sequences that mapped to multiple regions of the reference genome (31
genes for PPD compared with 4 genes for HybPiper alone). In conjunction with HybPiper,
paralogous genes identified by both pipelines can be removed resulting in the construction
of more robust orthologous gene data sets for phylogenomic and divergence time analyses.
Our study demonstrates the value of Hyb-Seq with data derived from the Angiosperms353
probe set for elucidating species relationships within a genus, and argues for the
importance of additional steps to filter paralogous genes and poorly aligned regions
(e.g., as occur through assembly errors), such as our new PPD pipeline described in this
study. [Angiosperms353; Castanea; divergence time;
Hamamelis; Hyb-Seq, paralogs, phylogenomics.]
Collapse
Affiliation(s)
- Wenbin Zhou
- Department of Plant and Microbial Biology, North Carolina State University, Raleigh, NC 27965, USA
| | - John Soghigian
- Department of Entomology and Plant Pathology, North Carolina State University, Raleigh, NC 27965, USA
| | - Qiu-Yun Jenny Xiang
- Department of Plant and Microbial Biology, North Carolina State University, Raleigh, NC 27965, USA
| |
Collapse
|
18
|
Ren C, Wang L, Nie ZL, Johnson G, Yang QE, Wen J. Development and phylogenetic utilities of a new set of single-/low-copy nuclear genes in Senecioneae (Asteraceae), with new insights into the tribal position and the relationships within subtribe Tussilagininae. Mol Phylogenet Evol 2021; 162:107202. [PMID: 33992786 DOI: 10.1016/j.ympev.2021.107202] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2020] [Revised: 04/13/2021] [Accepted: 05/06/2021] [Indexed: 11/26/2022]
Abstract
The tribe Senecioneae is one of the largest tribes in Asteraceae, with a nearly cosmopolitan distribution. Despite great efforts devoted to elucidate the evolution of Senecioneae, many questions still remain concerning the systematics of this group, from the tribal circumscription and position to species relationships in many genera. The hybridization-based target enrichment method of next-generation sequencing has been accepted as a promising approach to resolve phylogenetic problems. We herein develop a set of single-/low-copy genes for Senecioneae, and test their phylogenetic utilities. Our results demonstrate that these genes work highly efficiently for Senecioneae, with a high average gene recovery of 98.8% across the tribe and recovering robust phylogenetic hypotheses at different levels. In particular, the delimitation of the Senecioneae has been confirmed to include Abrotanella and exclude Doronicum, with the former sister to core Senecioneae and the latter shown to be more closely related to Calenduleae. Moreover, Doronicum and Calenduleae are inferred to be the closest relatives of Senecioneae, which is a new hypothesis well supported by statistical topology tests, morphological evidence, and the profile of pyrrolizidine alkaloids, a special kind of chemical characters generally used to define Senecioneae. Furthermore, this study suggests a complex reticulation history in the diversification of Senecioneae, accounting for the prevalence of polyploid groups in the tribe. With subtribe Tussilagininae s.str. as a case study showing a more evident pattern of gene duplication, we further explored reconstructing the phylogeny in the groups with high ploidy levels. Our results also demonstrate that tree topologies based on sorted paralogous copies are stable across different methods of phylogenetic inference, and more congruent with the morphological evidence and the results of previous phylogenetic studies.
Collapse
Affiliation(s)
- Chen Ren
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, Guangdong 510650, China; Center of Conservation Biology, Core Botanical Gardens, Chinese Academy of Sciences, Guangzhou, Guangdong 510650, China
| | - Long Wang
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, Guangdong 510650, China
| | - Ze-Long Nie
- Key Laboratory of Plant Resources Conservation and Utilization, College of Biology and Environmental Sciences, Jishou University, Jishou, Hunan 416000, China
| | - Gabriel Johnson
- Department of Botany, National Museum of Natural History, MRC 166, Smithsonian Institution, Washington, DC 20013-7012, USA
| | - Qin-Er Yang
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, Guangdong 510650, China; Center of Conservation Biology, Core Botanical Gardens, Chinese Academy of Sciences, Guangzhou, Guangdong 510650, China; Key Laboratory of Digital Botanical Garden of Guangdong Province, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, Guangdong 510650, China.
| | - Jun Wen
- Department of Botany, National Museum of Natural History, MRC 166, Smithsonian Institution, Washington, DC 20013-7012, USA.
| |
Collapse
|
19
|
Osuna-Mascaró C, Rubio de Casas R, Landis JB, Perfectti F. Genomic Resources for Erysimum spp. (Brassicaceae): Transcriptome and Chloroplast Genomes. Front Ecol Evol 2021. [DOI: 10.3389/fevo.2021.620601] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
|
20
|
Arcila D, Hughes LC, Meléndez-Vazquez F, Baldwin CC, White W, Carpenter K, Williams JT, Santos MD, Pogonoski J, Miya M, Ortí G, Betancur-R R. Testing the utility of alternative metrics of branch support to address the ancient evolutionary radiation of tunas, stromateoids, and allies (Teleostei: Pelagiaria). Syst Biol 2021; 70:1123-1144. [PMID: 33783539 DOI: 10.1093/sysbio/syab018] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2021] [Accepted: 03/13/2021] [Indexed: 12/19/2022] Open
Abstract
The use of high-throughput sequencing technologies to produce genome-scale datasets was expected to settle some long-standing controversies across the Tree of Life, particularly in areas where short branches occur at deep timescales. Instead, these datasets have often yielded many well-supported but conflicting topologies, and highly variable gene-tree distributions. A variety of branch-support metrics beyond the nonparametric bootstrap are now available to assess how robust a phylogenetic hypothesis may be, as well as new methods to quantify gene-tree discordance. We applied multiple branch support metrics to an ancient group of marine fishes (Teleostei: Pelagiaria) whose interfamilial relationships have proven difficult to resolve due to a rapid accumulation of lineages very early in its history. We analyzed hundreds of loci including published UCE data and newly generated exonic data along with their flanking regions to represent all 16 extant families for more than 150 out of 284 valid species in the group. Branch support was lower for interfamilial relationships (except the SH-like aLRT and aBayes methods) regardless of the type of marker used. Several nodes that were highly supported with bootstrap had very low site and gene-tree concordance, revealing underlying conflict. Despite this conflict, we were able to identify four consistent interfamilial clades, each comprised of two or three families. Combining exons with their flanking regions also produced increased branch lengths in the deep branches of the pelagiarian tree. Our results demonstrate the limitations of employing current metrics of branch support and species-tree estimation when assessing the confidence of ancient evolutionary radiations and emphasize the necessity to embrace alternative measurements to explore phylogenetic uncertainty and discordance in phylogenomic datasets.
Collapse
Affiliation(s)
- Dahiana Arcila
- Department of Ichthyology, Sam Noble Oklahoma Museum of Natural History, Norman, Oklahoma, U.S.A.,Department of Biology, University of Oklahoma, Norman, Oklahoma, U.S.A
| | - Lily C Hughes
- Department of Biological Sciences, The George Washington University, Washington, District of Columbia, U.S.A.,Department of Organismal Biology and Anatomy, The University of Chicago, Illinois, Chicago, U.S.A.,Department of Vertebrate Zoology, Smithsonian Institution National Museum of Natural History, Washington, District of Columbia, U.S.A
| | - Fernando Meléndez-Vazquez
- Department of Ichthyology, Sam Noble Oklahoma Museum of Natural History, Norman, Oklahoma, U.S.A.,Department of Biology, University of Oklahoma, Norman, Oklahoma, U.S.A
| | - Carole C Baldwin
- Department of Vertebrate Zoology, Smithsonian Institution National Museum of Natural History, Washington, District of Columbia, U.S.A
| | - William White
- CSIRO Australian National Fish Collection, National Research Collections Australia, Hobart, Hobart, Tasmania, Australia
| | - Kent Carpenter
- Department of Biological Sciences, Old Dominion University, Norfolk, Virginia, U.S.A
| | - Jeffrey T Williams
- Department of Vertebrate Zoology, Smithsonian Institution National Museum of Natural History, Washington, District of Columbia, U.S.A
| | | | - John Pogonoski
- CSIRO Australian National Fish Collection, National Research Collections Australia, Hobart, Hobart, Tasmania, Australia
| | - Masaki Miya
- Natural History Museum and Institute, Chiba, Aoba-cho, Chuo-ku, Chiba, Japan
| | - Guillermo Ortí
- Department of Biological Sciences, The George Washington University, Washington, District of Columbia, U.S.A.,Department of Vertebrate Zoology, Smithsonian Institution National Museum of Natural History, Washington, District of Columbia, U.S.A
| | | |
Collapse
|
21
|
Serna-Sánchez MA, Pérez-Escobar OA, Bogarín D, Torres-Jimenez MF, Alvarez-Yela AC, Arcila-Galvis JE, Hall CF, de Barros F, Pinheiro F, Dodsworth S, Chase MW, Antonelli A, Arias T. Plastid phylogenomics resolves ambiguous relationships within the orchid family and provides a solid timeframe for biogeography and macroevolution. Sci Rep 2021; 11:6858. [PMID: 33767214 PMCID: PMC7994851 DOI: 10.1038/s41598-021-83664-5] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2019] [Accepted: 12/30/2020] [Indexed: 11/29/2022] Open
Abstract
Recent phylogenomic analyses based on the maternally inherited plastid organelle have enlightened evolutionary relationships between the subfamilies of Orchidaceae and most of the tribes. However, uncertainty remains within several subtribes and genera for which phylogenetic relationships have not ever been tested in a phylogenomic context. To address these knowledge-gaps, we here provide the most extensively sampled analysis of the orchid family to date, based on 78 plastid coding genes representing 264 species, 117 genera, 18 tribes and 28 subtribes. Divergence times are also provided as inferred from strict and relaxed molecular clocks and birth-death tree models. Our taxon sampling includes 51 newly sequenced plastid genomes produced by a genome skimming approach. We focus our sampling efforts on previously unplaced clades within tribes Cymbidieae and Epidendreae. Our results confirmed phylogenetic relationships in Orchidaceae as recovered in previous studies, most of which were recovered with maximum support (209 of the 262 tree branches). We provide for the first time a clear phylogenetic placement for Codonorchideae within subfamily Orchidoideae, and Podochilieae and Collabieae within subfamily Epidendroideae. We also identify relationships that have been persistently problematic across multiple studies, regardless of the different details of sampling and genomic datasets used for phylogenetic reconstructions. Our study provides an expanded, robust temporal phylogenomic framework of the Orchidaceae that paves the way for biogeographical and macroevolutionary studies.
Collapse
Affiliation(s)
- Maria Alejandra Serna-Sánchez
- Laboratorio de Biología Comparativa, Corporación Para Investigaciones Biológicas (CIB), Cra. 72 A No. 78 B 141, Medellín, Colombia
- Biodiversity, Evolution and Conservation, EAFIT University, Cra. 49, No. 7 sur 50, Medellín, Colombia
| | | | - Diego Bogarín
- Jardín Botánico Lankester, Universidad de Costa Rica, P. O. Box 302-7050, Cartago, Costa Rica
- Endless Forms Group, Naturalis Biodiversity Center, P.O. Box 9517, 2300 RA, Leiden, The Netherlands
| | - María Fernanda Torres-Jimenez
- Gothenburg Global Biodiversity Centre, Department of Biological and Environmental Sciences, University of Gothenburg, 405 30, Gothenburg, Sweden
| | - Astrid Catalina Alvarez-Yela
- Centro de Bioinformática y Biología Computacional (BIOS), Ecoparque Los Yarumos Edificio BIOS, Manizales, Colombia
| | - Juliana E Arcila-Galvis
- Laboratorio de Biología Comparativa, Corporación Para Investigaciones Biológicas (CIB), Cra. 72 A No. 78 B 141, Medellín, Colombia
| | - Climbie F Hall
- Instituto de Botânica, Núcleo de Pesquisa Orquídario Do Estado, Postal 68041, São Paulo, SP, 04045-972, Brasil
| | - Fábio de Barros
- Instituto de Botânica, Núcleo de Pesquisa Orquídario Do Estado, Postal 68041, São Paulo, SP, 04045-972, Brasil
| | - Fábio Pinheiro
- Instituto de Biologia, Departamento de Biologia Vegetal, Universidade Estadual de Campinas, Campinas, SP, 13083-862, Brazil
| | - Steven Dodsworth
- School of Life Sciences, University of Bedfordshire, University Square, Luton, LU1 3JU, UK
| | | | - Alexandre Antonelli
- Royal Botanic Gardens Kew, London, TW9 3AE, UK
- Gothenburg Global Biodiversity Centre, Department of Biological and Environmental Sciences, University of Gothenburg, 405 30, Gothenburg, Sweden
- Department of Plant Sciences, University of Oxford, South Parks Road, Oxford, OX1 3RB, UK
| | - Tatiana Arias
- Laboratorio de Biología Comparativa, Corporación Para Investigaciones Biológicas (CIB), Cra. 72 A No. 78 B 141, Medellín, Colombia.
- Centro de Bioinformática y Biología Computacional (BIOS), Ecoparque Los Yarumos Edificio BIOS, Manizales, Colombia.
- Tecnológico de Antioquia, Calle 78B NO. 72A - 220, Medellín, Colombia.
| |
Collapse
|
22
|
Loeuille B, Thode V, Siniscalchi C, Andrade S, Rossi M, Pirani JR. Extremely low nucleotide diversity among thirty-six new chloroplast genome sequences from Aldama (Heliantheae, Asteraceae) and comparative chloroplast genomics analyses with closely related genera. PeerJ 2021; 9:e10886. [PMID: 33665028 PMCID: PMC7912680 DOI: 10.7717/peerj.10886] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2020] [Accepted: 01/12/2021] [Indexed: 01/23/2023] Open
Abstract
Aldama (Heliantheae, Asteraceae) is a diverse genus in the sunflower family. To date, nearly 200 Asteraceae chloroplast genomes have been sequenced, but the plastomes of Aldama remain undescribed. Plastomes in Asteraceae usually show little sequence divergence, consequently, our hypothesis is that species of Aldama will be overall conserved. In this study, we newly sequenced 36 plastomes of Aldama and of five species belonging to other Heliantheae genera selected as outgroups (i.e., Dimerostemma asperatum, Helianthus tuberosus, Iostephane heterophylla, Pappobolus lanatus var. lanatus, and Tithonia diversifolia). We analyzed the structure and gene content of the assembled plastomes and performed comparative analyses within Aldama and with other closely related genera. As expected, Aldama plastomes are very conserved, with the overall gene content and orientation being similar in all studied species. The length of the plastome is also consistent and the junction between regions usually contain the same genes and have similar lengths. A large ∼20 kb and a small ∼3 kb inversion were detected in the Large Single Copy (LSC) regions of all assembled plastomes, similarly to other Asteraceae species. The nucleotide diversity is very low, with only 1,509 variable sites in 127,466 bp (i.e., 1.18% of the sites in the alignment of 36 Aldama plastomes, with one of the IRs removed, is variable). Only one gene, rbcL, shows signatures of positive selection. The plastomes of the selected outgroups feature a similar gene content and structure compared to Aldama and also present the two inversions in the LSC region. Deletions of different lengths were observed in the gene ycf2. Multiple SSRs were identified for the sequenced Aldama and outgroups. The phylogenetic analysis shows that Aldama is not monophyletic due to the position of the Mexican species A. dentata. All Brazilian species form a strongly supported clade. Our results bring new understandings into the evolution and diversity of plastomes at the species level.
Collapse
Affiliation(s)
- Benoit Loeuille
- Departamento de Botânica, Universidade Federal de Pernambuco, Recife, Pernambuco, Brazil
| | - Verônica Thode
- Instituto de Biociências, Universidade Federal do Rio Grande do Sul, Porto Alegre, Rio Grande do Sul, Brazil
| | - Carolina Siniscalchi
- Department of Biological Sciences, Mississippi State University, Mississippi State, MS, United States of America
| | - Sonia Andrade
- Departamento de Genética e Biologia Evolutiva, Universidade de São Paulo, São Paulo, São Paulo, Brazil
| | - Magdalena Rossi
- Departamento de Botânica, Universidade de São Paulo, São Paulo, São Paulo, Brazil
| | - José Rubens Pirani
- Departamento de Botânica, Universidade de São Paulo, São Paulo, São Paulo, Brazil
| |
Collapse
|
23
|
Peakall R, Wong DCJ, Phillips RD, Ruibal M, Eyles R, Rodriguez-Delgado C, Linde CC. A multitiered sequence capture strategy spanning broad evolutionary scales: Application for phylogenetic and phylogeographic studies of orchids. Mol Ecol Resour 2021; 21:1118-1140. [PMID: 33453072 DOI: 10.1111/1755-0998.13327] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2020] [Revised: 12/22/2020] [Accepted: 01/05/2021] [Indexed: 11/30/2022]
Abstract
With over 25,000 species, the drivers of diversity in the Orchidaceae remain to be fully understood. Here, we outline a multitiered sequence capture strategy aimed at capturing hundreds of loci to enable phylogenetic resolution from subtribe to subspecific levels in orchids of the tribe Diurideae. For the probe design, we mined subsets of 18 transcriptomes, to give five target sequence sets aimed at the tribe (Sets 1 & 2), subtribe (Set 3), and within subtribe levels (Sets 4 & 5). Analysis included alternative de novo and reference-guided assembly, before target sequence extraction, annotation and alignment, and application of a homology-aware k-mer block phylogenomic approach, prior to maximum likelihood and coalescence-based phylogenetic inference. Our evaluation considered 87 taxa in two test data sets: 67 samples spanning the tribe, and 72 samples involving 24 closely related Caladenia species. The tiered design achieved high target loci recovery (>89%), with the median number of recovered loci in Sets 1-5 as follows: 212, 219, 816, 1024, and 1009, respectively. Interestingly, as a first test of the homologous k-mer approach for targeted sequence capture data, our study revealed its potential for enabling robust phylogenetic species tree inferences. Specifically, we found matching, and in one case improved phylogenetic resolution within species complexes, compared to conventional phylogenetic analysis involving target gene extraction. Our findings indicate that a customized multitiered sequence capture strategy, in combination with promising yet underutilized phylogenomic approaches, will be effective for groups where interspecific divergence is recent, but information on deeper phylogenetic relationships is also required.
Collapse
Affiliation(s)
- Rod Peakall
- Ecology and Evolution, Research School of Biology, The Australian National University, Canberra, ACT, Australia
| | - Darren C J Wong
- Ecology and Evolution, Research School of Biology, The Australian National University, Canberra, ACT, Australia
| | - Ryan D Phillips
- Ecology and Evolution, Research School of Biology, The Australian National University, Canberra, ACT, Australia.,Department of Ecology, Environment and Evolution, La Trobe University, Melbourne, Vic., Australia
| | - Monica Ruibal
- Ecology and Evolution, Research School of Biology, The Australian National University, Canberra, ACT, Australia
| | - Rodney Eyles
- Ecology and Evolution, Research School of Biology, The Australian National University, Canberra, ACT, Australia
| | - Claudia Rodriguez-Delgado
- Ecology and Evolution, Research School of Biology, The Australian National University, Canberra, ACT, Australia
| | - Celeste C Linde
- Ecology and Evolution, Research School of Biology, The Australian National University, Canberra, ACT, Australia
| |
Collapse
|
24
|
Ogutcen E, Christe C, Nishii K, Salamin N, Möller M, Perret M. Phylogenomics of Gesneriaceae using targeted capture of nuclear genes. Mol Phylogenet Evol 2021; 157:107068. [PMID: 33422648 DOI: 10.1016/j.ympev.2021.107068] [Citation(s) in RCA: 30] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2020] [Revised: 12/30/2020] [Accepted: 01/04/2021] [Indexed: 01/07/2023]
Abstract
Gesneriaceae (ca. 3400 species) is a pantropical plant family with a wide range of growth form and floral morphology that are associated with repeated adaptations to different environments and pollinators. Although Gesneriaceae systematics has been largely improved by the use of Sanger sequencing data, our understanding of the evolutionary history of the group is still far from complete due to the limited number of informative characters provided by this type of data. To overcome this limitation, we developed here a Gesneriaceae-specific gene capture kit targeting 830 single-copy loci (776,754 bp in total), including 279 genes from the Universal Angiosperms-353 kit. With an average of 557,600 reads and 87.8% gene recovery, our target capture was successful across the family Gesneriaceae and also in other families of Lamiales. From our bait set, we selected the most informative 418 loci to resolve phylogenetic relationships across the entire Gesneriaceae family using maximum likelihood and coalescent-based methods. Upon testing the phylogenetic performance of our baits on 78 taxa representing 20 out of 24 subtribes within the family, we showed that our data provided high support for the phylogenetic relationships among the major lineages, and were able to provide high resolution within more recent radiations. Overall, the molecular resources we developed here open new perspectives for the study of Gesneriaceae phylogeny at different taxonomical levels and the identification of the factors underlying the diversification of this plant group.
Collapse
Affiliation(s)
- Ezgi Ogutcen
- Conservatoire et Jardin botaniques de la Ville de Genève and Department of Botany and Plant Biology, University of Geneva, 1292 Chambésy, Switzerland
| | - Camille Christe
- Conservatoire et Jardin botaniques de la Ville de Genève and Department of Botany and Plant Biology, University of Geneva, 1292 Chambésy, Switzerland
| | - Kanae Nishii
- Royal Botanic Garden Edinburgh, 20A Inverleith Row, Edinburgh EH3 5LR, Scotland, UK; Kanagawa University, 2946, Tsuchiya, Hiratsuka-shi, Kanagawa 259-1293, Japan
| | - Nicolas Salamin
- Department of Computational Biology, University of Lausanne, 1015 Lausanne, Switzerland
| | - Michael Möller
- Royal Botanic Garden Edinburgh, 20A Inverleith Row, Edinburgh EH3 5LR, Scotland, UK
| | - Mathieu Perret
- Conservatoire et Jardin botaniques de la Ville de Genève and Department of Botany and Plant Biology, University of Geneva, 1292 Chambésy, Switzerland.
| |
Collapse
|
25
|
Banker SE, Lemmon AR, Hassinger AB, Dye M, Holland SD, Kortyna ML, Ospina OE, Ralicki H, Lemmon EM. Hierarchical Hybrid Enrichment: Multitiered Genomic Data Collection Across Evolutionary Scales, With Application to Chorus Frogs (Pseudacris). Syst Biol 2021; 69:756-773. [PMID: 31886503 PMCID: PMC7302053 DOI: 10.1093/sysbio/syz074] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2018] [Revised: 10/26/2019] [Accepted: 11/04/2019] [Indexed: 01/25/2023] Open
Abstract
Determining the optimal targets of genomic subsampling for phylogenomics, phylogeography, and population genomics remains a challenge for evolutionary biologists. Of the available methods for subsampling the genome, hybrid enrichment (sequence capture) has become one of the primary means of data collection for systematics, due to the flexibility and cost efficiency of this approach. Despite the utility of this method, information is lacking as to what genomic targets are most appropriate for addressing questions at different evolutionary scales. In this study, first, we compare the benefits of target loci developed for deep- and shallow scales by comparing these loci at each of three taxonomic levels: within a genus (phylogenetics), within a species (phylogeography), and within a hybrid zone (population genomics). Specifically, we target evolutionarily conserved loci that are appropriate for deeper phylogenetic scales and more rapidly evolving loci that are informative for phylogeographic and population genomic scales. Second, we assess the efficacy of targeting multiple-locus sets for different taxonomic levels in the same hybrid enrichment reaction, an approach we term hierarchical hybrid enrichment. Third, we apply this approach to the North American chorus frog genus Pseudacris to answer key evolutionary questions across taxonomic and temporal scales. We demonstrate that in this system the type of genomic target that produces the most resolved gene trees differs depending on the taxonomic level, although the potential for error is substantially lower for the deep-scale loci at all levels. We successfully recover data for the two different locus sets with high efficiency. Using hierarchical data targeting deep and shallow levels: we 1) resolve the phylogeny of the genus Pseudacris and introduce a novel visual and hypothesis testing method that uses nodal heat maps to examine the robustness of branch support values to the removal of sites and loci; 2) estimate the phylogeographic history of Pseudacris feriarum, which reveals up to five independent invasions leading to sympatry with congener Pseudacris nigrita to form replicated reinforcement contact zones with ongoing gene flow into sympatry; and 3) quantify with high confidence the frequency of hybridization in one of these zones between P. feriarum and P. nigrita, which is lower than microsatellite-based estimates. We find that the hierarchical hybrid enrichment approach offers an efficient, multitiered data collection method for simultaneously addressing questions spanning multiple evolutionary scales. [Anchored hybrid enrichment; heat map; hybridization; phylogenetics; phylogeography; population genomics; reinforcement; reproductive character displacement.].
Collapse
Affiliation(s)
- Sarah E Banker
- Department of Biological Science, Florida State University, 319 Stadium Drive, Tallahassee, FL 32306, USA.,Department of Integrative Biology, University of California, Berkeley, #3160 Berkeley, CA 94720-3160, USA
| | - Alan R Lemmon
- Department of Scientific Computing, Florida State University 400 Dirac Science Library, Tallahassee, FL 32306, USA
| | - Alyssa Bigelow Hassinger
- Department of Biological Science, Florida State University, 319 Stadium Drive, Tallahassee, FL 32306, USA.,Department of Evolution, Ecology, and Organismal Biology, Ohio State University, 318 W. 12th Avenue, 300 Aronoff Laboratory, Columbus, OH 43210, USA
| | - Mysia Dye
- Department of Biological Science, Florida State University, 319 Stadium Drive, Tallahassee, FL 32306, USA
| | - Sean D Holland
- Department of Biological Science, Florida State University, 319 Stadium Drive, Tallahassee, FL 32306, USA
| | - Michelle L Kortyna
- Department of Biological Science, Florida State University, 319 Stadium Drive, Tallahassee, FL 32306, USA
| | - Oscar E Ospina
- Department of Biological Science, Florida State University, 319 Stadium Drive, Tallahassee, FL 32306, USA
| | - Hannah Ralicki
- Department of Biological Science, Florida State University, 319 Stadium Drive, Tallahassee, FL 32306, USA.,Department of Biological Science, University of Connecticut, 91 North Eagleville Road, Storrs, CT 06268, USA
| | - Emily Moriarty Lemmon
- Department of Biological Science, Florida State University, 319 Stadium Drive, Tallahassee, FL 32306, USA
| |
Collapse
|
26
|
Orr RJS, Sannum MM, Boessenkool S, Di Martino E, Gordon DP, Mello HL, Obst M, Ramsfjell MH, Smith AM, Liow LH. A molecular phylogeny of historical and contemporary specimens of an under-studied micro-invertebrate group. Ecol Evol 2021; 11:309-320. [PMID: 33437431 PMCID: PMC7790615 DOI: 10.1002/ece3.7042] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2020] [Revised: 10/08/2020] [Accepted: 10/28/2020] [Indexed: 11/06/2022] Open
Abstract
Resolution of relationships at lower taxonomic levels is crucial for answering many evolutionary questions, and as such, sufficiently varied species representation is vital. This latter goal is not always achievable with relatively fresh samples. To alleviate the difficulties in procuring rarer taxa, we have seen increasing utilization of historical specimens in building molecular phylogenies using high throughput sequencing. This effort, however, has mainly focused on large-bodied or well-studied groups, with small-bodied and under-studied taxa under-prioritized. Here, we utilize both historical and contemporary specimens, to increase the resolution of phylogenetic relationships among a group of under-studied and small-bodied metazoans, namely, cheilostome bryozoans. In this study, we pioneer the sequencing of air-dried cheilostomes, utilizing a recently developed library preparation method for low DNA input. We evaluate a de novo mitogenome assembly and two iterative methods, using the sequenced target specimen as a reference for mapping, for our sequences. In doing so, we present mitochondrial and ribosomal RNA sequences of 43 cheilostomes representing 37 species, including 14 from historical samples ranging from 50 to 149 years old. The inferred phylogenetic relationships of these samples, analyzed together with publicly available sequence data, are shown in a statistically well-supported 65 taxa and 17 genes cheilostome tree, which is also the most broadly sampled and largest to date. The robust phylogenetic placement of historical samples whose contemporary conspecifics and/or congenerics have been sequenced verifies the appropriateness of our workflow and gives confidence in the phylogenetic placement of those historical samples for which there are no close relatives sequenced. The success of our workflow is highlighted by the circularization of a total of 27 mitogenomes, seven from historical cheilostome samples. Our study highlights the potential of utilizing DNA from micro-invertebrate specimens stored in natural history collections for resolving phylogenetic relationships among species.
Collapse
Affiliation(s)
| | | | - Sanne Boessenkool
- Department of BiosciencesCentre for Ecological and Evolutionary SynthesisUniversity of OsloOsloNorway
| | | | - Dennis P. Gordon
- National Institute of Water and Atmospheric ResearchWellingtonNew Zealand
| | - Hannah L. Mello
- Department of Marine ScienceUniversity of OtagoDunedinNew Zealand
| | - Matthias Obst
- Department of Marine SciencesUniversity of GothenburgGothenburgSweden
| | | | - Abigail M. Smith
- Department of Marine ScienceUniversity of OtagoDunedinNew Zealand
| | - Lee Hsiang Liow
- Natural History MuseumUniversity of OsloOsloNorway
- Department of BiosciencesCentre for Ecological and Evolutionary SynthesisUniversity of OsloOsloNorway
| |
Collapse
|
27
|
Chan KO, Hutter CR, Wood PL, Grismer LL, Brown RM. Target-capture phylogenomics provide insights on gene and species tree discordances in Old World treefrogs (Anura: Rhacophoridae). Proc Biol Sci 2020; 287:20202102. [PMID: 33290680 PMCID: PMC7739936 DOI: 10.1098/rspb.2020.2102] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2020] [Accepted: 11/13/2020] [Indexed: 11/12/2022] Open
Abstract
Genome-scale data have greatly facilitated the resolution of recalcitrant nodes that Sanger-based datasets have been unable to resolve. However, phylogenomic studies continue to use traditional methods such as bootstrapping to estimate branch support; and high bootstrap values are still interpreted as providing strong support for the correct topology. Furthermore, relatively little attention has been given to assessing discordances between gene and species trees, and the underlying processes that produce phylogenetic conflict. We generated novel genomic datasets to characterize and determine the causes of discordance in Old World treefrogs (Family: Rhacophoridae)-a group that is fraught with conflicting and poorly supported topologies among major clades. Additionally, a suite of data filtering strategies and analytical methods were applied to assess their impact on phylogenetic inference. We showed that incomplete lineage sorting was detected at all nodes that exhibited high levels of discordance. Those nodes were also associated with extremely short internal branches. We also clearly demonstrate that bootstrap values do not reflect uncertainty or confidence for the correct topology and, hence, should not be used as a measure of branch support in phylogenomic datasets. Overall, we showed that phylogenetic discordances in Old World treefrogs resulted from incomplete lineage sorting and that species tree inference can be improved using a multi-faceted, total-evidence approach, which uses the most amount of data and considers results from different analytical methods and datasets.
Collapse
Affiliation(s)
- Kin Onn Chan
- Lee Kong Chian Natural History Museum, National University of Singapore, 2 Conservatory Drive, Singapore 117377, Republic of Singapore
| | - Carl R. Hutter
- Museum of Natural Sciences and Department of Biological Sciences, Louisiana State University, Baton Rouge, LA 70803, USA
| | - Perry L. Wood
- Department of Biological Sciences and Museum of Natural History, Auburn University, Auburn, AL 36849, USA
| | - L. Lee Grismer
- Herpetology Laboratory, Department of Biology, La Sierra University, Riverside, CA 92505, USA
| | - Rafe M. Brown
- Biodiversity Institute and Department of Ecology and Evolutionary Biology, University of Kansas, Lawrence, KS 66045, USA
| |
Collapse
|
28
|
Straub SCK, Boutte J, Fishbein M, Livshultz T. Enabling evolutionary studies at multiple scales in Apocynaceae through Hyb-Seq. APPLICATIONS IN PLANT SCIENCES 2020; 8:e11400. [PMID: 33304663 PMCID: PMC7705337 DOI: 10.1002/aps3.11400] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/01/2020] [Accepted: 09/12/2020] [Indexed: 05/07/2023]
Abstract
PREMISE Apocynaceae is the 10th largest flowering plant family and a focus for study of plant-insect interactions, especially as mediated by secondary metabolites. However, it has few genomic resources relative to its size. Target capture sequencing is a powerful approach for genome reduction that facilitates studies requiring data from the nuclear genome in non-model taxa, such as Apocynaceae. METHODS Transcriptomes were used to design probes for targeted sequencing of putatively single-copy nuclear genes across Apocynaceae. The sequences obtained were used to assess the success of the probe design, the intrageneric and intraspecific variation in the targeted genes, and the utility of the genes for inferring phylogeny. RESULTS From 853 candidate nuclear genes, 835 were consistently recovered in single copy and were variable enough for phylogenomics. The inferred gene trees were useful for coalescent-based species tree analysis, which showed all subfamilies of Apocynaceae as monophyletic, while also resolving relationships among species within the genus Apocynum. Intraspecific comparison of Elytropus chilensis individuals revealed numerous single-nucleotide polymorphisms with potential for use in population-level studies. DISCUSSION Community use of this Hyb-Seq probe set will facilitate and promote progress in the study of Apocynaceae across scales from population genomics to phylogenomics.
Collapse
Affiliation(s)
- Shannon C. K. Straub
- Department of BiologyHobart and William Smith Colleges300 Pulteney StreetGenevaNew York14456USA
| | - Julien Boutte
- Department of BiologyHobart and William Smith Colleges300 Pulteney StreetGenevaNew York14456USA
| | - Mark Fishbein
- Department of Plant Biology, Ecology, and EvolutionOklahoma State University301 Physical SciencesStillwaterOklahoma74078USA
| | - Tatyana Livshultz
- Department of Biodiversity, Earth, and Environmental Sciences and the Academy of Natural SciencesDrexel University1900 Benjamin Franklin ParkwayPhiladelphiaPennsylvania19103USA
| |
Collapse
|
29
|
Gonçalves DJP, Jansen RK, Ruhlman TA, Mandel JR. Under the rug: Abandoning persistent misconceptions that obfuscate organelle evolution. Mol Phylogenet Evol 2020; 151:106903. [PMID: 32628998 DOI: 10.1016/j.ympev.2020.106903] [Citation(s) in RCA: 30] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2020] [Accepted: 06/29/2020] [Indexed: 02/01/2023]
Abstract
The advent and advance of next generation sequencing over the past two decades made it possible to accumulate large quantities of sequence reads that could be used to assemble complete or nearly complete organelle genomes (plastome or mitogenome). The result has been an explosive increase in the availability of organelle genome sequences with over 4000 different species of green plants currently available on GenBank. During the same time period, plant molecular biologists greatly enhanced the understanding of the structure, repair, replication, recombination, transcription and translation, and inheritance of organelle DNA. Unfortunately many plant evolutionary biologists are unaware of or have overlooked this knowledge, resulting in misrepresentation of several phenomena that are critical for phylogenetic and evolutionary studies using organelle genomes. We believe that confronting these misconceptions about organelle genome organization, composition, and inheritance will improve our understanding of the evolutionary processes that underly organelle evolution. Here we discuss four misconceptions that can limit evolutionary biology studies and lead to inaccurate phylogenies and incorrect structure of the organellar DNA used to infer organelle evolution.
Collapse
Affiliation(s)
- Deise J P Gonçalves
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI 48109, USA; Department of Integrative Biology, The University of Texas at Austin, Austin, TX 78713, USA.
| | - Robert K Jansen
- Department of Integrative Biology, The University of Texas at Austin, Austin, TX 78713, USA; Center of Excellence for Bionanoscience Research, Department of Biological Sciences, Faculty of Science, King Abdulaziz University, Jeddah 21589, Saudi Arabia
| | - Tracey A Ruhlman
- Department of Integrative Biology, The University of Texas at Austin, Austin, TX 78713, USA
| | - Jennifer R Mandel
- Department of Biological Sciences, Center for Biodiversity Research, University of Memphis, Memphis, TN 38152, USA
| |
Collapse
|
30
|
Rose JP, Toledo CAP, Lemmon EM, Lemmon AR, Sytsma KJ. Out of Sight, Out of Mind: Widespread Nuclear and Plastid-Nuclear Discordance in the Flowering Plant Genus Polemonium (Polemoniaceae) Suggests Widespread Historical Gene Flow Despite Limited Nuclear Signal. Syst Biol 2020; 70:162-180. [PMID: 32617587 DOI: 10.1093/sysbio/syaa049] [Citation(s) in RCA: 42] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2019] [Revised: 06/10/2020] [Accepted: 06/23/2020] [Indexed: 12/13/2022] Open
Abstract
Phylogenomic data from a rapidly increasing number of studies provide new evidence for resolving relationships in recently radiated clades, but they also pose new challenges for inferring evolutionary histories. Most existing methods for reconstructing phylogenetic hypotheses rely solely on algorithms that only consider incomplete lineage sorting (ILS) as a cause of intra- or intergenomic discordance. Here, we utilize a variety of methods, including those to infer phylogenetic networks, to account for both ILS and introgression as a cause for nuclear and cytoplasmic-nuclear discordance using phylogenomic data from the recently radiated flowering plant genus Polemonium (Polemoniaceae), an ecologically diverse genus in Western North America with known and suspected gene flow between species. We find evidence for widespread discordance among nuclear loci that can be explained by both ILS and reticulate evolution in the evolutionary history of Polemonium. Furthermore, the histories of organellar genomes show strong discordance with the inferred species tree from the nuclear genome. Discordance between the nuclear and plastid genome is not completely explained by ILS, and only one case of discordance is explained by detected introgression events. Our results suggest that multiple processes have been involved in the evolutionary history of Polemonium and that the plastid genome does not accurately reflect species relationships. We discuss several potential causes for this cytoplasmic-nuclear discordance, which emerging evidence suggests is more widespread across the Tree of Life than previously thought. [Cyto-nuclear discordance, genomic discordance, phylogenetic networks, plastid capture, Polemoniaceae, Polemonium, reticulations.].
Collapse
Affiliation(s)
- Jeffrey P Rose
- Department of Botany, University of Wisconsin-Madison, Madison, WI 53706, USA.,Department of Biology, University of Nebraska at Kearney, Kearney, NE 68849, USA
| | - Cassio A P Toledo
- Programa de Pós-Graduação em Biologia Vegetal, Instituto de Biolgia, Universidade Estadual de Campinas-UNICAMP, Rua Monteiro Lobato, 255, Campinas, SP. CEP: 13083-862, Brazil
| | - Emily Moriarty Lemmon
- Department of Biological Science, Florida State University, Tallahassee, FL 32306, USA
| | - Alan R Lemmon
- Department of Scientific Computing, Florida State University, Tallahassee, FL 32306, USA
| | - Kenneth J Sytsma
- Department of Botany, University of Wisconsin-Madison, Madison, WI 53706, USA
| |
Collapse
|
31
|
Chan KO, Hutter CR, Wood PL, Grismer LL, Brown RM. Larger, unfiltered datasets are more effective at resolving phylogenetic conflict: Introns, exons, and UCEs resolve ambiguities in Golden-backed frogs (Anura: Ranidae; genus Hylarana). Mol Phylogenet Evol 2020; 151:106899. [PMID: 32590046 DOI: 10.1016/j.ympev.2020.106899] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2020] [Revised: 05/18/2020] [Accepted: 06/17/2020] [Indexed: 01/01/2023]
Abstract
Using FrogCap, a recently-developed sequence-capture protocol, we obtained >12,000 highly informative exons, introns, and ultraconserved elements (UCEs), which we used to illustrate variation in evolutionary histories of these classes of markers, and to resolve long-standing systematic problems in Southeast Asian Golden-backed frogs of the genus-complex Hylarana. We also performed a comprehensive suite of analyses to assess the relative performance of different genetic markers, data filtering strategies, tree inference methods, and different measures of branch support. To reduce gene tree estimation error, we filtered the data using different thresholds of taxon completeness (missing data) and parsimony informative sites (PIS). We then estimated species trees using concatenated datasets and Maximum Likelihood (IQ-TREE) in addition to summary (ASTRAL-III), distance-based (ASTRID), and site-based (SVDQuartets) multispecies coalescent methods. Topological congruence and branch support were examined using traditional bootstrap, local posterior probabilities, gene concordance factors, quartet frequencies, and quartet scores. Our results did not yield a single concordant topology. Instead, introns, exons, and UCEs clearly possessed different phylogenetic signals, resulting in conflicting, yet strongly-supported phylogenetic estimates. However, a combined analysis comprising the most informative introns, exons, and UCEs converged on a similar topology across all analyses, with the exception of SVDQuartets. Bootstrap values were consistently high despite high levels of incongruence and high proportions of gene trees supporting conflicting topologies. Although low bootstrap values did indicate low heuristic support, high bootstrap support did not necessarily reflect congruence or support for the correct topology. This study reiterates findings of some previous studies, which demonstrated that traditional bootstrap values can produce positively misleading measures of support in large phylogenomic datasets. We also showed a remarkably strong positive relationship between branch length and topological congruence across all datasets, implying that very short internodes remain a challenge to resolve, even with orders of magnitude more data than ever before. Overall, our results demonstrate that more data from unfiltered or combined datasets produced superior results. Although data filtering reduced gene tree incongruence, decreased amounts of data also biased phylogenetic estimation. A point of diminishing returns was evident, at which higher congruence (from more stringent filtering) at the expense of amount of data led to topological error as assessed by comparison to more complete datasets across different genomic markers. Additionally, we showed that applying a parameter-rich model to a partitioned analysis of concatenated data produces better results compared to unpartitioned, or even partitioned analysis using model selection. Despite some lingering uncertainties, a combined analysis of our genomic data and sequences supplemented from GenBank (on the basis of a few gene regions) revealed highly supported novel systematic arrangements. Based on these new findings, we transfer Amnirana nicobariensis into the genus Indosylvirana; and I. milleti and Hylarana celebensis to the genus Papurana. We also provisionally place H. attigua in the genus Papurana pending verification from positively identified (voucher substantiated) samples.
Collapse
Affiliation(s)
- Kin Onn Chan
- Lee Kong Chian National History Museum, Faculty of Science, National University of Singapore, 2 Conservatory Drive, 117377, Singapore.
| | - Carl R Hutter
- Biodiversity Institute and Department of Ecology and Evolutionary Biology, University of Kansas, Lawrence, KS 66045, USA; Museum of Natural Sciences and Department of Biological Sciences, Louisiana State University, Baton Rouge, LA 70803, USA
| | - Perry L Wood
- Museum of Natural Sciences and Department of Biological Sciences, Louisiana State University, Baton Rouge, LA 70803, USA; Department of Biological Sciences & Museum of Natural History, Auburn University, Auburn, AL 36849, USA
| | - L Lee Grismer
- Herpetology Laboratory, Department of Biology, La Sierra University, 4500 Riverwalk Parkway, Riverside, CA 92505, USA
| | - Rafe M Brown
- Biodiversity Institute and Department of Ecology and Evolutionary Biology, University of Kansas, Lawrence, KS 66045, USA
| |
Collapse
|
32
|
Crowl AA, Manos PS, McVay JD, Lemmon AR, Lemmon EM, Hipp AL. Uncovering the genomic signature of ancient introgression between white oak lineages (Quercus). THE NEW PHYTOLOGIST 2020; 226:1158-1170. [PMID: 30963585 DOI: 10.1111/nph.15842] [Citation(s) in RCA: 35] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/31/2019] [Accepted: 03/29/2019] [Indexed: 05/10/2023]
Abstract
Botanists have long recognised interspecific gene flow as a common occurrence within white oaks (Quercus section Quercus). Historical allele exchange, however, has not been fully characterised and the complex genomic signals resulting from the combination of vertical and horizontal gene transmission may confound phylogenetic inference and obscure our ability to accurately infer the deep evolutionary history of oaks. Using anchored enrichment, we obtained a phylogenomic dataset consisting of hundreds of single-copy nuclear loci. Concatenation, species-tree and network analyses were carried out in an attempt to uncover the genomic signal of ancient introgression and infer the divergent phylogenetic topology for the white oak clade. Locus and site-level likelihood comparisons were then conducted to further explore the introgressed signal within our dataset. Historical, intersectional gene flow is suggested to have occurred between an ancestor of the Eurasian Roburoid lineage and Quercus pontica and North American Dumosae and Prinoideae lineages. Despite extensive time past, our approach proved successful in detecting the genomic signature of ancient introgression. Our results, however, highlight the importance of sampling and the use of a plurality of analytical tools and methods to sufficiently explore genomic datasets, uncover this signal, and accurately infer evolutionary history.
Collapse
Affiliation(s)
- Andrew A Crowl
- Department of Biology, Duke University, Durham, NC, 27708, USA
| | - Paul S Manos
- Department of Biology, Duke University, Durham, NC, 27708, USA
| | - John D McVay
- Department of Biology, Duke University, Durham, NC, 27708, USA
| | - Alan R Lemmon
- Department of Scientific Computing, Florida State University, Dirac Science Library, Tallahassee, FL, 32317, USA
| | - Emily Moriarty Lemmon
- Department of Biological Science, Florida State University, 89 Chieftan Way, Tallahassee, FL, 32317, USA
| | - Andrew L Hipp
- The Morton Arboretum, 4100 Illinois Route 53, Lisle, IL, 60532, USA
- The Field Museum, 1400 S Lake Shore Drive, Chicago, IL, 60605, USA
| |
Collapse
|
33
|
Wolfe JM, Breinholt JW, Crandall KA, Lemmon AR, Lemmon EM, Timm LE, Siddall ME, Bracken-Grissom HD. A phylogenomic framework, evolutionary timeline and genomic resources for comparative studies of decapod crustaceans. Proc Biol Sci 2020; 286:20190079. [PMID: 31014217 DOI: 10.1098/rspb.2019.0079] [Citation(s) in RCA: 66] [Impact Index Per Article: 16.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Comprising over 15 000 living species, decapods (crabs, shrimp and lobsters) are the most instantly recognizable crustaceans, representing a considerable global food source. Although decapod systematics have received much study, limitations of morphological and Sanger sequence data have yet to produce a consensus for higher-level relationships. Here, we introduce a new anchored hybrid enrichment kit for decapod phylogenetics designed from genomic and transcriptomic sequences that we used to capture new high-throughput sequence data from 94 species, including 58 of 179 extant decapod families, and 11 of 12 major lineages. The enrichment kit yields 410 loci (greater than 86 000 bp) conserved across all lineages of Decapoda, more clade-specific molecular data than any prior study. Phylogenomic analyses recover a robust decapod tree of life strongly supporting the monophyly of all infraorders, and monophyly of each of the reptant, 'lobster' and 'crab' groups, with some results supporting pleocyemate monophyly. We show that crown decapods diverged in the Late Ordovician and most crown lineages diverged in the Triassic-Jurassic, highlighting a cryptic Palaeozoic history, and post-extinction diversification. New insights into decapod relationships provide a phylogenomic window into morphology and behaviour, and a basis to rapidly and cheaply expand sampling in this economically and ecologically significant invertebrate clade.
Collapse
Affiliation(s)
- Joanna M Wolfe
- 1 Division of Invertebrate Zoology and Sackler Institute of Comparative Genomics, American Museum of Natural History , New York, NY 10024 , USA.,2 Department of Earth, Atmospheric and Planetary Sciences, Massachusetts Institute of Technology , Cambridge, MA 02139 , USA.,3 Museum of Comparative Zoology, Department of Organismic and Evolutionary Biology, Harvard University , Cambridge, MA 02138 , USA
| | - Jesse W Breinholt
- 4 Florida Museum of Natural History, University of Florida , Gainesville, FL 32611 , USA.,5 RAPiD Genomics , Gainesville, FL 32601 , USA
| | - Keith A Crandall
- 6 Computational Biology Institute, The George Washington University , Ashburn, VA 20147 , USA.,7 Department of Invertebrate Zoology, National Museum of Natural History, Smithsonian Institution , Washington, DC 20012 , USA
| | - Alan R Lemmon
- 8 Department of Scientific Computing, Florida State University , Dirac Science Library, Tallahassee, FL 32306 , USA
| | - Emily Moriarty Lemmon
- 9 Department of Biological Science, Florida State University , Tallahassee, FL 32306 , USA
| | - Laura E Timm
- 10 Department of Biological Sciences, Florida International University , North Miami, FL 33181 , USA
| | - Mark E Siddall
- 1 Division of Invertebrate Zoology and Sackler Institute of Comparative Genomics, American Museum of Natural History , New York, NY 10024 , USA
| | - Heather D Bracken-Grissom
- 10 Department of Biological Sciences, Florida International University , North Miami, FL 33181 , USA
| |
Collapse
|
34
|
Bagley JC, Uribe-Convers S, Carlsen MM, Muchhala N. Utility of targeted sequence capture for phylogenomics in rapid, recent angiosperm radiations: Neotropical Burmeistera bellflowers as a case study. Mol Phylogenet Evol 2020; 152:106769. [PMID: 32081762 DOI: 10.1016/j.ympev.2020.106769] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2019] [Revised: 02/10/2020] [Accepted: 02/12/2020] [Indexed: 02/06/2023]
Abstract
Targeted sequence capture is a promising approach for large-scale phylogenomics. However, rapid evolutionary radiations pose significant challenges for phylogenetic inference (e.g. incomplete lineages sorting (ILS), phylogenetic noise), and the ability of targeted nuclear loci to resolve species trees despite such issues remains poorly studied. We test the utility of targeted sequence capture for inferring phylogenetic relationships in rapid, recent angiosperm radiations, focusing on Burmeistera bellflowers (Campanulaceae), which diversified into ~130 species over less than 3 million years. We compared phylogenies estimated from supercontig (exons plus flanking sequences), exon-only, and flanking-only datasets with 506-546 loci (~4.7 million bases) for 46 Burmeistera species/lineages and 10 outgroup taxa. Nuclear loci resolved backbone nodes and many congruent internal relationships with high support in concatenation and coalescent-based species tree analyses, and inferences were largely robust to effects of missing taxa and base composition biases. Nevertheless, species trees were incongruent between datasets, and gene trees exhibited remarkably high levels of conflict (~4-60% congruence, ~40-99% conflict) not simply driven by poor gene tree resolution. Higher gene tree heterogeneity at shorter branches suggests an important role of ILS, as expected for rapid radiations. Phylogenetic informativeness analyses also suggest this incongruence has resulted from low resolving power at short internal branches, consistent with ILS, and homoplasy at deeper nodes, with exons exhibiting much greater risk of incorrect topologies due to homoplasy than other datasets. Our findings suggest that targeted sequence capture is feasible for resolving rapid, recent angiosperm radiations, and that results based on supercontig alignments containing nuclear exons and flanking sequences have higher phylogenetic utility and accuracy than either alone. We use our results to make practical recommendations for future target capture-based studies of Burmeistera and other rapid angiosperm radiations, including that such studies should analyze supercontigs to maximize the phylogenetic information content of loci.
Collapse
Affiliation(s)
- Justin C Bagley
- Department of Biology, University of Missouri-St. Louis, St. Louis, MO 63121, USA; Department of Biology, Virginia Commonwealth University, Richmond, VA 23284, USA.
| | - Simon Uribe-Convers
- Department of Biology, University of Missouri-St. Louis, St. Louis, MO 63121, USA
| | - Mónica M Carlsen
- Research Department, Science and Conservation Division, Missouri Botanical Garden, St. Louis, MO 63110, USA
| | - Nathan Muchhala
- Department of Biology, University of Missouri-St. Louis, St. Louis, MO 63121, USA
| |
Collapse
|
35
|
Granados Mendoza C, Jost M, Hágsater E, Magallón S, van den Berg C, Lemmon EM, Lemmon AR, Salazar GA, Wanke S. Target Nuclear and Off-Target Plastid Hybrid Enrichment Data Inform a Range of Evolutionary Depths in the Orchid Genus Epidendrum. FRONTIERS IN PLANT SCIENCE 2020; 10:1761. [PMID: 32063915 PMCID: PMC7000662 DOI: 10.3389/fpls.2019.01761] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/02/2019] [Accepted: 12/16/2019] [Indexed: 05/12/2023]
Abstract
Universal angiosperm enrichment probe sets designed to enrich hundreds of putatively orthologous nuclear single-copy loci are increasingly being applied to infer phylogenetic relationships of different lineages of angiosperms at a range of evolutionary depths. Studies applying such probe sets have focused on testing the universality and performance of the target nuclear loci, but they have not taken advantage of off-target data from other genome compartments generated alongside the nuclear loci. Here we do so to infer phylogenetic relationships in the orchid genus Epidendrum and closely related genera of subtribe Laeliinae. Our aims are to: 1) test the technical viability of applying the plant anchored hybrid enrichment (AHE) method (Angiosperm v.1 probe kit) to our focal group, 2) mine plastid protein coding genes from off-target reads; and 3) evaluate the performance of the target nuclear and off-target plastid loci in resolving and supporting phylogenetic relationships along a range of taxonomical depths. Phylogenetic relationships were inferred from the nuclear data set through coalescent summary and site-based methods, whereas plastid loci were analyzed in a concatenated partitioned matrix under maximum likelihood. The usefulness of target and flanking non-target nuclear regions and plastid loci was assessed through the estimation of their phylogenetic informativeness. Our study successfully applied the plant AHE probe kit to Epidendrum, supporting the universality of this kit in angiosperms. Moreover, it demonstrated the feasibility of mining plastome loci from off-target reads generated with the Angiosperm v.1 probe kit to obtain additional, uniparentally inherited sequence data at no extra sequencing cost. Our analyses detected some strongly supported incongruences between nuclear and plastid data sets at shallow divergences, an indication of potential lineage sorting, hybridization, or introgression events in the group. Lastly, we found that the per site phylogenetic informativeness of the ycf1 plastid gene surpasses that of all other plastid genes and several nuclear loci, making it an excellent candidate for assessing phylogenetic relationships at medium to low taxonomic levels in orchids.
Collapse
Affiliation(s)
- Carolina Granados Mendoza
- Departamento de Botánica, Instituto de Biología, Universidad Nacional Autónoma de México, Mexico City, Mexico
| | - Matthias Jost
- Institut für Botanik, Technische Universität Dresden, Dresden, Germany
| | - Eric Hágsater
- Herbario AMO, Instituto Chinoin, A.C., Mexico City, Mexico
| | - Susana Magallón
- Departamento de Botánica, Instituto de Biología, Universidad Nacional Autónoma de México, Mexico City, Mexico
| | - Cássio van den Berg
- Departamento de Ciências Biológicas, Universidade Estadual de Feira de Santana, Feira de Santana, Brazil
| | - Emily Moriarty Lemmon
- Department of Biological Science, Florida State University, Tallahassee, FL, United States
| | - Alan R. Lemmon
- Department of Scientific Computing, Florida State University, Tallahassee, FL, United States
| | - Gerardo A. Salazar
- Departamento de Botánica, Instituto de Biología, Universidad Nacional Autónoma de México, Mexico City, Mexico
| | - Stefan Wanke
- Institut für Botanik, Technische Universität Dresden, Dresden, Germany
| |
Collapse
|
36
|
Starr JR, Léveillé-Bourret É, Tài VA, Thị Kim Thanh NN, Ford BA. The rediscovery of the rare Vietnamese endemic Eriophorum scabriculme redefines generic limits in the Scirpo-Caricoid Clade (Cyperaceae). PeerJ 2019; 7:e7538. [PMID: 31579569 PMCID: PMC6765354 DOI: 10.7717/peerj.7538] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2019] [Accepted: 07/23/2019] [Indexed: 11/20/2022] Open
Abstract
For those familiar with boreal bogs and wet tundra, species of Eriophorum (“the cotton grasses”) will undoubtedly represent some of the most striking and memorable taxa they have encountered. This small genus of 20 Holarctic sedge species (Cyperaceae) is remarkable because its inflorescences produce large, brilliantly white to rusty-red cottony masses when its flowers develop a perianth of highly elongated bristles after anthesis. In this study, we document the rediscovery of Eriophorum scabriculme, a narrow Vietnamese endemic known from only two collections made approximately 7 km apart near Sa Pa in Lào Cai Province over 75 years ago. Using plastid DNA sequences (matK, ndhF), embryology, and morphology, we test whether E. scabriculme is aligned within the Scirpo-Caricoid Clade (genus Khaosokia and tribes Cariceae, Dulichieae, Scirpeae, and Sumatroscirpeae) or the Ficinia Clade (Cypereae), and we determine whether its unique character combinations (≥10 elongated bristles, reduced sheathing basal leaves, 1–4 spikelets) could be evidence for a new genus or simply mark it as an unusual species within currently recognised genera. In addition, we document the discovery of seven new populations, and we extend its range westward to Lai Châu Province and southward in Lào Cai Province by more than 47 km. Our results demonstrate that Eriophorum scabriculme is best treated in the genus Trichophorum, thus re-circumscribing both genera and their limits with Scirpus s.str. In addition, we emend the description of Trichophorum scabriculme (Beetle) J.R.Starr, Lév.-Bourret & B.A. Ford, provide the first pictures and accurate illustration of the species, and assess its conservation status in Vietnam (VU, Vulnerable). Our study corroborates the fact that in such a diverse and taxonomically difficult family like the sedges, conspicuous characters like highly elongated bristles may be useful for dividing diversity, but they are no guarantee that the groups they mark are natural.
Collapse
Affiliation(s)
- Julian R Starr
- Department of Biology, University of Ottawa, Ottawa, Ontario, Canada
| | - Étienne Léveillé-Bourret
- Department of Biology, University of Ottawa, Ottawa, Ontario, Canada.,Institute of Systematic and Evolutionary Botany, University of Zürich, Zürich, Switzerland
| | - Vũ Anh Tài
- Institute of Geography, Vietnam Academy of Science and Technology, Hanoi, Vietnam
| | | | - Bruce A Ford
- Department of Biological Sciences, University of Manitoba, Winnipeg, Manitoba, Canada
| |
Collapse
|
37
|
Quartet-Based Computations of Internode Certainty Provide Robust Measures of Phylogenetic Incongruence. Syst Biol 2019; 69:308-324. [DOI: 10.1093/sysbio/syz058] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2017] [Accepted: 08/26/2019] [Indexed: 11/14/2022] Open
Abstract
Abstract
Incongruence, or topological conflict, is prevalent in genome-scale data sets. Internode certainty (IC) and related measures were recently introduced to explicitly quantify the level of incongruence of a given internal branch among a set of phylogenetic trees and complement regular branch support measures (e.g., bootstrap, posterior probability) that instead assess the statistical confidence of inference. Since most phylogenomic studies contain data partitions (e.g., genes) with missing taxa and IC scores stem from the frequencies of bipartitions (or splits) on a set of trees, IC score calculation typically requires adjusting the frequencies of bipartitions from these partial gene trees. However, when the proportion of missing taxa is high, the scores yielded by current approaches that adjust bipartition frequencies in partial gene trees differ substantially from each other and tend to be overestimates. To overcome these issues, we developed three new IC measures based on the frequencies of quartets, which naturally apply to both complete and partial trees. Comparison of our new quartet-based measures to previous bipartition-based measures on simulated data shows that: (1) on complete data sets, both quartet-based and bipartition-based measures yield very similar IC scores; (2) IC scores of quartet-based measures on a given data set with and without missing taxa are more similar than the scores of bipartition-based measures; and (3) quartet-based measures are more robust to the absence of phylogenetic signal and errors in phylogenetic inference than bipartition-based measures. Additionally, the analysis of an empirical mammalian phylogenomic data set using our quartet-based measures reveals the presence of substantial levels of incongruence for numerous internal branches. An efficient open-source implementation of these quartet-based measures is freely available in the program QuartetScores (https://github.com/lutteropp/QuartetScores).
Collapse
|
38
|
Johnson MG, Pokorny L, Dodsworth S, Botigué LR, Cowan RS, Devault A, Eiserhardt WL, Epitawalage N, Forest F, Kim JT, Leebens-Mack JH, Leitch IJ, Maurin O, Soltis DE, Soltis PS, Wong GKS, Baker WJ, Wickett NJ. A Universal Probe Set for Targeted Sequencing of 353 Nuclear Genes from Any Flowering Plant Designed Using k-Medoids Clustering. Syst Biol 2019; 68:594-606. [PMID: 30535394 PMCID: PMC6568016 DOI: 10.1093/sysbio/syy086] [Citation(s) in RCA: 205] [Impact Index Per Article: 41.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2018] [Revised: 11/29/2018] [Accepted: 12/03/2018] [Indexed: 01/31/2023] Open
Abstract
Sequencing of target-enriched libraries is an efficient and cost-effective method for obtaining DNA sequence data from hundreds of nuclear loci for phylogeny reconstruction. Much of the cost of developing targeted sequencing approaches is associated with the generation of preliminary data needed for the identification of orthologous loci for probe design. In plants, identifying orthologous loci has proven difficult due to a large number of whole-genome duplication events, especially in the angiosperms (flowering plants). We used multiple sequence alignments from over 600 angiosperms for 353 putatively single-copy protein-coding genes identified by the One Thousand Plant Transcriptomes Initiative to design a set of targeted sequencing probes for phylogenetic studies of any angiosperm group. To maximize the phylogenetic potential of the probes, while minimizing the cost of production, we introduce a k-medoids clustering approach to identify the minimum number of sequences necessary to represent each coding sequence in the final probe set. Using this method, 5-15 representative sequences were selected per orthologous locus, representing the sequence diversity of angiosperms more efficiently than if probes were designed using available sequenced genomes alone. To test our approximately 80,000 probes, we hybridized libraries from 42 species spanning all higher-order groups of angiosperms, with a focus on taxa not present in the sequence alignments used to design the probes. Out of a possible 353 coding sequences, we recovered an average of 283 per species and at least 100 in all species. Differences among taxa in sequence recovery could not be explained by relatedness to the representative taxa selected for probe design, suggesting that there is no phylogenetic bias in the probe set. Our probe set, which targeted 260 kbp of coding sequence, achieved a median recovery of 137 kbp per taxon in coding regions, a maximum recovery of 250 kbp, and an additional median of 212 kbp per taxon in flanking non-coding regions across all species. These results suggest that the Angiosperms353 probe set described here is effective for any group of flowering plants and would be useful for phylogenetic studies from the species level to higher-order groups, including the entire angiosperm clade itself.
Collapse
Affiliation(s)
- Matthew G Johnson
- Department of Biological Sciences, Texas Tech University, Lubbock, TX 79409, USA
- Plant Science and Conservation, Chicago Botanic Garden, 1000 Lake Cook Road, Glencoe, IL 60022, USA
| | - Lisa Pokorny
- Department of Comparative Plant and Fungal Biology, Royal Botanic Gardens, Kew, Richmond, Surrey TW9 3AE, UK
| | - Steven Dodsworth
- Department of Comparative Plant and Fungal Biology, Royal Botanic Gardens, Kew, Richmond, Surrey TW9 3AE, UK
- School of Life Sciences, University of Bedfordshire, University Square, Luton LU1 3JU, UK
| | - Laura R Botigué
- Department of Comparative Plant and Fungal Biology, Royal Botanic Gardens, Kew, Richmond, Surrey TW9 3AE, UK
- Centre for Research in Agricultural Genomics, Campus UAB, Edifici CRAG, Bellaterra Cerdanyola del Vallès, 08193 Barcelona, Spain
| | - Robyn S Cowan
- Department of Comparative Plant and Fungal Biology, Royal Botanic Gardens, Kew, Richmond, Surrey TW9 3AE, UK
| | - Alison Devault
- Arbor Biosciences, 5840 Interface Dr, Suite 101, Ann Arbor, MI 48103, USA
| | - Wolf L Eiserhardt
- Department of Comparative Plant and Fungal Biology, Royal Botanic Gardens, Kew, Richmond, Surrey TW9 3AE, UK
- Department of Bioscience, Aarhus University, 8000 Aarhus C, Denmark
| | - Niroshini Epitawalage
- Department of Comparative Plant and Fungal Biology, Royal Botanic Gardens, Kew, Richmond, Surrey TW9 3AE, UK
| | - Félix Forest
- Department of Comparative Plant and Fungal Biology, Royal Botanic Gardens, Kew, Richmond, Surrey TW9 3AE, UK
| | - Jan T Kim
- Department of Comparative Plant and Fungal Biology, Royal Botanic Gardens, Kew, Richmond, Surrey TW9 3AE, UK
| | - James H Leebens-Mack
- Department of Plant Biology, University of Georgia, 2502 Miller Plant Sciences, Athens, GA 30602, USA
| | - Ilia J Leitch
- Department of Comparative Plant and Fungal Biology, Royal Botanic Gardens, Kew, Richmond, Surrey TW9 3AE, UK
| | - Olivier Maurin
- Department of Comparative Plant and Fungal Biology, Royal Botanic Gardens, Kew, Richmond, Surrey TW9 3AE, UK
| | - Douglas E Soltis
- Department of Biology, University of Florida, 220 Bartram Hall, Gainesville, FL 32611-8525, USA
- Florida Museum of Natural History, University of Florida, 3215 Hull Road, Gainesville, FL 32611-2710, USA
| | - Pamela S Soltis
- Department of Biology, University of Florida, 220 Bartram Hall, Gainesville, FL 32611-8525, USA
- Florida Museum of Natural History, University of Florida, 3215 Hull Road, Gainesville, FL 32611-2710, USA
| | - Gane Ka-shu Wong
- BGI-Shenzhen, Beishan Industrial Zone, Yantian District, Shenzhen 518083, China
- Department of Biological Sciences, University of Alberta, Edmonton, AB T6G 2E9, Canada
- Department of Medicine, University of Alberta, Edmonton, AB T6G 2E1, Canada
| | - William J Baker
- Department of Comparative Plant and Fungal Biology, Royal Botanic Gardens, Kew, Richmond, Surrey TW9 3AE, UK
| | - Norman J Wickett
- Plant Science and Conservation, Chicago Botanic Garden, 1000 Lake Cook Road, Glencoe, IL 60022, USA
- Program in Plant Biology and Conservation, Northwestern University, 2205 Tech Drive, Evanston, IL 60208, USA
| |
Collapse
|
39
|
Soto Gomez M, Pokorny L, Kantar MB, Forest F, Leitch IJ, Gravendeel B, Wilkin P, Graham SW, Viruel J. A customized nuclear target enrichment approach for developing a phylogenomic baseline for Dioscorea yams (Dioscoreaceae). APPLICATIONS IN PLANT SCIENCES 2019; 7:e11254. [PMID: 31236313 PMCID: PMC6580989 DOI: 10.1002/aps3.11254] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/13/2018] [Accepted: 04/01/2019] [Indexed: 05/14/2023]
Abstract
PREMISE We developed a target enrichment panel for phylogenomic studies of Dioscorea, an economically important genus with incompletely resolved relationships. METHODS Our bait panel comprises 260 low- to single-copy nuclear genes targeted to work in Dioscorea, assessed here using a preliminary taxon sampling that includes both distantly and closely related taxa, including several yam crops and potential crop wild relatives. We applied coalescent-based and maximum likelihood phylogenomic inference approaches to the pilot taxon set, incorporating new and published transcriptome data from additional species. RESULTS The custom panel retrieved ~94% of targets and >80% of full gene length from 88% and 68% of samples, respectively. It has minimal gene overlap with existing panels designed for angiosperm-wide studies and generally recovers longer and more variable targets. Pilot phylogenomic analyses consistently resolve most deep and recent relationships with strong support across analyses and point to revised relationships between the crop species D. alata and candidate crop wild relatives. DISCUSSION Our customized panel reliably retrieves targeted loci from Dioscorea, is informative for resolving relationships in denser samplings, and is suitable for refining our understanding of the independent origins of cultivated yam species; the panel likely has broader promise for phylogenomic studies across Dioscoreales.
Collapse
Affiliation(s)
- Marybel Soto Gomez
- Department of BotanyUniversity of British Columbia6270 University BoulevardVancouverBritish ColumbiaV6T 1Z4Canada
- UBC Botanical Garden and Centre for Plant ResearchUniversity of British Columbia6804 Marine Drive SWVancouverBritish ColumbiaV6T 1Z4Canada
| | - Lisa Pokorny
- Royal Botanic GardensKew, RichmondSurreyTW9 3DSUnited Kingdom
| | - Michael B. Kantar
- Department of Tropical Plant and Soil SciencesUniversity of Hawai‘i at ManoaHonoluluHawai‘i96822USA
| | - Félix Forest
- Royal Botanic GardensKew, RichmondSurreyTW9 3DSUnited Kingdom
| | - Ilia J. Leitch
- Royal Botanic GardensKew, RichmondSurreyTW9 3DSUnited Kingdom
| | - Barbara Gravendeel
- Naturalis Biodiversity CenterEndless FormsSylviusweg 72Leiden2333 BEThe Netherlands
- Institute Biology LeidenLeiden UniversitySylviusweg 72Leiden2333 BEThe Netherlands
- Faculty of Science and TechnologyUniversity of Applied Sciences LeidenZernikedreef 11Leiden2333 CKThe Netherlands
| | - Paul Wilkin
- Royal Botanic GardensKew, RichmondSurreyTW9 3DSUnited Kingdom
| | - Sean W. Graham
- Department of BotanyUniversity of British Columbia6270 University BoulevardVancouverBritish ColumbiaV6T 1Z4Canada
- UBC Botanical Garden and Centre for Plant ResearchUniversity of British Columbia6804 Marine Drive SWVancouverBritish ColumbiaV6T 1Z4Canada
| | - Juan Viruel
- Royal Botanic GardensKew, RichmondSurreyTW9 3DSUnited Kingdom
| |
Collapse
|
40
|
Gonçalves DJP, Simpson BB, Ortiz EM, Shimizu GH, Jansen RK. Incongruence between gene trees and species trees and phylogenetic signal variation in plastid genes. Mol Phylogenet Evol 2019; 138:219-232. [PMID: 31146023 DOI: 10.1016/j.ympev.2019.05.022] [Citation(s) in RCA: 83] [Impact Index Per Article: 16.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2018] [Revised: 05/20/2019] [Accepted: 05/21/2019] [Indexed: 10/26/2022]
Abstract
The current classification of angiosperms is based primarily on concatenated plastid markers and maximum likelihood (ML) inference. This approach has been justified by the assumption that plastid DNA (ptDNA) is inherited as a single locus and that its individual genes produce congruent trees. However, structural and functional characteristics of ptDNA suggest that plastid genes may not evolve as a single locus and are experiencing different evolutionary forces. To examine this idea, we produced new complete plastid genome (plastome) sequences of 27 species and combined these data with publicly available sequences to produce a final dataset that includes 78 plastid genes for 89 species of rosids and five outgroups. We used four data matrices (i.e., gene, exon, codon-aligned, and amino acid) to infer species and gene trees using ML and multispecies coalescent (MSC) methods. Rosids include about one third of all angiosperms and their two major clades, fabids and malvids, were recovered in almost all analyses. However, we detected incongruence between species trees inferred with different matrices and methods and previously published plastid and nuclear phylogenies. We visualized and tested the significance of incongruence between gene trees and species trees. We then measured the distribution of phylogenetic signal across sites and genes supporting alternative placements of five controversial nodes at different taxonomic levels. Gene trees inferred with plastid data often disagree with species trees inferred using both ML (with unpartitioned or partitioned data) and MSC. Species trees inferred with both methods produced alternative topologies for a few taxa. Our results show that, in a phylogenetic context, plastid protein-coding genes may not be fully linked and behaving as a single locus. Furthermore, concatenated matrices may produce highly supported phylogenies that are discordant with individual gene trees. We also show that phylogenies inferred with MSC are accurate. We therefore emphasize the importance of considering variation in phylogenetic signal across plastid genes and the exploration of plastome data to increase accuracy of estimating relationships. We also support the use of MSC with plastome matrices in future phylogenomic investigations.
Collapse
Affiliation(s)
- Deise J P Gonçalves
- Department of Integrative Biology, The University of Texas at Austin, 2415 Speedway #C0930, Austin, TX 78713, USA.
| | - Beryl B Simpson
- Department of Integrative Biology, The University of Texas at Austin, 2415 Speedway #C0930, Austin, TX 78713, USA
| | - Edgardo M Ortiz
- Department of Integrative Biology, The University of Texas at Austin, 2415 Speedway #C0930, Austin, TX 78713, USA; Department of Ecology & Ecosystem Management, Plant Biodiversity Research, Technical University of Munich, Emil-Ramann Strasse 2, Freising D-85354, Germany
| | - Gustavo H Shimizu
- Department of Plant Biology, University of Campinas, 13083-970 Campinas, SP, Brazil
| | - Robert K Jansen
- Department of Integrative Biology, The University of Texas at Austin, 2415 Speedway #C0930, Austin, TX 78713, USA; Genomics and Biotechnology Research Group, Department of Biological Sciences, Faculty of Science, King Abdulaziz University, Jeddah 21589, Saudi Arabia
| |
Collapse
|
41
|
Chan KO, Grismer LL. To split or not to split? Multilocus phylogeny and molecular species delimitation of southeast Asian toads (family: Bufonidae). BMC Evol Biol 2019; 19:95. [PMID: 31023232 PMCID: PMC6485082 DOI: 10.1186/s12862-019-1422-3] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2019] [Accepted: 04/12/2019] [Indexed: 11/10/2022] Open
Abstract
Background Recent studies have demonstrated that Bayesian species delimitation based on the multispecies coalescent model can produce inaccurate results by misinterpreting population splits as species divergences. An approach based on the genealogical divergence index (gdi) was shown to be a viable alternative, especially for delimiting allopatric populations where gene flow is low. We implemented these analyses to assess species boundaries in Southeast Asian toads, a group that is understudied and characterized by numerous unresolved species complexes. Results Multilocus phylogenetic analyses showed that deep evolutionary relationships including the genera Sigalegalephrynus, Ghatophryne, Parapelophryne, Leptophryne, Pseudobufo, Rentapia, and Phrynoides remain unresolved. Comparison of genetic divergences revealed that intraspecific divergences among allopatric populations of Pelophyrne signata (Borneo vs. Peninsular Malaysia), Ingerophrynus parvus (Peninsular Malaysia vs. Myanmar), and Leptophryne borbonica (Peninsular Malaysia, Java, Borneo, and Sumatra) are consistent with interspecific divergences of other Southeast Asian bufonid taxa. Conversely, interspecific divergences between Pelophryne guentheri/P. api, Ansonia latiffi/A. leptopus, and I. gollum/I. divergens were low (< 3%) and consistent with intraspecific divergences of other closely related taxa. The BPP analysis produced variable results depending on prior settings and priors estimated from empirical data produced the best results that were also congruent with the gdi analysis. Conclusions This study showed that the evolutionary history of Southeast Asian toads is difficult to resolve and numerous relationships remain ambiguous. Although some results from the species delimitation analyses were inconclusive, they were nevertheless efficacious at identifying potential new species and taxonomic incompatibilities for future in-depth investigation. We also demonstrated the sensitivity of BPP to different priors and that careful selection priors based on empirical data can greatly improve the analysis. Finally, the gdi can be a robust tool to complement other species delimitation methods. Electronic supplementary material The online version of this article (10.1186/s12862-019-1422-3) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Kin Onn Chan
- Lee Kong Chian Natural History Museum, National University of Singapore, 2 Conservatory Drive, Singapore, 117377, Singapore.
| | - L Lee Grismer
- Herpetology Laboratory, Department of Biology, La Sierra University, 4500 Riverwalk Parkway, Riverside, CA, 92515, USA
| |
Collapse
|
42
|
Liu Y, Johnson MG, Cox CJ, Medina R, Devos N, Vanderpoorten A, Hedenäs L, Bell NE, Shevock JR, Aguero B, Quandt D, Wickett NJ, Shaw AJ, Goffinet B. Resolution of the ordinal phylogeny of mosses using targeted exons from organellar and nuclear genomes. Nat Commun 2019; 10:1485. [PMID: 30940807 PMCID: PMC6445109 DOI: 10.1038/s41467-019-09454-w] [Citation(s) in RCA: 72] [Impact Index Per Article: 14.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2018] [Accepted: 03/07/2019] [Indexed: 11/21/2022] Open
Abstract
Mosses are a highly diverse lineage of land plants, whose diversification, spanning at least 400 million years, remains phylogenetically ambiguous due to the lack of fossils, massive early extinctions, late radiations, limited morphological variation, and conflicting signal among previously used markers. Here, we present phylogenetic reconstructions based on complete organellar exomes and a comparable set of nuclear genes for this major lineage of land plants. Our analysis of 142 species representing 29 of the 30 moss orders reveals that relative average rates of non-synonymous substitutions in nuclear versus plastid genes are much higher in mosses than in seed plants, consistent with the emerging concept of evolutionary dynamism in mosses. Our results highlight the evolutionary significance of taxa with reduced morphologies, shed light on the relative tempo and mechanisms underlying major cladogenic events, and suggest hypotheses for the relationships and delineation of moss orders.
Collapse
Affiliation(s)
- Yang Liu
- Fairy Lake Botanical Garden & Chinese Academy of Sciences, Shenzhen, 518004, China
- BGI-Shenzhen, Shenzhen, 518120, China
| | | | - Cymon J Cox
- Centro de Ciências do Mar, Universidade do Algarve, Gambelas, 8005-319, Faro, Portugal
| | - Rafael Medina
- Department of Biology, Augustana College, Rock Island, IL, 61201, USA
| | - Nicolas Devos
- Department of Biology, Duke University, Durham, NC, 27708, USA
| | | | - Lars Hedenäs
- Department of Botany, Swedish Museum of Natural History, Stockholm, Box 50007, 10405, Sweden
| | - Neil E Bell
- Royal Botanic Garden Edinburgh, 20A Inverleith Row, Edinburgh, EH3 5LR, UK
| | - James R Shevock
- California Academy of Sciences, San Francisco, CA, 94118, USA
| | - Blanka Aguero
- Department of Biology, Duke University, Durham, NC, 27708, USA
| | - Dietmar Quandt
- Nees Institute for Biodiversity of Plants, University of Bonn, Bonn, 53115, Germany
| | | | - A Jonathan Shaw
- Department of Biology, Duke University, Durham, NC, 27708, USA
| | - Bernard Goffinet
- Department of Ecology and Evolutionary Biology, University of Connecticut, Storrs, CT, 06269, USA.
| |
Collapse
|
43
|
Wielstra B, McCartney-Melstad E, Arntzen J, Butlin R, Shaffer H. Phylogenomics of the adaptive radiation of Triturus newts supports gradual ecological niche expansion towards an incrementally aquatic lifestyle. Mol Phylogenet Evol 2019; 133:120-127. [DOI: 10.1016/j.ympev.2018.12.032] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2018] [Revised: 12/30/2018] [Accepted: 12/30/2018] [Indexed: 11/29/2022]
|
44
|
Li X, Feng T, Randle C, Schneeweiss GM. Phylogenetic Relationships in Orobanchaceae Inferred From Low-Copy Nuclear Genes: Consolidation of Major Clades and Identification of a Novel Position of the Non-photosynthetic Orobanche Clade Sister to All Other Parasitic Orobanchaceae. FRONTIERS IN PLANT SCIENCE 2019; 10:902. [PMID: 31379896 PMCID: PMC6646720 DOI: 10.3389/fpls.2019.00902] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/15/2019] [Accepted: 06/26/2019] [Indexed: 05/18/2023]
Abstract
Molecular phylogenetic analyses have greatly advanced our understanding of phylogenetic relationships in Orobanchaceae, a model system to study parasitism in angiosperms. As members of this group may lack some genes widely used for phylogenetic analysis and exhibit varying degrees of accelerated base substitution in other genes, relationships among major clades identified previously remain contentious. To improve inferences of phylogenetic relationships in Orobanchaceae, we used two pentatricopeptide repeat (PPR) and three low-copy nuclear (LCN) genes, two of which have been developed for this study. Resolving power and level of support strongly differed among markers. Despite considerable incongruence among newly and previously sequenced markers, monophyly of major clades identified in previous studies was confirmed and, especially in analyses of concatenated data, strongly supported after the exclusion of a small group of East Asian genera (Pterygiella and Phtheirospermum) from the Euphrasia-Rhinanthus clade. The position of the Orobanche clade sister to all other parasitic Orobanchaceae may indicate that the shift to holoparasitism occurred early in the evolution of the family. Although well supported in analyses of concatenated data comprising ten loci (five newly and five previously sequenced), relationships among major clades, most prominently the Striga-Alectra clade, the Euphrasia-Rhinanthus clade, and the Castilleja-Pedicularis clade, were uncertain because of strongly supported incongruence also among well-resolving loci. Despite the limitations of using a few selected loci, congruence among markers with respect to circumscription of major clades of Orobanchaceae renders those frameworks for detailed, species-level, phylogenetic studies.
Collapse
Affiliation(s)
- Xi Li
- Department of Botany and Biodiversity Research, University of Vienna, Vienna, Austria
| | - Tao Feng
- Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan, China
| | - Chris Randle
- Department of Biological Sciences, Sam Houston State University, Huntsville, TX, United States
| | - Gerald M. Schneeweiss
- Department of Botany and Biodiversity Research, University of Vienna, Vienna, Austria
- *Correspondence: Gerald M. Schneeweiss,
| |
Collapse
|
45
|
Larridon I, Villaverde T, Zuntini AR, Pokorny L, Brewer GE, Epitawalage N, Fairlie I, Hahn M, Kim J, Maguilla E, Maurin O, Xanthos M, Hipp AL, Forest F, Baker WJ. Tackling Rapid Radiations With Targeted Sequencing. FRONTIERS IN PLANT SCIENCE 2019; 10:1655. [PMID: 31998342 PMCID: PMC6962237 DOI: 10.3389/fpls.2019.01655] [Citation(s) in RCA: 34] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/04/2019] [Accepted: 11/22/2019] [Indexed: 05/19/2023]
Abstract
In phylogenetic studies across angiosperms, at various taxonomic levels, polytomies have persisted despite efforts to resolve them by increasing sampling of taxa and loci. The large amount of genomic data now available and statistical tools to analyze them provide unprecedented power for phylogenetic inference. Targeted sequencing has emerged as a strong tool for estimating species trees in the face of rapid radiations, lineage sorting, and introgression. Evolutionary relationships in Cyperaceae have been studied mostly using Sanger sequencing until recently. Despite ample taxon sampling, relationships in many genera remain poorly understood, hampered by diversification rates that outpace mutation rates in the loci used. The C4 Cyperus clade of the genus Cyperus has been particularly difficult to resolve. Previous studies based on a limited set of markers resolved relationships among Cyperus species using the C3 photosynthetic pathway, but not among C4 Cyperus clade taxa. We test the ability of two targeted sequencing kits to resolve relationships in the C4 Cyperus clade, the universal Angiosperms-353 kit and a Cyperaceae-specific kit. Sequences of the targeted loci were recovered from data generated with both kits and used to investigate overlap in data between kits and relative efficiency of the general and custom approaches. The power to resolve shallow-level relationships was tested using a summary species tree method and a concatenated maximum likelihood approach. High resolution and support are obtained using both approaches, but high levels of missing data disproportionately impact the latter. Targeted sequencing provides new insights into the evolution of morphology in the C4 Cyperus clade, demonstrating for example that the former segregate genus Alinula is polyphyletic despite its seeming morphological integrity. An unexpected result is that the Cyperus margaritaceus-Cyperus niveus complex comprises a clade separate from and sister to the core C4 Cyperus clade. Our results demonstrate that data generated with a family-specific kit do not necessarily have more power than those obtained with a universal kit, but that data generated with different targeted sequencing kits can often be merged for downstream analyses. Moreover, our study contributes to the growing consensus that targeted sequencing data are a powerful tool in resolving rapid radiations.
Collapse
Affiliation(s)
- Isabel Larridon
- Royal Botanic Gardens, Kew, Surrey, United Kingdom
- Systematic and Evolutionary Botany Lab, Department of Biology, Ghent University, Ghent, Belgium
- *Correspondence: Isabel Larridon, ; Tamara Villaverde,
| | - Tamara Villaverde
- Real Jardín Botánico (RJB-CSIC), Madrid, Spain
- The Morton Arboretum, Lisle, IL, United States
- The Field Museum, Chicago, IL, United States
- *Correspondence: Isabel Larridon, ; Tamara Villaverde,
| | | | - Lisa Pokorny
- Royal Botanic Gardens, Kew, Surrey, United Kingdom
- Real Jardín Botánico (RJB-CSIC), Madrid, Spain
- Centre for Plant Biotechnology and Genomics (CBGP, UPM-INIA), Madrid, Spain
| | | | | | - Isabel Fairlie
- Royal Botanic Gardens, Kew, Surrey, United Kingdom
- Department of Animal and Plant Sciences, University of Sheffield, Sheffield, United Kingdom
| | | | - Jan Kim
- Royal Botanic Gardens, Kew, Surrey, United Kingdom
| | - Enrique Maguilla
- The Morton Arboretum, Lisle, IL, United States
- Departamento de Biología Vegetal y Ecología, Universidad de Sevilla, Sevilla, Spain
| | | | | | - Andrew L. Hipp
- The Morton Arboretum, Lisle, IL, United States
- The Field Museum, Chicago, IL, United States
| | - Félix Forest
- Royal Botanic Gardens, Kew, Surrey, United Kingdom
| | | |
Collapse
|
46
|
Anchored hybrid enrichment generated nuclear, plastid and mitochondrial markers resolve the Lepanthes horrida (Orchidaceae: Pleurothallidinae) species complex. Mol Phylogenet Evol 2018; 129:27-47. [DOI: 10.1016/j.ympev.2018.07.014] [Citation(s) in RCA: 35] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2018] [Revised: 06/25/2018] [Accepted: 07/15/2018] [Indexed: 11/20/2022]
|
47
|
Comparative assessment shows the reliability of chloroplast genome assembly using RNA-seq. Sci Rep 2018; 8:17404. [PMID: 30479362 PMCID: PMC6258696 DOI: 10.1038/s41598-018-35654-3] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2018] [Accepted: 11/09/2018] [Indexed: 11/08/2022] Open
Abstract
Chloroplast genomes (cp genomes) are widely used in comparative genomics, population genetics, and phylogenetic studies. Obtaining chloroplast genomes from RNA-Seq data seems feasible due to the almost full transcription of cpDNA. However, the reliability of chloroplast genomes assembled from RNA-Seq instead of genomic DNA libraries remains to be thoroughly verified. In this study, we assembled chloroplast genomes for three Erysimum (Brassicaceae) species from three RNA-Seq replicas and from one genomic library of each species, using a streamlined bioinformatics protocol. We compared these assembled genomes, confirming that assembled cp genomes from RNA-Seq data were highly similar to each other and to those from genomic libraries in terms of overall structure, size, and composition. Although post-transcriptional modifications, such as RNA-editing, may introduce variations in the RNA-seq data, the assembly of cp genomes from RNA-seq appeared to be reliable. Moreover, RNA-Seq assembly was less sensitive to sources of error such as the recovery of nuclear plastid DNAs (NUPTs). Although some precautions should be taken when producing reference genomes in non-model plants, we conclude that assembling cp genomes from RNA-Seq data is a fast, accurate, and reliable strategy.
Collapse
|
48
|
Givnish TJ, Zuluaga A, Spalink D, Soto Gomez M, Lam VKY, Saarela JM, Sass C, Iles WJD, de Sousa DJL, Leebens-Mack J, Chris Pires J, Zomlefer WB, Gandolfo MA, Davis JI, Stevenson DW, dePamphilis C, Specht CD, Graham SW, Barrett CF, Ané C. Monocot plastid phylogenomics, timeline, net rates of species diversification, the power of multi-gene analyses, and a functional model for the origin of monocots. AMERICAN JOURNAL OF BOTANY 2018; 105:1888-1910. [PMID: 30368769 DOI: 10.1002/ajb2.1178] [Citation(s) in RCA: 107] [Impact Index Per Article: 17.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/14/2018] [Accepted: 08/03/2018] [Indexed: 05/03/2023]
Abstract
PREMISE OF THE STUDY We present the first plastome phylogeny encompassing all 77 monocot families, estimate branch support, and infer monocot-wide divergence times and rates of species diversification. METHODS We conducted maximum likelihood analyses of phylogeny and BAMM studies of diversification rates based on 77 plastid genes across 545 monocots and 22 outgroups. We quantified how branch support and ascertainment vary with gene number, branch length, and branch depth. KEY RESULTS Phylogenomic analyses shift the placement of 16 families in relation to earlier studies based on four plastid genes, add seven families, date the divergence between monocots and eudicots+Ceratophyllum at 136 Mya, successfully place all mycoheterotrophic taxa examined, and support recognizing Taccaceae and Thismiaceae as separate families and Arecales and Dasypogonales as separate orders. Only 45% of interfamilial divergences occurred after the Cretaceous. Net species diversification underwent four large-scale accelerations in PACMAD-BOP Poaceae, Asparagales sister to Doryanthaceae, Orchidoideae-Epidendroideae, and Araceae sister to Lemnoideae, each associated with specific ecological/morphological shifts. Branch ascertainment and support across monocots increase with gene number and branch length, and decrease with relative branch depth. Analysis of entire plastomes in Zingiberales quantifies the importance of non-coding regions in identifying and supporting short, deep branches. CONCLUSIONS We provide the first resolved, well-supported monocot phylogeny and timeline spanning all families, and quantify the significant contribution of plastome-scale data to resolving short, deep branches. We outline a new functional model for the evolution of monocots and their diagnostic morphological traits from submersed aquatic ancestors, supported by convergent evolution of many of these traits in aquatic Hydatellaceae (Nymphaeales).
Collapse
Affiliation(s)
- Thomas J Givnish
- Department of Botany, University of Wisconsin-Madison, Madison, Wisconsin, 53706, USA
| | | | - Daniel Spalink
- Department of Ecosystem Science, Texas A&M University, College Station, Texas, 77840, USA
| | - Marybel Soto Gomez
- Department of Botany, University of British Columbia, Vancouver, British Columbia, V6T 1Z4, Canada
| | - Vivienne K Y Lam
- Department of Botany, University of British Columbia, Vancouver, British Columbia, V6T 1Z4, Canada
| | | | - Chodon Sass
- The University and Jepson Herbarium, University of California-Berkeley, Berkeley, California, 94720, USA
| | - William J D Iles
- Department of Earth and Environmental Sciences, University of Michigan, Ann Arbor, Michigan, 48109, USA
| | - Danilo José Lima de Sousa
- Departamento de Ciéncias Biológicas, Universidade Estadual de Feira de Santana, Feira de Santana, Bahia, 44036-900, Brazil
| | - James Leebens-Mack
- Department of Plant Biology, University of Georgia, Athens, Georgia, 30602, USA
| | - J Chris Pires
- Division of Biological Sciences, University of Missouri-Columbia, Columbia, Missouri, 65211, USA
| | - Wendy B Zomlefer
- Department of Plant Biology, University of Georgia, Athens, Georgia, 30602, USA
| | - Maria A Gandolfo
- School of Integrative Plant Sciences and L.H. Bailey Hortorium, Cornell University, Ithaca, New York, 14853, USA
| | - Jerrold I Davis
- School of Integrative Plant Sciences and L.H. Bailey Hortorium, Cornell University, Ithaca, New York, 14853, USA
| | | | - Claude dePamphilis
- Department of Biology, Pennsylvania State University, University Park, Pennsylvania, 16802, USA
| | - Chelsea D Specht
- School of Integrative Plant Sciences and L.H. Bailey Hortorium, Cornell University, Ithaca, New York, 14853, USA
| | - Sean W Graham
- Department of Botany, University of British Columbia, Vancouver, British Columbia, V6T 1Z4, Canada
| | - Craig F Barrett
- Department of Biology, West Virginia University, Morgantown, West Virginia, 26506, USA
| | - Cécile Ané
- Department of Botany, University of Wisconsin-Madison, Madison, Wisconsin, 53706, USA
- Department of Statistics, University of Wisconsin-Madison, Madison, Wisconsin, 53706, USA
| |
Collapse
|
49
|
Annenkova NV, Ahrén D, Logares R, Kremp A, Rengefors K. Delineating closely related dinoflagellate lineages using phylotranscriptomics. JOURNAL OF PHYCOLOGY 2018; 54:571-576. [PMID: 29676790 DOI: 10.1111/jpy.12748] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/24/2017] [Accepted: 04/05/2018] [Indexed: 06/08/2023]
Abstract
Recently radiated dinoflagellates Apocalathium aciculiferum (collected in Lake Erken, Sweden), Apocalathium malmogiense (Baltic Sea) and Apocalathium aff. malmogiense (Highway Lake, Antarctica) represent a lineage with an unresolved phylogeny. We determined their phylogenetic relationships using phylotranscriptomics based on 792 amino acid sequences. Our results showed that A. aciculiferum diverged from the other two closely related lineages, consistent with their different morphologies in cell size, relative cell length and presence of spines. We hypothesized that A. aff. malmogiense and A. malmogiense, which inhabit different hemispheres, are evolutionarily more closely related because they diverged from a marine common ancestor, adapting to a wide salinity range, while A. aciculiferum colonized a freshwater habitat, by acquiring adaptations to this environment, in particular, salinity intolerance. We show that phylotranscriptomics can resolve the phylogeny of recently diverged protists. This has broad relevance, given that many phytoplankton species are morphologically very similar, and single genes sometimes lack the information to determine species' relationships.
Collapse
Affiliation(s)
- Nataliia V Annenkova
- Limnological Institute Siberian Branch of the Russian Academy of Sciences 3, Ulan-Batorskaya St., 664033, Irkutsk, Russia
| | - Dag Ahrén
- Microbial Ecology Group, Department of Biology, Lund University, Ecology Building, SE-223 62, Lund, Sweden
- Bioinformatics Infrastructures for Life Sciences (BILS), Department of Biology, Lund University, Ecology Building, SE-223 62, Lund, Sweden
| | - Ramiro Logares
- Department of Marine Biology and Oceanography, Institute of Marine Science (ICM)-Consejo Superior de Investigaciones Científicas (CSIC), Passeig Marítim de la Barceloneta 37-49, E08003, Barcelona, Spain
| | - Anke Kremp
- Marine Research Centre, Finnish Environment Institute, Erik Palmenin aukio 1, 00560, Helsinki, Finland
| | - Karin Rengefors
- Aquatic Ecology, Department of Biology, Lund University, Ecology Building, SE-223 62, Lund, Sweden
| |
Collapse
|
50
|
Herrando-Moraira S. Exploring data processing strategies in NGS target enrichment to disentangle radiations in the tribe Cardueae (Compositae). Mol Phylogenet Evol 2018; 128:69-87. [PMID: 30036700 DOI: 10.1016/j.ympev.2018.07.012] [Citation(s) in RCA: 30] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2018] [Revised: 07/13/2018] [Accepted: 07/14/2018] [Indexed: 12/17/2022]
Abstract
Target enrichment is a cost-effective sequencing technique that holds promise for elucidating evolutionary relationships in fast-evolving lineages. However, potential biases and impact of bioinformatic sequence treatments in phylogenetic inference have not been thoroughly explored yet. Here, we investigate this issue with an ultimate goal to shed light into a highly diversified group of Compositae (Asteraceae) constituted by four main genera: Arctium, Cousinia, Saussurea, and Jurinea. Specifically, we compared sequence data extraction methods implemented in two easy-to-use workflows, PHYLUCE and HybPiper, and assessed the impact of two filtering practices intended to reduce phylogenetic noise. In addition, we compared two phylogenetic inference methods: (1) the concatenation approach, in which all loci were concatenated in a supermatrix; and (2) the coalescence approach, in which gene trees were produced independently and then used to construct a species tree under coalescence assumptions. Here we confirm the usefulness of the set of 1061 COS targets (a nuclear conserved orthology loci set developed for the Compositae) across a variety of taxonomic levels. Intergeneric relationships were completely resolved: there are two sister groups, Arctium-Cousinia and Saussurea-Jurinea, which are in agreement with a morphological hypothesis. Intrageneric relationships among species of Arctium, Cousinia, and Saussurea are also well defined. Conversely, conflicting species relationships remain for Jurinea. Methodological choices significantly affected phylogenies in terms of topology, branch length, and support. Across all analyses, the phylogeny obtained using HybPiper and the strictest scheme of removing fast-evolving sites was estimated as the optimal. Regarding methodological choices, we conclude that: (1) trees obtained under the coalescence approach are topologically more congruent between them than those inferred using the concatenation approach; (2) refining treatments only improved support values under the concatenation approach; and (3) branch support values are maximized when fast-evolving sites are removed in the concatenation approach, and when a higher number of loci is analyzed in the coalescence approach.
Collapse
Affiliation(s)
- Sonia Herrando-Moraira
- Botanic Institute of Barcelona (IBB, CSIC-ICUB), Pg. del Migdia, s.n., 08038 Barcelona, Spain.
| | | |
Collapse
|