1
|
Ball LD, Bedoya AM, Taylor CM, Lagomarsino LP. A target enrichment probe set for resolving phylogenetic relationships in the coffee family, Rubiaceae. Appl Plant Sci 2023; 11:e11554. [PMID: 38106541 PMCID: PMC10719880 DOI: 10.1002/aps3.11554] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/16/2023] [Revised: 05/08/2023] [Accepted: 05/13/2023] [Indexed: 12/19/2023]
Abstract
Premise Rubiaceae is among the most species-rich plant families, as well as one of the most morphologically and geographically diverse. Currently available phylogenies have mostly relied on few genomic and plastid loci, as opposed to large-scale genomic data. Target enrichment provides the ability to generate sequence data for hundreds to thousands of phylogenetically informative, single-copy loci, which often leads to improved phylogenetic resolution at both shallow and deep taxonomic scales; however, a publicly accessible Rubiaceae-specific probe set that allows for comparable phylogenetic inference across clades is lacking. Methods Here, we use publicly accessible genomic resources to identify putatively single-copy nuclear loci for target enrichment in two Rubiaceae groups: tribe Hillieae (Cinchonoideae) and tribal complex Palicoureeae+Psychotrieae (Rubioideae). We sequenced 2270 exonic regions corresponding to 1059 loci in our target clades and generated in silico target enrichment sequences for other Rubiaceae taxa using our designed probe set. To test the utility of our probe set for phylogenetic inference across Rubiaceae, we performed a coalescent-aware phylogenetic analysis using a subset of 27 Rubiaceae taxa from 10 different tribes and three subfamilies, and one outgroup in Apocynaceae. Results We recovered an average of 75% and 84% of targeted exons and loci, respectively, per Rubiaceae sample. Probes designed using genomic resources from a particular subfamily were most efficient at targeting sequences from taxa in that subfamily. The number of paralogs recovered during assembly varied for each clade. Phylogenetic inference of Rubiaceae with our target regions resolves relationships at various scales. Relationships are largely consistent with previous studies of relationships in the family with high support (≥0.98 local posterior probability) at nearly all nodes and evidence of gene tree discordance. Discussion Our probe set, which we call Rubiaceae2270x, was effective for targeting loci in species across and even outside of Rubiaceae. This probe set will facilitate phylogenomic studies in Rubiaceae and advance systematics and macroevolutionary studies in the family.
Collapse
Affiliation(s)
- Laymon D Ball
- Department of Biological Sciences Louisiana State University Baton Rouge Louisiana 70803 USA
| | - Ana M Bedoya
- Department of Biological Sciences Louisiana State University Baton Rouge Louisiana 70803 USA
| | - Charlotte M Taylor
- Missouri Botanical Garden 4344 Shaw Blvd. Saint Louis Missouri 63110 USA
| | - Laura P Lagomarsino
- Department of Biological Sciences Louisiana State University Baton Rouge Louisiana 70803 USA
| |
Collapse
|
2
|
Fonseca LHM, Carlsen MM, Fine PVA, Lohmann LG. A nuclear target sequence capture probe set for phylogeny reconstruction of the charismatic plant family Bignoniaceae. Front Genet 2023; 13:1085692. [PMID: 36699458 PMCID: PMC9869424 DOI: 10.3389/fgene.2022.1085692] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2022] [Accepted: 12/12/2022] [Indexed: 01/11/2023] Open
Abstract
The plant family Bignoniaceae is a conspicuous and charismatic element of the tropical flora. The family has a complex taxonomic history, with substantial changes in the classification of the group during the past two centuries. Recent re-classifications at the tribal and generic levels have been largely possible by the availability of molecular phylogenies reconstructed using Sanger sequencing data. However, our complete understanding of the systematics, evolution, and biogeography of the family remains incomplete, especially due to the low resolution and support of different portions of the Bignoniaceae phylogeny. To overcome these limitations and increase the amount of molecular data available for phylogeny reconstruction within this plant family, we developed a bait kit targeting 762 nuclear genes, including 329 genes selected specifically for the Bignoniaceae; 348 genes obtained from the Angiosperms353 with baits designed specifically for the family; and, 85 low-copy genes of known function. On average, 77.4% of the reads mapped to the targets, and 755 genes were obtained per species. After removing genes with putative paralogs, 677 loci were used for phylogenetic analyses. On-target genes were compared and combined in the Exon-Only dataset, and on-target + off-target regions were combined in the Supercontig dataset. We tested the performance of the bait kit at different taxonomic levels, from family to species-level, using 38 specimens of 36 different species of Bignoniaceae, representing: 1) six (out of eight) tribal level-clades (e.g., Bignonieae, Oroxyleae, Tabebuia Alliance, Paleotropical Clade, Tecomeae, and Jacarandeae), only Tourrettieae and Catalpeae were not sampled; 2) all 20 genera of Bignonieae; 3) seven (out of nine) species of Dolichandra (e.g., D. chodatii, D. cynanchoides, D. dentata, D. hispida, D. quadrivalvis, D. uncata, and D. uniguis-cati), only D. steyermarkii and D. unguiculata were not sampled; and 4) three individuals of Dolichandra unguis-cati. Our data reconstructed a well-supported phylogeny of the Bignoniaceae at different taxonomic scales, opening new perspectives for a comprehensive phylogenetic framework for the family as a whole.
Collapse
Affiliation(s)
- Luiz Henrique M. Fonseca
- Departamento de Botânica, Instituto de Biociências, Universidade de São Paulo, São Paulo, Brazil,Systematic and Evolutionary Botany Laboratory, Department of Biology, Ghent University, Ghent, Belgium,*Correspondence: Luiz Henrique M. Fonseca, ; Lúcia G. Lohmann,
| | | | - Paul V. A. Fine
- University and Jepson Herbaria, and Department of Integrative Biology, University of California, Berkeley, Berkeley, CA, United States
| | - Lúcia G. Lohmann
- Departamento de Botânica, Instituto de Biociências, Universidade de São Paulo, São Paulo, Brazil,University and Jepson Herbaria, and Department of Integrative Biology, University of California, Berkeley, Berkeley, CA, United States,*Correspondence: Luiz Henrique M. Fonseca, ; Lúcia G. Lohmann,
| |
Collapse
|
3
|
Crone MK, Biddinger DJ, Grozinger CM. Wild Bee Nutritional Ecology: Integrative Strategies to Assess Foraging Preferences and Nutritional Requirements. Front Sustain Food Syst 2022. [DOI: 10.3389/fsufs.2022.847003] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
Bees depend on flowering plants for their nutrition, and reduced availability of floral resources is a major driver of declines in both managed and wild bee populations. Understanding the nutritional needs of different bee species, and how these needs are met by the varying nutritional resources provided by different flowering plant taxa, can greatly inform land management recommendations to support bee populations and their associated ecosystem services. However, most bee nutrition research has focused on the three most commonly managed and commercially reared bee taxa—honey bees, bumble bees, and mason bees—with fewer studies focused on wild bees and other managed species, such as leafcutting bees, stingless bees, and alkali bees. Thus, we have limited information about the nutritional requirements and foraging preferences of the vast majority of bee species. Here, we discuss the approaches traditionally used to understand bee nutritional ecology: identification of floral visitors of selected focal plant species, evaluation of the foraging preferences of adults in selected focal bee species, evaluation of the nutritional requirements of focal bee species (larvae or adults) in controlled settings, and examine how these methods may be adapted to study a wider range of bee species. We also highlight emerging technologies that have the potential to greatly facilitate studies of the nutritional ecology of wild bee species, as well as evaluate bee nutritional ecology at significantly larger spatio-temporal scales than were previously feasible. While the focus of this review is on bee species, many of these techniques can be applied to other pollinator taxa as well.
Collapse
|
4
|
Yardeni G, Viruel J, Paris M, Hess J, Groot Crego C, de La Harpe M, Rivera N, Barfuss MHJ, Till W, Guzmán-Jacob V, Krömer T, Lexer C, Paun O, Leroy T. Taxon-specific or universal? Using target capture to study the evolutionary history of rapid radiations. Mol Ecol Resour 2021; 22:927-945. [PMID: 34606683 PMCID: PMC9292372 DOI: 10.1111/1755-0998.13523] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2021] [Revised: 09/09/2021] [Accepted: 09/22/2021] [Indexed: 12/20/2022]
Abstract
Target capture has emerged as an important tool for phylogenetics and population genetics in nonmodel taxa. Whereas developing taxon‐specific capture probes requires sustained efforts, available universal kits may have a lower power to reconstruct relationships at shallow phylogenetic scales and within rapidly radiating clades. We present here a newly developed target capture set for Bromeliaceae, a large and ecologically diverse plant family with highly variable diversification rates. The set targets 1776 coding regions, including genes putatively involved in key innovations, with the aim to empower testing of a wide range of evolutionary hypotheses. We compare the relative power of this taxon‐specific set, Bromeliad1776, to the universal Angiosperms353 kit. The taxon‐specific set results in higher enrichment success across the entire family; however, the overall performance of both kits to reconstruct phylogenetic trees is relatively comparable, highlighting the vast potential of universal kits for resolving evolutionary relationships. For more detailed phylogenetic or population genetic analyses, for example the exploration of gene tree concordance, nucleotide diversity or population structure, the taxon‐specific capture set presents clear benefits. We discuss the potential lessons that this comparative study provides for future phylogenetic and population genetic investigations, in particular for the study of evolutionary radiations.
Collapse
Affiliation(s)
- Gil Yardeni
- Department of Botany and Biodiversity Research, University of Vienna, Vienna, Austria
| | | | - Margot Paris
- Unit of Ecology & Evolution, Department of Biology, University of Fribourg, Fribourg, Switzerland
| | - Jaqueline Hess
- Department of Botany and Biodiversity Research, University of Vienna, Vienna, Austria.,Department of Soil Ecology, Helmholtz Centre for Environmental Research, UFZ, Halle (Saale), Germany
| | - Clara Groot Crego
- Department of Botany and Biodiversity Research, University of Vienna, Vienna, Austria.,Vienna Graduate School of Population Genetics, Vienna, Austria
| | - Marylaure de La Harpe
- Department of Botany and Biodiversity Research, University of Vienna, Vienna, Austria
| | - Norma Rivera
- Department of Botany and Biodiversity Research, University of Vienna, Vienna, Austria
| | - Michael H J Barfuss
- Department of Botany and Biodiversity Research, University of Vienna, Vienna, Austria
| | - Walter Till
- Department of Botany and Biodiversity Research, University of Vienna, Vienna, Austria
| | - Valeria Guzmán-Jacob
- Biodiversity, Macroecology and Biogeography, University of Goettingen, Göttingen, Germany
| | - Thorsten Krömer
- Centro de Investigaciones Tropicales, Universidad Veracruzana, Xalapa, Mexico
| | - Christian Lexer
- Department of Botany and Biodiversity Research, University of Vienna, Vienna, Austria
| | - Ovidiu Paun
- Department of Botany and Biodiversity Research, University of Vienna, Vienna, Austria
| | - Thibault Leroy
- Department of Botany and Biodiversity Research, University of Vienna, Vienna, Austria
| |
Collapse
|
5
|
Reichelt N, Wen J, Pätzold C, Appelhans MS. Target enrichment improves phylogenetic resolution in the genus Zanthoxylum (Rutaceae) and indicates both incomplete lineage sorting and hybridization events. Ann Bot 2021; 128:497-510. [PMID: 34250543 PMCID: PMC8414929 DOI: 10.1093/aob/mcab092] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/26/2021] [Accepted: 07/09/2021] [Indexed: 05/14/2023]
Abstract
BACKGROUND AND AIMS Zanthoxylum is the only pantropical genus within Rutaceae, with a few species native to temperate eastern Asia and North America. Efforts using Sanger sequencing failed to resolve the backbone phylogeny of Zanthoxylum. In this study, we employed target-enrichment high-throughput sequencing to improve resolution. Gene trees were examined for concordance and sectional classifications of Zanthoxylum were evaluated. Off-target reads were investigated to identify putative single-copy markers for bait refinement, and low-copy markers for evidence of putative hybridization events. METHODS A custom bait set targeting 354 genes, with a median of 321 bp, was designed for Zanthoxylum and applied to 44 Zanthoxylum species and one Tetradium species as the outgroup. Illumina reads were processed via the HybPhyloMaker pipeline. Phylogenetic inferences were conducted using coalescent and maximum likelihood methods based on concatenated datasets. Concordance was assessed using quartet sampling. Additional phylogenetic analyses were performed on putative single and low-copy genes extracted from off-target reads. KEY RESULTS Four major clades are supported within Zanthoxylum: the African clade, the Z. asiaticum clade, the Asian-Pacific-Australian clade and the American-eastern Asian clade. While overall support has improved, regions of conflict are similar to those previously observed. Gene tree discordances indicate a hybridization event in the ancestor of the Hawaiian lineage, and incomplete lineage sorting in the American backbone. Off-target putative single-copy genes largely confirm on-target results, and putative low-copy genes provide additional evidence for hybridization in the Hawaiian lineage. Only two of the five sections of Zanthoxylum are resolved as monophyletic. CONCLUSIONS Target enrichment is suitable for assessing phylogenetic relationships in Zanthoxylum. Our phylogenetic analyses reveal that current sectional classifications need revision. Quartet tree concordance indicates several instances of reticulate evolution. Off-target reads are proven useful to identify additional phylogenetically informative regions for bait refinement or gene tree based approaches.
Collapse
Affiliation(s)
- Niklas Reichelt
- Department of Systematics, Biodiversity and Evolution of Plants, Albrecht-von-Haller Institute of Plant Sciences, University of Goettingen, Untere Karspuele 2, 37073 Goettingen, Germany
- Department of Botany, National Museum of Natural History, Smithsonian Institution, P.O. Box 37012, MRC 166, Washington, DC 20013-7012, USA
- Pharmaceutical Biology, Julius-von-Sachs-Institute for Biosciences, University of Wuerzburg, Julius-von-Sachs-Platz 2, 97082 Wuerzburg, Germany
| | - Jun Wen
- Department of Botany, National Museum of Natural History, Smithsonian Institution, P.O. Box 37012, MRC 166, Washington, DC 20013-7012, USA
| | - Claudia Pätzold
- Department of Systematics, Biodiversity and Evolution of Plants, Albrecht-von-Haller Institute of Plant Sciences, University of Goettingen, Untere Karspuele 2, 37073 Goettingen, Germany
- Department Botany and Molecular Evolution, Senckenberg Research Institute and Natural History Museum Frankfurt, Senckenberganlage 25, 60325 Frankfurt am Main, Germany
| | - Marc S Appelhans
- Department of Systematics, Biodiversity and Evolution of Plants, Albrecht-von-Haller Institute of Plant Sciences, University of Goettingen, Untere Karspuele 2, 37073 Goettingen, Germany
- Department of Botany, National Museum of Natural History, Smithsonian Institution, P.O. Box 37012, MRC 166, Washington, DC 20013-7012, USA
- For correspondence. E-mail
| |
Collapse
|
6
|
McLay TGB, Birch JL, Gunn BF, Ning W, Tate JA, Nauheimer L, Joyce EM, Simpson L, Schmidt‐Lebuhn AN, Baker WJ, Forest F, Jackson CJ. New targets acquired: Improving locus recovery from the Angiosperms353 probe set. Appl Plant Sci 2021; 9:APS311420. [PMID: 34336399 PMCID: PMC8312740 DOI: 10.1002/aps3.11420] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/06/2020] [Accepted: 03/15/2021] [Indexed: 05/10/2023]
Abstract
PREMISE Universal target enrichment kits maximize utility across wide evolutionary breadth while minimizing the number of baits required to create a cost-efficient kit. The Angiosperms353 kit has been successfully used to capture loci throughout the angiosperms, but the default target reference file includes sequence information from only 6-18 taxa per locus. Consequently, reads sequenced from on-target DNA molecules may fail to map to references, resulting in fewer on-target reads for assembly, and reducing locus recovery. METHODS We expanded the Angiosperms353 target file, incorporating sequences from 566 transcriptomes to produce a 'mega353' target file, with each locus represented by 17-373 taxa. This mega353 file is a drop-in replacement for the original Angiosperms353 file in HybPiper analyses. We provide tools to subsample the file based on user-selected taxon groups, and to incorporate other transcriptome or protein-coding gene data sets. RESULTS Compared to the default Angiosperms353 file, the mega353 file increased the percentage of on-target reads by an average of 32%, increased locus recovery at 75% length by 49%, and increased the total length of the concatenated loci by 29%. DISCUSSION Increasing the phylogenetic density of the target reference file results in improved recovery of target capture loci. The mega353 file and associated scripts are available at: https://github.com/chrisjackson-pellicle/NewTargets.
Collapse
Affiliation(s)
- Todd G. B. McLay
- National Herbarium of VictoriaRoyal Botanic Gardens VictoriaMelbourneAustralia
- School of BiosciencesUniversity of MelbourneMelbourneAustralia
- Centre for Australian National Biodiversity ResearchCSIROCanberraAustralia
| | - Joanne L. Birch
- School of BiosciencesUniversity of MelbourneMelbourneAustralia
| | - Bee F. Gunn
- National Herbarium of VictoriaRoyal Botanic Gardens VictoriaMelbourneAustralia
- School of BiosciencesUniversity of MelbourneMelbourneAustralia
| | - Weixuan Ning
- School of Fundamental SciencesMassey UniversityPalmerston NorthNew Zealand
| | - Jennifer A. Tate
- School of Fundamental SciencesMassey UniversityPalmerston NorthNew Zealand
| | - Lars Nauheimer
- James Cook UniversityCairnsAustralia
- Australian Tropical HerbariumJames Cook UniversityCairnsAustralia
| | - Elizabeth M. Joyce
- James Cook UniversityCairnsAustralia
- Australian Tropical HerbariumJames Cook UniversityCairnsAustralia
| | - Lalita Simpson
- James Cook UniversityCairnsAustralia
- Australian Tropical HerbariumJames Cook UniversityCairnsAustralia
| | | | | | - Félix Forest
- Royal Botanic Gardens, KewRichmondSurreyTW9 3AEUnited Kingdom
| | - Chris J. Jackson
- National Herbarium of VictoriaRoyal Botanic Gardens VictoriaMelbourneAustralia
| |
Collapse
|
7
|
Ufimov R, Zeisek V, Píšová S, Baker WJ, Fér T, van Loo M, Dobeš C, Schmickl R. Relative performance of customized and universal probe sets in target enrichment: A case study in subtribe Malinae. Appl Plant Sci 2021; 9:e11442. [PMID: 34336405 PMCID: PMC8312748 DOI: 10.1002/aps3.11442] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/07/2020] [Accepted: 04/09/2021] [Indexed: 05/10/2023]
Abstract
PREMISE Custom probe design for target enrichment in phylogenetics is tedious and often hinders broader phylogenetic synthesis. The universal angiosperm probe set Angiosperms353 may be the solution. Here, we test the relative performance of Angiosperms353 on the Rosaceae subtribe Malinae in comparison with custom probes that we specifically designed for this clade. We then address the impact of bioinformatically altering the performance of Angiosperms353 by replacing the original probe sequences with orthologs extracted from the Malus domestica genome. METHODS To evaluate the relative performance of these probe sets, we compared the enrichment efficiency, locus recovery, alignment length, proportion of parsimony-informative sites, proportion of potential paralogs, the topology and support of the resulting species trees, and the gene tree discordance. RESULTS Locus recovery was highest for our custom Malinae probe set, and replacing the original Angiosperms353 sequences with a Malus representative improved the locus recovery relative to Angiosperms353. The proportion of parsimony-informative sites was similar between all probe sets, while the gene tree discordance was lower in the case of the custom probes. DISCUSSION A custom probe set benefits from data completeness and can be tailored toward the specificities of the project of choice; however, Angiosperms353 was equally as phylogenetically informative as the custom probes. We therefore recommend using both a custom probe set and Angiosperms353 to facilitate large-scale systematic studies, where financially possible.
Collapse
Affiliation(s)
- Roman Ufimov
- Department of Forest Growth, Silviculture and GeneticsAustrian Research Centre for ForestsSeckendorff‐Gudent‐Weg 8Vienna1130Austria
- Komarov Botanical InstituteRussian Academy of Sciencesul. Prof. Popova 2St. Petersburg197376Russian Federation
| | - Vojtěch Zeisek
- Institute of BotanyThe Czech Academy of SciencesZámek 1Průhonice252 43Czech Republic
- Department of BotanyFaculty of ScienceCharles UniversityBenátská 2Prague128 01Czech Republic
| | - Soňa Píšová
- Department of Forest Growth, Silviculture and GeneticsAustrian Research Centre for ForestsSeckendorff‐Gudent‐Weg 8Vienna1130Austria
- Institute of BotanyThe Czech Academy of SciencesZámek 1Průhonice252 43Czech Republic
| | | | - Tomáš Fér
- Department of BotanyFaculty of ScienceCharles UniversityBenátská 2Prague128 01Czech Republic
| | - Marcela van Loo
- Department of Forest Growth, Silviculture and GeneticsAustrian Research Centre for ForestsSeckendorff‐Gudent‐Weg 8Vienna1130Austria
| | - Christoph Dobeš
- Department of Forest Growth, Silviculture and GeneticsAustrian Research Centre for ForestsSeckendorff‐Gudent‐Weg 8Vienna1130Austria
| | - Roswitha Schmickl
- Institute of BotanyThe Czech Academy of SciencesZámek 1Průhonice252 43Czech Republic
- Department of BotanyFaculty of ScienceCharles UniversityBenátská 2Prague128 01Czech Republic
| |
Collapse
|
8
|
Shah T, Schneider JV, Zizka G, Maurin O, Baker W, Forest F, Brewer GE, Savolainen V, Darbyshire I, Larridon I. Joining forces in Ochnaceae phylogenomics: a tale of two targeted sequencing probe kits. Am J Bot 2021; 108:1201-1216. [PMID: 34180046 DOI: 10.1002/ajb2.1682] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/29/2020] [Accepted: 02/23/2021] [Indexed: 05/10/2023]
Abstract
PREMISE Both universal and family-specific targeted sequencing probe kits are becoming widely used for reconstruction of phylogenetic relationships in angiosperms. Within the pantropical Ochnaceae, we show that with careful data filtering, universal kits are equally as capable in resolving intergeneric relationships as custom probe kits. Furthermore, we show the strength in combining data from both kits to mitigate bias and provide a more robust result to resolve evolutionary relationships. METHODS We sampled 23 Ochnaceae genera and used targeted sequencing with two probe kits, the universal Angiosperms353 kit and a family-specific kit. We used maximum likelihood inference with a concatenated matrix of loci and multispecies-coalescence approaches to infer relationships in the family. We explored phylogenetic informativeness and the impact of missing data on resolution and tree support. RESULTS For the Angiosperms353 data set, the concatenation approach provided results more congruent with those of the Ochnaceae-specific data set. Filtering missing data was most impactful on the Angiosperms353 data set, with a relaxed threshold being the optimum scenario. The Ochnaceae-specific data set resolved consistent topologies using both inference methods, and no major improvements were obtained after data filtering. Merging of data obtained with the two kits resulted in a well-supported phylogenetic tree. CONCLUSIONS The Angiosperms353 data set improved upon data filtering, and missing data played an important role in phylogenetic reconstruction. The Angiosperms353 data set resolved the phylogenetic backbone of Ochnaceae as equally well as the family specific data set. All analyses indicated that both Sauvagesia L. and Campylospermum Tiegh. as currently circumscribed are polyphyletic and require revised delimitation.
Collapse
Affiliation(s)
- Toral Shah
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, UK
- Department of Life Sciences, Imperial College, Silwood Park Campus, Ascot, Berks, SL5 7PY, UK
| | - Julio V Schneider
- Department of Botany and Molecular Evolution, Senckenberg Research Institute and Natural History Museum Frankfurt, Senckenberganlage 25, Frankfurt am Main, D-60325, Germany
| | - Georg Zizka
- Department of Botany and Molecular Evolution, Senckenberg Research Institute and Natural History Museum Frankfurt, Senckenberganlage 25, Frankfurt am Main, D-60325, Germany
- Institute of Ecology, Evolution and Diversity, Goethe University, Max-von-Laue-Str. 13, Frankfurt am Main, 60438, Germany
| | - Olivier Maurin
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, UK
| | - William Baker
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, UK
| | - Félix Forest
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, UK
| | - Grace E Brewer
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, UK
| | - Vincent Savolainen
- Department of Life Sciences, Imperial College, Silwood Park Campus, Ascot, Berks, SL5 7PY, UK
| | | | - Isabel Larridon
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, UK
- Systematic and Evolutionary Botany Lab, Department of Biology, Ghent University, K.L., Ledeganckstraat 35, Gent, 9000, Belgium
| |
Collapse
|
9
|
Baker WJ, Bailey P, Barber V, Barker A, Bellot S, Bishop D, Botigué LR, Brewer G, Carruthers T, Clarkson JJ, Cook J, Cowan RS, Dodsworth S, Epitawalage N, Françoso E, Gallego B, Johnson MG, Kim JT, Leempoel K, Maurin O, McGinnie C, Pokorny L, Roy S, Stone M, Toledo E, Wickett NJ, Zuntini AR, Eiserhardt WL, Kersey PJ, Leitch IJ, Forest F. A Comprehensive Phylogenomic Platform for Exploring the Angiosperm Tree of Life. Syst Biol 2021; 71:301-319. [PMID: 33983440 PMCID: PMC8830076 DOI: 10.1093/sysbio/syab035] [Citation(s) in RCA: 58] [Impact Index Per Article: 19.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2021] [Revised: 05/06/2021] [Accepted: 05/08/2021] [Indexed: 12/22/2022] Open
Abstract
The tree of life is the fundamental biological roadmap for navigating the evolution and properties of life on Earth, and yet remains largely unknown. Even angiosperms (flowering plants) are fraught with data gaps, despite their critical role in sustaining terrestrial life. Today, high-throughput sequencing promises to significantly deepen our understanding of evolutionary relationships. Here, we describe a comprehensive phylogenomic platform for exploring the angiosperm tree of life, comprising a set of open tools and data based on the 353 nuclear genes targeted by the universal Angiosperms353 sequence capture probes. The primary goals of this article are to (i) document our methods, (ii) describe our first data release, and (iii) present a novel open data portal, the Kew Tree of Life Explorer (https://treeoflife.kew.org). We aim to generate novel target sequence capture data for all genera of flowering plants, exploiting natural history collections such as herbarium specimens, and augment it with mined public data. Our first data release, described here, is the most extensive nuclear phylogenomic data set for angiosperms to date, comprising 3099 samples validated by DNA barcode and phylogenetic tests, representing all 64 orders, 404 families (96\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{upgreek}
\usepackage{mathrsfs}
\setlength{\oddsidemargin}{-69pt}
\begin{document}
}{}$\%$\end{document}) and 2333 genera (17\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{upgreek}
\usepackage{mathrsfs}
\setlength{\oddsidemargin}{-69pt}
\begin{document}
}{}$\%$\end{document}). A “first pass” angiosperm tree of life was inferred from the data, which totaled 824,878 sequences, 489,086,049 base pairs, and 532,260 alignment columns, for interactive presentation in the Kew Tree of Life Explorer. This species tree was generated using methods that were rigorous, yet tractable at our scale of operation. Despite limitations pertaining to taxon and gene sampling, gene recovery, models of sequence evolution and paralogy, the tree strongly supports existing taxonomy, while challenging numerous hypothesized relationships among orders and placing many genera for the first time. The validated data set, species tree and all intermediates are openly accessible via the Kew Tree of Life Explorer and will be updated as further data become available. This major milestone toward a complete tree of life for all flowering plant species opens doors to a highly integrated future for angiosperm phylogenomics through the systematic sequencing of standardized nuclear markers. Our approach has the potential to serve as a much-needed bridge between the growing movement to sequence the genomes of all life on Earth and the vast phylogenomic potential of the world’s natural history collections. [Angiosperms; Angiosperms353; genomics; herbariomics; museomics; nuclear phylogenomics; open access; target sequence capture; tree of life.]
Collapse
Affiliation(s)
- William J Baker
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, United Kingdom
| | - Paul Bailey
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, United Kingdom
| | - Vanessa Barber
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, United Kingdom
| | - Abigail Barker
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, United Kingdom
| | - Sidonie Bellot
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, United Kingdom
| | - David Bishop
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, United Kingdom
| | - Laura R Botigué
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, United Kingdom.,Centre for Research in Agricultural Genomics, Campus UAB, Edifici CRAG, Bellaterra Cerdanyola del Vallès, 08193 Barcelona, Spain
| | - Grace Brewer
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, United Kingdom
| | - Tom Carruthers
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, United Kingdom
| | - James J Clarkson
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, United Kingdom
| | - Jeffrey Cook
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, United Kingdom
| | - Robyn S Cowan
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, United Kingdom
| | - Steven Dodsworth
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, United Kingdom.,School of Life Sciences, University of Bedfordshire, University Square, Luton LU1 3JU, United Kingdom
| | | | - Elaine Françoso
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, United Kingdom
| | - Berta Gallego
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, United Kingdom
| | - Matthew G Johnson
- Department of Biological Sciences, Texas Tech University, Lubbock, TX 79409, USA
| | - Jan T Kim
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, United Kingdom.,Department of Computer Science, School of Physics, Engineering and Computer Science, University of Hertfordshire, Hatfield, Hertfordshire, AL10 9AB, United Kingdom
| | - Kevin Leempoel
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, United Kingdom
| | - Olivier Maurin
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, United Kingdom
| | | | - Lisa Pokorny
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, United Kingdom.,Centre for Plant Biotechnology and Genomics (CBGP) UPM-INIA, 28223 Pozuelo de Alarcón (Madrid), Spain
| | - Shyamali Roy
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, United Kingdom
| | - Malcolm Stone
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, United Kingdom
| | - Eduardo Toledo
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, United Kingdom
| | - Norman J Wickett
- Plant Science and Conservation, Chicago Botanic Garden, 1000 Lake Cook Road, Glencoe, IL 60022, USA
| | | | - Wolf L Eiserhardt
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, United Kingdom.,Department of Biology, Aarhus University, 8000 Aarhus C, Denmark
| | - Paul J Kersey
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, United Kingdom
| | - Ilia J Leitch
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, United Kingdom
| | - Félix Forest
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, United Kingdom
| |
Collapse
|
10
|
Straub SCK, Boutte J, Fishbein M, Livshultz T. Enabling evolutionary studies at multiple scales in Apocynaceae through Hyb-Seq. Appl Plant Sci 2020; 8:e11400. [PMID: 33304663 PMCID: PMC7705337 DOI: 10.1002/aps3.11400] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/01/2020] [Accepted: 09/12/2020] [Indexed: 05/07/2023]
Abstract
PREMISE Apocynaceae is the 10th largest flowering plant family and a focus for study of plant-insect interactions, especially as mediated by secondary metabolites. However, it has few genomic resources relative to its size. Target capture sequencing is a powerful approach for genome reduction that facilitates studies requiring data from the nuclear genome in non-model taxa, such as Apocynaceae. METHODS Transcriptomes were used to design probes for targeted sequencing of putatively single-copy nuclear genes across Apocynaceae. The sequences obtained were used to assess the success of the probe design, the intrageneric and intraspecific variation in the targeted genes, and the utility of the genes for inferring phylogeny. RESULTS From 853 candidate nuclear genes, 835 were consistently recovered in single copy and were variable enough for phylogenomics. The inferred gene trees were useful for coalescent-based species tree analysis, which showed all subfamilies of Apocynaceae as monophyletic, while also resolving relationships among species within the genus Apocynum. Intraspecific comparison of Elytropus chilensis individuals revealed numerous single-nucleotide polymorphisms with potential for use in population-level studies. DISCUSSION Community use of this Hyb-Seq probe set will facilitate and promote progress in the study of Apocynaceae across scales from population genomics to phylogenomics.
Collapse
Affiliation(s)
- Shannon C. K. Straub
- Department of BiologyHobart and William Smith Colleges300 Pulteney StreetGenevaNew York14456USA
| | - Julien Boutte
- Department of BiologyHobart and William Smith Colleges300 Pulteney StreetGenevaNew York14456USA
| | - Mark Fishbein
- Department of Plant Biology, Ecology, and EvolutionOklahoma State University301 Physical SciencesStillwaterOklahoma74078USA
| | - Tatyana Livshultz
- Department of Biodiversity, Earth, and Environmental Sciences and the Academy of Natural SciencesDrexel University1900 Benjamin Franklin ParkwayPhiladelphiaPennsylvania19103USA
| |
Collapse
|
11
|
Jantzen JR, Amarasinghe P, Folk RA, Reginato M, Michelangeli FA, Soltis DE, Cellinese N, Soltis PS. A two-tier bioinformatic pipeline to develop probes for target capture of nuclear loci with applications in Melastomataceae. Appl Plant Sci 2020; 8:e11345. [PMID: 32477841 PMCID: PMC7249273 DOI: 10.1002/aps3.11345] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/09/2019] [Accepted: 12/20/2019] [Indexed: 05/21/2023]
Abstract
PREMISE Putatively single-copy nuclear (SCN) loci, which are identified using genomic resources of closely related species, are ideal for phylogenomic inference. However, suitable genomic resources are not available for many clades, including Melastomataceae. We introduce a versatile approach to identify SCN loci for clades with few genomic resources and use it to develop probes for target enrichment in the distantly related Memecylon and Tibouchina (Melastomataceae). METHODS We present a two-tiered pipeline. First, we identified putatively SCN loci using MarkerMiner and transcriptomes from distantly related species in Melastomataceae. Published loci and genes of functional significance were then added (384 total loci). Second, using HybPiper, we retrieved 689 homologous template sequences for these loci using genome-skimming data from within the focal clades. RESULTS We sequenced 193 loci common to Memecylon and Tibouchina. Probes designed from 56 template sequences successfully targeted sequences in both clades. Probes designed from genome-skimming data within a focal clade were more successful than probes designed from other sources. DISCUSSION Our pipeline successfully identified and targeted SCN loci in Memecylon and Tibouchina, enabling phylogenomic studies in both clades and potentially across Melastomataceae. This pipeline could be easily applied to other clades with few genomic resources.
Collapse
Affiliation(s)
- Johanna R. Jantzen
- Department of BiologyUniversity of FloridaGainesvilleFlorida32611USA
- Florida Museum of Natural HistoryUniversity of FloridaGainesvilleFlorida32611USA
| | - Prabha Amarasinghe
- Department of BiologyUniversity of FloridaGainesvilleFlorida32611USA
- Florida Museum of Natural HistoryUniversity of FloridaGainesvilleFlorida32611USA
| | - Ryan A. Folk
- Department of Biological SciencesMississippi State UniversityStarkvilleMississippi39762USA
| | - Marcelo Reginato
- Institute of Systematic BotanyThe New York Botanical GardenBronxNew York10458USA
- Universidade Federal do Rio Grande do SulPorto AlegreRio Grande do Sul90040‐060Brazil
| | | | - Douglas E. Soltis
- Department of BiologyUniversity of FloridaGainesvilleFlorida32611USA
- Florida Museum of Natural HistoryUniversity of FloridaGainesvilleFlorida32611USA
| | - Nico Cellinese
- Florida Museum of Natural HistoryUniversity of FloridaGainesvilleFlorida32611USA
| | - Pamela S. Soltis
- Florida Museum of Natural HistoryUniversity of FloridaGainesvilleFlorida32611USA
| |
Collapse
|
12
|
Shee ZQ, Frodin DG, Cámara-Leret R, Pokorny L. Reconstructing the Complex Evolutionary History of the Papuasian Schefflera Radiation Through Herbariomics. Front Plant Sci 2020; 11:258. [PMID: 32265950 PMCID: PMC7099051 DOI: 10.3389/fpls.2020.00258] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/03/2019] [Accepted: 02/19/2020] [Indexed: 05/19/2023]
Abstract
With its large proportion of endemic taxa, complex geological past, and location at the confluence of the highly diverse Malesian and Australian floristic regions, Papuasia - the floristic region comprising the Bismarck Archipelago, New Guinea, and the Solomon Islands - represents an ideal natural experiment in plant biogeography. However, scattered knowledge of its flora and limited representation in herbaria have hindered our understanding of the drivers of its diversity. Focusing on the woody angiosperm genus Schefflera (Araliaceae), we ask whether its morphologically defined infrageneric groupings are monophyletic, when these lineages diverged, and where (within Papuasia or elsewhere) they diversified. To address these questions, we use a high-throughput sequencing approach (Hyb-Seq) which combines target capture (with an angiosperm-wide bait kit targeting 353 single-copy nuclear loci) and genome shotgun sequencing (which allows retrieval of regions in high-copy number, e.g., organellar DNA) of historical herbarium collections. To reconstruct the evolutionary history of the genus we reconstruct molecular phylogenies with Bayesian inference, maximum likelihood, and pseudo-coalescent approaches, and co-estimate divergence times and ancestral areas in a Bayesian framework. We find strong support for most infrageneric morphological groupings, as currently circumscribed, and we show the efficacy of the Angiosperms-353 probe kit in resolving both deep and shallow phylogenetic relationships. We infer a sequence of colonization to explain the present-day distribution of Schefflera in Papuasia: from the Sunda Shelf, Schefflera arrived to the Woodlark plate (present-day eastern New Guinea) in the late Oligocene (when most of New Guinea was submerged) and, subsequently (throughout the Miocene), it migrated westwards (to the Maoke and Bird's Head Plates and thereon) and further diversified, in agreement with previous reconstructions.
Collapse
Affiliation(s)
- Zhi Qiang Shee
- Royal Botanic Gardens, Kew, Richmond, United Kingdom
- Singapore Botanic Gardens, Singapore, Singapore
| | | | - Rodrigo Cámara-Leret
- Royal Botanic Gardens, Kew, Richmond, United Kingdom
- Department of Evolutionary Biology and Environmental Studies, University of Zurich, Zurich, Switzerland
- Bren School of Environmental Science and Management, University of California, Santa Barbara, Santa Barbara, CA, United States
| | - Lisa Pokorny
- Royal Botanic Gardens, Kew, Richmond, United Kingdom
- Centre for Plant Biotechnology and Genomics (CBGP UPM-INIA), Madrid, Spain
- Real Jardín Botánico (RJB-CSIC), Madrid, Spain
| |
Collapse
|
13
|
Jones KE, Fér T, Schmickl RE, Dikow RB, Funk VA, Herrando‐Moraira S, Johnston PR, Kilian N, Siniscalchi CM, Susanna A, Slovák M, Thapa R, Watson LE, Mandel JR. An empirical assessment of a single family-wide hybrid capture locus set at multiple evolutionary timescales in Asteraceae. Appl Plant Sci 2019; 7:e11295. [PMID: 31667023 PMCID: PMC6814182 DOI: 10.1002/aps3.11295] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/27/2019] [Accepted: 09/05/2019] [Indexed: 05/23/2023]
Abstract
PREMISE Hybrid capture with high-throughput sequencing (Hyb-Seq) is a powerful tool for evolutionary studies. The applicability of an Asteraceae family-specific Hyb-Seq probe set and the outcomes of different phylogenetic analyses are investigated here. METHODS Hyb-Seq data from 112 Asteraceae samples were organized into groups at different taxonomic levels (tribe, genus, and species). For each group, data sets of non-paralogous loci were built and proportions of parsimony informative characters estimated. The impacts of analyzing alternative data sets, removing long branches, and type of analysis on tree resolution and inferred topologies were investigated in tribe Cichorieae. RESULTS Alignments of the Asteraceae family-wide Hyb-Seq locus set were parsimony informative at all taxonomic levels. Levels of resolution and topologies inferred at shallower nodes differed depending on the locus data set and the type of analysis, and were affected by the presence of long branches. DISCUSSION The approach used to build a Hyb-Seq locus data set influenced resolution and topologies inferred in phylogenetic analyses. Removal of long branches improved the reliability of topological inferences in maximum likelihood analyses. The Astereaceae Hyb-Seq probe set is applicable at multiple taxonomic depths, which demonstrates that probe sets do not necessarily need to be lineage-specific.
Collapse
Affiliation(s)
- Katy E. Jones
- Botanischer Garten und Botanisches Museum BerlinFreie Universität BerlinKönigin‐Luise‐Str. 6–814195BerlinGermany
| | - Tomáš Fér
- Department of BotanyFaculty of ScienceCharles UniversityBenátská 2CZ 12800PragueCzech Republic
| | - Roswitha E. Schmickl
- Department of BotanyFaculty of ScienceCharles UniversityBenátská 2CZ 12800PragueCzech Republic
- Institute of BotanyThe Czech Academy of SciencesZámek 1CZ 25243PrůhoniceCzech Republic
| | - Rebecca B. Dikow
- Data Science LabOffice of the Chief Information OfficerSmithsonian InstitutionWashingtonD.C.20013‐7012USA
| | - Vicki A. Funk
- Department of BotanyNational Museum of Natural HistorySmithsonian InstitutionWashingtonD.C.20013‐7012USA
| | | | - Paul R. Johnston
- Freie Universität BerlinEvolutionary BiologyBerlinGermany
- Berlin Center for Genomics in Biodiversity ResearchBerlinGermany
- Leibniz‐Institute of Freshwater Ecology and Inland Fisheries (IGB)BerlinGermany
| | - Norbert Kilian
- Botanischer Garten und Botanisches Museum BerlinFreie Universität BerlinKönigin‐Luise‐Str. 6–814195BerlinGermany
| | - Carolina M. Siniscalchi
- Department of Biological SciencesUniversity of MemphisMemphisTennessee38152USA
- Center for BiodiversityUniversity of MemphisMemphisTennessee38152USA
| | - Alfonso Susanna
- Botanic Institute of Barcelona (IBB‐CSIC‐ICUB)Pg. del Migdia s.n.ES 08038BarcelonaSpain
| | - Marek Slovák
- Department of BotanyFaculty of ScienceCharles UniversityBenátská 2CZ 12800PragueCzech Republic
- Plant Science and Biodiversity CentreSlovak Academy of SciencesSK‐84523BratislavaSlovakia
| | - Ramhari Thapa
- Department of Biological SciencesUniversity of MemphisMemphisTennessee38152USA
- Center for BiodiversityUniversity of MemphisMemphisTennessee38152USA
| | - Linda E. Watson
- Department of Plant Biology, Ecology, and EvolutionOklahoma State UniversityStillwaterOklahoma74078USA
| | - Jennifer R. Mandel
- Department of Biological SciencesUniversity of MemphisMemphisTennessee38152USA
- Center for BiodiversityUniversity of MemphisMemphisTennessee38152USA
| |
Collapse
|
14
|
Choi B, Crisp MD, Cook LG, Meusemann K, Edwards RD, Toon A, Külheim C. Identifying genetic markers for a range of phylogenetic utility-From species to family level. PLoS One 2019; 14:e0218995. [PMID: 31369563 DOI: 10.1371/journal.pone.0218995] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2019] [Accepted: 06/13/2019] [Indexed: 12/03/2022] Open
Abstract
Resolving the phylogenetic relationships of closely related species using a small set of loci is challenging as sufficient information may not be captured from a limited sample of the genome. Relying on few loci can also be problematic when conflict between gene-trees arises from incomplete lineage sorting and/or ongoing hybridization, problems especially likely in recently diverged lineages. Here, we developed a method using limited genomic resources that allows identification of many low copy candidate loci from across the nuclear and chloroplast genomes, design probes for target capture and sequence the captured loci. To validate our method we present data from Eucalyptus and Melaleuca, two large and phylogenetically problematic genera within the Myrtaceae family. With one annotated genome, one transcriptome and two whole-genome shotgun sequences of one Eucalyptus and four Melaleuca species, respectively, we identified 212 loci representing 263 kbp for targeted sequence capture and sequencing. Of these, 209 were successfully tested from 47 samples across five related genera of Myrtaceae. The average percentage of reads mapped back to the reference was 57.6% with coverage of more than 20 reads per position across 83.5% of the data. The methods developed here should be applicable across a large range of taxa across all kingdoms. The core methods are very flexible, providing a platform for various genomic resource availabilities and are useful from shallow to deep phylogenies.
Collapse
|
15
|
Johnson MG, Pokorny L, Dodsworth S, Botigué LR, Cowan RS, Devault A, Eiserhardt WL, Epitawalage N, Forest F, Kim JT, Leebens-Mack JH, Leitch IJ, Maurin O, Soltis DE, Soltis PS, Wong GKS, Baker WJ, Wickett NJ. A Universal Probe Set for Targeted Sequencing of 353 Nuclear Genes from Any Flowering Plant Designed Using k-Medoids Clustering. Syst Biol 2019; 68:594-606. [PMID: 30535394 PMCID: PMC6568016 DOI: 10.1093/sysbio/syy086] [Citation(s) in RCA: 188] [Impact Index Per Article: 37.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2018] [Revised: 11/29/2018] [Accepted: 12/03/2018] [Indexed: 01/31/2023] Open
Abstract
Sequencing of target-enriched libraries is an efficient and cost-effective method for obtaining DNA sequence data from hundreds of nuclear loci for phylogeny reconstruction. Much of the cost of developing targeted sequencing approaches is associated with the generation of preliminary data needed for the identification of orthologous loci for probe design. In plants, identifying orthologous loci has proven difficult due to a large number of whole-genome duplication events, especially in the angiosperms (flowering plants). We used multiple sequence alignments from over 600 angiosperms for 353 putatively single-copy protein-coding genes identified by the One Thousand Plant Transcriptomes Initiative to design a set of targeted sequencing probes for phylogenetic studies of any angiosperm group. To maximize the phylogenetic potential of the probes, while minimizing the cost of production, we introduce a k-medoids clustering approach to identify the minimum number of sequences necessary to represent each coding sequence in the final probe set. Using this method, 5-15 representative sequences were selected per orthologous locus, representing the sequence diversity of angiosperms more efficiently than if probes were designed using available sequenced genomes alone. To test our approximately 80,000 probes, we hybridized libraries from 42 species spanning all higher-order groups of angiosperms, with a focus on taxa not present in the sequence alignments used to design the probes. Out of a possible 353 coding sequences, we recovered an average of 283 per species and at least 100 in all species. Differences among taxa in sequence recovery could not be explained by relatedness to the representative taxa selected for probe design, suggesting that there is no phylogenetic bias in the probe set. Our probe set, which targeted 260 kbp of coding sequence, achieved a median recovery of 137 kbp per taxon in coding regions, a maximum recovery of 250 kbp, and an additional median of 212 kbp per taxon in flanking non-coding regions across all species. These results suggest that the Angiosperms353 probe set described here is effective for any group of flowering plants and would be useful for phylogenetic studies from the species level to higher-order groups, including the entire angiosperm clade itself.
Collapse
Affiliation(s)
- Matthew G Johnson
- Department of Biological Sciences, Texas Tech University, Lubbock, TX 79409, USA
- Plant Science and Conservation, Chicago Botanic Garden, 1000 Lake Cook Road, Glencoe, IL 60022, USA
| | - Lisa Pokorny
- Department of Comparative Plant and Fungal Biology, Royal Botanic Gardens, Kew, Richmond, Surrey TW9 3AE, UK
| | - Steven Dodsworth
- Department of Comparative Plant and Fungal Biology, Royal Botanic Gardens, Kew, Richmond, Surrey TW9 3AE, UK
- School of Life Sciences, University of Bedfordshire, University Square, Luton LU1 3JU, UK
| | - Laura R Botigué
- Department of Comparative Plant and Fungal Biology, Royal Botanic Gardens, Kew, Richmond, Surrey TW9 3AE, UK
- Centre for Research in Agricultural Genomics, Campus UAB, Edifici CRAG, Bellaterra Cerdanyola del Vallès, 08193 Barcelona, Spain
| | - Robyn S Cowan
- Department of Comparative Plant and Fungal Biology, Royal Botanic Gardens, Kew, Richmond, Surrey TW9 3AE, UK
| | - Alison Devault
- Arbor Biosciences, 5840 Interface Dr, Suite 101, Ann Arbor, MI 48103, USA
| | - Wolf L Eiserhardt
- Department of Comparative Plant and Fungal Biology, Royal Botanic Gardens, Kew, Richmond, Surrey TW9 3AE, UK
- Department of Bioscience, Aarhus University, 8000 Aarhus C, Denmark
| | - Niroshini Epitawalage
- Department of Comparative Plant and Fungal Biology, Royal Botanic Gardens, Kew, Richmond, Surrey TW9 3AE, UK
| | - Félix Forest
- Department of Comparative Plant and Fungal Biology, Royal Botanic Gardens, Kew, Richmond, Surrey TW9 3AE, UK
| | - Jan T Kim
- Department of Comparative Plant and Fungal Biology, Royal Botanic Gardens, Kew, Richmond, Surrey TW9 3AE, UK
| | - James H Leebens-Mack
- Department of Plant Biology, University of Georgia, 2502 Miller Plant Sciences, Athens, GA 30602, USA
| | - Ilia J Leitch
- Department of Comparative Plant and Fungal Biology, Royal Botanic Gardens, Kew, Richmond, Surrey TW9 3AE, UK
| | - Olivier Maurin
- Department of Comparative Plant and Fungal Biology, Royal Botanic Gardens, Kew, Richmond, Surrey TW9 3AE, UK
| | - Douglas E Soltis
- Department of Biology, University of Florida, 220 Bartram Hall, Gainesville, FL 32611-8525, USA
- Florida Museum of Natural History, University of Florida, 3215 Hull Road, Gainesville, FL 32611-2710, USA
| | - Pamela S Soltis
- Department of Biology, University of Florida, 220 Bartram Hall, Gainesville, FL 32611-8525, USA
- Florida Museum of Natural History, University of Florida, 3215 Hull Road, Gainesville, FL 32611-2710, USA
| | - Gane Ka-shu Wong
- BGI-Shenzhen, Beishan Industrial Zone, Yantian District, Shenzhen 518083, China
- Department of Biological Sciences, University of Alberta, Edmonton, AB T6G 2E9, Canada
- Department of Medicine, University of Alberta, Edmonton, AB T6G 2E1, Canada
| | - William J Baker
- Department of Comparative Plant and Fungal Biology, Royal Botanic Gardens, Kew, Richmond, Surrey TW9 3AE, UK
| | - Norman J Wickett
- Plant Science and Conservation, Chicago Botanic Garden, 1000 Lake Cook Road, Glencoe, IL 60022, USA
- Program in Plant Biology and Conservation, Northwestern University, 2205 Tech Drive, Evanston, IL 60208, USA
| |
Collapse
|
16
|
Vargas OM, Heuertz M, Smith SA, Dick CW. Target sequence capture in the Brazil nut family (Lecythidaceae): Marker selection and in silico capture from genome skimming data. Mol Phylogenet Evol 2019; 135:98-104. [DOI: 10.1016/j.ympev.2019.02.020] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2018] [Revised: 02/22/2019] [Accepted: 02/23/2019] [Indexed: 12/20/2022]
|
17
|
Soto Gomez M, Pokorny L, Kantar MB, Forest F, Leitch IJ, Gravendeel B, Wilkin P, Graham SW, Viruel J. A customized nuclear target enrichment approach for developing a phylogenomic baseline for Dioscorea yams (Dioscoreaceae). Appl Plant Sci 2019; 7:e11254. [PMID: 31236313 PMCID: PMC6580989 DOI: 10.1002/aps3.11254] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/13/2018] [Accepted: 04/01/2019] [Indexed: 05/14/2023]
Abstract
PREMISE We developed a target enrichment panel for phylogenomic studies of Dioscorea, an economically important genus with incompletely resolved relationships. METHODS Our bait panel comprises 260 low- to single-copy nuclear genes targeted to work in Dioscorea, assessed here using a preliminary taxon sampling that includes both distantly and closely related taxa, including several yam crops and potential crop wild relatives. We applied coalescent-based and maximum likelihood phylogenomic inference approaches to the pilot taxon set, incorporating new and published transcriptome data from additional species. RESULTS The custom panel retrieved ~94% of targets and >80% of full gene length from 88% and 68% of samples, respectively. It has minimal gene overlap with existing panels designed for angiosperm-wide studies and generally recovers longer and more variable targets. Pilot phylogenomic analyses consistently resolve most deep and recent relationships with strong support across analyses and point to revised relationships between the crop species D. alata and candidate crop wild relatives. DISCUSSION Our customized panel reliably retrieves targeted loci from Dioscorea, is informative for resolving relationships in denser samplings, and is suitable for refining our understanding of the independent origins of cultivated yam species; the panel likely has broader promise for phylogenomic studies across Dioscoreales.
Collapse
Affiliation(s)
- Marybel Soto Gomez
- Department of BotanyUniversity of British Columbia6270 University BoulevardVancouverBritish ColumbiaV6T 1Z4Canada
- UBC Botanical Garden and Centre for Plant ResearchUniversity of British Columbia6804 Marine Drive SWVancouverBritish ColumbiaV6T 1Z4Canada
| | - Lisa Pokorny
- Royal Botanic GardensKew, RichmondSurreyTW9 3DSUnited Kingdom
| | - Michael B. Kantar
- Department of Tropical Plant and Soil SciencesUniversity of Hawai‘i at ManoaHonoluluHawai‘i96822USA
| | - Félix Forest
- Royal Botanic GardensKew, RichmondSurreyTW9 3DSUnited Kingdom
| | - Ilia J. Leitch
- Royal Botanic GardensKew, RichmondSurreyTW9 3DSUnited Kingdom
| | - Barbara Gravendeel
- Naturalis Biodiversity CenterEndless FormsSylviusweg 72Leiden2333 BEThe Netherlands
- Institute Biology LeidenLeiden UniversitySylviusweg 72Leiden2333 BEThe Netherlands
- Faculty of Science and TechnologyUniversity of Applied Sciences LeidenZernikedreef 11Leiden2333 CKThe Netherlands
| | - Paul Wilkin
- Royal Botanic GardensKew, RichmondSurreyTW9 3DSUnited Kingdom
| | - Sean W. Graham
- Department of BotanyUniversity of British Columbia6270 University BoulevardVancouverBritish ColumbiaV6T 1Z4Canada
- UBC Botanical Garden and Centre for Plant ResearchUniversity of British Columbia6804 Marine Drive SWVancouverBritish ColumbiaV6T 1Z4Canada
| | - Juan Viruel
- Royal Botanic GardensKew, RichmondSurreyTW9 3DSUnited Kingdom
| |
Collapse
|
18
|
Quek ZBR, Huang D. Effects of missing data and data type on phylotranscriptomic analysis of stony corals (Cnidaria: Anthozoa: Scleractinia). Mol Phylogenet Evol 2019; 134:12-23. [DOI: 10.1016/j.ympev.2019.01.012] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2018] [Revised: 01/11/2019] [Accepted: 01/17/2019] [Indexed: 01/28/2023]
|
19
|
Couvreur TLP, Helmstetter AJ, Koenen EJM, Bethune K, Brandão RD, Little SA, Sauquet H, Erkens RHJ. Phylogenomics of the Major Tropical Plant Family Annonaceae Using Targeted Enrichment of Nuclear Genes. Front Plant Sci 2019; 9:1941. [PMID: 30687347 PMCID: PMC6334231 DOI: 10.3389/fpls.2018.01941] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/19/2018] [Accepted: 12/13/2018] [Indexed: 05/19/2023]
Abstract
Targeted enrichment and sequencing of hundreds of nuclear loci for phylogenetic reconstruction is becoming an important tool for plant systematics and evolution. Annonaceae is a major pantropical plant family with 110 genera and ca. 2,450 species, occurring across all major and minor tropical forests of the world. Baits were designed by sequencing the transcriptomes of five species from two of the largest Annonaceae subfamilies. Orthologous loci were identified. The resulting baiting kit was used to reconstruct phylogenetic relationships at two different levels using concatenated and gene tree approaches: a family wide Annonaceae analysis sampling 65 genera and a species level analysis of tribe Piptostigmateae sampling 29 species with multiple individuals per species. DNA extraction was undertaken mainly on silicagel dried leaves, with two samples from herbarium dried leaves. Our kit targets 469 exons (364,653 bp of sequence data), successfully capturing sequences from across Annonaceae. Silicagel dried and herbarium DNA worked equally well. We present for the first time a nuclear gene-based phylogenetic tree at the generic level based on 317 supercontigs. Results mainly confirm previous chloroplast based studies. However, several new relationships are found and discussed. We show significant differences in branch lengths between the two large subfamilies Annonoideae and Malmeoideae. A new tribe, Annickieae, is erected containing a single African genus Annickia. We also reconstructed a well-resolved species-level phylogenetic tree of the Piptostigmteae tribe. Our baiting kit is useful for reconstructing well-supported phylogenetic relationships within Annonaceae at different taxonomic levels. The nuclear genome is mainly concordant with plastome information with a few exceptions. Moreover, we find that substitution rate heterogeneity between the two subfamilies is also found within the nuclear compartment, and not just plastomes and ribosomal DNA as previously shown. Our results have implications for understanding the biogeography, molecular dating and evolution of Annonaceae.
Collapse
Affiliation(s)
| | | | - Erik J. M. Koenen
- Institute of Systematic Botany, University of Zurich, Zurich, Switzerland
| | - Kevin Bethune
- IRD, UMR DIADE, Univ. Montpellier, Montpellier, France
| | - Rita D. Brandão
- Maastricht Science Programme, Maastricht University, Maastricht, Netherlands
| | - Stefan A. Little
- Ecologie Systématique Evolution, Univ. Paris-Sud, CNRS, AgroParisTech, Université-Paris Saclay, Orsay, France
| | - Hervé Sauquet
- Ecologie Systématique Evolution, Univ. Paris-Sud, CNRS, AgroParisTech, Université-Paris Saclay, Orsay, France
- National Herbarium of New South Wales (NSW), Royal Botanic Gardens and Domain Trust, Sydney, NSW, Australia
| | - Roy H. J. Erkens
- Maastricht Science Programme, Maastricht University, Maastricht, Netherlands
| |
Collapse
|
20
|
Villaverde T, Pokorny L, Olsson S, Rincón-Barrado M, Johnson MG, Gardner EM, Wickett NJ, Molero J, Riina R, Sanmartín I. Bridging the micro- and macroevolutionary levels in phylogenomics: Hyb-Seq solves relationships from populations to species and above. New Phytol 2018; 220:636-650. [PMID: 30016546 DOI: 10.1111/nph.15312] [Citation(s) in RCA: 64] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/01/2018] [Accepted: 06/04/2018] [Indexed: 05/20/2023]
Abstract
Reconstructing phylogenetic relationships at the micro- and macroevoutionary levels within the same tree is problematic because of the need to use different data types and analytical frameworks. We test the power of target enrichment to provide phylogenetic resolution based on DNA sequences from above species to within populations, using a large herbarium sampling and Euphorbia balsamifera (Euphorbiaceae) as a case study. Target enrichment with custom probes was combined with genome skimming (Hyb-Seq) to sequence 431 low-copy nuclear genes and partial plastome DNA. We used supermatrix, multispecies-coalescent approaches, and Bayesian dating to estimate phylogenetic relationships and divergence times. Euphorbia balsamifera, with a disjunct Rand Flora-type distribution at opposite sides of Africa, comprises three well-supported subspecies: western Sahelian sepium is sister to eastern African-southern Arabian adenensis and Macaronesian-southwest Moroccan balsamifera. Lineage divergence times support Late Miocene to Pleistocene diversification and climate-driven vicariance to explain the Rand Flora pattern. We show that probes designed using genomic resources from taxa not directly related to the focal group are effective in providing phylogenetic resolution at deep and shallow evolutionary levels. Low capture efficiency in herbarium samples increased the proportion of missing data but did not bias estimation of phylogenetic relationships or branch lengths.
Collapse
Affiliation(s)
- Tamara Villaverde
- Real Jardín Botánico (RJB-CSIC), Plaza de Murillo 2, 28014, Madrid, Spain
| | - Lisa Pokorny
- Comparative Plant and Fungal Biology Department, Royal Botanic Gardens, Kew, Richmond, TW9 3DS, UK
| | - Sanna Olsson
- Department of Forest Ecology and Genetics, INIA Forest Research Centre (INIA-CIFOR), Ctra. de la Coruña km. 7.5, 28040, Madrid, Spain
| | | | - Matthew G Johnson
- Department of Biological Sciences, Texas Tech University, 2901 Main St, Lubbock, TX, 79409-43131, USA
- Department of Plant Science and Conservation, Chicago Botanical Garden, 1000 Lake Cook Road, Glencoe, IL, 60022, USA
| | | | - Norman J Wickett
- Department of Plant Science and Conservation, Chicago Botanical Garden, 1000 Lake Cook Road, Glencoe, IL, 60022, USA
- Program in Plant Biology and Conservation, Northwestern University, 2205 Tech Drive, Evanston, IL, 60208, USA
| | - Julià Molero
- Laboratori de Botànica, Departament de Biologia, Sanitat i Medi Ambient, Facultat de Farmàcia, Universitat de Barcelona, 08028, Barcelona, Spain
| | - Ricarda Riina
- Real Jardín Botánico (RJB-CSIC), Plaza de Murillo 2, 28014, Madrid, Spain
| | - Isabel Sanmartín
- Real Jardín Botánico (RJB-CSIC), Plaza de Murillo 2, 28014, Madrid, Spain
| |
Collapse
|
21
|
Andermann T, Cano Á, Zizka A, Bacon C, Antonelli A. SECAPR-a bioinformatics pipeline for the rapid and user-friendly processing of targeted enriched Illumina sequences, from raw reads to alignments. PeerJ 2018; 6:e5175. [PMID: 30023140 PMCID: PMC6047508 DOI: 10.7717/peerj.5175] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2018] [Accepted: 06/16/2018] [Indexed: 11/20/2022] Open
Abstract
Evolutionary biology has entered an era of unprecedented amounts of DNA sequence data, as new sequencing technologies such as Massive Parallel Sequencing (MPS) can generate billions of nucleotides within less than a day. The current bottleneck is how to efficiently handle, process, and analyze such large amounts of data in an automated and reproducible way. To tackle these challenges we introduce the Sequence Capture Processor (SECAPR) pipeline for processing raw sequencing data into multiple sequence alignments for downstream phylogenetic and phylogeographic analyses. SECAPR is user-friendly and we provide an exhaustive empirical data tutorial intended for users with no prior experience with analyzing MPS output. SECAPR is particularly useful for the processing of sequence capture (synonyms: target or hybrid enrichment) datasets for non-model organisms, as we demonstrate using an empirical sequence capture dataset of the palm genus Geonoma (Arecaceae). Various quality control and plotting functions help the user to decide on the most suitable settings for even challenging datasets. SECAPR is an easy-to-use, free, and versatile pipeline, aimed to enable efficient and reproducible processing of MPS data for many samples in parallel.
Collapse
Affiliation(s)
- Tobias Andermann
- Department of Biological and Environmental Sciences, University of Gothenburg, Gothenburg, Sweden
- Gothenburg Global Biodiversity Centre, Gothenburg, Sweden
| | - Ángela Cano
- Gothenburg Global Biodiversity Centre, Gothenburg, Sweden
- Department of Botany and Plant Biology, University of Geneva, Geneva, Switzerland
| | - Alexander Zizka
- Department of Biological and Environmental Sciences, University of Gothenburg, Gothenburg, Sweden
- Gothenburg Global Biodiversity Centre, Gothenburg, Sweden
| | - Christine Bacon
- Department of Biological and Environmental Sciences, University of Gothenburg, Gothenburg, Sweden
- Gothenburg Global Biodiversity Centre, Gothenburg, Sweden
| | - Alexandre Antonelli
- Department of Biological and Environmental Sciences, University of Gothenburg, Gothenburg, Sweden
- Gothenburg Global Biodiversity Centre, Gothenburg, Sweden
- Gothenburg Botanical Garden, Gothenburg, Sweden
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA, United States of America
| |
Collapse
|
22
|
Aubriot X, Knapp S, Syfert MM, Poczai P, Buerki S. Shedding new light on the origin and spread of the brinjal eggplant (Solanum melongena L.) and its wild relatives. Am J Bot 2018; 105:1175-1187. [PMID: 30091787 DOI: 10.1002/ajb2.1133] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/16/2018] [Accepted: 05/04/2018] [Indexed: 05/11/2023]
Abstract
PREMISE OF THE STUDY While brinjal eggplant (Solanum melongena L.) is the second most important solanaceous fruit crop, we lack firm knowledge of its evolutionary relationships. This in turn limits efficient use of crop wild relatives in eggplant improvement. Here, we examine the hypothesis of linear step-wise expansion of the eggplant group from Africa to Asia. METHODS We use museum collections to generate nuclear and full-plastome data for all species of the Eggplant clade. We combine a phylogenomic approach with distribution data to infer a biogeographic scenario for the clade. KEY RESULTS The Eggplant clade has Pleistocene origins in northern Africa. Dispersals to tropical Asia gave rise to Solanum insanum, the wild progenitor of the eggplant, and to African distinct lineages of widespread and southern African species. Results suggest that spread of the species to southern Africa has been recent and likely facilitated by large mammalian herbivores, such as the African elephant and impala feeding on Solanum fruit. CONCLUSIONS Rather than a linear 'Out Of Africa' sequence, our results are more consistent with an initial dispersal event into Asia, and subsequent wide dispersal and differentiation across Africa driven by large mammalian herbivores. Our evolutionary results will affect future work on eggplant domestication and affect the use of wild relatives in breeding of this increasingly important solanaceous crop.
Collapse
Affiliation(s)
- Xavier Aubriot
- Department of Life Sciences, Natural History Museum, Cromwell Road, London, SW7 5BD, England, UK
- Unité Mixte de Recherche 6553 Écosystèmes, Biodiversité, Évolution (ECOBIO), Observatoire des Sciences de l'Univers de Rennes, Centre National de la Recherche Scientifique, Université de Rennes 1, Rennes CEDEX, France
| | - Sandra Knapp
- Department of Life Sciences, Natural History Museum, Cromwell Road, London, SW7 5BD, England, UK
| | - Mindy M Syfert
- Department of Life Sciences, Natural History Museum, Cromwell Road, London, SW7 5BD, England, UK
| | - Péter Poczai
- Botany Unit, Finnish Museum of Natural History, University of Helsinki, PO Box 7, Helsinki, FI-00014, Finland
| | - Sven Buerki
- Department of Life Sciences, Natural History Museum, Cromwell Road, London, SW7 5BD, England, UK
- Department of Biological Sciences, Boise State University, 1910 University Drive, Boise, Idaho, 83725, U.S.A
| |
Collapse
|
23
|
Vatanparast M, Powell A, Doyle JJ, Egan AN. Targeting legume loci: A comparison of three methods for target enrichment bait design in Leguminosae phylogenomics. Appl Plant Sci 2018; 6:e1036. [PMID: 29732266 PMCID: PMC5895186 DOI: 10.1002/aps3.1036] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/26/2017] [Accepted: 02/22/2018] [Indexed: 05/19/2023]
Abstract
PREMISE OF THE STUDY The development of pipelines for locus discovery has spurred the use of target enrichment for plant phylogenomics. However, few studies have compared pipelines from locus discovery and bait design, through validation, to tree inference. We compared three methods within Leguminosae (Fabaceae) and present a workflow for future efforts. METHODS Using 30 transcriptomes, we compared Hyb-Seq, MarkerMiner, and the Yang and Smith (Y&S) pipelines for locus discovery, validated 7501 baits targeting 507 loci across 25 genera via Illumina sequencing, and inferred gene and species trees via concatenation- and coalescent-based methods. RESULTS Hyb-Seq discovered loci with the longest mean length. MarkerMiner discovered the most conserved loci with the least flagged as paralogous. Y&S offered the most parsimony-informative sites and putative orthologs. Target recovery averaged 93% across taxa. We optimized our targeted locus set based on a workflow designed to minimize paralog/ortholog conflation and thus present 423 loci for legume phylogenomics. CONCLUSIONS Methods differed across criteria important for phylogenetic marker development. We recommend Hyb-Seq as a method that may be useful for most phylogenomic projects. Our targeted locus set is a resource for future, community-driven efforts to reconstruct the legume tree of life.
Collapse
Affiliation(s)
- Mohammad Vatanparast
- Department of BotanyNational Museum of Natural HistorySmithsonian InstitutionP.O. Box 37012, MRC 166WashingtonDC20560USA
- Present address:
Forest, Nature, and Biomass SectionDepartment of Geosciences and Natural Resource ManagementRolighedsvej 23, 1958 Frederiksberg C., University of CopenhagenDenmark
| | - Adrian Powell
- Section of Plant Breeding and GeneticsSchool of Integrated Plant SciencesCornell University512 Mann LibraryIthacaNew York14853USA
- Present address:
Boyce Thompson Institute533 Tower RoadIthacaNew York14853USA
| | - Jeff J. Doyle
- Section of Plant Breeding and GeneticsSchool of Integrated Plant SciencesCornell University512 Mann LibraryIthacaNew York14853USA
| | - Ashley N. Egan
- Department of BotanyNational Museum of Natural HistorySmithsonian InstitutionP.O. Box 37012, MRC 166WashingtonDC20560USA
| |
Collapse
|