1
|
Liu D, Cui J, Liu Y, Niu M, Wang F, Zhao Q, Cai B, Zhang H, Wei J. Ultraconserved elements from transcriptome and genome data provide insight into the phylogenomics of Sternorrhyncha (Insecta: Hemiptera). Cladistics 2024. [PMID: 38808591 DOI: 10.1111/cla.12585] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2024] [Revised: 04/30/2024] [Accepted: 05/07/2024] [Indexed: 05/30/2024] Open
Abstract
Sternorrhyncha, one of the four major suborders of Hemiptera, is a phytophagous taxon inclusive of nearly 18 000 described species. The phylogenetic relationships within the taxon and the earliest-branching lineage of its infraorders remain incompletely understood. This study attempted to illuminate the phylogenetic relationships within Sternorrhyncha through the use of maximum likelihood, Bayesian inference and maximum parsimony analyses, employing ultraconserved element (UCE) data from 39 genomic and 62 transcriptomic datasets and thereby representing most families within the taxon. The probe set Hemiptera 2.7Kv1 was used to recover a total of 2731 UCE loci: from 547 to 1699 (with an average of 1084) across all genomic datasets and from 108 to 849 (with an average of 329) across all transcriptomic datasets. All three types of phylogenetic analyses employed in this study produced robust statistical support for Sternorrhyncha being a monophyletic group. The different methods of phylogenetic analysis produced inconsistent descriptions of topological structure at the infraorder level: while maximum likelihood and Bayesian inference analyses produced strong statistical evidence (100%) indicating the clade Psylloidea + Aleyrodoidea to be a sister of the clade Aphidoidea (Aphidomorpha) + Coccoidea (Coccomorpha), the maximum parsimony analysis failed to recover a similar result. Our results also provide detail on the phylogenetic relationships within each infraorder. This study presents the first use of UCE data to investigate the phylogeny of Sternorrhyncha. It also shows the viability of amalgamating genomic and transcriptomic data in studies of phylogenetic relationships, potentially highlighting a resource-efficient approach for future inquiries into diverse taxa through the integration of varied data sources.
Collapse
Affiliation(s)
- Dajun Liu
- College of Plant Protection, Shanxi Agricultural University, Jinzhong, Shanxi, 030801, China
- Department of Biology, Xinzhou Normal University, Xinzhou, Shanxi, 034000, China
| | - Jinyu Cui
- College of Plant Protection, Shanxi Agricultural University, Jinzhong, Shanxi, 030801, China
| | - Yubo Liu
- College of Plant Protection, Shanxi Agricultural University, Jinzhong, Shanxi, 030801, China
| | - Minmin Niu
- College of Plant Protection, Shanxi Agricultural University, Jinzhong, Shanxi, 030801, China
| | - Fang Wang
- Ministry of Education Key Laboratory of Molecular and Cellular Biology, Hebei Collaborative Innovation Center for Eco-Environment, Hebei Key Laboratory of Animal Physiology, Biochemistry and Molecular Biology, College of Life Sciences, Hebei Normal University, Shijiazhuang, Hebei, 050024, China
| | - Qing Zhao
- College of Plant Protection, Shanxi Agricultural University, Jinzhong, Shanxi, 030801, China
| | - Bo Cai
- Post-Entry Quarantine Station for Tropical Plant, Haikou Customs District, No. 9 West Haixiu Road, Haikou, 570311, China
| | - Hufang Zhang
- College of Plant Protection, Shanxi Agricultural University, Jinzhong, Shanxi, 030801, China
- Department of Biology, Xinzhou Normal University, Xinzhou, Shanxi, 034000, China
| | - Jiufeng Wei
- College of Plant Protection, Shanxi Agricultural University, Jinzhong, Shanxi, 030801, China
| |
Collapse
|
2
|
Gilman IS, Smith JAC, Holtum JAM, Sage RF, Silvera K, Winter K, Edwards EJ. The CAM lineages of planet Earth. ANNALS OF BOTANY 2023; 132:627-654. [PMID: 37698538 PMCID: PMC10799995 DOI: 10.1093/aob/mcad135] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/22/2022] [Revised: 01/09/2023] [Accepted: 09/11/2023] [Indexed: 09/13/2023]
Abstract
BACKGROUND AND SCOPE The growth of experimental studies of crassulacean acid metabolism (CAM) in diverse plant clades, coupled with recent advances in molecular systematics, presents an opportunity to re-assess the phylogenetic distribution and diversity of species capable of CAM. It has been more than two decades since the last comprehensive lists of CAM taxa were published, and an updated survey of the occurrence and distribution of CAM taxa is needed to facilitate and guide future CAM research. We aimed to survey the phylogenetic distribution of these taxa, their diverse morphology, physiology and ecology, and the likely number of evolutionary origins of CAM based on currently known lineages. RESULTS AND CONCLUSIONS We found direct evidence (in the form of experimental or field observations of gas exchange, day-night fluctuations in organic acids, carbon isotope ratios and enzymatic activity) for CAM in 370 genera of vascular plants, representing 38 families. Further assumptions about the frequency of CAM species in CAM clades and the distribution of CAM in the Cactaceae and Crassulaceae bring the currently estimated number of CAM-capable species to nearly 7 % of all vascular plants. The phylogenetic distribution of these taxa suggests a minimum of 66 independent origins of CAM in vascular plants, possibly with dozens more. To achieve further insight into CAM origins, there is a need for more extensive and systematic surveys of previously unstudied lineages, particularly in living material to identify low-level CAM activity, and for denser sampling to increase phylogenetic resolution in CAM-evolving clades. This should allow further progress in understanding the functional significance of this pathway by integration with studies on the evolution and genomics of CAM in its many forms.
Collapse
Affiliation(s)
- Ian S Gilman
- Department of Ecology and Evolutionary Biology, Yale University, New Haven, CT, USA
| | | | - Joseph A M Holtum
- College of Science and Engineering, James Cook University, Townsville, Queensland, Australia
| | - Rowan F Sage
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, ON, Canada
| | - Katia Silvera
- Smithsonian Tropical Research Institute, Balboa, Ancón, Panama
- Department of Botany & Plant Sciences, University of California, Riverside, CA, USA
| | - Klaus Winter
- Smithsonian Tropical Research Institute, Balboa, Ancón, Panama
| | - Erika J Edwards
- Department of Ecology and Evolutionary Biology, Yale University, New Haven, CT, USA
| |
Collapse
|
3
|
Shah AP, Travadi T, Sharma S, Pandit R, Joshi C, Joshi M. Comprehensive analysis using DNA metabarcoding, SCAR marker based PCR assay, and HPLC unveils the adulteration in Brahmi herbal products. Mol Biol Rep 2023; 50:7605-7618. [PMID: 37532919 DOI: 10.1007/s11033-023-08653-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2023] [Accepted: 06/28/2023] [Indexed: 08/04/2023]
Abstract
BACKGROUND Brahmi is one of the important nootropic botanicals, widely sold in the market, with the name "Brahmi'' being used to describe both Bacopa monnieri and Centella asiatica species. The Brahmi herbal products market is expanding; hence, economically motivated adulteration is highly prevalent. METHODS AND RESULTS This study aimed to develop DNA-based methods, including SCAR marker-based PCR and metabarcoding, to authenticate Brahmi herbal products and compare these methods with HPLC. These methods have been validated using mock controls (in-house blended formulations). All targeted plant species in mock controls were detected successfully with all three methods, whereas, in market samples, only 22.2%, 55.6%, and 50.0% were found positive for Brahmi by PCR assay, DNA metabarcoding, and HPLC, respectively. Metabarcoding can detect the presence of non-labeled plants together with targeted species, which is an advantage over PCR assay or HPLC. CONCLUSION SCAR marker-based PCR is a rapid and cost-effective method for detecting the presence of B. monnieri and C. asiatica. However, in this study, the success rate of PCR amplification was relatively low because the primers targeted either RAPD or ITS-based SCAR markers. HPLC assay, although an alternative, was unable to detect the presence of other botanicals, just like the SCAR marker-based PCR assay. On the other hand, metabarcoding can be utilized to identify the target plants, even in very small quantities, while also providing simulated identification of other botanicals. This study successfully addressed the need for quality control of Brahmi herbal products and provided the first-time report of DNA metabarcoding for such products.
Collapse
Affiliation(s)
- Abhi P Shah
- Gujarat Biotechnology Research Centre (GBRC), Department of Science and Technology, Government of Gujarat, Gandhinagar, India
| | - Tasnim Travadi
- Gujarat Biotechnology Research Centre (GBRC), Department of Science and Technology, Government of Gujarat, Gandhinagar, India
| | - Sonal Sharma
- Gujarat Biotechnology Research Centre (GBRC), Department of Science and Technology, Government of Gujarat, Gandhinagar, India
| | - Ramesh Pandit
- Gujarat Biotechnology Research Centre (GBRC), Department of Science and Technology, Government of Gujarat, Gandhinagar, India
| | - Chaitanya Joshi
- Gujarat Biotechnology Research Centre (GBRC), Department of Science and Technology, Government of Gujarat, Gandhinagar, India
| | - Madhvi Joshi
- Gujarat Biotechnology Research Centre (GBRC), Department of Science and Technology, Government of Gujarat, Gandhinagar, India.
| |
Collapse
|
4
|
Staples TL, Mayfield MM, England JR, Dwyer JM. Drivers of Acacia and Eucalyptus growth rate differ in strength and direction in restoration plantings across Australia. ECOLOGICAL APPLICATIONS : A PUBLICATION OF THE ECOLOGICAL SOCIETY OF AMERICA 2022; 32:e2636. [PMID: 35404495 PMCID: PMC9539508 DOI: 10.1002/eap.2636] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/07/2021] [Revised: 01/24/2022] [Accepted: 02/18/2022] [Indexed: 05/31/2023]
Abstract
Functional traits are proxies for a species' ecology and physiology and are often correlated with plant vital rates. As such they have the potential to guide species selection for restoration projects. However, predictive trait-based models often only explain a small proportion of plant performance, suggesting that commonly measured traits do not capture all important ecological differences between species. Some residual variation in vital rates may be evolutionarily conserved and captured using taxonomic groupings alongside common functional traits. We tested this hypothesis using growth rate data for 17,299 trees and shrubs from 80 species of Eucalyptus and 43 species of Acacia, two hyper-diverse and co-occurring genera, collected from 497 neighborhood plots in 137 Australian mixed-species revegetation plantings. We modeled relative growth rates of individual plants as a function of environmental conditions, species-mean functional traits, and neighbor density and diversity, across a moisture availability gradient. We then assessed whether the strength and direction of these relationships differed between the two genera. We found that the inclusion of genus-specific relationships offered a significant but modest improvement to model fit (1.6%-1.7% greater R2 than simpler models). More importantly, almost all correlates of growth rate differed between Eucalyptus and Acacia in strength, direction, or how they changed along the moisture gradient. These differences mapped onto physiological differences between the genera that were not captured solely by measured functional traits. Our findings suggest taxonomic groupings can capture or mediate variation in plant performance missed by common functional traits. The inclusion of taxonomy can provide a more nuanced understanding of how functional traits interact with abiotic and biotic conditions to drive plant performance, which may be important for constructing trait-based frameworks to improve restoration outcomes.
Collapse
Affiliation(s)
- Timothy L. Staples
- School of Biological SciencesThe University of QueenslandBrisbaneQueenslandAustralia
- CSIRO Land and Water, EcoSciences PrecinctDutton ParkQueenslandAustralia
| | - Margaret M. Mayfield
- School of Biological SciencesThe University of QueenslandBrisbaneQueenslandAustralia
| | | | - John M. Dwyer
- School of Biological SciencesThe University of QueenslandBrisbaneQueenslandAustralia
- CSIRO Land and Water, EcoSciences PrecinctDutton ParkQueenslandAustralia
| |
Collapse
|
5
|
The validity evaluation of different 16srRNA gene primers for helicobacter detection urgently requesting to design new specific primers. Sci Rep 2022; 12:10737. [PMID: 35750699 PMCID: PMC9232570 DOI: 10.1038/s41598-022-14600-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2021] [Accepted: 06/09/2022] [Indexed: 11/30/2022] Open
Abstract
Molecular diagnosis of helicobacters by PCR is simpler, more accurate, and feasible compared to other diagnostic methods. Validity and accuracy are highly dependent on the PCR primer design, diffusion time, and mutation rate of helicobacters. This study aimed to design 16srRNA -specific primers for Helicobacter spp. and H. pylori. Application of comparative statistical analysis of the diagnostic utility of the most available 16srRNA genus-specific primers. The new primers were designed using bioinformatics tools (MAFFT MSA and Gblocks command line). A comparative study was applied on nine genus-specific 16srRNA primers in comparison to the ConsH using in silico and laboratory evaluation. The results demonstrated that the best specificity and sensitivity of the primers designed for this study compared to other primers. The comparative study revealed that the heminested outer/inner primers were the worst. Although H276, 16srRNA(a), HeliS/Heli-nest, and Hcom had acceptable diagnostic utility, false positive and false negative results were obtained. Specificity testing on clinical samples indicated a surprising result; that H. pylori was not the sole enemy that we were looking for, but the Non-Helicobacter pylori Helicobacters should be considered as a real risk prognostic for gastric diseases, consequently, a specific diagnosis and treatment should be developed. This study concluded that our designed primers were the most specific and sensitive in comparison with other primers. In addition, in silico evaluation is not accurate enough for primer assessment and that the laboratory evaluation is mandatory.
Collapse
|
6
|
Baker WJ, Bailey P, Barber V, Barker A, Bellot S, Bishop D, Botigué LR, Brewer G, Carruthers T, Clarkson JJ, Cook J, Cowan RS, Dodsworth S, Epitawalage N, Françoso E, Gallego B, Johnson MG, Kim JT, Leempoel K, Maurin O, McGinnie C, Pokorny L, Roy S, Stone M, Toledo E, Wickett NJ, Zuntini AR, Eiserhardt WL, Kersey PJ, Leitch IJ, Forest F. A Comprehensive Phylogenomic Platform for Exploring the Angiosperm Tree of Life. Syst Biol 2021; 71:301-319. [PMID: 33983440 PMCID: PMC8830076 DOI: 10.1093/sysbio/syab035] [Citation(s) in RCA: 65] [Impact Index Per Article: 21.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2021] [Revised: 05/06/2021] [Accepted: 05/08/2021] [Indexed: 12/22/2022] Open
Abstract
The tree of life is the fundamental biological roadmap for navigating the evolution and properties of life on Earth, and yet remains largely unknown. Even angiosperms (flowering plants) are fraught with data gaps, despite their critical role in sustaining terrestrial life. Today, high-throughput sequencing promises to significantly deepen our understanding of evolutionary relationships. Here, we describe a comprehensive phylogenomic platform for exploring the angiosperm tree of life, comprising a set of open tools and data based on the 353 nuclear genes targeted by the universal Angiosperms353 sequence capture probes. The primary goals of this article are to (i) document our methods, (ii) describe our first data release, and (iii) present a novel open data portal, the Kew Tree of Life Explorer (https://treeoflife.kew.org). We aim to generate novel target sequence capture data for all genera of flowering plants, exploiting natural history collections such as herbarium specimens, and augment it with mined public data. Our first data release, described here, is the most extensive nuclear phylogenomic data set for angiosperms to date, comprising 3099 samples validated by DNA barcode and phylogenetic tests, representing all 64 orders, 404 families (96\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{upgreek}
\usepackage{mathrsfs}
\setlength{\oddsidemargin}{-69pt}
\begin{document}
}{}$\%$\end{document}) and 2333 genera (17\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{upgreek}
\usepackage{mathrsfs}
\setlength{\oddsidemargin}{-69pt}
\begin{document}
}{}$\%$\end{document}). A “first pass” angiosperm tree of life was inferred from the data, which totaled 824,878 sequences, 489,086,049 base pairs, and 532,260 alignment columns, for interactive presentation in the Kew Tree of Life Explorer. This species tree was generated using methods that were rigorous, yet tractable at our scale of operation. Despite limitations pertaining to taxon and gene sampling, gene recovery, models of sequence evolution and paralogy, the tree strongly supports existing taxonomy, while challenging numerous hypothesized relationships among orders and placing many genera for the first time. The validated data set, species tree and all intermediates are openly accessible via the Kew Tree of Life Explorer and will be updated as further data become available. This major milestone toward a complete tree of life for all flowering plant species opens doors to a highly integrated future for angiosperm phylogenomics through the systematic sequencing of standardized nuclear markers. Our approach has the potential to serve as a much-needed bridge between the growing movement to sequence the genomes of all life on Earth and the vast phylogenomic potential of the world’s natural history collections. [Angiosperms; Angiosperms353; genomics; herbariomics; museomics; nuclear phylogenomics; open access; target sequence capture; tree of life.]
Collapse
Affiliation(s)
- William J Baker
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, United Kingdom
| | - Paul Bailey
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, United Kingdom
| | - Vanessa Barber
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, United Kingdom
| | - Abigail Barker
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, United Kingdom
| | - Sidonie Bellot
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, United Kingdom
| | - David Bishop
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, United Kingdom
| | - Laura R Botigué
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, United Kingdom.,Centre for Research in Agricultural Genomics, Campus UAB, Edifici CRAG, Bellaterra Cerdanyola del Vallès, 08193 Barcelona, Spain
| | - Grace Brewer
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, United Kingdom
| | - Tom Carruthers
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, United Kingdom
| | - James J Clarkson
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, United Kingdom
| | - Jeffrey Cook
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, United Kingdom
| | - Robyn S Cowan
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, United Kingdom
| | - Steven Dodsworth
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, United Kingdom.,School of Life Sciences, University of Bedfordshire, University Square, Luton LU1 3JU, United Kingdom
| | | | - Elaine Françoso
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, United Kingdom
| | - Berta Gallego
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, United Kingdom
| | - Matthew G Johnson
- Department of Biological Sciences, Texas Tech University, Lubbock, TX 79409, USA
| | - Jan T Kim
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, United Kingdom.,Department of Computer Science, School of Physics, Engineering and Computer Science, University of Hertfordshire, Hatfield, Hertfordshire, AL10 9AB, United Kingdom
| | - Kevin Leempoel
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, United Kingdom
| | - Olivier Maurin
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, United Kingdom
| | | | - Lisa Pokorny
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, United Kingdom.,Centre for Plant Biotechnology and Genomics (CBGP) UPM-INIA, 28223 Pozuelo de Alarcón (Madrid), Spain
| | - Shyamali Roy
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, United Kingdom
| | - Malcolm Stone
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, United Kingdom
| | - Eduardo Toledo
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, United Kingdom
| | - Norman J Wickett
- Plant Science and Conservation, Chicago Botanic Garden, 1000 Lake Cook Road, Glencoe, IL 60022, USA
| | | | - Wolf L Eiserhardt
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, United Kingdom.,Department of Biology, Aarhus University, 8000 Aarhus C, Denmark
| | - Paul J Kersey
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, United Kingdom
| | - Ilia J Leitch
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, United Kingdom
| | - Félix Forest
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, United Kingdom
| |
Collapse
|
7
|
Folk RA, Kates HR, LaFrance R, Soltis DE, Soltis PS, Guralnick RP. High-throughput methods for efficiently building massive phylogenies from natural history collections. APPLICATIONS IN PLANT SCIENCES 2021; 9:e11410. [PMID: 33680581 PMCID: PMC7910806 DOI: 10.1002/aps3.11410] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/23/2020] [Accepted: 12/20/2020] [Indexed: 05/10/2023]
Abstract
PREMISE Large phylogenetic data sets have often been restricted to small numbers of loci from GenBank, and a vetted sampling-to-sequencing phylogenomic protocol scaling to thousands of species is not yet available. Here, we report a high-throughput collections-based approach that empowers researchers to explore more branches of the tree of life with numerous loci. METHODS We developed an integrated Specimen-to-Laboratory Information Management System (SLIMS), connecting sampling and wet lab efforts with progress tracking at each stage. Using unique identifiers encoded in QR codes and a taxonomic database, a research team can sample herbarium specimens, efficiently record the sampling event, and capture specimen images. After sampling in herbaria, images are uploaded to a citizen science platform for metadata generation, and tissue samples are moved through a simple, high-throughput, plate-based herbarium DNA extraction and sequencing protocol. RESULTS We applied this sampling-to-sequencing workflow to ~15,000 species, producing for the first time a data set with ~50% taxonomic representation of the "nitrogen-fixing clade" of angiosperms. DISCUSSION The approach we present is appropriate at any taxonomic scale and is extensible to other collection types. The widespread use of large-scale sampling strategies repositions herbaria as accessible but largely untapped resources for broad taxonomic sampling with thousands of species.
Collapse
Affiliation(s)
- Ryan A. Folk
- Department of Biological SciencesMississippi State UniversityMississippi StateMississippiUSA
| | - Heather R. Kates
- Florida Museum of Natural HistoryUniversity of FloridaGainesvilleFloridaUSA
| | - Raphael LaFrance
- Florida Museum of Natural HistoryUniversity of FloridaGainesvilleFloridaUSA
| | - Douglas E. Soltis
- Florida Museum of Natural HistoryUniversity of FloridaGainesvilleFloridaUSA
- Department of BiologyUniversity of FloridaGainesvilleFloridaUSA
- Genetics InstituteUniversity of FloridaGainesvilleFloridaUSA
- Biodiversity InstituteUniversity of FloridaGainesvilleFloridaUSA
| | - Pamela S. Soltis
- Florida Museum of Natural HistoryUniversity of FloridaGainesvilleFloridaUSA
- Genetics InstituteUniversity of FloridaGainesvilleFloridaUSA
- Biodiversity InstituteUniversity of FloridaGainesvilleFloridaUSA
| | - Robert P. Guralnick
- Florida Museum of Natural HistoryUniversity of FloridaGainesvilleFloridaUSA
- Biodiversity InstituteUniversity of FloridaGainesvilleFloridaUSA
| |
Collapse
|
8
|
McCraney WT, Thacker CE, Alfaro ME. Supermatrix phylogeny resolves goby lineages and reveals unstable root of Gobiaria. Mol Phylogenet Evol 2020; 151:106862. [DOI: 10.1016/j.ympev.2020.106862] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2019] [Revised: 05/06/2020] [Accepted: 05/21/2020] [Indexed: 01/04/2023]
|
9
|
Ali FS, Ismail M, Aly W. DNA barcoding to characterize biodiversity of freshwater fishes of Egypt. Mol Biol Rep 2020; 47:5865-5877. [PMID: 32661870 DOI: 10.1007/s11033-020-05657-3] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2019] [Accepted: 07/08/2020] [Indexed: 12/16/2022]
Abstract
The current study represents the first molecular characterization of freshwater fish species in Egypt from two major fish resources; the River Nile and Lake Nasser. A total of 160 DNA barcodes using a 655-bp-long fragment of the mitochondrial cytochrome oxidase subunit I (COI) gene were generated from 37 species belonging to 32 genera that represent 15 families from nine orders. The studied species were identified using different molecular-based identification approaches, in addition to the morphological identification, including neighbor-joining (NJ) trees, Barcode Index Number, and Automatic Barcode Gap Discovery (ABGD). The average genetic divergence based on the Kimura two-parameter model (K2P) within orders, families, genera, and species were 0.175, 0.067, 0.02, and 0.008, respectively. The minimum and maximum K2P distance-based genetic divergences were 0.0 and 0.154, respectively. Nucleotide diversity (π) varied among families and ranged between 0.0% for families Malapteruridae, Auchenoglanididae, Schilbeidae, Anguillidae, Centropomidae and Tetraodontidae and 17% for family Cyprinidae. The current study cautions against the lack of species coverage at public databases which limits the accurate identification of newly surveyed species and recommends that multiple methods are encouraged for accurate species identification. The findings of the current study also support that COI barcode enabled effective fish species identification in River Nile and Lake Nasser. Moreover, the results of the current study will establish a comprehensive DNA barcode library for freshwater fishes along the River Nile in Egypt. Egyptian freshwater fish DNA barcodes will contribute substantially to future efforts in monitoring, conservation, and management of fisheries in Egypt.
Collapse
Affiliation(s)
- Fawzia S Ali
- Genetics Laboratory, Aquaculture Division, National Institute of Oceanography and Fisheries (NIOF), Cairo, Egypt.
| | - Mohamed Ismail
- Genetics Department, Faculty of Agriculture, Menoufia University, Shibin El-Kom, Egypt
| | - Walid Aly
- Fisheries Division, National Institute of Oceanography and Fisheries (NIOF), Cairo, Egypt
| |
Collapse
|
10
|
Dagallier LMJ, Janssens SB, Dauby G, Blach‐Overgaard A, Mackinder BA, Droissart V, Svenning J, Sosef MSM, Stévart T, Harris DJ, Sonké B, Wieringa JJ, Hardy OJ, Couvreur TLP. Cradles and museums of generic plant diversity across tropical Africa. THE NEW PHYTOLOGIST 2020; 225:2196-2213. [PMID: 31665816 PMCID: PMC7027791 DOI: 10.1111/nph.16293] [Citation(s) in RCA: 41] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/04/2019] [Accepted: 10/20/2019] [Indexed: 05/27/2023]
Abstract
Determining where species diversify (cradles) and persist (museums) over evolutionary time is fundamental to understanding the distribution of biodiversity and for conservation prioritization. Here, we identify cradles and museums of angiosperm generic diversity across tropical Africa, one of the most biodiverse regions on Earth. Regions containing nonrandom concentrations of young (neo-) and old (paleo-) endemic taxa were identified using distribution data of 1719 genera combined with a newly generated time-calibrated mega-phylogenetic tree. We then compared the identified regions with the current network of African protected areas (PAs). At the generic level, phylogenetic diversity and endemism are mainly concentrated in the biogeographically complex region of Eastern Africa. We show that mountainous areas are centres of both neo- and paleo-endemism. By contrast, the Guineo-Congolian lowland rain forest region is characterized by widespread and old lineages. We found that the overlap between centres of phylogenetic endemism and PAs is high (> 85%). We show the vital role played by mountains acting simultaneously as cradles and museums of tropical African plant biodiversity. By contrast, lowland rainforests act mainly as museums for generic diversity. Our study shows that incorporating large-scale taxonomically verified distribution datasets and mega-phylogenies lead to an improved understanding of tropical plant biodiversity evolution.
Collapse
Affiliation(s)
| | | | - Gilles Dauby
- AMAP, IRD, CIRAD, CNRS, INRAUniversity of MontpellierBd de la Lironde34398MontpellierFrance
| | - Anne Blach‐Overgaard
- Center for Biodiversity Dynamics in a Changing World (BIOCHANGE)Department of BioscienceAarhus UniversityNy Munkegade 114DK‐8000Aarhus CDenmark
- Section for Ecoinformatics and BiodiversityDepartment of BioscienceAarhus UniversityNy Munkegade 114DK‐8000Aarhus CDenmark
| | | | - Vincent Droissart
- AMAP, IRD, CIRAD, CNRS, INRAUniversity of MontpellierBd de la Lironde34398MontpellierFrance
| | - Jens‐Christian Svenning
- Center for Biodiversity Dynamics in a Changing World (BIOCHANGE)Department of BioscienceAarhus UniversityNy Munkegade 114DK‐8000Aarhus CDenmark
- Section for Ecoinformatics and BiodiversityDepartment of BioscienceAarhus UniversityNy Munkegade 114DK‐8000Aarhus CDenmark
| | | | - Tariq Stévart
- Meise Botanic GardenNieuwelaan 38BE‐1860MeiseBelgium
- Herbarium et Bibliothèque de Botanique AfricaineUniversité Libre de BruxellesBoulevard du TriompheB‐1050BruxellesBelgium
- Africa & Madagascar DepartmentMissouri Botanical GardenSt LouisMO631109USA
| | - David J. Harris
- Royal Botanic Garden Edinburgh20A Inverleith RowEdinburghEH3 5LRUK
| | - Bonaventure Sonké
- Laboratoire de Botanique systématique et d'ÉcologieDépartement des Sciences BiologiquesÉcole Normale SupérieureUniversité de Yaoundé IBP 047YaoundéCameroon
| | - Jan J. Wieringa
- Naturalis Biodiversity CenterDarwinweg 22333 CRLeidenthe Netherlands
| | - Olivier J. Hardy
- Evolutionary Biology and EcologyFaculté des SciencesUniversité Libre de BruxellesAv. F.D. Roosevelt 501050BrusselsBelgium
| | - Thomas L. P. Couvreur
- DIADE, IRDUniversity of Montpellier911 Avenue Agropolis34394MontpellierFrance
- Naturalis Biodiversity CenterDarwinweg 22333 CRLeidenthe Netherlands
| |
Collapse
|
11
|
Abstract
It has long been appreciated that analyses of genomic data (e.g., whole genome sequencing or sequence capture) have the potential to reveal the tree of life, but it remains challenging to move from sequence data to a clear understanding of evolutionary history, in part due to the computational challenges of phylogenetic estimation using genome-scale data. Supertree methods solve that challenge because they facilitate a divide-and-conquer approach for large-scale phylogeny inference by integrating smaller subtrees in a computationally efficient manner. Here, we combined information from sequence capture and whole-genome phylogenies using supertree methods. However, the available phylogenomic trees had limited overlap so we used taxon-rich (but not phylogenomic) megaphylogenies to weave them together. This allowed us to construct a phylogenomic supertree, with support values, that included 707 bird species (~7% of avian species diversity). We estimated branch lengths using mitochondrial sequence data and we used these branch lengths to estimate divergence times. Our time-calibrated supertree supports radiation of all three major avian clades (Palaeognathae, Galloanseres, and Neoaves) near the Cretaceous-Paleogene (K-Pg) boundary. The approach we used will permit the continued addition of taxa to this supertree as new phylogenomic data are published, and it could be applied to other taxa as well.
Collapse
|
12
|
Johnson MG, Pokorny L, Dodsworth S, Botigué LR, Cowan RS, Devault A, Eiserhardt WL, Epitawalage N, Forest F, Kim JT, Leebens-Mack JH, Leitch IJ, Maurin O, Soltis DE, Soltis PS, Wong GKS, Baker WJ, Wickett NJ. A Universal Probe Set for Targeted Sequencing of 353 Nuclear Genes from Any Flowering Plant Designed Using k-Medoids Clustering. Syst Biol 2019; 68:594-606. [PMID: 30535394 PMCID: PMC6568016 DOI: 10.1093/sysbio/syy086] [Citation(s) in RCA: 205] [Impact Index Per Article: 41.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2018] [Revised: 11/29/2018] [Accepted: 12/03/2018] [Indexed: 01/31/2023] Open
Abstract
Sequencing of target-enriched libraries is an efficient and cost-effective method for obtaining DNA sequence data from hundreds of nuclear loci for phylogeny reconstruction. Much of the cost of developing targeted sequencing approaches is associated with the generation of preliminary data needed for the identification of orthologous loci for probe design. In plants, identifying orthologous loci has proven difficult due to a large number of whole-genome duplication events, especially in the angiosperms (flowering plants). We used multiple sequence alignments from over 600 angiosperms for 353 putatively single-copy protein-coding genes identified by the One Thousand Plant Transcriptomes Initiative to design a set of targeted sequencing probes for phylogenetic studies of any angiosperm group. To maximize the phylogenetic potential of the probes, while minimizing the cost of production, we introduce a k-medoids clustering approach to identify the minimum number of sequences necessary to represent each coding sequence in the final probe set. Using this method, 5-15 representative sequences were selected per orthologous locus, representing the sequence diversity of angiosperms more efficiently than if probes were designed using available sequenced genomes alone. To test our approximately 80,000 probes, we hybridized libraries from 42 species spanning all higher-order groups of angiosperms, with a focus on taxa not present in the sequence alignments used to design the probes. Out of a possible 353 coding sequences, we recovered an average of 283 per species and at least 100 in all species. Differences among taxa in sequence recovery could not be explained by relatedness to the representative taxa selected for probe design, suggesting that there is no phylogenetic bias in the probe set. Our probe set, which targeted 260 kbp of coding sequence, achieved a median recovery of 137 kbp per taxon in coding regions, a maximum recovery of 250 kbp, and an additional median of 212 kbp per taxon in flanking non-coding regions across all species. These results suggest that the Angiosperms353 probe set described here is effective for any group of flowering plants and would be useful for phylogenetic studies from the species level to higher-order groups, including the entire angiosperm clade itself.
Collapse
Affiliation(s)
- Matthew G Johnson
- Department of Biological Sciences, Texas Tech University, Lubbock, TX 79409, USA
- Plant Science and Conservation, Chicago Botanic Garden, 1000 Lake Cook Road, Glencoe, IL 60022, USA
| | - Lisa Pokorny
- Department of Comparative Plant and Fungal Biology, Royal Botanic Gardens, Kew, Richmond, Surrey TW9 3AE, UK
| | - Steven Dodsworth
- Department of Comparative Plant and Fungal Biology, Royal Botanic Gardens, Kew, Richmond, Surrey TW9 3AE, UK
- School of Life Sciences, University of Bedfordshire, University Square, Luton LU1 3JU, UK
| | - Laura R Botigué
- Department of Comparative Plant and Fungal Biology, Royal Botanic Gardens, Kew, Richmond, Surrey TW9 3AE, UK
- Centre for Research in Agricultural Genomics, Campus UAB, Edifici CRAG, Bellaterra Cerdanyola del Vallès, 08193 Barcelona, Spain
| | - Robyn S Cowan
- Department of Comparative Plant and Fungal Biology, Royal Botanic Gardens, Kew, Richmond, Surrey TW9 3AE, UK
| | - Alison Devault
- Arbor Biosciences, 5840 Interface Dr, Suite 101, Ann Arbor, MI 48103, USA
| | - Wolf L Eiserhardt
- Department of Comparative Plant and Fungal Biology, Royal Botanic Gardens, Kew, Richmond, Surrey TW9 3AE, UK
- Department of Bioscience, Aarhus University, 8000 Aarhus C, Denmark
| | - Niroshini Epitawalage
- Department of Comparative Plant and Fungal Biology, Royal Botanic Gardens, Kew, Richmond, Surrey TW9 3AE, UK
| | - Félix Forest
- Department of Comparative Plant and Fungal Biology, Royal Botanic Gardens, Kew, Richmond, Surrey TW9 3AE, UK
| | - Jan T Kim
- Department of Comparative Plant and Fungal Biology, Royal Botanic Gardens, Kew, Richmond, Surrey TW9 3AE, UK
| | - James H Leebens-Mack
- Department of Plant Biology, University of Georgia, 2502 Miller Plant Sciences, Athens, GA 30602, USA
| | - Ilia J Leitch
- Department of Comparative Plant and Fungal Biology, Royal Botanic Gardens, Kew, Richmond, Surrey TW9 3AE, UK
| | - Olivier Maurin
- Department of Comparative Plant and Fungal Biology, Royal Botanic Gardens, Kew, Richmond, Surrey TW9 3AE, UK
| | - Douglas E Soltis
- Department of Biology, University of Florida, 220 Bartram Hall, Gainesville, FL 32611-8525, USA
- Florida Museum of Natural History, University of Florida, 3215 Hull Road, Gainesville, FL 32611-2710, USA
| | - Pamela S Soltis
- Department of Biology, University of Florida, 220 Bartram Hall, Gainesville, FL 32611-8525, USA
- Florida Museum of Natural History, University of Florida, 3215 Hull Road, Gainesville, FL 32611-2710, USA
| | - Gane Ka-shu Wong
- BGI-Shenzhen, Beishan Industrial Zone, Yantian District, Shenzhen 518083, China
- Department of Biological Sciences, University of Alberta, Edmonton, AB T6G 2E9, Canada
- Department of Medicine, University of Alberta, Edmonton, AB T6G 2E1, Canada
| | - William J Baker
- Department of Comparative Plant and Fungal Biology, Royal Botanic Gardens, Kew, Richmond, Surrey TW9 3AE, UK
| | - Norman J Wickett
- Plant Science and Conservation, Chicago Botanic Garden, 1000 Lake Cook Road, Glencoe, IL 60022, USA
- Program in Plant Biology and Conservation, Northwestern University, 2205 Tech Drive, Evanston, IL 60208, USA
| |
Collapse
|
13
|
Nelsen MP, Ree RH, Moreau CS. Ant-plant interactions evolved through increasing interdependence. Proc Natl Acad Sci U S A 2018; 115:12253-12258. [PMID: 30420513 PMCID: PMC6275543 DOI: 10.1073/pnas.1719794115] [Citation(s) in RCA: 45] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023] Open
Abstract
Ant-plant interactions are diverse and abundant and include classic models in the study of mutualism and other biotic interactions. By estimating a time-scaled phylogeny of more than 1,700 ant species and a time-scaled phylogeny of more than 10,000 plant genera, we infer when and how interactions between ants and plants evolved and assess their macroevolutionary consequences. We estimate that ant-plant interactions originated in the Mesozoic, when predatory, ground-inhabiting ants first began foraging arboreally. This served as an evolutionary precursor to the use of plant-derived food sources, a dietary transition that likely preceded the evolution of extrafloral nectaries and elaiosomes. Transitions to a strict, plant-derived diet occurred in the Cenozoic, and optimal models of shifts between strict predation and herbivory include omnivory as an intermediate step. Arboreal nesting largely evolved from arboreally foraging lineages relying on a partially or entirely plant-based diet, and was initiated in the Mesozoic, preceding the evolution of domatia. Previous work has suggested enhanced diversification in plants with specialized ant-associated traits, but it appears that for ants, living and feeding on plants does not affect ant diversification. Together, the evidence suggests that ants and plants increasingly relied on one another and incrementally evolved more intricate associations with different macroevolutionary consequences as angiosperms increased their ecological dominance.
Collapse
Affiliation(s)
- Matthew P Nelsen
- Department of Science and Education, Integrative Research Center, Field Museum of Natural History, Chicago, IL 60605
| | - Richard H Ree
- Department of Science and Education, Integrative Research Center, Field Museum of Natural History, Chicago, IL 60605
| | - Corrie S Moreau
- Department of Science and Education, Integrative Research Center, Field Museum of Natural History, Chicago, IL 60605
| |
Collapse
|
14
|
Raclariu AC, Ţebrencu CE, Ichim MC, Ciupercǎ OT, Brysting AK, de Boer H. What's in the box? Authentication of Echinacea herbal products using DNA metabarcoding and HPTLC. PHYTOMEDICINE : INTERNATIONAL JOURNAL OF PHYTOTHERAPY AND PHYTOPHARMACOLOGY 2018; 44:32-38. [PMID: 29895490 DOI: 10.1016/j.phymed.2018.03.058] [Citation(s) in RCA: 33] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/31/2017] [Revised: 02/18/2018] [Accepted: 03/21/2018] [Indexed: 05/26/2023]
Abstract
BACKGROUND Differences in regulatory policies between countries as well as a lack of appropriate standardized methods for the authentication and quality control of herbal products directly impact their quality and safety. Echinacea products are among the top-selling herbal products in Europe and the United States with indications for a broad range of ailments. The increased use of Echinacea species has led to concerns about adulterated products resulting from challenges in morphology-based identification, due to overlapping morphological variation, frequent hybridization between species, and deliberate adulteration. PURPOSE This study addressed the need for a novel analytical strategy in the authentication of herbal products. METHODS A combination of high performance thin layer chromatography (HPTLC) and DNA metabarcoding was employed. Fifty-three Echinacea herbal products marketed across Europe were tested to evaluate the accuracy of these methods in plant identification and their potential for detecting substitutes, adulterants and other unreported plant constituents. RESULTS HPTLC provides high resolution in the detection of Echinacea phytochemical target compounds, but does not offer information on the other species within the product. Alternatively, we showed that the limitation of HPTLC in detecting non-targeted species can be overcome by the complementary use of DNA metabarcoding. Using DNA metabarcoding, Echinacea species were detected in 34 out of the 38 retained products (89%), but with a lack of discriminatory resolution at the species level due to the low level of molecular divergence within the Echinacea genus. All of the tested herbal products showed considerable discrepancies between ingredients listed on the label and the ones detected using DNA metabarcoding, registering an overall ingredient fidelity of only 43%. CONCLUSION The results confirm that DNA metabarcoding can be used to test for the presence of Echinacea species and simultaneously to detect other species present in even highly processed and multi-ingredient herbal products.
Collapse
Affiliation(s)
- Ancuta Cristina Raclariu
- Natural History Museum, University of Oslo, P.O. Box 1172 Blindern, 0318 Oslo, Norway; Stejarul Research Centre for Biological Sciences, National Institute of Research and Development for Biological Sciences, Alexandru cel Bun Street, 6, 610004 Piatra Neamt, Romania
| | - Carmen Elena Ţebrencu
- Research and Processing Center for Medicinal Plants Plantavorel S.A., Cuza Voda Street, 46, 610019, Piatra Neamt, Romania; Academy of Romanian Scientists, Splaiul Independentei, 54, 050094, Bucharest, Romania
| | - Mihael Cristin Ichim
- Stejarul Research Centre for Biological Sciences, National Institute of Research and Development for Biological Sciences, Alexandru cel Bun Street, 6, 610004 Piatra Neamt, Romania
| | - Oana Teodora Ciupercǎ
- Research and Processing Center for Medicinal Plants Plantavorel S.A., Cuza Voda Street, 46, 610019, Piatra Neamt, Romania
| | - Anne Krag Brysting
- Department of Biosciences, Centre for Ecological and Evolutionary Synthesis (CEES), University of Oslo, P.O. Box 1066 Blindern, 0316 Oslo, Norway
| | - Hugo de Boer
- Natural History Museum, University of Oslo, P.O. Box 1172 Blindern, 0318 Oslo, Norway.
| |
Collapse
|
15
|
Eiserhardt WL, Antonelli A, Bennett DJ, Botigué LR, Burleigh JG, Dodsworth S, Enquist BJ, Forest F, Kim JT, Kozlov AM, Leitch IJ, Maitner BS, Mirarab S, Piel WH, Pérez-Escobar OA, Pokorny L, Rahbek C, Sandel B, Smith SA, Stamatakis A, Vos RA, Warnow T, Baker WJ. A roadmap for global synthesis of the plant tree of life. AMERICAN JOURNAL OF BOTANY 2018; 105:614-622. [PMID: 29603138 DOI: 10.1002/ajb2.1041] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/13/2017] [Accepted: 11/08/2017] [Indexed: 06/08/2023]
Abstract
Providing science and society with an integrated, up-to-date, high quality, open, reproducible and sustainable plant tree of life would be a huge service that is now coming within reach. However, synthesizing the growing body of DNA sequence data in the public domain and disseminating the trees to a diverse audience are often not straightforward due to numerous informatics barriers. While big synthetic plant phylogenies are being built, they remain static and become quickly outdated as new data are published and tree-building methods improve. Moreover, the body of existing phylogenetic evidence is hard to navigate and access for non-experts. We propose that our community of botanists, tree builders, and informaticians should converge on a modular framework for data integration and phylogenetic analysis, allowing easy collaboration, updating, data sourcing and flexible analyses. With support from major institutions, this pipeline should be re-run at regular intervals, storing trees and their metadata long-term. Providing the trees to a diverse global audience through user-friendly front ends and application development interfaces should also be a priority. Interactive interfaces could be used to solicit user feedback and thus improve data quality and to coordinate the generation of new data. We conclude by outlining a number of steps that we suggest the scientific community should take to achieve global phylogenetic synthesis.
Collapse
Affiliation(s)
- Wolf L Eiserhardt
- Royal Botanic Gardens, Kew, TW9 3AE, Richmond, Surrey, UK
- Department of Bioscience, Aarhus University, Ny Munkegade 116, 8000, Aarhus C, Denmark
| | - Alexandre Antonelli
- Gothenburg Global Biodiversity Centre, Box 461, 405 30, Gothenburg, Sweden
- Department of Biological and Environmental Sciences, University of Gothenburg, Box 461, 405 30, Gothenburg, Sweden
- Gothenburg Botanical Garden, Carl Skottsbergs Gata 22B, SE-413 19, Gothenburg, Sweden
| | - Dominic J Bennett
- Gothenburg Global Biodiversity Centre, Box 461, 405 30, Gothenburg, Sweden
- Department of Biological and Environmental Sciences, University of Gothenburg, Box 461, 405 30, Gothenburg, Sweden
- Gothenburg Botanical Garden, Carl Skottsbergs Gata 22B, SE-413 19, Gothenburg, Sweden
| | | | | | | | - Brian J Enquist
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ, 85721, USA
- The Santa Fe Institute, Santa Fe, NM, 87501, USA
| | - Félix Forest
- Royal Botanic Gardens, Kew, TW9 3AE, Richmond, Surrey, UK
| | - Jan T Kim
- Royal Botanic Gardens, Kew, TW9 3AE, Richmond, Surrey, UK
| | - Alexey M Kozlov
- Scientific Computing Group, Heidelberg Institute for Theoretical Studies, 69118, Heidelberg, Germany
| | - Ilia J Leitch
- Royal Botanic Gardens, Kew, TW9 3AE, Richmond, Surrey, UK
| | - Brian S Maitner
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ, 85721, USA
| | - Siavash Mirarab
- Department of Electrical and Computer Engineering, University of California, San Diego, San Diego, CA, 92093, USA
| | - William H Piel
- Yale-NUS College, 16 College Avenue West, Singapore, 138527, Republic of Singapore
| | | | - Lisa Pokorny
- Royal Botanic Gardens, Kew, TW9 3AE, Richmond, Surrey, UK
| | - Carsten Rahbek
- Center for Macroecology, Evolution and Climate, University of Copenhagen, Universitetsparken 15, DK-2100, Copenhagen O, Denmark
- Imperial College London, Silwood Park, Buckhurst Road, Ascot, Berkshire, SL5 7PY, UK
| | - Brody Sandel
- Department of Biology, Santa Clara University, Santa Clara, CA, 95053, USA
| | - Stephen A Smith
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI, 48109, USA
| | - Alexandros Stamatakis
- Scientific Computing Group, Heidelberg Institute for Theoretical Studies, 69118, Heidelberg, Germany
- Institute for Theoretical Informatics, Karlsruhe Institute of Technology, 76128, Karlsruhe, Germany
| | - Rutger A Vos
- Naturalis Biodiversity Center, P.O. Box 9517, 2300RA, Leiden, The Netherlands
- Institute of Biology Leiden, P.O. Box 9505, 2300RA, Leiden, The Netherlands
| | - Tandy Warnow
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL, 61801, USA
| | | |
Collapse
|
16
|
Smith SA, Brown JW. Constructing a broadly inclusive seed plant phylogeny. AMERICAN JOURNAL OF BOTANY 2018; 105:302-314. [PMID: 29746720 DOI: 10.1002/ajb2.1019] [Citation(s) in RCA: 349] [Impact Index Per Article: 58.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/08/2017] [Accepted: 10/19/2017] [Indexed: 05/03/2023]
Abstract
PREMISE OF THE STUDY Large phylogenies can help shed light on macroevolutionary patterns that inform our understanding of fundamental processes that shape the tree of life. These phylogenies also serve as tools that facilitate other systematic, evolutionary, and ecological analyses. Here we combine genetic data from public repositories (GenBank) with phylogenetic data (Open Tree of Life project) to construct a dated phylogeny for seed plants. METHODS We conducted a hierarchical clustering analysis of publicly available molecular data for major clades within the Spermatophyta. We constructed phylogenies of major clades, estimated divergence times, and incorporated data from the Open Tree of Life project, resulting in a seed plant phylogeny. We estimated diversification rates, excluding those taxa without molecular data. We also summarized topological uncertainty and data overlap for each major clade. KEY RESULTS The trees constructed for Spermatophyta consisted of 79,881 and 353,185 terminal taxa; the latter included the Open Tree of Life taxa for which we could not include molecular data from GenBank. The diversification analyses demonstrated nested patterns of rate shifts throughout the phylogeny. Data overlap and inference uncertainty show significant variation throughout and demonstrate the continued need for data collection across seed plants. CONCLUSIONS This study demonstrates a means for combining available resources to construct a dated phylogeny for plants. However, this approach is an early step and more developments are needed to add data, better incorporating underlying uncertainty, and improve resolution. The methods discussed here can also be applied to other major clades in the tree of life.
Collapse
Affiliation(s)
- Stephen A Smith
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, Michigan, 48109, USA
| | - Joseph W Brown
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, Michigan, 48109, USA
| |
Collapse
|
17
|
Pease JB, Brown JW, Walker JF, Hinchliff CE, Smith SA. Quartet Sampling distinguishes lack of support from conflicting support in the green plant tree of life. AMERICAN JOURNAL OF BOTANY 2018; 105:385-403. [PMID: 29746719 DOI: 10.1002/ajb2.1016] [Citation(s) in RCA: 105] [Impact Index Per Article: 17.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/09/2017] [Accepted: 09/05/2017] [Indexed: 05/21/2023]
Abstract
PREMISE OF THE STUDY Phylogenetic support has been difficult to evaluate within the green plant tree of life partly due to a lack of specificity between conflicted versus poorly informed branches. As data sets continue to expand in both breadth and depth, new support measures are needed that are more efficient and informative. METHODS We describe the Quartet Sampling (QS) method, a quartet-based evaluation system that synthesizes several phylogenetic and genomic analytical approaches. QS characterizes discordance in large-sparse and genome-wide data sets, overcoming issues of alignment sparsity and distinguishing strong conflict from weak support. We tested QS with simulations and recent plant phylogenies inferred from variously sized data sets. KEY RESULTS QS scores demonstrated convergence with increasing replicates and were not strongly affected by branch depth. Patterns of QS support from different phylogenies led to a coherent understanding of ancestral branches defining key disagreements, including the relationships of Ginkgo to cycads, magnoliids to monocots and eudicots, and mosses to liverworts. The relationships of ANA-grade angiosperms (Amborella, Nymphaeales, Austrobaileyales), major monocot groups, bryophytes, and fern families are likely highly discordant in their evolutionary histories, rather than poorly informed. QS can also detect discordance due to introgression in phylogenomic data. CONCLUSIONS Quartet Sampling is an efficient synthesis of phylogenetic tests that offers more comprehensive and specific information on branch support than conventional measures. The QS method corroborates growing evidence that phylogenomic investigations that incorporate discordance testing are warranted when reconstructing complex evolutionary histories, in particular those surrounding ANA-grade, monocots, and nonvascular plants.
Collapse
Affiliation(s)
- James B Pease
- Department of Biology, Wake Forest University, 455 Vine Street, Winston-Salem, North Carolina, 27101, USA
| | - Joseph W Brown
- Department of Ecology and Evolutionary Biology, University of Michigan, 830 North University, Ann Arbor, Michigan, 48109, USA
| | - Joseph F Walker
- Department of Ecology and Evolutionary Biology, University of Michigan, 830 North University, Ann Arbor, Michigan, 48109, USA
| | - Cody E Hinchliff
- Department of Biological Sciences, University of Idaho, 875 Perimeter Drive, MS 3051, Moscow, Idaho, 83844, USA
| | - Stephen A Smith
- Department of Ecology and Evolutionary Biology, University of Michigan, 830 North University, Ann Arbor, Michigan, 48109, USA
| |
Collapse
|
18
|
Title PO, Rabosky DL. Do Macrophylogenies Yield Stable Macroevolutionary Inferences? An Example from Squamate Reptiles. Syst Biol 2018; 66:843-856. [PMID: 27821703 DOI: 10.1093/sysbio/syw102] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2015] [Accepted: 10/27/2016] [Indexed: 01/03/2023] Open
Abstract
Advances in the generation, retrieval, and analysis of phylogenetic data have enabled researchers to create phylogenies that contain many thousands of taxa. These "macrophylogenies"-large trees that typically derive from megaphylogeny, supermatrix, or supertree approaches-provide researchers with an unprecedented ability to conduct evolutionary analyses across broad phylogenetic scales. Many studies have now used these phylogenies to explore the dynamics of speciation, extinction, and phenotypic evolution across large swaths of the tree of life. These trees are characterized by substantial phylogenetic uncertainty on multiple levels, and the stability of macroevolutionary inferences from these data sets has not been rigorously explored. As a case study, we tested whether five recently published phylogenies for squamate reptiles-each consisting of more than 4000 species-yield congruent inferences about the processes that underlie variation in species richness across replicate evolutionary radiations of Australian snakes and lizards. We find discordance across the five focal phylogenies with respect to clade age and several diversification rate metrics, and in the effects of clade age on species richness. We also find that crown clade ages reported in the literature on these Australian groups are in conflict with all of the large phylogenies examined. Macrophylogenies offer an unprecedented opportunity to address evolutionary and ecological questions at broad phylogenetic scales, but accurately representing the uncertainty that is inherent to such analyses remains a critical challenge to our field. [Australia; macroevolution; macrophylogeny; squamates; time calibration.].
Collapse
Affiliation(s)
- Pascal O Title
- Department of Ecology and Evolutionary Biology and Museum of Zoology, University of Michigan, Ann Arbor, MI 48109, USA
| | - Daniel L Rabosky
- Department of Ecology and Evolutionary Biology and Museum of Zoology, University of Michigan, Ann Arbor, MI 48109, USA
| |
Collapse
|
19
|
The development of scientific consensus: Analyzing conflict and concordance among avian phylogenies. Mol Phylogenet Evol 2017; 116:69-77. [DOI: 10.1016/j.ympev.2017.08.002] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2017] [Revised: 08/03/2017] [Accepted: 08/06/2017] [Indexed: 11/22/2022]
|
20
|
Maitner BS, Boyle B, Casler N, Condit R, Donoghue J, Durán SM, Guaderrama D, Hinchliff CE, Jørgensen PM, Kraft NJ, McGill B, Merow C, Morueta‐Holme N, Peet RK, Sandel B, Schildhauer M, Smith SA, Svenning J, Thiers B, Violle C, Wiser S, Enquist BJ. The
bien r
package: A tool to access the Botanical Information and Ecology Network (BIEN) database. Methods Ecol Evol 2017. [DOI: 10.1111/2041-210x.12861] [Citation(s) in RCA: 170] [Impact Index Per Article: 24.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Brian S. Maitner
- Department of Ecology and Evolutionary Biology University of Arizona Tucson AZ USA
| | - Brad Boyle
- Department of Ecology and Evolutionary Biology University of Arizona Tucson AZ USA
| | - Nathan Casler
- National Center for Supercomputing Applications University of Illinois Urbana‐Champaign Urbana IL USA
| | - Rick Condit
- Smithsonian Tropical Research Institute Center for Tropical Forest Science Global Forest Observatory Network Panama City Panama
| | - John Donoghue
- Department of Ecology and Evolutionary Biology University of Arizona Tucson AZ USA
| | - Sandra M. Durán
- Department of Ecology and Evolutionary Biology University of Arizona Tucson AZ USA
| | - Daniel Guaderrama
- Department of Ecology and Evolutionary Biology University of Arizona Tucson AZ USA
| | - Cody E. Hinchliff
- Department of Ecology and Evolutionary Biology University of Michigan Ann Arbor MI USA
| | | | - Nathan J.B. Kraft
- Department of Ecology and Evolutionary Biology University of California Los Angeles CA USA
| | - Brian McGill
- School of Biology and Ecology University of Maine Orono ME USA
| | - Cory Merow
- Department of Ecology and Evolutionary Biology Yale University New Haven CT USA
| | - Naia Morueta‐Holme
- Department of Integrative Biology University of California Berkeley CA USA
| | - Robert K. Peet
- Department of Biology University of North Carolina Chapel Hill NC USA
| | - Brody Sandel
- Department of Biology Santa Clara University Santa Clara CA USA
| | - Mark Schildhauer
- National Center for Ecological Analysis and Synthesis Santa Barbara CA USA
| | - Stephen A. Smith
- Department of Ecology and Evolutionary Biology University of Michigan Ann Arbor MI USA
| | - Jens‐Christian Svenning
- Section for Ecoinformatics & Biodiversity Department of Bioscience Aarhus University Aarhus C Denmark
| | - Barbara Thiers
- William and Lynda Steere Herbarium at the New York Botanical Garden Bronx NY USA
| | - Cyrille Violle
- Center for Functional and Evolutionary Ecology (UMR 5175) CNRS ‐ University of Montpellier ‐ Paul Valéry University of Montpellier EPHE Montpellier France
| | | | - Brian J. Enquist
- Department of Ecology and Evolutionary Biology University of Arizona Tucson AZ USA
- The Santa Fe Institute Santa Fe NM USA
| |
Collapse
|
21
|
Seed size and its rate of evolution correlate with species diversification across angiosperms. PLoS Biol 2017; 15:e2002792. [PMID: 28723902 PMCID: PMC5536390 DOI: 10.1371/journal.pbio.2002792] [Citation(s) in RCA: 45] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2017] [Revised: 07/31/2017] [Accepted: 06/29/2017] [Indexed: 11/19/2022] Open
Abstract
Species diversity varies greatly across the different taxonomic groups that comprise the Tree of Life (ToL). This imbalance is particularly conspicuous within angiosperms, but is largely unexplained. Seed mass is one trait that may help clarify why some lineages diversify more than others because it confers adaptation to different environments, which can subsequently influence speciation and extinction. The rate at which seed mass changes across the angiosperm phylogeny may also be linked to diversification by increasing reproductive isolation and allowing access to novel ecological niches. However, the magnitude and direction of the association between seed mass and diversification has not been assessed across the angiosperm phylogeny. Here, we show that absolute seed size and the rate of change in seed size are both associated with variation in diversification rates. Based on the largest available angiosperm phylogenetic tree, we found that smaller-seeded plants had higher rates of diversification, possibly due to improved colonisation potential. The rate of phenotypic change in seed size was also strongly positively correlated with speciation rates, providing rare, large-scale evidence that rapid morphological change is associated with species divergence. Our study now reveals that variation in morphological traits and, importantly, the rate at which they evolve can contribute to explaining the extremely uneven distribution of diversity across the ToL. Why are some groups of flowering plants extremely diverse while others are very poor in species? Are the traits of species and the rate at which they evolve important in generating this uneven distribution of biodiversity? By using the largest available phylogenetic tree of plants coupled with an unparalleled trait dataset, we analysed how seed size and its rate of change across the phylogeny are correlated with the rate of species formation. Seed size is crucial to plant evolution because it is related to adaptation to environment and influences many aspects of plant life history, including dispersal, resistance to damage and colonisation potential. We found that faster rates of seed size change were associated with faster rates of speciation, probably by fostering the appearance of reproductive barriers between lineages. We also found that smaller seeded species speciated faster than larger seeded ones. These results underscore the importance of morphological traits, and particularly their rate of evolution, in promoting species divergence across one of the largest radiations of organisms on the planet.
Collapse
|
22
|
Gao S, Yan Q, Chen L, Song Y, Li J, Fu C, Dong M. Effects of ploidy level and haplotype on variation of photosynthetic traits: Novel evidence from two Fragaria species. PLoS One 2017; 12:e0179899. [PMID: 28644876 PMCID: PMC5482484 DOI: 10.1371/journal.pone.0179899] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2017] [Accepted: 06/06/2017] [Indexed: 12/26/2022] Open
Abstract
To reveal the effects of ploidy level and haplotype on photosynthetic traits, we chose 175 genotypes of wild strawberries belonging to two haplotypes at two types of ploidy levels (diploidy and tetraploidy) and measured photosynthetic traits. Our results revealed that ploidy significantly affected the characteristics of light-response curves, CO2-response curves, and leaf gas exchange parameters, except intercellular CO2 concentration (Ci). Tetraploid species had a lower light saturation point (LSP) and CO2 saturation point (CSP), higher light compensation point (LCP), dark respiration (Rd), and CO2 compensation point (CCP) than diploid species. Furthermore, tetraploid species have lower photosynthetic capacity than diploid species, including net photosynthetic rate (Pn), stomatal conductivity (Gs), and transpiration rate (Tr). In addition, haplotype had a significant effect on LSP, CSP, Tr, and Ci as well as a significant interactive effect between ploidy and haplotype on the maximal photosynethic rate of the light-response curve and Rd. Most of the variance existed within haplotypes among individuals. These results suggest that polyploidization was the main driver for the evolution of photosynthesis with increasing ploidy level (i.e. from diploidy to tetraploidy in Fragaria species), while the origin of a chromosome could also affect the photosynthetic traits and the polyploidization effect on photosynthetic traits.
Collapse
Affiliation(s)
- Song Gao
- Key Laboratory of Conservation Biology for Endangered Wildlife of the Ministry of Education, and College of Life Sciences, Zhejiang University, Hangzhou, China
- Research Institute of Zhejiang University-Taizhou, Taizhou, China
- Zhejiang Provincial Key Laboratory of Plant Evolutionary Ecology and Conservation, Taizhou University, Taizhou, China
| | - Qiaodi Yan
- Zhejiang Provincial Key Laboratory of Plant Evolutionary Ecology and Conservation, Taizhou University, Taizhou, China
| | - Luxi Chen
- Zhejiang Provincial Key Laboratory of Plant Evolutionary Ecology and Conservation, Taizhou University, Taizhou, China
| | - Yaobin Song
- Key Laboratory of Hangzhou City for Ecosystem Protection and Restoration, and College of Life and Environmental Sciences, Hangzhou Normal University, Hangzhou, China
| | - Junmin Li
- Zhejiang Provincial Key Laboratory of Plant Evolutionary Ecology and Conservation, Taizhou University, Taizhou, China
| | - Chengxin Fu
- Key Laboratory of Conservation Biology for Endangered Wildlife of the Ministry of Education, and College of Life Sciences, Zhejiang University, Hangzhou, China
| | - Ming Dong
- Key Laboratory of Hangzhou City for Ecosystem Protection and Restoration, and College of Life and Environmental Sciences, Hangzhou Normal University, Hangzhou, China
| |
Collapse
|
23
|
Molina-Venegas R, Rodríguez MÁ. Revisiting phylogenetic signal; strong or negligible impacts of polytomies and branch length information? BMC Evol Biol 2017; 17:53. [PMID: 28201989 PMCID: PMC5312541 DOI: 10.1186/s12862-017-0898-y] [Citation(s) in RCA: 70] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2016] [Accepted: 01/30/2017] [Indexed: 11/10/2022] Open
Abstract
Background Inaccurate estimates of phylogenetic signal may mislead interpretations of many ecological and evolutionary processes, and hence understanding where potential sources of uncertainty may lay has become a priority for comparative studies. Importantly, the sensitivity of phylogenetic signal indices and their associated statistical tests to incompletely resolved phylogenies and suboptimal branch-length information has been only partially investigated. Methods Here, we use simulations of trait evolution along phylogenetic trees to assess whether incompletely resolved phylogenies (polytomic chronograms) and phylogenies with suboptimal branch-length information (pseudo-chronograms) could produce directional biases in significance tests (p-values) associated with Blomberg et al.’s K and Pagel’s lambda (λ) statistics, two of the most widely used indices to measure and test phylogenetic signal. Specifically, we conducted pairwise comparisons between the p-values resulted from the use of “true” chronograms and their degraded counterparts (i.e. polytomic chronograms and pseudo-chronograms), and computed the frequency with which the null hypothesis of no phylogenetic signal was accepted using “true” chronograms but rejected when using their degraded counterparts (type I bias) and vice versa (type II bias). Results We found that the use of polytomic chronograms in combination with Blomberg et al.’s K resulted in both, clearly inflated estimates of phylogenetic signal and moderate levels of type I and II biases. More importantly, pseudo-chronograms led to high rates of type I biases. In contrast, Pagel’s λ was strongly robust to either incompletely resolved phylogenies and suboptimal branch-length information. Conclusions Our results suggest that pseudo-chronograms can lead to strong overestimation of phylogenetic signal when using Blomberg et al.’s K (i.e. high rates of type I biases), while polytomies may be a minor concern given other sources of uncertainty. In contrast, Pagel’s λ seems strongly robust to either incompletely resolved phylogenies and suboptimal branch-length information. Hence, Pagel’s λ may be a more appropriate alternative over Blomberg et al.’s K to measure and test phylogenetic signal in most ecologically relevant traits when phylogenetic information is incomplete. Electronic supplementary material The online version of this article (doi:10.1186/s12862-017-0898-y) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Rafael Molina-Venegas
- Departamento de Ciencias de la Vida, Universidad de Alcalá, 28805, Alcalá de Henares, Madrid, Spain.
| | - Miguel Á Rodríguez
- Departamento de Ciencias de la Vida, Universidad de Alcalá, 28805, Alcalá de Henares, Madrid, Spain
| |
Collapse
|
24
|
Abstract
Pharmacovigilance of herbal medicines relies on the product label information regarding the ingredients and the adherence to good manufacturing practices along the commercialisation chain. Several studies have shown that substitution of plant species occurs in herbal medicines, and this in turn poses a challenge to herbal pharmacovigilance as adverse reactions might be due to adulterated or added ingredients. Authentication of constituents in herbal medicines using analytical chemistry methods can help detect contaminants and toxins, but are often limited or incapable of detecting the source of the contamination. Recent developments in molecular plant identification using DNA sequence data enable accurate identification of plant species from herbal medicines using defined DNA markers. Identification of multiple constituent species from compound herbal medicines using amplicon metabarcoding enables verification of labelled ingredients and detection of substituted, adulterated and added species. DNA barcoding is proving to be a powerful method to assess species composition in herbal medicines and has the potential to be used as a standard method in herbal pharmacovigilance research of adverse reactions to specific products.
Collapse
|
25
|
Spalink D, Drew BT, Pace MC, Zaborsky JG, Li P, Cameron KM, Givnish TJ, Sytsma KJ. Evolution of geographical place and niche space: Patterns of diversification in the North American sedge (Cyperaceae) flora. Mol Phylogenet Evol 2016; 95:183-95. [DOI: 10.1016/j.ympev.2015.09.028] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2015] [Revised: 09/10/2015] [Accepted: 09/17/2015] [Indexed: 11/15/2022]
|
26
|
Pennell MW, FitzJohn RG, Cornwell WK. A simple approach for maximizing the overlap of phylogenetic and comparative data. Methods Ecol Evol 2016. [DOI: 10.1111/2041-210x.12517] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Affiliation(s)
- Matthew W. Pennell
- Institute for Bioinformatics and Evolutionary Studies; University of Idaho; Moscow ID 83844 USA
- Department of Zoology; Biodiversity Research Centre; University of British Columbia; Vancouver BC V6T 1Z4 Canada
| | - Richard G. FitzJohn
- Department of Biological Sciences; Macquarie University; Sydney NSW 2109 Australia
| | - William K. Cornwell
- Ecology and Evolution Research Centre; School of Biological, Earth and Environmental Sciences; University of New South Wales; Sydney NSW 2052 Australia
- Centre for Ecosystem Science; School of Biological, Earth and Environmental Sciences; University of New South Wales; Sydney NSW 2052 Australia
| |
Collapse
|
27
|
Affiliation(s)
- Sven Buerki
- Department of Life Sciences; Natural History Museum; Cromwell Road London SW7 5BD UK
| | | |
Collapse
|
28
|
Read KB, Sheehan JR, Huerta MF, Knecht LS, Mork JG, Humphreys BL. Sizing the Problem of Improving Discovery and Access to NIH-Funded Data: A Preliminary Study. PLoS One 2015. [PMID: 26207759 PMCID: PMC4514623 DOI: 10.1371/journal.pone.0132735] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Abstract
Objective This study informs efforts to improve the discoverability of and access to biomedical datasets by providing a preliminary estimate of the number and type of datasets generated annually by research funded by the U.S. National Institutes of Health (NIH). It focuses on those datasets that are “invisible” or not deposited in a known repository. Methods We analyzed NIH-funded journal articles that were published in 2011, cited in PubMed and deposited in PubMed Central (PMC) to identify those that indicate data were submitted to a known repository. After excluding those articles, we analyzed a random sample of the remaining articles to estimate how many and what types of invisible datasets were used in each article. Results About 12% of the articles explicitly mention deposition of datasets in recognized repositories, leaving 88% that are invisible datasets. Among articles with invisible datasets, we found an average of 2.9 to 3.4 datasets, suggesting there were approximately 200,000 to 235,000 invisible datasets generated from NIH-funded research published in 2011. Approximately 87% of the invisible datasets consist of data newly collected for the research reported; 13% reflect reuse of existing data. More than 50% of the datasets were derived from live human or non-human animal subjects. Conclusion In addition to providing a rough estimate of the total number of datasets produced per year by NIH-funded researchers, this study identifies additional issues that must be addressed to improve the discoverability of and access to biomedical research data: the definition of a “dataset,” determination of which (if any) data are valuable for archiving and preservation, and better methods for estimating the number of datasets of interest. Lack of consensus amongst annotators about the number of datasets in a given article reinforces the need for a principled way of thinking about how to identify and characterize biomedical datasets.
Collapse
Affiliation(s)
- Kevin B. Read
- Medical Library, NYU Langone Medical Center, New York, New York, United States of America
- * E-mail:
| | - Jerry R. Sheehan
- National Library of Medicine, National Institutes of Health, Bethesda, Maryland, United States of America
| | - Michael F. Huerta
- National Library of Medicine, National Institutes of Health, Bethesda, Maryland, United States of America
| | - Lou S. Knecht
- National Library of Medicine, National Institutes of Health, Bethesda, Maryland, United States of America
| | - James G. Mork
- National Library of Medicine, National Institutes of Health, Bethesda, Maryland, United States of America
| | - Betsy L. Humphreys
- National Library of Medicine, National Institutes of Health, Bethesda, Maryland, United States of America
| | | |
Collapse
|