1
|
Lack of local genetic representation in one of the regions with the highest bird species richness, the Peruvian Amazonia. PLoS One 2024; 19:e0296305. [PMID: 38165899 PMCID: PMC10760656 DOI: 10.1371/journal.pone.0296305] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2023] [Accepted: 12/08/2023] [Indexed: 01/04/2024] Open
Abstract
Peru ranks among the three countries with the highest bird species diversity globally and a majority of those species are found in the Peruvian Amazon. However, birds in this area are currently facing serious anthropogenic threats. Genetic and genomic methods are becoming important tools for avian biodiversity monitoring and conservation planning. Comprehensive molecular libraries that are publicly available are key to the effective deployment of these tools. We analyze the information gaps for four molecular markers in the most important genetic sequence databases, Barcode of Life Data Systems (BOLD) and NCBI GenBank, for bird species of the Peruvian Amazonia. We found that 64% of Peruvian Amazonian bird species have gene sequences for COI, 59.5% have CYTB sequences, 16.4% have 12S sequences, and only 0.6% have 18S sequences. However, these numbers decrease drastically to 4.3% for COI sequences when we only consider specimens sampled in Peru. Our data also showed that 43.8% of Peruvian Amazonian endemic species (n = 32) are missing sequences of any screened marker uploaded to GenBank or BOLD. Our results will encourage and guide efforts of the scientific community to complete reference libraries for Peruvian avian species that will be useful for future DNA-based monitoring projects that include birds.
Collapse
|
2
|
Assessing the Seasonal and Spatial Dynamics of Zooplankton through DNA Metabarcoding in a Temperate Estuary. Animals (Basel) 2023; 13:3876. [PMID: 38136913 PMCID: PMC10740712 DOI: 10.3390/ani13243876] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2023] [Revised: 12/07/2023] [Accepted: 12/13/2023] [Indexed: 12/24/2023] Open
Abstract
Zooplankton are key components of estuarine trophic networks. However, routine monitoring is hindered by the difficulty of morphology-based identification. DNA-based methods allow us to circumvent some of these hurdles, providing precise species identifications regardless of the taxonomic expertise of the investigator or the developmental stage of the specimens. However, the process is dependent on the completeness of the reference libraries. In this study, we sought to evaluate the potential of DNA metabarcoding to assess the seasonal (summer, autumn, and early spring) and spatial dynamics of zooplankton (four locations spanning ca. 6 km) in the Lima estuary (NW Portugal). Two genetic markers were used: the cytochrome c oxidase subunit I and the V4 hypervariable region of the ribosomal 18S rRNA genes. Overall, 327 species were recovered, and both markers displayed minute overlap (7% were detected with both markers). Species richness, composition, and taxonomic distinctness were majorly influenced by the season, with a declining tendency from summer (highest number of exclusive species, n = 74) to spring. Second to season, the taxa composition was influenced by spatial variation where the most downstream site displayed the highest number of exclusive species, n = 53. A total of 16 non-indigenous species were detected using metabarcoding, but only one (Austrominus modestus) has been documented out in the estuary. In conclusion, both the seasonal and spatial gradients influenced the recovered richness, composition, and taxonomic distinctness, confirming the great aptitude of DNA metabarcoding for providing higher density monitoring and shedding new light on the composition and dynamics of complex zooplankton communities.
Collapse
|
3
|
Maximizing the reliability and the number of species assignments in metabarcoding studies using a curated regional library and a public repository. METABARCODING AND METAGENOMICS 2023. [DOI: 10.3897/mbmg.7.98539] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/25/2023] Open
Abstract
Biodiversity assessments relying on DNA have increased rapidly over the last decade. However, the reliability of taxonomic assignments in metabarcoding studies is variable and affected by the reference databases and the assignment methods used. Species level assignments are usually considered as reliable using regional libraries but unreliable using public repositories. In this study, we aimed to test this assumption for metazoan species detected in the Gulf of St. Lawrence in the Northwest Atlantic. We first created a regional library (GSL-rl) by data mining COI barcode sequences from BOLD, and included a reliability ranking system for species assignments. We then estimated 1) the accuracy and precision of the public repository NCBI-nt for species assignments using sequences from the regional library and 2) compared the detection and reliability of species assignments of a metabarcoding dataset using either NCBI-nt or the regional library and popular assignment methods. With NCBI-nt and sequences from the regional library, the BLAST-LCA (least common ancestor) method was the most precise method for species assignments, but the accuracy was higher with the BLAST-TopHit method (>80% over all taxa, between 70% and 90% amongst taxonomic groups). With the metabarcoding dataset, the reliability of species assignments was greater using GSL-rl compared to NCBI-nt. However, we also observed that the total number of reliable species assignments could be maximized using both GSL-rl and NCBI-nt with different optimized assignment methods. The use of a two-step approach for species assignments, i.e., using a regional library and a public repository, could improve the reliability and the number of detected species in metabarcoding studies.
Collapse
|
4
|
Compilation, Revision, and Annotation of DNA Barcodes of Marine Invertebrate Non-Indigenous Species (NIS) Occurring in European Coastal Regions. DIVERSITY 2023. [DOI: 10.3390/d15020174] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Abstract
The introduction of non-indigenous species (NIS) is one of the major threats to the integrity of European coastal ecosystems. DNA-based assessments have been increasingly adopted for monitoring NIS. However, the accuracy of DNA-based taxonomic assignments is largely dependent on the completion and reliability of DNA barcode reference libraries. As such, we aimed to compile and audit a DNA barcode reference library for marine invertebrate NIS occurring in Europe. To do so, we compiled a list of NIS using three databases: the European Alien Species Information Network (EASIN), the Information System on Aquatic Non-indigenous and Cryptogenic Species (AquaNIS), and the World Register of Introduced Marine Species (WRiMS). For each species, we retrieved the available cytochrome c oxidase subunit I (COI) mitochondrial gene sequences from the Barcode of Life Data System (BOLD) and used the Barcode, Audit & Grade System (BAGS) to check congruence between morphospecies names and Barcode Index Numbers (BINs). From the 1249 species compiled, approximately 42% had records on BOLD, among which 56% were discordant. We further analyzed these cases to determine the causes of the discordances and attributed additional annotation tags. Of the 622 discordant BINs, after revision, 35% were successfully solved, which increased the number of NIS detected in metabarcoding datasets from 12 to 16. However, a fair number of BINs remained discordant. Reliability of reference barcode records is particularly critical in the case of NIS, where erroneous identification may trigger action or inaction when not required.
Collapse
|
5
|
Abstract
In the age of global climate change and biodiversity loss there is an urgent need to provide effective and robust tools for diversity monitoring. One of the promising techniques for species identification is the use of DNA barcoding, that in Metazoa utilizes the so called 'gold-standard' gene of cytochrome c oxidase (COI). However, the success of this method relies on the existence of trustworthy barcode libraries of the species. The Barcode of Life Data System (BOLD) aims to provide barcodes for all existing organisms, and is complemented by the Barcode Index Number (BIN) system serving as a tool for potential species recognition. Here we provide an analysis of all public COI sequences available in BOLD of the diverse and ubiquitous crustacean order Amphipoda, to identify the barcode library gaps and provide recommendations for future barcoding studies. Our gap analysis of 25,702 records has shown that although 3,835 BINs (indicating putative species) were recognised by BOLD, only 10% of known amphipod species are represented by barcodes. We have identified almost equal contribution of both records (sequences) and BINs associated with freshwater and with marine realms. Three quarters of records have a complete species-level identification provided, while BINs have just 50%. Large disproportions between identification levels of BINs coming from freshwaters and the marine environment were observed, with three quarters of the former possessing a species name, and less than 40% for the latter. Moreover, the majority of BINs are represented by a very low number of sequences rendering them unreliable according to the quality control system. The geographical coverage is poor with vast areas of Africa, South America and the open ocean acting as "white gaps". Several, of the most species rich and highly abundant families of Amphipoda (e.g., Phoxocephalidae, Ampeliscidae, Caprellidae), have very poor representation in the BOLD barcode library. As a result of our study we recommend stronger effort in identification of already recognised BINs, prioritising the studies of families that are known to be important and abundant components of particular communities, and targeted sampling programs for taxa coming from geographical regions with the least knowledge.
Collapse
|
6
|
The identification of fish eggs of two species, the ovate sole Solea ovata and black porgy Acanthopagrus schlegelii. JOURNAL OF FISH BIOLOGY 2021; 99:1746-1751. [PMID: 34270090 DOI: 10.1111/jfb.14854] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/03/2021] [Revised: 07/01/2021] [Accepted: 07/14/2021] [Indexed: 06/13/2023]
Abstract
Fish eggs of the ovate sole Solea ovata and black porgy Acanthopagrus schlegelii were identified through DNA barcoding of cytochrome c oxidase subunit I (COX1). Visual taxonomic features were achieved, and photographs of the eggs of both species at different developmental stages were reported for the first time. In addition, the dissolution of oil globules caused by ethanol as egg fixatives was observed. This result showed the importance of using formalin as egg fixatives in the case of morphometric analysis and the necessity of combining molecular and visual taxonomic method for morphological study.
Collapse
|
7
|
|
8
|
Meta-Fish-Lib: A generalised, dynamic DNA reference library pipeline for metabarcoding of fishes. JOURNAL OF FISH BIOLOGY 2021; 99:1446-1454. [PMID: 34269417 DOI: 10.1111/jfb.14852] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/29/2021] [Revised: 07/06/2021] [Accepted: 07/12/2021] [Indexed: 06/13/2023]
Abstract
The accuracy and reliability of DNA metabarcoding analyses depend on the breadth and quality of the reference libraries that underpin them. However, there are limited options available to obtain and curate the huge volumes of sequence data that are available on public repositories such as NCBI and BOLD. Here, we provide a pipeline to download, clean and annotate mitochondrial DNA sequence data for a given list of fish species. Features of this pipeline include (a) support for multiple metabarcode markers; (b) searches on species synonyms and taxonomic name validation; (c) phylogeny assisted quality control for identification and removal of misannotated sequences; (d) automatically generated coverage reports for each new GenBank release update; and (e) citable, versioned DOIs. As an example we provide a ready-to-use curated reference library for the marine and freshwater fishes of the U.K. To augment this reference library for environmental DNA metabarcoding specifically, we generated 241 new MiFish-12S sequences for 88 U.K. marine species, and make available new primer sets useful for sequencing these. This brings the coverage of common U.K. species for the MiFish-12S fragment to 93%, opening new avenues for scaling up fish metabarcoding across wide spatial gradients. The Meta-Fish-Lib reference library and pipeline is hosted at https://github.com/genner-lab/meta-fish-lib.
Collapse
|
9
|
On the Diversity of Phyllodocida (Annelida: Errantia), with a Focus on Glyceridae, Goniadidae, Nephtyidae, Polynoidae, Sphaerodoridae, Syllidae, and the Holoplanktonic Families. DIVERSITY-BASEL 2021. [DOI: 10.3390/d13030131] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Phyllodocida is a clade of errantiate annelids characterized by having ventral sensory palps, anterior enlarged cirri, axial muscular proboscis, compound chaetae (if present) with a single ligament, and of lacking dorsolateral folds. Members of most families date back to the Carboniferous, although the earliest fossil was dated from the Devonian. Phyllodocida holds 27 well-established and morphologically homogenous clades ranked as families, gathering more than 4600 currently accepted nominal species. Among them, Syllidae and Polynoidae are the most specious polychaete groups. Species of Phyllodocida are mainly found in the marine benthos, although a few inhabit freshwater, terrestrial and planktonic environments, and occur from intertidal to deep waters in all oceans. In this review, we (1) explore the current knowledge on species diversity trends (based on traditional species concept and molecular data), phylogeny, ecology, and geographic distribution for the whole group, (2) try to identify the main knowledge gaps, and (3) focus on selected families: Alciopidae, Goniadidae, Glyceridae, Iospilidae, Lopadorrhynchidae, Polynoidae, Pontodoridae, Nephtyidae, Sphaerodoridae, Syllidae, Tomopteridae, Typhloscolecidae, and Yndolaciidae. The highest species richness is concentrated in European, North American, and Australian continental shelves (reflecting a strong sampling bias). While most data come from shallow coastal and surface environments most world oceans are clearly under-studied. The overall trends indicate that new descriptions are constantly added through time and that less than 10% of the known species have molecular barcode information available.
Collapse
|
10
|
4SpecID: Reference DNA Libraries Auditing and Annotation System for Forensic Applications. Genes (Basel) 2021; 12:genes12010061. [PMID: 33401773 PMCID: PMC7824288 DOI: 10.3390/genes12010061] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2020] [Revised: 12/21/2020] [Accepted: 12/30/2020] [Indexed: 01/01/2023] Open
Abstract
Forensic genetics is a fast-growing field that frequently requires DNA-based taxonomy, namely, when evidence are parts of specimens, often highly processed in food, potions, or ointments. Reference DNA-sequences libraries, such as BOLD or GenBank, are imperative tools for taxonomic assignment, particularly when morphology is inadequate for classification. The auditing and curation of these datasets require reliable mechanisms, preferably with automated data preprocessing. Software tools were developed to grade these datasets considering as primary criterion the number of records, which is not compliant with forensic standards, where the priority is validation from independent sources. Moreover, 4SpecID is an efficient and freely available software tool developed to audit and annotate reference libraries, specifically designed for forensic applications. Its intuitive user-friendly interface virtually accesses any database and includes specific data mining functions tuned for the widespread BOLD repositories. The built tool was evaluated in laptop MacBook and a dual-Xeon server with a large BOLD dataset (Culicidae, 36,115 records), and the best execution time to grade the dataset on the laptop was 0.28 s. Datasets of Bovidae and Felidae families were used to evaluate the quality of the tool and the relevance of independent sources validation.
Collapse
|
11
|
DNA barcoding in the Southeast Pacific marine realm: Low coverage and geographic representation despite high diversity. PLoS One 2020; 15:e0244323. [PMID: 33370342 PMCID: PMC7769448 DOI: 10.1371/journal.pone.0244323] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2020] [Accepted: 12/07/2020] [Indexed: 11/25/2022] Open
Abstract
The Southeast Pacific comprises two Large Marine Ecosystems, the Pacific Central-American Coastal and the Humboldt Current System; and is one of the less well known in the tropical subregions in terms of biodiversity. To address this, we compared DNA barcoding repositories with the marine biodiversity species for the Southeast Pacific. We obtained a checklist of marine species in the Southeast Pacific (i.e. Colombia, Ecuador, Chile, and Peru) from the Ocean Biodiversity Information System (OBIS) database and compared it with species available at the Barcoding of Life Data System (BOLD) repository. Of the 5504 species records retrieved from OBIS, 42% of them had at least one registered specimen in BOLD (including specimens around the world); however, only 4.5% of records corresponded to publicly available DNA barcodes including specimens collected from a Southeast Pacific country. The low representation of barcoded species does not vary much across the different taxonomic groups or within countries, but we observed an asymmetric distribution of DNA barcoding records for taxonomic groups along the coast, being more abundant for the Humboldt Current System than the Pacific Central-American Coastal. We observed high-level of barcode records with Barcode Index Number (BIN) incongruences, particularly for fishes (Actinopterygii = 30.27% and Elasmobranchii = 24.71%), reflecting taxonomic uncertainties for fishes, whereas for Invertebrates and Mammalia more than 85% of records were classified as data deficient or inadequate procedure for DNA barcoding. DNA barcoding is a powerful tool to study biodiversity, with a great potential to increase the knowledge of the Southeast Pacific marine biodiversity. Our results highlight the critical need for increasing taxonomic sampling effort, the number of trained taxonomic specialists, laboratory facilities, scientific collections, and genetic reference libraries.
Collapse
|
12
|
BAGS: An automated Barcode, Audit & Grade System for DNA barcode reference libraries. Mol Ecol Resour 2020; 21:573-583. [PMID: 33000878 DOI: 10.1111/1755-0998.13262] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2020] [Revised: 09/01/2020] [Accepted: 09/07/2020] [Indexed: 01/12/2023]
Abstract
Biodiversity studies greatly benefit from molecular tools, such as DNA metabarcoding, which provides an effective identification tool in biomonitoring and conservation programmes. The accuracy of species-level assignment, and consequent taxonomic coverage, relies on comprehensive DNA barcode reference libraries. The role of these libraries is to support species identification, but accidental errors in the generation of the barcodes may compromise their accuracy. Here, we present an R-based application, Barcode, Audit & Grade System (BAGS) (https://github.com/tadeu95/BAGS), that performs automated auditing and annotation of cytochrome c oxidase subunit I (COI) sequences libraries, for a given taxonomic group of animals, available in the Barcode of Life Data System (BOLD). This is followed by implementing a qualitative ranking system that assigns one of five grades (A to E) to each species in the reference library, according to the attributes of the data and congruency of species names with sequences clustered in barcode index numbers (BINs). Our goal is to allow researchers to obtain the most useful and reliable data, highlighting and segregating records according to their congruency. Different tests were performed to perceive its usefulness and limitations. BAGS fulfils a significant gap in the current landscape of DNA barcoding research tools by quickly screening reference libraries to gauge the congruence status of data and facilitate the triage of ambiguous data for posterior review. Thereby, BAGS has the potential to become a valuable addition in forthcoming DNA metabarcoding studies, in the long term contributing to globally improve the quality and reliability of the public reference libraries.
Collapse
|
13
|
Reference DNA databases for forensic species identification: Auditing algorithms. FORENSIC SCIENCE INTERNATIONAL GENETICS SUPPLEMENT SERIES 2019. [DOI: 10.1016/j.fsigss.2019.10.091] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
|
14
|
The conundrum of species delimitation: a genomic perspective on a mitogenetically super-variable butterfly. Proc Biol Sci 2019; 286:20191311. [PMID: 31530141 PMCID: PMC6784721 DOI: 10.1098/rspb.2019.1311] [Citation(s) in RCA: 31] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
The Palaearctic butterfly Melitaea didyma stands out as one of the most striking cases of intraspecific genetic differentiation detected in Lepidoptera: 11 partially sympatric mitochondrial lineages have been reported, displaying levels of divergence of up to 7.4%. To better understand the evolutionary processes underlying the diversity observed in mtDNA, we compared mtDNA and genome-wide SNP data using double-digest restriction site-associated DNA sequencing (ddRADseq) results from 93 specimens of M. didyma ranging from Morocco to eastern Kazakhstan. We found that, between ddRADseq and mtDNA results, there is a match only in populations that probably remained allopatric for long periods of time. Other mtDNA lineages may have resulted from introgression events and were probably affected by Wolbachia infection. The five main ddRADseq clades supported by STRUCTURE were parapatric or allopatric and showed high pairwise FST values, but some were also estimated to display various levels of gene flow. Melitaea didyma represents one of the first cases of deep mtDNA splits among European butterflies assessed by a genome-wide DNA analysis and reveals that the interpretation of patterns remains challenging even when a high amount of genomic data is available. These findings actualize the ongoing debate of species delimitation in allopatry, an issue probably of relevance to a significant proportion of global biodiversity.
Collapse
|
15
|
DNA barcode reference libraries for the monitoring of aquatic biota in Europe: Gap-analysis and recommendations for future work. THE SCIENCE OF THE TOTAL ENVIRONMENT 2019; 678:499-524. [PMID: 31077928 DOI: 10.1016/j.scitotenv.2019.04.247] [Citation(s) in RCA: 157] [Impact Index Per Article: 31.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/14/2019] [Revised: 04/16/2019] [Accepted: 04/16/2019] [Indexed: 05/21/2023]
Abstract
Effective identification of species using short DNA fragments (DNA barcoding and DNA metabarcoding) requires reliable sequence reference libraries of known taxa. Both taxonomically comprehensive coverage and content quality are important for sufficient accuracy. For aquatic ecosystems in Europe, reliable barcode reference libraries are particularly important if molecular identification tools are to be implemented in biomonitoring and reports in the context of the EU Water Framework Directive (WFD) and the Marine Strategy Framework Directive (MSFD). We analysed gaps in the two most important reference databases, Barcode of Life Data Systems (BOLD) and NCBI GenBank, with a focus on the taxa most frequently used in WFD and MSFD. Our analyses show that coverage varies strongly among taxonomic groups, and among geographic regions. In general, groups that were actively targeted in barcode projects (e.g. fish, true bugs, caddisflies and vascular plants) are well represented in the barcode libraries, while others have fewer records (e.g. marine molluscs, ascidians, and freshwater diatoms). We also found that species monitored in several countries often are represented by barcodes in reference libraries, while species monitored in a single country frequently lack sequence records. A large proportion of species (up to 50%) in several taxonomic groups are only represented by private data in BOLD. Our results have implications for the future strategy to fill existing gaps in barcode libraries, especially if DNA metabarcoding is to be used in the monitoring of European aquatic biota under the WFD and MSFD. For example, missing species relevant to monitoring in multiple countries should be prioritized for future collaborative programs. We also discuss why a strategy for quality control and quality assurance of barcode reference libraries is needed and recommend future steps to ensure full utilisation of metabarcoding in aquatic biomonitoring.
Collapse
|
16
|
Categorization of species as native or nonnative using DNA sequence signatures without a complete reference library. ECOLOGICAL APPLICATIONS : A PUBLICATION OF THE ECOLOGICAL SOCIETY OF AMERICA 2019; 29:e01914. [PMID: 31050090 PMCID: PMC7079013 DOI: 10.1002/eap.1914] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/17/2018] [Revised: 03/18/2019] [Accepted: 04/01/2019] [Indexed: 05/26/2023]
Abstract
New genetic diagnostic approaches have greatly aided efforts to document global biodiversity and improve biosecurity. This is especially true for organismal groups in which species diversity has been underestimated historically due to difficulties associated with sampling, the lack of clear morphological characteristics, and/or limited availability of taxonomic expertise. Among these methods, DNA sequence barcoding (also known as "DNA barcoding") and by extension, meta-barcoding for biological communities, has emerged as one of the most frequently utilized methods for DNA-based species identifications. Unfortunately, the use of DNA barcoding is limited by the availability of complete reference libraries (i.e., a collection of DNA sequences from morphologically identified species), and by the fact that the vast majority of species do not have sequences present in reference databases. Such conditions are critical especially in tropical locations that are simultaneously biodiversity rich and suffer from a lack of exploration and DNA characterization by trained taxonomic specialists. To facilitate efforts to document biodiversity in regions lacking complete reference libraries, we developed a novel statistical approach that categorizes unidentified species as being either likely native or likely nonnative based solely on measures of nucleotide diversity. We demonstrate the utility of this approach by categorizing a large sample of specimens of terrestrial insects and spiders (collected as part of the Moorea BioCode project) using a generalized linear mixed model (GLMM). Using a training data set of known endemic (n = 45) and known introduced species (n = 102), we then estimated the likely native/nonnative status for 4,663 specimens representing an estimated 1,288 species (412 identified species), including both those specimens that were either unidentified or whose endemic/introduced status was uncertain. Using this approach, we were able to increase the number of categorized specimens by a factor of 4.4 (from 794 to 3,497), and the number of categorized species by a factor of 4.8 from (147 to 707) at a rate much greater than chance (77.6% accuracy). The study identifies phylogenetic signatures of both native and nonnative species and suggests several practical applications for this approach including monitoring biodiversity and facilitating biosecurity.
Collapse
|
17
|
Fish DNA barcoding around large marine infrastructure for improved biodiversity assessment and monitoring. Mitochondrial DNA A DNA Mapp Seq Anal 2018; 29:1174-1179. [PMID: 29373939 DOI: 10.1080/24701394.2018.1431225] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Abstract
Accurate species-level identification is pivotal for environmental assessments and monitoring. The PERU LNG terminal is composed of large marine infrastructure located on the central coast of Peru. Since construction, taxonomically challenging species such as drum fishes (Sciaenidae) have been attracted to the new hard-bottom habitat. We conducted a DNA barcoding study to investigate fish diversity and constructed a DNA barcode reference library. We examined 56 vouchered specimens and identified 24 unique species. Intra- and interspecific divergence estimates ranged between 0 and 0.64% and 11 and 35.5%, respectively. We assessed the efficiency of the reference library to identify 29 non-vouchered specimens. We had 82.5% efficiency by using both our reference library (n = 17) and GenBank (n = 24). We highlight the importance of implementing molecular barcoding for complementing biodiversity assessments in marine environments. This study represents a first step towards generating a comprehensive DNA barcode reference library for marine fishes in Peru.
Collapse
|
18
|
Why We Need Sustainable Networks Bridging Countries, Disciplines, Cultures and Generations for Aquatic Biomonitoring 2.0: A Perspective Derived From the DNAqua-Net COST Action. ADV ECOL RES 2018. [DOI: 10.1016/bs.aecr.2018.01.001] [Citation(s) in RCA: 89] [Impact Index Per Article: 14.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022]
|
19
|
Barcoding Atlantic Canada's mesopelagic and upper bathypelagic marine fishes. PLoS One 2017; 12:e0185173. [PMID: 28931082 PMCID: PMC5607201 DOI: 10.1371/journal.pone.0185173] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2017] [Accepted: 09/07/2017] [Indexed: 12/02/2022] Open
Abstract
DNA barcode sequences were developed from 557 mesopelagic and upper bathypelagic teleost specimens collected in waters off Atlantic Canada. Confident morphological identifications were available for 366 specimens, of 118 species and 93 genera, which yielded 328 haplotypes. Five of the species were novel to the Barcode of Life Database (BOLD). Most of the 118 species conformed to expectations of monophyly and the presence of a “barcode gap”, though some known weaknesses in existing taxonomy were confirmed and a deficiency in published keys was revealed. Of the specimens for which no firm morphological identification was available, 156 were successfully identified to species, and a further 11 to genus, using their barcode sequences and a combination of distance- and character-based methods. The remaining 24 specimens were from species for which no reference barcode is yet available or else ones confused by apparent misidentification of publicly available sequences in BOLD. Addition of the new sequences to those previously in BOLD contributed support to recent taxonomic revisions of Chiasmodon and Poromitra, while it also revealed 18 cases of potential cryptic speciation. Most of the latter appear to result from genetic divergence among populations in different ocean basins, while the general lack of strong horizontal environmental gradients within the deep sea has allowed morphology to be conserved. Other examples of divergence appear to distinguish individuals living under the sub-tropical gyre of the North Atlantic from those under that ocean’s sub-polar gyre. In contrast, the available sequences for two myctophid species, Benthosema glaciale and Notoscopelus elongatus, showed genetic structuring on finer geographic scales. The observed structure was not consistent with recent suggestions that “resident” populations of myctophids can maintain allopatry despite the mixing of ocean waters. Rather, it indicates that the very rapid speciation characteristic of the Myctophidae is both on-going and detectable using barcodes.
Collapse
|
20
|
Harnessing the Power of Genomics to Secure the Future of Seafood. Trends Ecol Evol 2017; 32:665-680. [PMID: 28818341 DOI: 10.1016/j.tree.2017.06.010] [Citation(s) in RCA: 100] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2017] [Revised: 06/16/2017] [Accepted: 06/20/2017] [Indexed: 11/15/2022]
Abstract
Best use of scientific knowledge is required to maintain the fundamental role of seafood in human nutrition. While it is acknowledged that genomic-based methods allow the collection of powerful data, their value to inform fisheries management, aquaculture, and biosecurity applications remains underestimated. We review genomic applications of relevance to the sustainable management of seafood resources, illustrate the benefits of, and identify barriers to their integration. We conclude that the value of genomic information towards securing the future of seafood does not need to be further demonstrated. Instead, we need immediate efforts to remove structural roadblocks and focus on ways that support integration of genomic-informed methods into management and production practices. We propose solutions to pave the way forward.
Collapse
|
21
|
Fish, genes and genomes: contributions to ecology, evolution and management. JOURNAL OF FISH BIOLOGY 2016; 89:2471-2478. [PMID: 27921308 DOI: 10.1111/jfb.13228] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
|