1
|
Calderon MS, Bustamante DE, Perez J, Fernandez-Güimac SLJ, Mendoza JE, Barboza JI, Ayala RY, Carrion JV. Diversity and functional role of bacterial microbiota in spontaneous coffee fermentation in northern Peru using shotgun metagenomics. J Food Sci 2024; 89:9692-9710. [PMID: 39636804 DOI: 10.1111/1750-3841.17583] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2024] [Revised: 11/14/2024] [Accepted: 11/17/2024] [Indexed: 12/07/2024]
Abstract
Peru is the ninth-largest coffee producer and the largest organic coffee exporter worldwide. Specific modifications in the microbial consortia during fermentation control the flavor of coffee. It is still unclear how fermentation duration affects microbial communities. This study aimed to provide insights into the diversity and functional behavior of the bacterial microbiome during coffee fermentation in northern Peru using shotgun metagenomics. Accordingly, metagenomic DNA was extracted and sequenced from samples of the liquid fraction during the short fermentation process (SFP) in Amazonas (6 and 12 h) and long fermentation process (LFP) in Cajamarca (6, 12, 18, 24, and 36 h). Our findings indicate that common (e.g., Acetobacter, Lactobacillus, Leuconostoc, and Weissella) and unique (e.g., Acidiphilium and Methylobacterium) acid-tolerant bacteria from the SFP and LFP play crucial roles and have a positive impact on the sensory qualities of coffee. Specifically, the LFP from San Ignacio might be associated with the high sensory quality of coffee based on the release of catalytic, hydrolase, oxidoreductase, transferase, and transporter enzymes in the InterPro and KEGG profiles. Additionally, these bacterial microorganisms metabolize several compounds (e.g., isoleucine, betaine, galactose, tryptophan, arginine, and cobalamin) into volatile compounds, mainly in the LFP, enhancing the flavor and aroma of coffees. This characteristic suggests that the LFP has a stronger effect on coffee quality than does the SFP on the basis of bacterial diversity and functional prediction. These findings provide new perspectives on the potential biotechnological uses of autochthonous microorganisms to produce superior-quality coffee beans from northern Peru.
Collapse
Affiliation(s)
- Martha S Calderon
- Instituto de Investigación en Ingeniería Ambiental (INAM), Facultad de Ingeniería Civil y Ambiental (FICIAM), Universidad Nacional Toribio Rodríguez de Mendoza, Chachapoyas, Amazonas, Peru
- Instituto de Investigación para el Desarrollo Sustentable de Ceja de Selva (INDES-CES), Universidad Nacional Toribio Rodríguez de Mendoza, Chachapoyas, Amazonas, Peru
| | - Danilo E Bustamante
- Instituto de Investigación en Ingeniería Ambiental (INAM), Facultad de Ingeniería Civil y Ambiental (FICIAM), Universidad Nacional Toribio Rodríguez de Mendoza, Chachapoyas, Amazonas, Peru
- Instituto de Investigación para el Desarrollo Sustentable de Ceja de Selva (INDES-CES), Universidad Nacional Toribio Rodríguez de Mendoza, Chachapoyas, Amazonas, Peru
| | - Jhordy Perez
- Instituto de Investigación para el Desarrollo Sustentable de Ceja de Selva (INDES-CES), Universidad Nacional Toribio Rodríguez de Mendoza, Chachapoyas, Amazonas, Peru
| | - Samia L J Fernandez-Güimac
- Instituto de Investigación para el Desarrollo Sustentable de Ceja de Selva (INDES-CES), Universidad Nacional Toribio Rodríguez de Mendoza, Chachapoyas, Amazonas, Peru
| | - Jani E Mendoza
- Instituto de Investigación para el Desarrollo Sustentable de Ceja de Selva (INDES-CES), Universidad Nacional Toribio Rodríguez de Mendoza, Chachapoyas, Amazonas, Peru
| | - José I Barboza
- Instituto de Investigación para el Desarrollo Sustentable de Ceja de Selva (INDES-CES), Universidad Nacional Toribio Rodríguez de Mendoza, Chachapoyas, Amazonas, Peru
| | - Rosmery Y Ayala
- Instituto de Investigación para el Desarrollo Sustentable de Ceja de Selva (INDES-CES), Universidad Nacional Toribio Rodríguez de Mendoza, Chachapoyas, Amazonas, Peru
| | - Jois V Carrion
- Instituto de Investigación para el Desarrollo Sustentable de Ceja de Selva (INDES-CES), Universidad Nacional Toribio Rodríguez de Mendoza, Chachapoyas, Amazonas, Peru
| |
Collapse
|
2
|
Serrana JM, Watanabe K. Haplotype-level metabarcoding of freshwater macroinvertebrate species: A prospective tool for population genetic analysis. PLoS One 2023; 18:e0289056. [PMID: 37486933 PMCID: PMC10365294 DOI: 10.1371/journal.pone.0289056] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2023] [Accepted: 07/10/2023] [Indexed: 07/26/2023] Open
Abstract
Metabarcoding is a molecular-based tool capable of large quantity high-throughput species identification from bulk samples that is a faster and more cost-effective alternative to conventional DNA-sequencing approaches. Still, further exploration and assessment of the laboratory and bioinformatics strategies are required to unlock the potential of metabarcoding-based inference of haplotype information. In this study, we assessed the inference of freshwater macroinvertebrate haplotypes from metabarcoding data in a mock sample. We also examined the influence of DNA template concentration and PCR cycle on detecting true and spurious haplotypes. We tested this strategy on a mock sample containing twenty individuals from four species with known haplotypes based on the 658-bp Folmer region of the mitochondrial cytochrome c oxidase gene. We recovered fourteen zero-radius operational taxonomic units (zOTUs) of 421-bp length, with twelve zOTUs having a 100% match with the Sanger haplotype sequences. High-quality reads relatively increased with increasing PCR cycles, and the relative abundance of each zOTU was consistent for each cycle. This suggests that increasing the PCR cycles from 24 to 64 did not affect the relative abundance of each zOTU. As metabarcoding becomes more established and laboratory protocols and bioinformatic pipelines are continuously being developed, our study demonstrated the method's ability to infer intraspecific variability while highlighting the challenges that must be addressed before its eventual application for population genetic studies.
Collapse
Affiliation(s)
- Joeselle M Serrana
- Center for Marine Environmental Studies, Ehime University, Matsuyama, Ehime, Japan
- Faculty of Engineering, Graduate School of Science and Engineering, Ehime University, Matsuyama, Ehime, Japan
| | - Kozo Watanabe
- Center for Marine Environmental Studies, Ehime University, Matsuyama, Ehime, Japan
| |
Collapse
|
3
|
Hughes MJ, Braun de Torrez EC, Buckner EA, Ober HK. Consumption of endemic arbovirus mosquito vectors by bats in the southeastern United States. JOURNAL OF VECTOR ECOLOGY : JOURNAL OF THE SOCIETY FOR VECTOR ECOLOGY 2022; 47:153-165. [PMID: 36314669 DOI: 10.52707/1081-1710-47.2.153] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/06/2021] [Accepted: 04/28/2022] [Indexed: 06/16/2023]
Abstract
Mosquitoes affect human health and well-being globally through their roles as disease-causing pathogen vectors. Utilizing genetic techniques, we conducted a large-scale dietary study of three bat species common to the southeastern U.S.A., Lasiurus seminolus (Seminole bat), Nycticeius humeralis (evening bat), and Myotis austroriparius (southeastern myotis). Through next-generation sequencing of a 180 bp portion of cytochrome oxidase subunit I (COI) of mitochondrial DNA from 180 bat guano samples, we documented consumption of 17 species of mosquitoes by bats, including six endemic arbovirus vectors. Culex quinquefasciatus, Culex coronator, Culiseta melanura, Culex salinarius, Culex erraticus, and Coquillettidia perturbans were consumed by 51.3%, 43.7%, 27.2%, 22.8%, 18.0%, and 12.7% of bats sampled, respectively. Consumption of two of these mosquito species was explained by spatial variables reflecting the prevalence of mosquito larval habitat, five were explained by bat traits (bat mass, bat species), and two were explained by these factors plus temporal variables (maximum daily temperature, time since sunset, date), making it challenging to offer specific guidance on how best to promote bats as a means of reducing arbovirus vector species. Our results show that common bat species of the southeastern U.S.A. consume endemic, but not exotic, arbovirus mosquito vectors. Future studies are needed to understand the impact of bat consumption on mosquito numbers and public health.
Collapse
Affiliation(s)
- Morgan J Hughes
- Department of Wildlife Ecology and Conservation, University of Florida, Gainesville, FL, U.S.A
| | - Elizabeth C Braun de Torrez
- Fish and Wildlife Research Institute, Florida Fish and Wildlife Conservation Commission, Gainesville, FL, U.S.A
| | - Eva A Buckner
- University of Florida, Institute of Food and Agricultural Sciences, Department of Entomology and Nematology, Florida Medical Entomology Laboratory, Vero Beach, FL, U.S.A
| | - Holly K Ober
- Department of Wildlife Ecology and Conservation, University of Florida, Gainesville, FL, U.S.A.,
- Department of Forest Ecosystems and Society, Oregon State University, Corvallis, OR, U.S.A
| |
Collapse
|
4
|
Couton M, Baud A, Daguin‐Thiébaut C, Corre E, Comtet T, Viard F. High-throughput sequencing on preservative ethanol is effective at jointly examining infraspecific and taxonomic diversity, although bioinformatics pipelines do not perform equally. Ecol Evol 2021; 11:5533-5546. [PMID: 34026027 PMCID: PMC8131761 DOI: 10.1002/ece3.7453] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2021] [Revised: 02/17/2021] [Accepted: 03/03/2021] [Indexed: 12/29/2022] Open
Abstract
High-throughput sequencing of amplicons (HTSA) has been proposed as an effective approach to evaluate taxonomic and genetic diversity at the same time. However, there are still uncertainties as to how the results produced by different bioinformatics treatments impact the conclusions drawn on biodiversity and population genetics indices.We evaluated the ability of six bioinformatics pipelines to recover taxonomic and genetic diversity from HTSA data obtained from controlled assemblages. To that end, 20 assemblages were produced using 354 colonies of Botrylloides spp., sampled in the wild in ten marinas around Brittany (France). We used DNA extracted from preservative ethanol (ebDNA) after various time of storage (3, 6, and 12 months), and from a bulk of preserved specimens (bulkDNA). DNA was amplified with primers designed for targeting this ascidian genus. Results obtained from HTSA data were compared with Sanger sequencing on individual zooids (i.e., individual barcoding).Species identification and relative abundance determined with HTSA data from either ebDNA or bulkDNA were similar to those obtained with traditional individual barcoding. However, after 12 months of storage, the correlation between HTSA and individual-based data was lower than after shorter durations. The six bioinformatics pipelines were able to depict accurately the genetic diversity using standard population genetics indices (HS and FST), despite producing false positives and missing rare haplotypes. However, they did not perform equally and dada2 was the only pipeline able to retrieve all expected haplotypes.This study showed that ebDNA is a nondestructive alternative for both species identification and haplotype recovery, providing storage does not last more than 6 months before DNA extraction. Choosing the bioinformatics pipeline is a matter of compromise, aiming to retrieve all true haplotypes while avoiding false positives. We here recommend to process HTSA data using dada2, including a chimera-removal step. Even if the possibility to use multiplexed primer sets deserves further investigation to expand the taxonomic coverage in future similar studies, we showed that primers targeting a particular genus allowed to reliably analyze this genus within a complex community.
Collapse
Affiliation(s)
- Marjorie Couton
- Sorbonne universitéCNRSUMR 7144Station Biologique de RoscoffRoscoffFrance
| | - Aurélien Baud
- Sorbonne universitéCNRSUMR 7144Station Biologique de RoscoffRoscoffFrance
| | | | - Erwan Corre
- Sorbonne universitéCNRSFR 2424Station Biologique de RoscoffRoscoffFrance
| | - Thierry Comtet
- Sorbonne universitéCNRSUMR 7144Station Biologique de RoscoffRoscoffFrance
| | - Frédérique Viard
- Sorbonne universitéCNRSUMR 7144Station Biologique de RoscoffRoscoffFrance
- ISEMUniv MontpellierCNRSEPHEIRDMontpellierFrance
| |
Collapse
|
5
|
Turon X, Antich A, Palacín C, Præbel K, Wangensteen OS. From metabarcoding to metaphylogeography: separating the wheat from the chaff. ECOLOGICAL APPLICATIONS : A PUBLICATION OF THE ECOLOGICAL SOCIETY OF AMERICA 2020; 30:e02036. [PMID: 31709684 PMCID: PMC7078904 DOI: 10.1002/eap.2036] [Citation(s) in RCA: 51] [Impact Index Per Article: 10.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/13/2019] [Revised: 07/31/2019] [Accepted: 10/03/2019] [Indexed: 05/31/2023]
Abstract
Metabarcoding is by now a well-established method for biodiversity assessment in terrestrial, freshwater, and marine environments. Metabarcoding data sets are usually used for α- and β-diversity estimates, that is, interspecies (or inter-MOTU [molecular operational taxonomic unit]) patterns. However, the use of hypervariable metabarcoding markers may provide an enormous amount of intraspecies (intra-MOTU) information-mostly untapped so far. The use of cytochrome oxidase (COI) amplicons is gaining momentum in metabarcoding studies targeting eukaryote richness. COI has been for a long time the marker of choice in population genetics and phylogeographic studies. Therefore, COI metabarcoding data sets may be used to study intraspecies patterns and phylogeographic features for hundreds of species simultaneously, opening a new field that we suggest to name metaphylogeography. The main challenge for the implementation of this approach is the separation of erroneous sequences from true intra-MOTU variation. Here, we develop a cleaning protocol based on changes in entropy of the different codon positions of the COI sequence, together with co-occurrence patterns of sequences. Using a data set of community DNA from several benthic littoral communities in the Mediterranean and Atlantic seas, we first tested by simulation on a subset of sequences a two-step cleaning approach consisting of a denoising step followed by a minimal abundance filtering. The procedure was then applied to the whole data set. We obtained a total of 563 MOTUs that were usable for phylogeographic inference. We used semiquantitative rank data instead of read abundances to perform AMOVAs and haplotype networks. Genetic variability was mainly concentrated within samples, but with an important between seas component as well. There were intergroup differences in the amount of variability between and within communities in each sea. For two species, the results could be compared with traditional Sanger sequence data available for the same zones, giving similar patterns. Our study shows that metabarcoding data can be used to infer intra- and interpopulation genetic variability of many species at a time, providing a new method with great potential for basic biogeography, connectivity and dispersal studies, and for the more applied fields of conservation genetics, invasion genetics, and design of protected areas.
Collapse
Affiliation(s)
- Xavier Turon
- Department of Marine EcologyCentre for Advanced Studies of Blanes (CEAB, CSIC)BlanesCataloniaSpain
| | - Adrià Antich
- Department of Marine EcologyCentre for Advanced Studies of Blanes (CEAB, CSIC)BlanesCataloniaSpain
| | - Creu Palacín
- Department of Evolutionary Biology, Ecology and Environmental Sciences, and Institute of Biodiversity Research (IRBio)University of BarcelonaBarcelonaCataloniaSpain
| | - Kim Præbel
- Norwegian College of Fishery ScienceUiT the Arctic University of NorwayTromsøNorway
| | | |
Collapse
|
6
|
Corse E, Tougard C, Archambaud‐Suard G, Agnèse J, Messu Mandeng FD, Bilong Bilong CF, Duneau D, Zinger L, Chappaz R, Xu CC, Meglécz E, Dubut V. One-locus-several-primers: A strategy to improve the taxonomic and haplotypic coverage in diet metabarcoding studies. Ecol Evol 2019; 9:4603-4620. [PMID: 31031930 PMCID: PMC6476781 DOI: 10.1002/ece3.5063] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2018] [Revised: 02/19/2019] [Accepted: 02/25/2019] [Indexed: 12/19/2022] Open
Abstract
In diet metabarcoding analyses, insufficient taxonomic coverage of PCR primer sets generates false negatives that may dramatically distort biodiversity estimates. In this paper, we investigated the taxonomic coverage and complementarity of three cytochrome c oxidase subunit I gene (COI) primer sets based on in silico analyses and we conducted an in vivo evaluation using fecal and spider web samples from different invertivores, environments, and geographic locations. Our results underline the lack of predictability of both the coverage and complementarity of individual primer sets: (a) sharp discrepancies exist observed between in silico and in vivo analyses (to the detriment of in silico analyses); (b) both coverage and complementarity depend greatly on the predator and on the taxonomic level at which preys are considered; (c) primer sets' complementarity is the greatest at fine taxonomic levels (molecular operational taxonomic units [MOTUs] and variants). We then formalized the "one-locus-several-primer-sets" (OLSP) strategy, that is, the use of several primer sets that target the same locus (here the first part of the COI gene) and the same group of taxa (here invertebrates). The proximal aim of the OLSP strategy is to minimize false negatives by increasing total coverage through multiple primer sets. We illustrate that the OLSP strategy is especially relevant from this perspective since distinct variants within the same MOTUs were not equally detected across all primer sets. Furthermore, the OLSP strategy produces largely overlapping and comparable sequences, which cannot be achieved when targeting different loci. This facilitates the use of haplotypic diversity information contained within metabarcoding datasets, for example, for phylogeography and finer analyses of prey-predator interactions.
Collapse
Affiliation(s)
- Emmanuel Corse
- Aix Marseille Univ, Avignon UnivCNRS, IRD, IMBEMarseilleFrance
- Agence de Recherche pour la Biodiversité à la Réunion (ARBRE)Saint‐Leu, La RéunionFrance
| | | | | | | | - Françoise D. Messu Mandeng
- Laboratory of Parasitology and Ecology, Departement of Animal Biology and PhysiologyUniversity of Yaoundé IYaoundéCameroon
| | - Charles F. Bilong Bilong
- Laboratory of Parasitology and Ecology, Departement of Animal Biology and PhysiologyUniversity of Yaoundé IYaoundéCameroon
| | - David Duneau
- Université Toulouse 3 Paul SabatierCNRS, ENSFEA, EDB (Laboratoire Évolution & Diversité Biologique)ToulouseFrance
| | - Lucie Zinger
- Institut de Biologie de l'Ecole Normale Supérieure (IBENS), Ecole Normale Supérieure, CNRS, INSERMPSL Research UniversityParisFrance
| | - Rémi Chappaz
- Irstea, Aix Marseille Univ, RECOVERAix‐en‐ProvenceFrance
| | - Charles C.Y. Xu
- Redpath Museum and Department of BiologyMcGill UniversityMontréalQuebecCanada
| | - Emese Meglécz
- Aix Marseille Univ, Avignon UnivCNRS, IRD, IMBEMarseilleFrance
| | - Vincent Dubut
- Aix Marseille Univ, Avignon UnivCNRS, IRD, IMBEMarseilleFrance
| |
Collapse
|
7
|
Elbrecht V, Vamos EE, Steinke D, Leese F. Estimating intraspecific genetic diversity from community DNA metabarcoding data. PeerJ 2018; 6:e4644. [PMID: 29666773 PMCID: PMC5896493 DOI: 10.7717/peerj.4644] [Citation(s) in RCA: 70] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2018] [Accepted: 03/28/2018] [Indexed: 12/18/2022] Open
Abstract
Background DNA metabarcoding is used to generate species composition data for entire communities. However, sequencing errors in high-throughput sequencing instruments are fairly common, usually requiring reads to be clustered into operational taxonomic units (OTUs), losing information on intraspecific diversity in the process. While Cytochrome c oxidase subunit I (COI) haplotype information is limited in resolving intraspecific diversity it is nevertheless often useful e.g. in a phylogeographic context, helping to formulate hypotheses on taxon distribution and dispersal. Methods This study combines sequence denoising strategies, normally applied in microbial research, with additional abundance-based filtering to extract haplotype information from freshwater macroinvertebrate metabarcoding datasets. This novel approach was added to the R package "JAMP" and can be applied to COI amplicon datasets. We tested our haplotyping method by sequencing (i) a single-species mock community composed of 31 individuals with 15 different haplotypes spanning three orders of magnitude in biomass and (ii) 18 monitoring samples each amplified with four different primer sets and two PCR replicates. Results We detected all 15 haplotypes of the single specimens in the mock community with relaxed filtering and denoising settings. However, up to 480 additional unexpected haplotypes remained in both replicates. Rigorous filtering removes most unexpected haplotypes, but also can discard expected haplotypes mainly from the small specimens. In the monitoring samples, the different primer sets detected 177-200 OTUs, each containing an average of 2.40-3.30 haplotypes per OTU. The derived intraspecific diversity data showed population structures that were consistent between replicates and similar between primer pairs but resolution depended on the primer length. A closer look at abundant taxa in the dataset revealed various population genetic patterns, e.g. the stonefly Taeniopteryx nebulosa and the caddisfly Hydropsyche pellucidula showed a distinct north-south cline with respect to haplotype distribution, while the beetle Oulimnius tuberculatus and the isopod Asellus aquaticus displayed no clear population pattern but differed in genetic diversity. Discussion We developed a strategy to infer intraspecific genetic diversity from bulk invertebrate metabarcoding data. It needs to be stressed that at this point this metabarcoding-informed haplotyping is not capable of capturing the full diversity present in such samples, due to variation in specimen size, primer bias and loss of sequence variants with low abundance. Nevertheless, for a high number of species intraspecific diversity was recovered, identifying potentially isolated populations and taxa for further more detailed phylogeographic investigation. While we are currently lacking large-scale metabarcoding datasets to fully take advantage of our new approach, metabarcoding-informed haplotyping holds great promise for biomonitoring efforts that not only seek information about species diversity but also underlying genetic diversity.
Collapse
Affiliation(s)
- Vasco Elbrecht
- Aquatic Ecosystem Research, University of Duisburg-Essen, Essen, North Rhine-Westphalia, Germany.,Centre for Biodiversity Genomics, University of Guelph, Guelph, ON, Canada
| | - Ecaterina Edith Vamos
- Aquatic Ecosystem Research, University of Duisburg-Essen, Essen, North Rhine-Westphalia, Germany
| | - Dirk Steinke
- Centre for Biodiversity Genomics, University of Guelph, Guelph, ON, Canada
| | - Florian Leese
- Aquatic Ecosystem Research, University of Duisburg-Essen, Essen, North Rhine-Westphalia, Germany.,Centre for Water and Environmental Research (ZWU) Essen, University of Duisburg-Essen, Essen, North Rhine-Westphalia, Germany
| |
Collapse
|