1
|
Morsink MC, van Schaik EN, Bossers K, Duijker DA, Speksnijder AGCL. Metagenomics education in a modular CURE format positively affects students' scientific discovery perception and data analytical skills. BIOCHEMISTRY AND MOLECULAR BIOLOGY EDUCATION : A BIMONTHLY PUBLICATION OF THE INTERNATIONAL UNION OF BIOCHEMISTRY AND MOLECULAR BIOLOGY 2025; 53:311-320. [PMID: 39876557 DOI: 10.1002/bmb.21888] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/12/2024] [Revised: 12/24/2024] [Accepted: 01/17/2025] [Indexed: 01/30/2025]
Abstract
Targeted metagenomics is a rapidly expanding technology to analyze complex biological samples and genetic monitoring of environmental samples. In this research field, data analytical aspects play a crucial role. In order to teach targeted metagenomics data analysis, we developed a 4-week inquiry-driven modular course-based undergraduate research experience (mCURE) using publicly available Australian coral microbiome DNA sequencing data and associated metadata. Since an enormous amount of metadata was provided alongside the DNA sequencing data, groups of students were able to develop their own authentic research questions. Throughout the course, the student groups worked on these research questions and were supported with bioinformatics and statistics lessons. Additionally, practical aspects of data collection and analysis were addressed during hands-on field work on a nearby Dutch beach. Evaluation of the course indicated that the majority of students (1) achieved the intended metagenomics-based learning outcomes and (2) experienced scientific discovery while working on their research projects. In conclusion, the huge amount of data and metadata available in the coral microbiome data set facilitated the development of a strongly inquiry-driven course. Different groups of students were able to develop and conduct their own distinct microbiome research projects and our current mCURE format positively affected students' metagenomics data analytical skills and scientific discovery perception.
Collapse
Affiliation(s)
- M C Morsink
- Research Group of Environmental Metagenomics, Leiden Centre for Applied Bioscience, Leiden University of Applied Sciences, Leiden, Netherlands
| | - E N van Schaik
- Research Group of Environmental Metagenomics, Leiden Centre for Applied Bioscience, Leiden University of Applied Sciences, Leiden, Netherlands
| | - K Bossers
- Research Group of Environmental Metagenomics, Leiden Centre for Applied Bioscience, Leiden University of Applied Sciences, Leiden, Netherlands
| | - D A Duijker
- Research Group of Environmental Metagenomics, Leiden Centre for Applied Bioscience, Leiden University of Applied Sciences, Leiden, Netherlands
| | - A G C L Speksnijder
- Research Group of Environmental Metagenomics, Leiden Centre for Applied Bioscience, Leiden University of Applied Sciences, Leiden, Netherlands
| |
Collapse
|
2
|
Daly SE, Feng J, Daeschel D, Kovac J, Snyder AB. The choice of 16S rRNA gene sequence analysis impacted characterization of highly variable surface microbiota in dairy processing environments. mSystems 2024; 9:e0062024. [PMID: 39431865 PMCID: PMC11575208 DOI: 10.1128/msystems.00620-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2024] [Accepted: 09/17/2024] [Indexed: 10/22/2024] Open
Abstract
Accurate knowledge of the microbiota collected from surfaces in food processing environments is important for food quality and safety. This study assessed discrepancies in taxonomic composition and alpha and beta diversity values generated from eight different bioinformatic workflows for the analysis of 16S rRNA gene sequences extracted from the microbiota collected from surfaces in dairy processing environments. We found that the microbiota collected from environmental surfaces varied widely in density (0-9.09 log10 CFU/cm2) and Shannon alpha diversity (0.01-3.40). Consequently, depending on the sequence analysis method used, characterization of low-abundance genera (i.e., below 1% relative abundance) and the number of genera identified (114-173 genera) varied considerably. Some low-abundance genera, including Listeria, varied between the amplicon sequence variant (ASV) and operational taxonomic unit (OTU) methods. Centered log-ratio transformation inflated alpha and beta diversity values compared to rarefaction. Furthermore, the ASV method also inflated alpha and beta diversity values compared to the OTU method (P < 0.05). Therefore, for sparse, uneven, low-density data sets, the OTU method and rarefaction are better for taxonomic and ecological characterization of surface microbiota.IMPORTANCECulture-dependent environmental monitoring programs are used by the food industry to identify foodborne pathogens and spoilage biota on surfaces in food processing environments. The use of culture-independent 16S rRNA amplicon sequencing to characterize this surface microbiota has been proposed as a tool to enhance environmental monitoring. However, there is no consensus on the most suitable bioinformatic analyses to accurately capture the diverse levels and types of bacteria on surfaces in food processing environments. Here, we quantify the impact of different bioinformatic analyses on the results and interpretation of 16S rRNA amplicon sequences collected from three cultured dairy facilities in New York State. This study provides guidance for the selection of appropriate 16S rRNA analysis procedures for studying environmental microbiota in dairy processing environments.
Collapse
Affiliation(s)
- Sarah E Daly
- Department of Food Science, Cornell University, Ithaca, New York, USA
| | - Jingzhang Feng
- Department of Food Science, Cornell University, Ithaca, New York, USA
| | - Devin Daeschel
- Department of Food Science, Cornell University, Ithaca, New York, USA
| | - Jasna Kovac
- Department of Food Science, Pennsylvania State University, University Park, Pennsylvania, USA
| | - Abigail B Snyder
- Department of Food Science, Cornell University, Ithaca, New York, USA
| |
Collapse
|
3
|
Pipes L, Nielsen R. A rapid phylogeny-based method for accurate community profiling of large-scale metabarcoding datasets. eLife 2024; 13:e85794. [PMID: 39145536 PMCID: PMC11377034 DOI: 10.7554/elife.85794] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2022] [Accepted: 08/14/2024] [Indexed: 08/16/2024] Open
Abstract
Environmental DNA (eDNA) is becoming an increasingly important tool in diverse scientific fields from ecological biomonitoring to wastewater surveillance of viruses. The fundamental challenge in eDNA analyses has been the bioinformatical assignment of reads to taxonomic groups. It has long been known that full probabilistic methods for phylogenetic assignment are preferable, but unfortunately, such methods are computationally intensive and are typically inapplicable to modern next-generation sequencing data. We present a fast approximate likelihood method for phylogenetic assignment of DNA sequences. Applying the new method to several mock communities and simulated datasets, we show that it identifies more reads at both high and low taxonomic levels more accurately than other leading methods. The advantage of the method is particularly apparent in the presence of polymorphisms and/or sequencing errors and when the true species is not represented in the reference database.
Collapse
Affiliation(s)
- Lenore Pipes
- Department of Integrative Biology, University of California, Berkeley, Berkeley, United States
| | - Rasmus Nielsen
- Department of Integrative Biology, University of California, Berkeley, Berkeley, United States
- GLOBE Institute, University of Copenhagen, Copenhagen, Denmark
| |
Collapse
|
4
|
Yu X, Gu C, Guo X, Guo R, Zhu L, Qiu X, Chai J, Liu F, Feng Z. Dynamic changes of microbiota and metabolite of traditional Hainan dregs vinegar during fermentation based on metagenomics and metabolomics. Food Chem 2024; 444:138641. [PMID: 38325080 DOI: 10.1016/j.foodchem.2024.138641] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2023] [Revised: 01/13/2024] [Accepted: 01/29/2024] [Indexed: 02/09/2024]
Abstract
Hainan dregs vinegar (HNDV) is a traditional fermented food in China that is renowned for its unique flavor. HNDV is one of the most popular vinegars in Southeast Asia. However, research on the microorganisms and characteristic metabolites specific to HNDV is lacking. This study investigated the changes in microbial succession, volatile flavor compounds and characteristic non-volatile flavor compounds during HNDV fermentation based on metagenomics and metabolomics. The predominant microbial genera were Lactococcus, Limosilactobacillus, Lactiplantibacillus, and Saccharomyces. Unlike traditional vinegar, l-lactic acid was identified as the primary organic acid in HNDV. Noteworthy flavor compounds specific to HNDV included 3-methylthiopropanol and dl-phenylalanine. Significant associations were observed between six predominant microorganisms and six characteristic volatile flavor compounds, as well as seven characteristic non-volatile flavor compounds. The present results contribute to the development of starter cultures and the enhancement of HNDV quality.
Collapse
Affiliation(s)
- Xiaohan Yu
- Key Laboratory of Dairy Science, Ministry of Education, College of Food Science, Northeast Agricultural University, Harbin 150030, China
| | - Chunhe Gu
- Spice and Beverage Research Institute, Chinese Academy of Tropical Agricultural Sciences, Wanning 571533, China
| | - Xiaoxue Guo
- Key Laboratory of Dairy Science, Ministry of Education, College of Food Science, Northeast Agricultural University, Harbin 150030, China
| | - Ruijia Guo
- Key Laboratory of Dairy Science, Ministry of Education, College of Food Science, Northeast Agricultural University, Harbin 150030, China
| | - Lin Zhu
- Key Laboratory of Dairy Science, Ministry of Education, College of Food Science, Northeast Agricultural University, Harbin 150030, China
| | - Xinrong Qiu
- Key Laboratory of Dairy Science, Ministry of Education, College of Food Science, Northeast Agricultural University, Harbin 150030, China
| | - Jun Chai
- Key Laboratory of Dairy Science, Ministry of Education, College of Food Science, Northeast Agricultural University, Harbin 150030, China
| | - Fei Liu
- Key Laboratory of Dairy Science, Ministry of Education, College of Food Science, Northeast Agricultural University, Harbin 150030, China.
| | - Zhen Feng
- Spice and Beverage Research Institute, Chinese Academy of Tropical Agricultural Sciences, Wanning 571533, China.
| |
Collapse
|
5
|
Nayman EI, Schwartz BA, Polanco FC, Firek AK, Gumabong AC, Hofstee NJ, Narasimhan G, Cickovski T, Mathee K. Microbiome depiction through user-adapted bioinformatic pipelines and parameters. J Med Microbiol 2023; 72. [PMID: 37823280 DOI: 10.1099/jmm.0.001756] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/13/2023] Open
Abstract
Introduction. The role of the microbiome in health and disease continues to be increasingly recognized. However, there is significant variability in the bioinformatic protocols for analysing genomic data. This, in part, has impeded the potential incorporation of microbiomics into the clinical setting and has challenged interstudy reproducibility. In microbial compositional analysis, there is a growing recognition for the need to move away from a one-size-fits-all approach to data processing.Gap Statement. Few evidence-based recommendations exist for setting parameters of programs that infer microbiota community profiles despite these parameters significantly impacting the accuracy of taxonomic inference.Aim. To compare three commonly used programs (DADA2, QIIME2, and mothur) and optimize them into four user-adapted pipelines for processing paired-end amplicon reads. We aim to increase the accuracy of compositional inference and help standardize microbiomic protocol.Methods. Two key parameters were isolated across four pipelines: filtering sequence reads based on a whole-number error threshold (maxEE) and truncating read ends based on a quality score threshold (QTrim). Closeness of sample inference was then evaluated using a mock community of known composition.Results. We observed that raw genomic data lost were proportionate to how stringently parameters were set. Exactly how much data were lost varied by pipeline. Accuracy of sample inference correlated with increased sequence read retention. Falsely detected taxa and unaccounted for microbial constituents were unique to pipeline and parameter. Implementation of optimized parameter values led to better approximation of the known mock community.Conclusions. Microbial compositions generated based on the 16S rRNA marker gene should be interpreted with caution. To improve microbial community profiling, bioinformatic protocols must be user-adapted. Analysis should be performed with consideration for the select target amplicon, pipelines and parameters used, and taxa of interest.
Collapse
Affiliation(s)
- Eric I Nayman
- Department of Human and Molecular Genetics, Herbert Wertheim College of Medicine, Florida International University, Miami, FL, USA
- Bioinformatics Research Group (BioRG), Knight Foundation School of Computing and Information Sciences, Florida International University, Miami, FL, USA
| | - Brooke A Schwartz
- Department of Human and Molecular Genetics, Herbert Wertheim College of Medicine, Florida International University, Miami, FL, USA
- Bioinformatics Research Group (BioRG), Knight Foundation School of Computing and Information Sciences, Florida International University, Miami, FL, USA
| | - Fantaysia C Polanco
- Bioinformatics Research Group (BioRG), Knight Foundation School of Computing and Information Sciences, Florida International University, Miami, FL, USA
| | - Alexandra K Firek
- Translational Glycobiology Institute, Herbert Wertheim College of Medicine, Florida International University, Miami, FL, USA
| | - Alayna C Gumabong
- Department of Human and Molecular Genetics, Herbert Wertheim College of Medicine, Florida International University, Miami, FL, USA
| | - Nolan J Hofstee
- Department of Human and Molecular Genetics, Herbert Wertheim College of Medicine, Florida International University, Miami, FL, USA
| | - Giri Narasimhan
- Bioinformatics Research Group (BioRG), Knight Foundation School of Computing and Information Sciences, Florida International University, Miami, FL, USA
- Biomolecular Sciences Institute, Florida International University, Miami, FL, USA
| | - Trevor Cickovski
- Bioinformatics Research Group (BioRG), Knight Foundation School of Computing and Information Sciences, Florida International University, Miami, FL, USA
| | - Kalai Mathee
- Department of Human and Molecular Genetics, Herbert Wertheim College of Medicine, Florida International University, Miami, FL, USA
- Biomolecular Sciences Institute, Florida International University, Miami, FL, USA
| |
Collapse
|
6
|
Liang X, Zhang J, Kim Y, Ho J, Liu K, Keenum I, Gupta S, Davis B, Hepp SL, Zhang L, Xia K, Knowlton KF, Liao J, Vikesland PJ, Pruden A, Heath LS. ARGem: a new metagenomics pipeline for antibiotic resistance genes: metadata, analysis, and visualization. Front Genet 2023; 14:1219297. [PMID: 37811141 PMCID: PMC10558085 DOI: 10.3389/fgene.2023.1219297] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2023] [Accepted: 09/01/2023] [Indexed: 10/10/2023] Open
Abstract
Antibiotic resistance is of crucial interest to both human and animal medicine. It has been recognized that increased environmental monitoring of antibiotic resistance is needed. Metagenomic DNA sequencing is becoming an attractive method to profile antibiotic resistance genes (ARGs), including a special focus on pathogens. A number of computational pipelines are available and under development to support environmental ARG monitoring; the pipeline we present here is promising for general adoption for the purpose of harmonized global monitoring. Specifically, ARGem is a user-friendly pipeline that provides full-service analysis, from the initial DNA short reads to the final visualization of results. The capture of extensive metadata is also facilitated to support comparability across projects and broader monitoring goals. The ARGem pipeline offers efficient analysis of a modest number of samples along with affordable computational components, though the throughput could be increased through cloud resources, based on the user's configuration. The pipeline components were carefully assessed and selected to satisfy tradeoffs, balancing efficiency and flexibility. It was essential to provide a step to perform short read assembly in a reasonable time frame to ensure accurate annotation of identified ARGs. Comprehensive ARG and mobile genetic element databases are included in ARGem for annotation support. ARGem further includes an expandable set of analysis tools that include statistical and network analysis and supports various useful visualization techniques, including Cytoscape visualization of co-occurrence and correlation networks. The performance and flexibility of the ARGem pipeline is demonstrated with analysis of aquatic metagenomes. The pipeline is freely available at https://github.com/xlxlxlx/ARGem.
Collapse
Affiliation(s)
- Xiao Liang
- Department of Computer Science, Virginia Polytechnic Institute and State University, Blacksburg, VA, United States
| | - Jingyi Zhang
- Department of Computer Science, Virginia Polytechnic Institute and State University, Blacksburg, VA, United States
| | - Yoonjin Kim
- Department of Computer Science, Virginia Polytechnic Institute and State University, Blacksburg, VA, United States
| | - Josh Ho
- Department of Computer Science, Virginia Polytechnic Institute and State University, Blacksburg, VA, United States
| | - Kevin Liu
- Department of Computer Science, Virginia Polytechnic Institute and State University, Blacksburg, VA, United States
| | - Ishi Keenum
- Department of Civil and Environmental Engineering, Virginia Polytechnic Institute and State University, Blacksburg, VA, United States
| | - Suraj Gupta
- Interdisciplinary PhD Program in Genetics, Bioinformatics, and Computational Biology, Virginia Polytechnic Institute and State University, Blacksburg, VA, United States
| | - Benjamin Davis
- Department of Civil and Environmental Engineering, Virginia Polytechnic Institute and State University, Blacksburg, VA, United States
| | - Shannon L. Hepp
- Department of Civil and Environmental Engineering, Virginia Polytechnic Institute and State University, Blacksburg, VA, United States
| | - Liqing Zhang
- Department of Computer Science, Virginia Polytechnic Institute and State University, Blacksburg, VA, United States
| | - Kang Xia
- School of Plant and Environmental Science, Virginia Polytechnic Institute and State University, Blacksburg, VA, United States
| | - Katharine F. Knowlton
- Department of Dairy Science, Virginia Polytechnic Institute and State University, Blacksburg, VaA, United States
| | - Jingqiu Liao
- Department of Civil and Environmental Engineering, Virginia Polytechnic Institute and State University, Blacksburg, VA, United States
| | - Peter J. Vikesland
- Department of Civil and Environmental Engineering, Virginia Polytechnic Institute and State University, Blacksburg, VA, United States
| | - Amy Pruden
- Department of Civil and Environmental Engineering, Virginia Polytechnic Institute and State University, Blacksburg, VA, United States
| | - Lenwood S. Heath
- Department of Computer Science, Virginia Polytechnic Institute and State University, Blacksburg, VA, United States
| |
Collapse
|
7
|
O'Connor LM, O'Connor BA, Lim SB, Zeng J, Lo CH. Integrative multi-omics and systems bioinformatics in translational neuroscience: A data mining perspective. J Pharm Anal 2023; 13:836-850. [PMID: 37719197 PMCID: PMC10499660 DOI: 10.1016/j.jpha.2023.06.011] [Citation(s) in RCA: 25] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2022] [Revised: 06/20/2023] [Accepted: 06/25/2023] [Indexed: 09/19/2023] Open
Abstract
Bioinformatic analysis of large and complex omics datasets has become increasingly useful in modern day biology by providing a great depth of information, with its application to neuroscience termed neuroinformatics. Data mining of omics datasets has enabled the generation of new hypotheses based on differentially regulated biological molecules associated with disease mechanisms, which can be tested experimentally for improved diagnostic and therapeutic targeting of neurodegenerative diseases. Importantly, integrating multi-omics data using a systems bioinformatics approach will advance the understanding of the layered and interactive network of biological regulation that exchanges systemic knowledge to facilitate the development of a comprehensive human brain profile. In this review, we first summarize data mining studies utilizing datasets from the individual type of omics analysis, including epigenetics/epigenomics, transcriptomics, proteomics, metabolomics, lipidomics, and spatial omics, pertaining to Alzheimer's disease, Parkinson's disease, and multiple sclerosis. We then discuss multi-omics integration approaches, including independent biological integration and unsupervised integration methods, for more intuitive and informative interpretation of the biological data obtained across different omics layers. We further assess studies that integrate multi-omics in data mining which provide convoluted biological insights and offer proof-of-concept proposition towards systems bioinformatics in the reconstruction of brain networks. Finally, we recommend a combination of high dimensional bioinformatics analysis with experimental validation to achieve translational neuroscience applications including biomarker discovery, therapeutic development, and elucidation of disease mechanisms. We conclude by providing future perspectives and opportunities in applying integrative multi-omics and systems bioinformatics to achieve precision phenotyping of neurodegenerative diseases and towards personalized medicine.
Collapse
Affiliation(s)
- Lance M. O'Connor
- College of Biological Sciences, University of Minnesota, Minneapolis, MN, 55455, USA
| | - Blake A. O'Connor
- School of Pharmacy, University of Wisconsin, Madison, WI, 53705, USA
| | - Su Bin Lim
- Department of Biochemistry and Molecular Biology, Ajou University School of Medicine, Suwon, 16499, South Korea
| | - Jialiu Zeng
- Lee Kong Chian School of Medicine, Nanyang Technological University, Singapore, 308232, Singapore
| | - Chih Hung Lo
- Lee Kong Chian School of Medicine, Nanyang Technological University, Singapore, 308232, Singapore
| |
Collapse
|
8
|
Leis ML. An Update on the Ocular Surface Bacterial Microbiota in Small Animals. Vet Clin North Am Small Anim Pract 2023; 53:299-318. [PMID: 36813387 DOI: 10.1016/j.cvsm.2022.10.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/22/2023]
Abstract
High-throughput sequencing (HTS) techniques have revolutionized the way we understand microbial communities in both research and clinical settings and are bringing new insights into what constitutes a healthy ocular surface (and a diseased one). As more diagnostic laboratories incorporate HTS into their technique repertoire, practitioners can expect this technology to become increasingly accessible for clinical practice, potentially becoming the new standard. However, particularly regarding ophthalmic microbiota, considerable research remains to render HTS accessible and applicable.
Collapse
Affiliation(s)
- Marina L Leis
- Western College of Veterinary Medicine, 52 Campus Drive, Saskatoon, Saskatchewan S7N 5B4, Canada.
| |
Collapse
|
9
|
Sundaray JK, Dixit S, Rather A, Rasal KD, Sahoo L. Aquaculture omics: An update on the current status of research and data analysis. Mar Genomics 2022; 64:100967. [PMID: 35779450 DOI: 10.1016/j.margen.2022.100967] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2021] [Revised: 05/26/2022] [Accepted: 06/15/2022] [Indexed: 11/28/2022]
Abstract
Aquaculture is the fast-growing agricultural sector and has the ability to meet the growing demand for protein nutritional security for future population. In future aquaculture is going to be the major source of fish proteins as capture fisheries reached at its maximum. However, several challenges need to overcome such as lack of genetically improved strains/varieties, lack of species-specific feed/functional feed, round the year availability of quality fish seed, pollution of ecosystems and increased frequencies of disease occurrence etc. In recent years, the continuous development of high throughput sequencing technology has revolutionized the biological sciences and provided necessary tools. Application of 'omics' in aquaculture research have been successfully used to resolve several productive and reproductive issues and thus ensure its sustainability and profitability. To date, high quality draft genomes of over fifty fish species have been generated and successfully used to develop large number of single nucleotide polymorphism markers (SNPs), marker panels and other genomic resources etc in several aquaculture species. Similarly, transcriptome profiling and miRNAs analysis have been used in aquaculture research to identify key transcripts and expression analysis of candidate genes/miRNAs involved in reproduction, immunity, growth, development, stress toxicology and disease. Metagenome analysis emerged as a promising scientific tool to analyze the complex genomes contained within microbial communities. Metagenomics has been successfully used in the aquaculture sector to identify novel and potential pathogens, antibiotic resistance genes, microbial roles in microcosms, microbial communities forming biofloc, probiotics etc. In the current review, we discussed application of high-throughput technologies (NGS) in the aquaculture sector.
Collapse
Affiliation(s)
- Jitendra Kumar Sundaray
- ICAR-Central Institute of Freshwater Aquaculture, Kausalyaganga, Bhubaneswar 751002, Odisha, India
| | - Sangita Dixit
- Centre for Biotechnology, School of Pharmaceutical Sciences, Siksha 'O' Anusandhan University (Deemed to be University), Bhubaneswar 751003, Odisha, India
| | - Ashraf Rather
- Division of Fish Genetics and Biotechnology, College of Fisheries, Sher-e- Kashmir University of Agricultural Science and Technology, Rangil-Ganderbal 190006, Jammu and Kashmir, India
| | - Kiran D Rasal
- Fish Genetics and Biotechnology Division, ICAR-Central Institute of Fisheries Education, Versova, Mumbai 400 061, Maharastra, India
| | - Lakshman Sahoo
- ICAR-Central Institute of Freshwater Aquaculture, Kausalyaganga, Bhubaneswar 751002, Odisha, India.
| |
Collapse
|
10
|
Chiarello M, McCauley M, Villéger S, Jackson CR. Ranking the biases: The choice of OTUs vs. ASVs in 16S rRNA amplicon data analysis has stronger effects on diversity measures than rarefaction and OTU identity threshold. PLoS One 2022; 17:e0264443. [PMID: 35202411 PMCID: PMC8870492 DOI: 10.1371/journal.pone.0264443] [Citation(s) in RCA: 75] [Impact Index Per Article: 25.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2021] [Accepted: 02/10/2022] [Indexed: 02/07/2023] Open
Abstract
Advances in the analysis of amplicon sequence datasets have introduced a methodological shift in how research teams investigate microbial biodiversity, away from sequence identity-based clustering (producing Operational Taxonomic Units, OTUs) to denoising methods (producing amplicon sequence variants, ASVs). While denoising methods have several inherent properties that make them desirable compared to clustering-based methods, questions remain as to the influence that these pipelines have on the ecological patterns being assessed, especially when compared to other methodological choices made when processing data (e.g. rarefaction) and computing diversity indices. We compared the respective influences of two widely used methods, namely DADA2 (a denoising method) vs. Mothur (a clustering method) on 16S rRNA gene amplicon datasets (hypervariable region v4), and compared such effects to the rarefaction of the community table and OTU identity threshold (97% vs. 99%) on the ecological signals detected. We used a dataset comprising freshwater invertebrate (three Unionidae species) gut and environmental (sediment, seston) communities sampled in six rivers in the southeastern USA. We ranked the respective effects of each methodological choice on alpha and beta diversity, and taxonomic composition. The choice of the pipeline significantly influenced alpha and beta diversities and changed the ecological signal detected, especially on presence/absence indices such as the richness index and unweighted Unifrac. Interestingly, the discrepancy between OTU and ASV-based diversity metrics could be attenuated by the use of rarefaction. The identification of major classes and genera also revealed significant discrepancies across pipelines. Compared to the pipeline's effect, OTU threshold and rarefaction had a minimal impact on all measurements.
Collapse
Affiliation(s)
- Marlène Chiarello
- Department of Biology, University of Mississippi, University, MS, United States of America
| | - Mark McCauley
- Department of Biology, University of Mississippi, University, MS, United States of America
| | - Sébastien Villéger
- MARBEC, University of Montpellier, CNRS, Ifremer, IRD, Montpellier, France
| | - Colin R. Jackson
- Department of Biology, University of Mississippi, University, MS, United States of America
| |
Collapse
|
11
|
Gold Z, Curd EE, Goodwin KD, Choi ES, Frable BW, Thompson AR, Walker HJ, Burton RS, Kacev D, Martz LD, Barber PH. Improving metabarcoding taxonomic assignment: A case study of fishes in a large marine ecosystem. Mol Ecol Resour 2021; 21:2546-2564. [PMID: 34235858 DOI: 10.1111/1755-0998.13450] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2020] [Revised: 05/25/2021] [Accepted: 06/03/2021] [Indexed: 01/08/2023]
Abstract
DNA metabarcoding is an important tool for molecular ecology. However, its effectiveness hinges on the quality of reference sequence databases and classification parameters employed. Here we evaluate the performance of MiFish 12S taxonomic assignments using a case study of California Current Large Marine Ecosystem fishes to determine best practices for metabarcoding. Specifically, we use a taxonomy cross-validation by identity framework to compare classification performance between a global database comprised of all available sequences and a curated database that only includes sequences of fishes from the California Current Large Marine Ecosystem. We demonstrate that the regional database provides higher assignment accuracy than the comprehensive global database. We also document a tradeoff between accuracy and misclassification across a range of taxonomic cutoff scores, highlighting the importance of parameter selection for taxonomic classification. Furthermore, we compared assignment accuracy with and without the inclusion of additionally generated reference sequences. To this end, we sequenced tissue from 597 species using the MiFish 12S primers, adding 252 species to GenBank's existing 550 California Current Large Marine Ecosystem fish sequences. We then compared species and reads identified from seawater environmental DNA samples using global databases with and without our generated references, and the regional database. The addition of new references allowed for the identification of 16 additional native taxa representing 17.0% of total reads from eDNA samples, including species with vast ecological and economic value. Together these results demonstrate the importance of comprehensive and curated reference databases for effective metabarcoding and the need for locus-specific validation efforts.
Collapse
Affiliation(s)
- Zachary Gold
- Department of Ecology and Evolutionary Biology, University of California Los Angeles, Los Angeles, California, USA
| | - Emily E Curd
- Department of Ecology and Evolutionary Biology, University of California Los Angeles, Los Angeles, California, USA
| | - Kelly D Goodwin
- Ocean Chemistry and Ecosystems Division, Atlantic Oceanographic and Meteorological Laboratory, National Oceanic and Atmospheric Administration, Stationed at Southwest Fisheries Science Center, La Jolla, California, USA
| | - Emma S Choi
- Scripps Institution of Oceanography, University of California San Diego, La Jolla, California, USA
| | - Benjamin W Frable
- Scripps Institution of Oceanography, University of California San Diego, La Jolla, California, USA
| | - Andrew R Thompson
- Southwest Fisheries Science Center, National Oceanic and Atmospheric Administration, La Jolla, California, USA
| | - Harold J Walker
- Scripps Institution of Oceanography, University of California San Diego, La Jolla, California, USA
| | - Ronald S Burton
- Scripps Institution of Oceanography, University of California San Diego, La Jolla, California, USA
| | - Dovi Kacev
- Scripps Institution of Oceanography, University of California San Diego, La Jolla, California, USA
| | - Lucas D Martz
- Scripps Institution of Oceanography, University of California San Diego, La Jolla, California, USA
| | - Paul H Barber
- Department of Ecology and Evolutionary Biology, University of California Los Angeles, Los Angeles, California, USA
| |
Collapse
|
12
|
Mathon L, Valentini A, Guérin PE, Normandeau E, Noel C, Lionnet C, Boulanger E, Thuiller W, Bernatchez L, Mouillot D, Dejean T, Manel S. Benchmarking bioinformatic tools for fast and accurate eDNA metabarcoding species identification. Mol Ecol Resour 2021; 21:2565-2579. [PMID: 34002951 DOI: 10.1111/1755-0998.13430] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2021] [Revised: 05/10/2021] [Accepted: 05/11/2021] [Indexed: 12/01/2022]
Abstract
Bioinformatic analysis of eDNA metabarcoding data is a crucial step toward rigorously assessing biodiversity. Many programs are now available for each step of the required analyses, but their relative abilities at providing fast and accurate species lists have seldom been evaluated. We used simulated mock communities and real fish eDNA metabarcoding data to evaluate the performance of 13 bioinformatic programs and pipelines to retrieve fish occurrence and read abundance using the 12S mt rRNA gene marker. We used four indices to compare the outputs of each program with the simulated samples: sensitivity, F-measure, root-mean-square error (RMSE) on read relative abundances, and execution time. We found marked differences among programs only for the taxonomic assignment step, both in terms of sensitivity, F-measure and RMSE. Running time was highly different between programs for each step. The fastest programs with best indices for each step were assembled into a pipeline. We compared this pipeline to pipelines constructed from existing toolboxes (OBITools, Barque, and QIIME 2). Our pipeline and Barque obtained the best performance for all indices and appear to be better alternatives to highly used pipelines for analysing fish eDNA metabarcoding data when a complete reference database is available. Analysis on real eDNA metabarcoding data also indicated differences for taxonomic assignment and execution time only. This study reveals major differences between programs during the taxonomic assignment step. The choice of algorithm for the taxonomic assignment can have a significant impact on diversity estimates and should be made according to the objectives of the study.
Collapse
Affiliation(s)
- Laetitia Mathon
- CEFE, Univ. Montpellier, CNRS, EPHE-PSL University, IRD, Montpellier, France.,SPYGEN, Savoie Technolac, Le Bourget du Lac, France
| | | | | | - Eric Normandeau
- Université Laval, IBIS (Institut de Biologie Intégrative et des Systèmes), Québec, QC, Canada
| | - Cyril Noel
- IFREMER - IRSI - Service de Bioinformatique (SeBiMER), Plouzané, France
| | - Clément Lionnet
- Univ. Grenoble Alpes, Univ. Savoie Mont Blanc, CNRS, LECA, Laboratoire d'Ecologie Alpine, Grenoble, France
| | - Emilie Boulanger
- CEFE, Univ. Montpellier, CNRS, EPHE-PSL University, IRD, Montpellier, France.,MARBEC, Univ. Montpellier, CNRS, IRD, Ifremer, Montpellier, France
| | - Wilfried Thuiller
- Univ. Grenoble Alpes, Univ. Savoie Mont Blanc, CNRS, LECA, Laboratoire d'Ecologie Alpine, Grenoble, France
| | - Louis Bernatchez
- Université Laval, IBIS (Institut de Biologie Intégrative et des Systèmes), Québec, QC, Canada
| | - David Mouillot
- MARBEC, Univ. Montpellier, CNRS, IRD, Ifremer, Montpellier, France.,Institut Universitaire de France, IUF, Paris, France
| | - Tony Dejean
- SPYGEN, Savoie Technolac, Le Bourget du Lac, France
| | - Stéphanie Manel
- CEFE, Univ. Montpellier, CNRS, EPHE-PSL University, IRD, Montpellier, France
| |
Collapse
|
13
|
Hleap JS, Littlefair JE, Steinke D, Hebert PDN, Cristescu ME. Assessment of current taxonomic assignment strategies for metabarcoding eukaryotes. Mol Ecol Resour 2021; 21:2190-2203. [PMID: 33905615 DOI: 10.1111/1755-0998.13407] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2020] [Revised: 03/08/2021] [Accepted: 04/19/2021] [Indexed: 01/04/2023]
Abstract
The effective use of metabarcoding in biodiversity science has brought important analytical challenges due to the need to generate accurate taxonomic assignments. The assignment of sequences to genus or species level is critical for biodiversity surveys and biomonitoring, but it is particularly challenging as researchers must select the approach that best recovers information on species composition. This study evaluates the performance and accuracy of seven methods in recovering the species composition of mock communities by using COI barcode fragments. The mock communities varied in species number and specimen abundance, while upstream molecular and bioinformatic variables were held constant, and using a set of COI fragments. We evaluated the impact of parameter optimization on the quality of the predictions. Our results indicate that BLAST top hit competes well with more complex approaches if optimized for the mock community under study. For example, the two machine learning methods that were benchmarked proved more sensitive to reference database heterogeneity and completeness than methods based on sequence similarity. The accuracy of assignments was impacted by both species and specimen counts (query compositional heterogeneity) which ultimately influence the selection of appropriate software. We urge researchers to: (i) use realistic mock communities to allow optimization of parameters, regardless of the taxonomic assignment method employed; (ii) carefully choose and curate the reference databases including completeness; and (iii) use QIIME, BLAST or LCA methods, in conjunction with parameter tuning to better assign taxonomy to diverse communities, especially when information on species diversity is lacking for the area under study.
Collapse
Affiliation(s)
- Jose S Hleap
- Department of Biology, McGill University, Montreal, QC, Canada.,SHARCNET, University of Guelph, Guelph, ON, Canada.,Fundacion SQUALUS, Cali, Colombia
| | - Joanne E Littlefair
- Department of Biology, McGill University, Montreal, QC, Canada.,Queen Mary University of London, London, UK
| | - Dirk Steinke
- Centre for Biodiversity Genomics & Department of Integrative Biology, University of Guelph, Guelph, ON, Canada
| | - Paul D N Hebert
- Centre for Biodiversity Genomics & Department of Integrative Biology, University of Guelph, Guelph, ON, Canada
| | | |
Collapse
|
14
|
Osman MA, Neoh HM, Ab Mutalib NS, Chin SF, Mazlan L, Raja Ali RA, Zakaria AD, Ngiu CS, Ang MY, Jamal R. Parvimonas micra, Peptostreptococcus stomatis, Fusobacterium nucleatum and Akkermansia muciniphila as a four-bacteria biomarker panel of colorectal cancer. Sci Rep 2021; 11:2925. [PMID: 33536501 PMCID: PMC7859180 DOI: 10.1038/s41598-021-82465-0] [Citation(s) in RCA: 97] [Impact Index Per Article: 24.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2020] [Accepted: 01/14/2021] [Indexed: 02/08/2023] Open
Abstract
Dysbiosis of the gut microbiome has been associated with the pathogenesis of colorectal cancer (CRC). We profiled the microbiome of gut mucosal tissues from 18 CRC patients and 18 non-CRC controls of the UKM Medical Centre (UKMMC), Kuala Lumpur, Malaysia. The results were then validated using a species-specific quantitative PCR in 40 CRC and 20 non-CRC tissues samples from the UMBI-UKMMC Biobank. Parvimonas micra, Fusobacterium nucleatum, Peptostreptococcus stomatis and Akkermansia muciniphila were found to be over-represented in our CRC patients compared to non-CRC controls. These four bacteria markers distinguished CRC from controls (AUROC = 0.925) in our validation cohort. We identified bacteria species significantly associated (cut-off value of > 5 fold abundance) with various CRC demographics such as ethnicity, gender and CRC staging; however, due to small sample size of the discovery cohort, these results could not be further verified in our validation cohort. In summary, Parvimonas micra, Fusobacterium nucleatum, Peptostreptococcus stomatis and Akkermansia muciniphila were enriched in our local CRC patients. Nevertheless, the roles of these bacteria in CRC initiation and progression remains to be investigated.
Collapse
Affiliation(s)
- Muhammad Afiq Osman
- UKM Medical Molecular Biology Institute (UMBI), UKM Medical Centre, Universiti Kebangsaan Malaysia, Jalan Yaa'cob Latiff, Bandar Tun Razak, Cheras, 56000, Kuala Lumpur, Malaysia
| | - Hui-Min Neoh
- UKM Medical Molecular Biology Institute (UMBI), UKM Medical Centre, Universiti Kebangsaan Malaysia, Jalan Yaa'cob Latiff, Bandar Tun Razak, Cheras, 56000, Kuala Lumpur, Malaysia.
| | - Nurul-Syakima Ab Mutalib
- UKM Medical Molecular Biology Institute (UMBI), UKM Medical Centre, Universiti Kebangsaan Malaysia, Jalan Yaa'cob Latiff, Bandar Tun Razak, Cheras, 56000, Kuala Lumpur, Malaysia
| | - Siok-Fong Chin
- UKM Medical Molecular Biology Institute (UMBI), UKM Medical Centre, Universiti Kebangsaan Malaysia, Jalan Yaa'cob Latiff, Bandar Tun Razak, Cheras, 56000, Kuala Lumpur, Malaysia
| | - Luqman Mazlan
- Department of Surgery, Faculty of Medicine, Universiti Kebangsaan Malaysia, Kuala Lumpur, Malaysia
| | - Raja Affendi Raja Ali
- Department of Medicine, Faculty of Medicine, Universiti Kebangsaan Malaysia, Kuala Lumpur, Malaysia
| | - Andee Dzulkarnaen Zakaria
- Department of Surgery, Faculty of Medicine, Universiti Kebangsaan Malaysia, Kuala Lumpur, Malaysia
- Department of Surgery, School of Medical Sciences, Universiti Sains Malaysia, Kubang Kerian, Kelantan, Malaysia
| | - Chai Soon Ngiu
- Department of Medicine, Faculty of Medicine, Universiti Kebangsaan Malaysia, Kuala Lumpur, Malaysia
| | - Mia Yang Ang
- UKM Medical Molecular Biology Institute (UMBI), UKM Medical Centre, Universiti Kebangsaan Malaysia, Jalan Yaa'cob Latiff, Bandar Tun Razak, Cheras, 56000, Kuala Lumpur, Malaysia
| | - Rahman Jamal
- UKM Medical Molecular Biology Institute (UMBI), UKM Medical Centre, Universiti Kebangsaan Malaysia, Jalan Yaa'cob Latiff, Bandar Tun Razak, Cheras, 56000, Kuala Lumpur, Malaysia
| |
Collapse
|
15
|
Starke R, Pylro VS, Morais DK. 16S rRNA Gene Copy Number Normalization Does Not Provide More Reliable Conclusions in Metataxonomic Surveys. MICROBIAL ECOLOGY 2021; 81:535-539. [PMID: 32862246 PMCID: PMC7835310 DOI: 10.1007/s00248-020-01586-7] [Citation(s) in RCA: 32] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/06/2020] [Accepted: 08/24/2020] [Indexed: 05/11/2023]
Abstract
Sequencing 16S rRNA gene amplicons is the gold standard to uncover the composition of prokaryotic communities. The presence of multiple copies of this gene makes the community abundance data distorted and gene copy normalization (GCN) necessary for correction. Even though GCN of 16S data provided a picture closer to the metagenome before, it should also be compared with communities of known composition due to the fact that library preparation is prone to methodological biases. Here, we process 16S rRNA gene amplicon data from eleven simple mock communities with DADA2 and estimate the impact of GCN. In all cases, the mock community composition derived from the 16S sequencing differs from those expected, and GCN fails to improve the classification for most of the analysed communities. Our approach provides empirical evidence that GCN does not improve the 16S target sequencing analyses in real scenarios. We therefore question the use of GCN for metataxonomic surveys until a more comprehensive catalogue of copy numbers becomes available.
Collapse
Affiliation(s)
- Robert Starke
- Laboratory of Environmental Microbiology, Institute of Microbiology of the Czech Academy of Sciences, Praha, Czech Republic.
| | - Victor Satler Pylro
- Department of Biology, Federal University of Lavras-UFLA, Lavras, Minas Gerais, Brazil
| | - Daniel Kumazawa Morais
- Laboratory of Environmental Microbiology, Institute of Microbiology of the Czech Academy of Sciences, Praha, Czech Republic
- Bioinformatics Core Facility, Institute of Microbiology of the Czech Academy of Sciences, Praha, Czech Republic
| |
Collapse
|
16
|
Bailet B, Apothéloz-Perret-Gentil L, Baričević A, Chonova T, Franc A, Frigerio JM, Kelly M, Mora D, Pfannkuchen M, Proft S, Ramon M, Vasselon V, Zimmermann J, Kahlert M. Diatom DNA metabarcoding for ecological assessment: Comparison among bioinformatics pipelines used in six European countries reveals the need for standardization. THE SCIENCE OF THE TOTAL ENVIRONMENT 2020; 745:140948. [PMID: 32736102 DOI: 10.1016/j.scitotenv.2020.140948] [Citation(s) in RCA: 31] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/02/2020] [Revised: 07/09/2020] [Accepted: 07/11/2020] [Indexed: 06/11/2023]
Abstract
Ecological assessment of lakes and rivers using benthic diatom assemblages currently requires considerable taxonomic expertise to identify species using light microscopy. This traditional approach is also time-consuming. Diatom metabarcoding is a promising alternative and there is increasing interest in using this approach for routine assessment. However, until now, analysis protocols for diatom metabarcoding have been developed and optimised by research groups working in isolation. The diversity of existing bioinformatics methods highlights the need for an assessment of the performance and comparability of results of different methods. The aim of this study was to test the correspondence of outputs from six bioinformatics pipelines currently in use for diatom metabarcoding in different European countries. Raw sequence data from 29 biofilm samples were treated by each of the bioinformatics pipelines, five of them using the same curated reference database. The outputs of the pipelines were compared in terms of sequence unit assemblages, taxonomic assignment, biotic index score and ecological assessment outcomes. The three last components were also compared to outputs from traditional light microscopy, which is currently accepted for ecological assessment of phytobenthos, as required by the Water Framework Directive. We also tested the performance of the pipelines on the two DNA markers (rbcL and 18S-V4) that are currently used by the working groups participating in this study. The sequence unit assemblages produced by different pipelines showed significant differences in terms of assigned and unassigned read numbers and sequence unit numbers. When comparing the taxonomic assignments at genus and species level, correspondence of the taxonomic assemblages between pipelines was weak. Most discrepancies were linked to differential detection or quantification of taxa, despite the use of the same reference database. Subsequent calculation of biotic index scores also showed significant differences between approaches, which were reflected in the final ecological assessment. Use of the rbcL marker always resulted in better correlation among molecular datasets and also in results closer to these generated using traditional microscopy. This study shows that decisions made in pipeline design have implications for the dataset's structure and the taxonomic assemblage, which in turn may affect biotic index calculation and ecological assessment. There is a need to define best-practice bioinformatics parameters in order to ensure the best representation of diatom assemblages. Only the use of similar parameters will ensure the compatibility of data from different working groups. The future of diatom metabarcoding for ecological assessment may also lie in the development of new metrics using, for example, presence/absence instead of relative abundance data.
Collapse
Affiliation(s)
- Bonnie Bailet
- Swedish University of Agricultural Sciences, Department of Aquatic Sciences and Assessment, PO Box 7050, SE - 750 07 Uppsala, Sweden.
| | | | - Ana Baričević
- Center for Marine Research, Ruđer Bosˇković Institute, Rovinj, Croatia.
| | - Teofana Chonova
- Research Department for Limnology, Mondsee, Faculty of Biology, University of Innsbruck, Mondsee, Austria; CARRTEL, French National Institute for Agricultural Research (INRA), University of Savoie Mont Blanc, 75 bis avenue de Corzent, 74200 Thonon-les-Bains, France.
| | - Alain Franc
- BioGeCo, French National Institute for Agricultural Research (INRA), 69 route d'Arcachon, 33610 Cesta, France.
| | - Jean-Marc Frigerio
- BioGeCo, French National Institute for Agricultural Research (INRA), 69 route d'Arcachon, 33610 Cesta, France.
| | - Martyn Kelly
- Bowburn Consultancy, 11 Monteigne Drive, Bowburn, Durham DH6 5QB, UK; School of Geography, University of Nottingham, Nottingham NG7 2RD, UK.
| | - Demetrio Mora
- Botanic Garden and Botanical Museum Berlin-Dahlem, Freie Universität Berlin, Königin-Luise-Str. 6-8, 14195 Berlin, Germany.
| | | | - Sebastian Proft
- Botanic Garden and Botanical Museum Berlin-Dahlem, Freie Universität Berlin, Königin-Luise-Str. 6-8, 14195 Berlin, Germany
| | | | - Valentin Vasselon
- AFB, Pôle R&D "ECLA", INRA, UMR CARRTEL, 75bis av. de Corzent - CS 50511, FR-74200 Thonon-les-Bains, France
| | - Jonas Zimmermann
- Botanic Garden and Botanical Museum Berlin-Dahlem, Freie Universität Berlin, Königin-Luise-Str. 6-8, 14195 Berlin, Germany.
| | - Maria Kahlert
- Swedish University of Agricultural Sciences, Department of Aquatic Sciences and Assessment, PO Box 7050, SE - 750 07 Uppsala, Sweden.
| |
Collapse
|
17
|
Straub D, Blackwell N, Langarica-Fuentes A, Peltzer A, Nahnsen S, Kleindienst S. Interpretations of Environmental Microbial Community Studies Are Biased by the Selected 16S rRNA (Gene) Amplicon Sequencing Pipeline. Front Microbiol 2020; 11:550420. [PMID: 33193131 PMCID: PMC7645116 DOI: 10.3389/fmicb.2020.550420] [Citation(s) in RCA: 125] [Impact Index Per Article: 25.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2020] [Accepted: 10/02/2020] [Indexed: 12/11/2022] Open
Abstract
One of the major methods to identify microbial community composition, to unravel microbial population dynamics, and to explore microbial diversity in environmental samples is high-throughput DNA- or RNA-based 16S rRNA (gene) amplicon sequencing in combination with bioinformatics analyses. However, focusing on environmental samples from contrasting habitats, it was not systematically evaluated (i) which analysis methods provide results that reflect reality most accurately, (ii) how the interpretations of microbial community studies are biased by different analysis methods and (iii) if the most optimal analysis workflow can be implemented in an easy-to-use pipeline. Here, we compared the performance of 16S rRNA (gene) amplicon sequencing analysis tools (i.e., Mothur, QIIME1, QIIME2, and MEGAN) using three mock datasets with known microbial community composition that differed in sequencing quality, species number and abundance distribution (i.e., even or uneven), and phylogenetic diversity (i.e., closely related or well-separated amplicon sequences). Our results showed that QIIME2 outcompeted all other investigated tools in sequence recovery (>10 times fewer false positives), taxonomic assignments (>22% better F-score) and diversity estimates (>5% better assessment), suggesting that this approach is able to reflect the in situ microbial community most accurately. Further analysis of 24 environmental datasets obtained from four contrasting terrestrial and freshwater sites revealed dramatic differences in the resulting microbial community composition for all pipelines at genus level. For instance, at the investigated river water sites Sphaerotilus was only reported when using QIIME1 (8% abundance) and Agitococcus with QIIME1 or QIIME2 (2 or 3% abundance, respectively), but both genera remained undetected when analyzed with Mothur or MEGAN. Since these abundant taxa probably have implications for important biogeochemical cycles (e.g., nitrate and sulfate reduction) at these sites, their detection and semi-quantitative enumeration is crucial for valid interpretations. A high-performance computing conformant workflow was constructed to allow FAIR (Findable, Accessible, Interoperable, and Re-usable) 16S rRNA (gene) amplicon sequence analysis starting from raw sequence files, using the most optimal methods identified in our study. Our presented workflow should be considered for future studies, thereby facilitating the analysis of high-throughput 16S rRNA (gene) sequencing data substantially, while maximizing reliability and confidence in microbial community data analysis.
Collapse
Affiliation(s)
- Daniel Straub
- Microbial Ecology, Center for Applied Geoscience, Department of Geosciences, University of Tübingen, Tübingen, Germany
- Quantitative Biology Center (QBiC), University of Tübingen, Tübingen, Germany
| | - Nia Blackwell
- Microbial Ecology, Center for Applied Geoscience, Department of Geosciences, University of Tübingen, Tübingen, Germany
| | - Adrian Langarica-Fuentes
- Microbial Ecology, Center for Applied Geoscience, Department of Geosciences, University of Tübingen, Tübingen, Germany
| | - Alexander Peltzer
- Quantitative Biology Center (QBiC), University of Tübingen, Tübingen, Germany
| | - Sven Nahnsen
- Quantitative Biology Center (QBiC), University of Tübingen, Tübingen, Germany
| | - Sara Kleindienst
- Microbial Ecology, Center for Applied Geoscience, Department of Geosciences, University of Tübingen, Tübingen, Germany
| |
Collapse
|
18
|
Microbiota composition in bilateral healthy breast tissue and breast tumors. Cancer Causes Control 2020; 31:1027-1038. [PMID: 32844256 DOI: 10.1007/s10552-020-01338-5] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2019] [Accepted: 08/13/2020] [Indexed: 12/15/2022]
Abstract
PURPOSE Previous reports suggest that a complex microbiome exists within the female human breast that might contribute to breast cancer etiology. The purpose of this pilot study was to assess the variation in microbiota composition by breast side (left versus right) within individual women and compare the microbiota of normal and breast tumor tissue between women. We aimed to determine whether microbiota composition differs between these groups and whether certain bacterial taxa may be associated with breast tumors. METHODS Bilateral normal breast tissue samples (n = 36) were collected from ten women who received routine mammoplasty procedures. Archived breast tumor samples (n = 10) were obtained from a biorepository. DNA was extracted, amplified, and sequenced. Microbiota data were analyzed using QIIME and RStudio. RESULTS The most abundant phyla in both tumor and normal tissues were Bacteroidetes, Firmicutes, Proteobacteria, and Actinobacteria. There were statistically significant differences in the relative abundance of various bacterial taxa between groups. Alpha diversity (Simpson's index) was significantly higher in normal compared to tumor samples (0.968 vs. 0.957, p = 0.022). Based on unweighted UniFrac measures, breast tumor samples clustered distinctly from normal samples (R2 = 0.130; p = 0.01). Microbiota composition in normal samples clustered within women (R2 = 0.394; p = 0.01) and by breast side (left or right) within a woman (R2 = 0.189; p = 0.03). CONCLUSION Significant differences in diversity between tumor and normal tissue and in composition between women and between breasts of the same woman were identified. These results warrant further research to investigate the relationship between microbiota and breast cancer.
Collapse
|
19
|
Kibegwa FM, Bett RC, Gachuiri CK, Stomeo F, Mujibi FD. A Comparison of Two DNA Metagenomic Bioinformatic Pipelines While Evaluating the Microbial Diversity in Feces of Tanzanian Small Holder Dairy Cattle. BIOMED RESEARCH INTERNATIONAL 2020; 2020:2348560. [PMID: 32382536 PMCID: PMC7195676 DOI: 10.1155/2020/2348560] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/13/2019] [Revised: 02/17/2020] [Accepted: 02/27/2020] [Indexed: 12/05/2022]
Abstract
Analysis of shotgun metagenomic data generated from next generation sequencing platforms can be done through a variety of bioinformatic pipelines. These pipelines employ different sets of sophisticated bioinformatics algorithms which may affect the results of this analysis. In this study, we compared two commonly used pipelines for shotgun metagenomic analysis: MG-RAST and Kraken 2, in terms of taxonomic classification, diversity analysis, and usability using their primarily default parameters. Overall, the two pipelines detected similar abundance distributions in the three most abundant taxa Proteobacteria, Firmicutes, and Bacteroidetes. Within bacterial domain, 497 genera were identified by both pipelines, while an additional 694 and 98 genera were solely identified by Kraken 2 and MG-RAST, respectively. 933 species were detected by the two algorithms. Kraken 2 solely detected 3550 species, while MG-RAST identified 557 species uniquely. For archaea, Kraken 2 generated 105 and 236 genera and species, respectively, while MG-RAST detected 60 genera and 88 species. 54 genera and 72 species were commonly detected by the two methods. Kraken 2 had a quicker analysis time (~4 hours) while MG-RAST took approximately 2 days per sample. This study revealed that Kraken 2 and MG-RAST generate comparable results and that a reliable high-level overview of sample is generated irrespective of the pipeline selected. However, Kraken 2 generated a more accurate taxonomic identification given the higher number of "Unclassified" reads in MG-RAST. The observed variations at the genus level show that a main restriction is using different databases for classification of the metagenomic data. The results of this research indicate that a more inclusive and representative classification of microbiomes may be achieved through creation of the combined pipelines.
Collapse
Affiliation(s)
| | | | | | - Francesca Stomeo
- Biosciences eastern and central Africa-International Livestock Research Institute (BecA-ILRI) Hub, Nairobi, Kenya
| | | |
Collapse
|
20
|
Vuorio K, Mäki A, Salmi P, Aalto SL, Tiirola M. Consistency of Targeted Metatranscriptomics and Morphological Characterization of Phytoplankton Communities. Front Microbiol 2020; 11:96. [PMID: 32117126 PMCID: PMC7016081 DOI: 10.3389/fmicb.2020.00096] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2019] [Accepted: 01/16/2020] [Indexed: 11/13/2022] Open
Abstract
The composition of phytoplankton community is the basis for environmental monitoring and assessment of the ecological status of aquatic ecosystems. Community composition studies of phytoplankton have been based on time-consuming and expertise-demanding light microscopy analyses. Molecular methods have the potential to replace microscopy, but the high copy number variation of ribosomal genes and the lack of universal primers for simultaneous amplification of prokaryotic and eukaryotic genes complicate data interpretation. In this study, we used our previously developed directional primer-independent high-throughput sequencing (HTS) approach to analyze 16S and 18S rRNA community structures. Comparison of 83 boreal lake samples showed that the relative abundances of eukaryotic phytoplankton at class level and prokaryotic cyanobacteria at order level were consistent between HTS and microscopy results. At the genus level, the results had low correspondence, mainly due to lack of sequences in the reference library. HTS was superior to identify genera that are extensively represented in the reference databases but lack specific morphological characteristics. Targeted metatranscriptomics proved to be a feasible method to complement the microscopy analysis. The metatranscriptomics can also be applied without linking the sequences to taxonomy. However, direct indexing of the sequences to their environmental indicator values needs collections of more comprehensive sample sets, as long as the coverage of molecular barcodes of eukaryotic species remains insufficient.
Collapse
Affiliation(s)
- Kristiina Vuorio
- Freshwater Centre, Finnish Environment Institute (SYKE), Helsinki, Finland
| | - Anita Mäki
- Department of Biological and Environmental Sciences, Nanoscience Center, University of Jyväskylä, Jyväskylä, Finland
| | - Pauliina Salmi
- Department of Biological and Environmental Sciences, Nanoscience Center, University of Jyväskylä, Jyväskylä, Finland
| | - Sanni L Aalto
- Department of Biological and Environmental Sciences, Nanoscience Center, University of Jyväskylä, Jyväskylä, Finland
| | - Marja Tiirola
- Department of Biological and Environmental Sciences, Nanoscience Center, University of Jyväskylä, Jyväskylä, Finland
| |
Collapse
|
21
|
Onywera H, Meiring TL. Comparative analyses of Ion Torrent V4 and Illumina V3-V4 16S rRNA gene metabarcoding methods for characterization of cervical microbiota: taxonomic and functional profiling. SCIENTIFIC AFRICAN 2020. [DOI: 10.1016/j.sciaf.2020.e00278] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
|
22
|
Mullis MM, Rambo IM, Baker BJ, Reese BK. Diversity, Ecology, and Prevalence of Antimicrobials in Nature. Front Microbiol 2019; 10:2518. [PMID: 31803148 PMCID: PMC6869823 DOI: 10.3389/fmicb.2019.02518] [Citation(s) in RCA: 33] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2019] [Accepted: 10/18/2019] [Indexed: 12/15/2022] Open
Abstract
Microorganisms possess a variety of survival mechanisms, including the production of antimicrobials that function to kill and/or inhibit the growth of competing microorganisms. Studies of antimicrobial production have largely been driven by the medical community in response to the rise in antibiotic-resistant microorganisms and have involved isolated pure cultures under artificial laboratory conditions neglecting the important ecological roles of these compounds. The search for new natural products has extended to biofilms, soil, oceans, coral reefs, and shallow coastal sediments; however, the marine deep subsurface biosphere may be an untapped repository for novel antimicrobial discovery. Uniquely, prokaryotic survival in energy-limited extreme environments force microbial populations to either adapt their metabolism to outcompete or produce novel antimicrobials that inhibit competition. For example, subsurface sediments could yield novel antimicrobial genes, while at the same time answering important ecological questions about the microbial community.
Collapse
Affiliation(s)
- Megan M. Mullis
- Department of Life Sciences, Texas A&M University Corpus Christi, Corpus Christi, TX, United States
| | - Ian M. Rambo
- Department of Marine Science, University of Texas Marine Science Institute, Port Aransas, TX, United States
| | - Brett J. Baker
- Department of Marine Science, University of Texas Marine Science Institute, Port Aransas, TX, United States
| | - Brandi Kiel Reese
- Department of Life Sciences, Texas A&M University Corpus Christi, Corpus Christi, TX, United States
| |
Collapse
|
23
|
Doster E, Rovira P, Noyes NR, Burgess BA, Yang X, Weinroth MD, Linke L, Magnuson R, Boucher C, Belk KE, Morley PS. A Cautionary Report for Pathogen Identification Using Shotgun Metagenomics; A Comparison to Aerobic Culture and Polymerase Chain Reaction for Salmonella enterica Identification. Front Microbiol 2019; 10:2499. [PMID: 31736924 PMCID: PMC6838018 DOI: 10.3389/fmicb.2019.02499] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2019] [Accepted: 10/16/2019] [Indexed: 12/19/2022] Open
Abstract
This study was conducted to compare aerobic culture, polymerase chain reaction (PCR), lateral flow immunoassay (LFI), and shotgun metagenomics for identification of Salmonella enterica in feces collected from feedlot cattle. Samples were analyzed in parallel using all four tests. Results from aerobic culture and PCR were 100% concordant and indicated low S. enterica prevalence (3/60 samples positive). Although low S. enterica prevalence restricted formal statistical comparisons, LFI and deep metagenomic sequencing results were discordant with these results. Specifically, metagenomic analysis using k-mer-based classification against the RefSeq database indicated that 11/60 of samples contained sequence reads that matched to the S. enterica genome and uniquely identified this species of bacteria within the sample. However, further examination revealed that plasmid sequences were often included with bacterial genomic sequence data submitted to NCBI, which can lead to incorrect taxonomic classification. To circumvent this classification problem, we separated all plasmid sequences included in bacterial RefSeq genomes and reassigned them to a unique taxon so that they would not be uniquely associated with specific bacterial species such as S. enterica. Using this revised database and taxonomic structure, we found that only 6/60 samples contained sequences specific for S. enterica, suggesting increased relative specificity. Reads identified as S. enterica in these six samples were further evaluated using BLAST and NCBI's nr/nt database, which identified that only 2/60 samples contained reads exclusive to S. enterica chromosomal genomes. These two samples were culture- and PCR-negative, suggesting that even deep metagenomic sequencing suffers from lower sensitivity and specificity in comparison to more traditional pathogen detection methods. Additionally, no sample reads were taxonomically classified as S. enterica with two other metagenomic tools, Metagenomic Intra-species Diversity Analysis System (MIDAS) and Metagenomic Phylogenetic Analysis 2 (MetaPhlAn2). This study re-affirmed that the traditional techniques of aerobic culture and PCR provide similar results for S. enterica identification in cattle feces. On the other hand, metagenomic results are highly influenced by the classification method and reference database employed. These results highlight the nuances of computational detection of species-level sequences within short-read metagenomic sequence data, and emphasize the need for cautious interpretation of such results.
Collapse
Affiliation(s)
- Enrique Doster
- Department of Microbiology, Immunology and Pathology, Colorado State University, Fort Collins, CO, United States
| | - Pablo Rovira
- Instituto Nacional de Investigacion Agropecuaria, Treinta y Tres, Uruguay
| | - Noelle R. Noyes
- Department of Veterinary Population Medicine, University of Minnesota, St. Paul, MN, United States
| | - Brandy A. Burgess
- Department of Population Health, University of Georgia, Athens, GA, United States
| | - Xiang Yang
- Department of Animal Science, University of California, Davis, Davis, CA, United States
| | - Margaret D. Weinroth
- Department of Animal Sciences, Colorado State University, Fort Collins, CO, United States
| | - Lyndsey Linke
- Department of Clinical Sciences, Colorado State University, Fort Collins, CO, United States
| | - Roberta Magnuson
- Department of Clinical Sciences, Colorado State University, Fort Collins, CO, United States
| | - Christina Boucher
- Department of Computer and Information Science and Engineering, University of Florida, Gainesville, FL, United States
| | - Keith E. Belk
- Department of Animal Sciences, Colorado State University, Fort Collins, CO, United States
| | - Paul S. Morley
- Veterinary Education, Research, and Outreach Center, West Texas A&M University, Canyon, TX, United States
| |
Collapse
|
24
|
Kaszubinski SF, Pechal JL, Schmidt CJ, Jordan HR, Benbow ME, Meek MH. Evaluating Bioinformatic Pipeline Performance for Forensic Microbiome Analysis *,†,‡. J Forensic Sci 2019; 65:513-525. [PMID: 31657871 DOI: 10.1111/1556-4029.14213] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2019] [Revised: 09/01/2019] [Accepted: 09/19/2019] [Indexed: 11/26/2022]
Abstract
Microbial communities have potential evidential utility for forensic applications. However, bioinformatic analysis of high-throughput sequencing data varies widely among laboratories. These differences can potentially affect microbial community composition and downstream analyses. To illustrate the importance of standardizing methodology, we compared analyses of postmortem microbiome samples using several bioinformatic pipelines, varying minimum library size or minimum number of sequences per sample, and sample size. Using the same input sequence data, we found that three open-source bioinformatic pipelines, MG-RAST, mothur, and QIIME2, had significant differences in relative abundance, alpha-diversity, and beta-diversity, despite the same input data. Increasing minimum library size and sample size increased the number of low-abundant and infrequent taxa detected. Our results show that bioinformatic pipeline and parameter choice affect results in important ways. Given the growing potential application of forensic microbiology to the criminal justice system, continued research on standardizing computational methodology will be important for downstream applications.
Collapse
Affiliation(s)
- Sierra F Kaszubinski
- Department of Integrative Biology, Michigan State University, East Lansing, MI, 48824
| | - Jennifer L Pechal
- Department of Entomology, Michigan State University, East Lansing, MI, 48824
| | - Carl J Schmidt
- Wayne County Medical Examiner's Office, Detroit, MI, 48207.,Department of Pathology, University of Michigan, Ann Arbor, MI, 48109
| | - Heather R Jordan
- Department of Biological Sciences, Mississippi State University, Mississippi State, MS, 39762
| | - Mark E Benbow
- Department of Entomology, Michigan State University, East Lansing, MI, 48824.,Ecology, Evolutionary Biology and Behavior Program, Michigan State University, East Lansing, MI, 48824.,Michigan Department of Osteopathic Medical Specialties, Michigan State University, East Lansing, MI, 48824
| | - Mariah H Meek
- Department of Integrative Biology, Michigan State University, East Lansing, MI, 48824.,Michigan Department of Osteopathic Medical Specialties, Michigan State University, East Lansing, MI, 48824.,AgBio Research, Michigan State University, East Lansing, MI, 48824
| |
Collapse
|
25
|
The Impact of Bioinformatics Pipelines on Microbiota Studies: Does the Analytical "Microscope" Affect the Biological Interpretation? Microorganisms 2019; 7:microorganisms7100393. [PMID: 31561435 PMCID: PMC6843237 DOI: 10.3390/microorganisms7100393] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2019] [Revised: 09/24/2019] [Accepted: 09/24/2019] [Indexed: 11/16/2022] Open
Abstract
Targeted metagenomics is the solution of choice to reveal differential microbial profiles (defined by richness, diversity and composition) as part of case-control studies. It is well documented that each data processing step may have the potential to introduce bias in the results. However, selecting a bioinformatics pipeline to analyze high-throughput sequencing data from A to Z remains one of the critical considerations in a case-control microbiota study design. Consequently, the aim of this study was to assess whether the same biological conclusions regarding human gut microbiota composition and diversity could be reached using different bioinformatics pipelines. In this work, we considered four pipelines (mothur, QIIME, kraken and CLARK) with different versions and databases, and examined their impact on the outcome of metagenetic analysis of Ion Torrent 16S sequencing data. We re-analyzed a case-control study evaluating the impact of the colonization of the intestinal protozoa Blastocystis sp. on the human gut microbial profile. Although most pipelines reported the same trends in this case-control study, we demonstrated how the use of different pipelines affects the biological conclusions that can be drawn. Targeted metagenomics must therefore rather be considered as a profiling tool to obtain a broad sense of the variations of the microbiota, rather than an accurate identification tool.
Collapse
|
26
|
Rajan SK, Lindqvist M, Brummer RJ, Schoultz I, Repsilber D. Phylogenetic microbiota profiling in fecal samples depends on combination of sequencing depth and choice of NGS analysis method. PLoS One 2019; 14:e0222171. [PMID: 31527871 PMCID: PMC6748435 DOI: 10.1371/journal.pone.0222171] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2019] [Accepted: 08/22/2019] [Indexed: 12/12/2022] Open
Abstract
The human gut microbiota is well established as an important factor in health and disease. Fecal sample microbiota are often analyzed as a proxy for gut microbiota, and characterized with respect to their composition profiles. Modern approaches employ whole genome shotgun next-generation sequencing as the basis for these analyses. Sequencing depth as well as choice of next-generation sequencing data analysis method constitute two main interacting methodological factors for such an approach. In this study, we used 200 million sequence read pairs from one fecal sample for comparing different taxonomy classification methods, using default and custom-made reference databases, at different sequencing depths. A mock community data set with known composition was used for validating the classification methods. Results suggest that sequencing beyond 60 million read pairs does not seem to improve classification. The phylogeny prediction pattern, when using the default databases and the consensus database, appeared to be similar for all three methods. Moreover, these methods predicted rather different species. We conclude that the choice of sequencing depth and classification method has important implications for taxonomy composition prediction. A multi-method-consensus approach for robust gut microbiota NGS analysis is recommended.
Collapse
Affiliation(s)
| | | | | | - Ida Schoultz
- School of Medical Sciences, Örebro University, Örebro, Sweden
| | - Dirk Repsilber
- School of Medical Sciences, Örebro University, Örebro, Sweden
- * E-mail:
| |
Collapse
|
27
|
Mitchell AL, Scheremetjew M, Denise H, Potter S, Tarkowska A, Qureshi M, Salazar GA, Pesseat S, Boland MA, Hunter F, ten Hoopen P, Alako B, Amid C, Wilkinson DJ, Curtis TP, Cochrane G, Finn RD. EBI Metagenomics in 2017: enriching the analysis of microbial communities, from sequence reads to assemblies. Nucleic Acids Res 2019; 46:D726-D735. [PMID: 29069476 PMCID: PMC5753268 DOI: 10.1093/nar/gkx967] [Citation(s) in RCA: 123] [Impact Index Per Article: 20.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2017] [Accepted: 10/12/2017] [Indexed: 01/16/2023] Open
Abstract
EBI metagenomics (http://www.ebi.ac.uk/metagenomics) provides a free to use platform for the analysis and archiving of sequence data derived from the microbial populations found in a particular environment. Over the past two years, EBI metagenomics has increased the number of datasets analysed 10-fold. In addition to increased throughput, the underlying analysis pipeline has been overhauled to include both new or updated tools and reference databases. Of particular note is a new workflow for taxonomic assignments that has been extended to include assignments based on both the large and small subunit RNA marker genes and to encompass all cellular micro-organisms. We also describe the addition of metagenomic assembly as a new analysis service. Our pilot studies have produced over 2400 assemblies from datasets in the public domain. From these assemblies, we have produced a searchable, non-redundant protein database of over 50 million sequences. To provide improved access to the data stored within the resource, we have developed a programmatic interface that provides access to the analysis results and associated sample metadata. Finally, we have integrated the results of a series of statistical analyses that provide estimations of diversity and sample comparisons.
Collapse
Affiliation(s)
- Alex L Mitchell
- EMBL-EBI European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
- To whom correspondence should be addressed. Tel: +44 1223 494652; . Correspondence may also be addressed to Robert D. Finn. Tel: +44 1223 492678;
| | - Maxim Scheremetjew
- EMBL-EBI European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Hubert Denise
- EMBL-EBI European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Simon Potter
- EMBL-EBI European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Aleksandra Tarkowska
- EMBL-EBI European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Matloob Qureshi
- EMBL-EBI European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Gustavo A Salazar
- EMBL-EBI European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Sebastien Pesseat
- EMBL-EBI European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Miguel A Boland
- EMBL-EBI European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Fiona M I Hunter
- EMBL-EBI European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Petra ten Hoopen
- EMBL-EBI European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Blaise Alako
- EMBL-EBI European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Clara Amid
- EMBL-EBI European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Darren J Wilkinson
- School of Mathematics & Statistics, Newcastle University, Newcastle upon Tyne NE1 7RU, UK
| | - Thomas P Curtis
- School of Civil Engineering and Geosciences, Newcastle University, Newcastle upon Tyne NE1 7RU, UK
| | - Guy Cochrane
- EMBL-EBI European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Robert D Finn
- EMBL-EBI European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
- To whom correspondence should be addressed. Tel: +44 1223 494652; . Correspondence may also be addressed to Robert D. Finn. Tel: +44 1223 492678;
| |
Collapse
|
28
|
Breitwieser FP, Lu J, Salzberg SL. A review of methods and databases for metagenomic classification and assembly. Brief Bioinform 2019; 20:1125-1136. [PMID: 29028872 PMCID: PMC6781581 DOI: 10.1093/bib/bbx120] [Citation(s) in RCA: 294] [Impact Index Per Article: 49.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2017] [Revised: 08/22/2017] [Indexed: 12/13/2022] Open
Abstract
Microbiome research has grown rapidly over the past decade, with a proliferation of new methods that seek to make sense of large, complex data sets. Here, we survey two of the primary types of methods for analyzing microbiome data: read classification and metagenomic assembly, and we review some of the challenges facing these methods. All of the methods rely on public genome databases, and we also discuss the content of these databases and how their quality has a direct impact on our ability to interpret a microbiome sample.
Collapse
Affiliation(s)
| | | | - Steven L Salzberg
- Corresponding author: Steven L. Salzberg, Center for Computational Biology, Johns Hopkins University, 1900 E. Monument St., Baltimore, MD, 21205, USA. E-mail:
| |
Collapse
|
29
|
Abstract
Microbiome research has grown rapidly over the past decade, with a proliferation of new methods that seek to make sense of large, complex data sets. Here, we survey two of the primary types of methods for analyzing microbiome data: read classification and metagenomic assembly, and we review some of the challenges facing these methods. All of the methods rely on public genome databases, and we also discuss the content of these databases and how their quality has a direct impact on our ability to interpret a microbiome sample.
Collapse
|
30
|
Metataxonomics of Tunisian phosphogypsum based on five bioinformatics pipelines: Insights for bioremediation. Genomics 2019; 112:981-989. [PMID: 31220587 DOI: 10.1016/j.ygeno.2019.06.014] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2019] [Accepted: 06/15/2019] [Indexed: 11/23/2022]
Abstract
Phosphogypsum (PG) is an acidic by-product from the phosphate fertilizer industry and it is characterized by a low nutrient availability and the presence of radionuclides and heavy metals which pose a serious problem in its management. Here, we have applied Illumina MiSeq sequencing technology and five bioinformatics pipelines to explore the phylogenetic communities in Tunisian PG. Taking One Codex as a reference method, we present the results of 16S-rDNA-gene-based metataxonomics abundances with four other alternative bioinformatics pipelines (MetaGenome Rapid Annotation using Subsystem Technology (MG-RAST), mothur, MICrobial Community Analysis (MICCA) and Quantitative Insights into Microbial Ecology (QIIME)), when analyzing the Tunisian PG. Importantly, based on 16S rDNA datasets, the functional capabilities of microbial communities of PG were deciphered. They suggested the presence of PG autochthonous bacteria valorizable into (1) removal of radioactive elements and toxic heavy metals, (2) promotion of plant growth, (3) oxidation and (4) reduction of sulfate. These bacteria can be explored further for applications in the bioremediation of by-products, like PG, by different processes.
Collapse
|
31
|
Jagadeesan B, Gerner-Smidt P, Allard MW, Leuillet S, Winkler A, Xiao Y, Chaffron S, Van Der Vossen J, Tang S, Katase M, McClure P, Kimura B, Ching Chai L, Chapman J, Grant K. The use of next generation sequencing for improving food safety: Translation into practice. Food Microbiol 2019; 79:96-115. [PMID: 30621881 PMCID: PMC6492263 DOI: 10.1016/j.fm.2018.11.005] [Citation(s) in RCA: 160] [Impact Index Per Article: 26.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2018] [Revised: 10/27/2018] [Accepted: 11/13/2018] [Indexed: 01/06/2023]
Abstract
Next Generation Sequencing (NGS) combined with powerful bioinformatic approaches are revolutionising food microbiology. Whole genome sequencing (WGS) of single isolates allows the most detailed comparison possible hitherto of individual strains. The two principle approaches for strain discrimination, single nucleotide polymorphism (SNP) analysis and genomic multi-locus sequence typing (MLST) are showing concordant results for phylogenetic clustering and are complementary to each other. Metabarcoding and metagenomics, applied to total DNA isolated from either food materials or the production environment, allows the identification of complete microbial populations. Metagenomics identifies the entire gene content and when coupled to transcriptomics or proteomics, allows the identification of functional capacity and biochemical activity of microbial populations. The focus of this review is on the recent use and future potential of NGS in food microbiology and on current challenges. Guidance is provided for new users, such as public health departments and the food industry, on the implementation of NGS and how to critically interpret results and place them in a broader context. The review aims to promote the broader application of NGS technologies within the food industry as well as highlight knowledge gaps and novel applications of NGS with the aim of driving future research and increasing food safety outputs from its wider use.
Collapse
Affiliation(s)
- Balamurugan Jagadeesan
- Nestlé Research, Nestec Ltd, Route du Jorat 57, Vers-chez-les-Blanc, CH-1000, Lausanne 26, Switzerland.
| | - Peter Gerner-Smidt
- Centers for Disease Control and Prevention, MS-CO-3, 1600 Clifton Road, 30329-4027, Atlanta, USA
| | - Marc W Allard
- US Food and Drug Administration, 5001 Campus Drive, College Park, MD, 02740, USA
| | - Sébastien Leuillet
- Institut Mérieux, Mérieux NutriSciences, 3 route de la Chatterie, 44800, Saint Herblain, France
| | - Anett Winkler
- Cargill Deutschland GmbH, Cerestarstr. 2, 47809, Krefeld, Germany
| | - Yinghua Xiao
- Arla Innovation Center, Agro Food Park 19, 8200, Aarhus, Denmark
| | - Samuel Chaffron
- Laboratoire des Sciences du Numérique de Nantes (LS2N), CNRS UMR 6004 - Université de Nantes, 2 rue de la Houssinière, 44322, Nantes, France
| | - Jos Van Der Vossen
- The Netherlands Organisation for Applied Scientific Research, TNO, Utrechtseweg 48, 3704 HE, Zeist, NL, the Netherlands
| | - Silin Tang
- Mars Global Food Safety Center, Yanqi Economic Development Zone, 101407, Beijing, China
| | - Mitsuru Katase
- Fuji Oil Co., Ltd., Sumiyoshi-cho 1, Izumisano Osaka, 598-8540, Japan
| | - Peter McClure
- Mondelēz International, Linden 3, Bournville Lane, B30 2LU, Birmingham, United Kingdom
| | - Bon Kimura
- Tokyo University of Marine Science & Technology, Konan 4-5-7, Minato-ku, Tokyo, 108-8477, Japan
| | - Lay Ching Chai
- Institute of Biological Sciences, Faculty of Science, University of Malaya, 50603, Kuala Lumpur, Malaysia
| | - John Chapman
- Unilever Research & Development, Postbus, 114, 3130 AC, Vlaardingen, the Netherlands
| | - Kathie Grant
- Gastrointestinal Bacteria Reference Unit, National Infection Service, Public Health England, 61 Colindale Avenue, London, NW9 5EQ, United Kingdom.
| |
Collapse
|
32
|
Effects of Nanoparticles on Plant Growth-Promoting Bacteria in Indian Agricultural Soil. AGRONOMY-BASEL 2019. [DOI: 10.3390/agronomy9030140] [Citation(s) in RCA: 30] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
Abstract
Soil bacteria are some of the key players affecting plant productivity. Soil today is exposed to emerging contaminants like metal engineered nanoparticles. The objective of this study was to evaluate the toxicological effects of silver and zinc oxide nanoparticles on bacteria classified as plant growth-promoting bacteria. Three types of bacteria—nitrogen fixers, phosphate solubilizers, and biofilm formers—were exposed to engineered nanoparticles. Initially, the effect of silver and zinc oxide nanoparticles was determined on pure cultures of the bacteria. These nanoparticles were then applied to soil to assess changes in composition of bacterial communities. Impacts of the nanoparticles were analyzed using Illumina MiSeq sequencing of 16S rRNA genes. In the soil used, relative abundances of the dominant and agriculturally significant phyla, namely, Proteobacteria, Actinobacteria, and Firmicutes, were altered in the presence of silver nanoparticles. Silver nanoparticles changed the abundance of the three phyla by 25 to 45%. Zinc oxide nanoparticles showed negligible effects at the phylum level. Thus, silver nanoparticles may impact bacterial communities in soil, and this in turn may influence processes carried out by soil bacteria.
Collapse
|
33
|
Gardner PP, Watson RJ, Morgan XC, Draper JL, Finn RD, Morales SE, Stott MB. Identifying accurate metagenome and amplicon software via a meta-analysis of sequence to taxonomy benchmarking studies. PeerJ 2019; 7:e6160. [PMID: 30631651 PMCID: PMC6322486 DOI: 10.7717/peerj.6160] [Citation(s) in RCA: 26] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2017] [Accepted: 11/14/2018] [Indexed: 01/26/2023] Open
Abstract
Metagenomic and meta-barcode DNA sequencing has rapidly become a widely-used technique for investigating a range of questions, particularly related to health and environmental monitoring. There has also been a proliferation of bioinformatic tools for analysing metagenomic and amplicon datasets, which makes selecting adequate tools a significant challenge. A number of benchmark studies have been undertaken; however, these can present conflicting results. In order to address this issue we have applied a robust Z-score ranking procedure and a network meta-analysis method to identify software tools that are consistently accurate for mapping DNA sequences to taxonomic hierarchies. Based upon these results we have identified some tools and computational strategies that produce robust predictions.
Collapse
Affiliation(s)
- Paul P Gardner
- Biomolecular Interactions Centre, School of Biological Sciences, University of Canterbury, Christchurch, New Zealand.,Department of Biochemistry, University of Otago, Dunedin, New Zealand
| | - Renee J Watson
- Biomolecular Interactions Centre, School of Biological Sciences, University of Canterbury, Christchurch, New Zealand
| | - Xochitl C Morgan
- Department of Microbiology and Immunology, University of Otago, Dunedin, New Zealand
| | - Jenny L Draper
- Institute of Environmental Science and Research, Porirua, New Zealand
| | - Robert D Finn
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, UK
| | - Sergio E Morales
- Department of Microbiology and Immunology, University of Otago, Dunedin, New Zealand
| | - Matthew B Stott
- Biomolecular Interactions Centre, School of Biological Sciences, University of Canterbury, Christchurch, New Zealand
| |
Collapse
|
34
|
Dubey AK, Uppadhyaya N, Nilawe P, Chauhan N, Kumar S, Gupta UA, Bhaduri A. LogMPIE, pan-India profiling of the human gut microbiome using 16S rRNA sequencing. Sci Data 2018; 5:180232. [PMID: 30375992 PMCID: PMC6207063 DOI: 10.1038/sdata.2018.232] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2018] [Accepted: 08/17/2018] [Indexed: 02/06/2023] Open
Abstract
The "Landscape Of Gut Microbiome - Pan-India Exploration", or LogMPIE study, is the first large-scale, nationwide record of the Indian gut microbiome. The primary objective of the study was to identify and map the Indian gut microbiome baseline. This observational study was conducted across 14 geographical locations in India. Enrolled subjects were uniformly distributed across geographies (north, east, west and south) and body mass index (obese and non-obese). Furthermore, factors influencing the microbiome, such as age and physical activity, were also considered in the study design. The LogMPIE study recorded data from 1004 eligible subjects and reported 993 unique microorganisms across the Indian microbiome diaspora. The data not only map the Indian gut microbiome baseline but also function as a useful resource to study, analyse and identify signatures characterizing the physiological dispositions of the subjects. Furthermore, they provide insight into the unique features describing the Indian microbiome. The data are open and may be accessed from the European Nucleotide Archive (ENA) portal of the European Bioinformatics Institute (https://www.ebi.ac.uk/ena/data/view/PRJEB25642).
Collapse
Affiliation(s)
- Ashok Kumar Dubey
- Innovation Center, Tata Chemicals Ltd, Ambedveth, Pune, Maharashtra, 412111, India
| | - Niyati Uppadhyaya
- Innovation Center, Tata Chemicals Ltd, Ambedveth, Pune, Maharashtra, 412111, India
| | - Pravin Nilawe
- Thermo Fisher Scientific, Invitrogen BioServices India Pvt Ltd, Mumbai, Maharashtra, 400076, India
| | - Neeraj Chauhan
- Thermo Fisher Scientific, Life Science Solutions, Gurgaon, Haryana, 122016, India
| | - Santosh Kumar
- JSS Medical Research India Pvt. Ltd, Faridabad, Haryana, 121003, India
| | | | - Anirban Bhaduri
- Innovation Center, Tata Chemicals Ltd, Ambedveth, Pune, Maharashtra, 412111, India
| |
Collapse
|
35
|
Analysis of sequencing strategies and tools for taxonomic annotation: Defining standards for progressive metagenomics. Sci Rep 2018; 8:12034. [PMID: 30104688 PMCID: PMC6089906 DOI: 10.1038/s41598-018-30515-5] [Citation(s) in RCA: 55] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2018] [Accepted: 07/24/2018] [Indexed: 12/30/2022] Open
Abstract
Metagenomics research has recently thrived due to DNA sequencing technologies improvement, driving the emergence of new analysis tools and the growth of taxonomic databases. However, there is no all-purpose strategy that can guarantee the best result for a given project and there are several combinations of software, parameters and databases that can be tested. Therefore, we performed an impartial comparison, using statistical measures of classification for eight bioinformatic tools and four taxonomic databases, defining a benchmark framework to evaluate each tool in a standardized context. Using in silico simulated data for 16S rRNA amplicons and whole metagenome shotgun data, we compared the results from different software and database combinations to detect biases related to algorithms or database annotation. Using our benchmark framework, researchers can define cut-off values to evaluate the expected error rate and coverage for their results, regardless the score used by each software. A quick guide to select the best tool, all datasets and scripts to reproduce our results and benchmark any new method are available at https://github.com/Ales-ibt/Metagenomic-benchmark. Finally, we stress out the importance of gold standards, database curation and manual inspection of taxonomic profiling results, for a better and more accurate microbial diversity description.
Collapse
|
36
|
De León-Lorenzana AS, Delgado-Balbuena L, Domínguez-Mendoza CA, Navarro-Noya YE, Luna-Guido M, Dendooven L. Soil Salinity Controls Relative Abundance of Specific Bacterial Groups Involved in the Decomposition of Maize Plant Residues. Front Ecol Evol 2018. [DOI: 10.3389/fevo.2018.00051] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
|
37
|
Blank C, Easterly C, Gruening B, Johnson J, Kolmeder CA, Kumar P, May D, Mehta S, Mesuere B, Brown Z, Elias JE, Hervey WJ, McGowan T, Muth T, Nunn B, Rudney J, Tanca A, Griffin TJ, Jagtap PD. Disseminating Metaproteomic Informatics Capabilities and Knowledge Using the Galaxy-P Framework. Proteomes 2018; 6:proteomes6010007. [PMID: 29385081 PMCID: PMC5874766 DOI: 10.3390/proteomes6010007] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2017] [Revised: 01/26/2018] [Accepted: 01/26/2018] [Indexed: 01/12/2023] Open
Abstract
The impact of microbial communities, also known as the microbiome, on human health and the environment is receiving increased attention. Studying translated gene products (proteins) and comparing metaproteomic profiles may elucidate how microbiomes respond to specific environmental stimuli, and interact with host organisms. Characterizing proteins expressed by a complex microbiome and interpreting their functional signature requires sophisticated informatics tools and workflows tailored to metaproteomics. Additionally, there is a need to disseminate these informatics resources to researchers undertaking metaproteomic studies, who could use them to make new and important discoveries in microbiome research. The Galaxy for proteomics platform (Galaxy-P) offers an open source, web-based bioinformatics platform for disseminating metaproteomics software and workflows. Within this platform, we have developed easily-accessible and documented metaproteomic software tools and workflows aimed at training researchers in their operation and disseminating the tools for more widespread use. The modular workflows encompass the core requirements of metaproteomic informatics: (a) database generation; (b) peptide spectral matching; (c) taxonomic analysis and (d) functional analysis. Much of the software available via the Galaxy-P platform was selected, packaged and deployed through an online metaproteomics "Contribution Fest" undertaken by a unique consortium of expert software developers and users from the metaproteomics research community, who have co-authored this manuscript. These resources are documented on GitHub and freely available through the Galaxy Toolshed, as well as a publicly accessible metaproteomics gateway Galaxy instance. These documented workflows are well suited for the training of novice metaproteomics researchers, through online resources such as the Galaxy Training Network, as well as hands-on training workshops. Here, we describe the metaproteomics tools available within these Galaxy-based resources, as well as the process by which they were selected and implemented in our community-based work. We hope this description will increase access to and utilization of metaproteomics tools, as well as offer a framework for continued community-based development and dissemination of cutting edge metaproteomics software.
Collapse
Affiliation(s)
- Clemens Blank
- Bioinformatics Group, Department of Computer Science, University of Freiburg, 79110 Freiburg im Breisgau, Germany.
| | - Caleb Easterly
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN 55455, USA.
| | - Bjoern Gruening
- Bioinformatics Group, Department of Computer Science, University of Freiburg, 79110 Freiburg im Breisgau, Germany.
| | - James Johnson
- Minnesota Supercomputing Institute, University of Minnesota, Minneapolis, MN 55455, USA.
| | - Carolin A Kolmeder
- Institute of Biotechnology, University of Helsinki, 00014 Helsinki, Finland.
| | - Praveen Kumar
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN 55455, USA.
| | - Damon May
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA.
| | - Subina Mehta
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN 55455, USA.
| | - Bart Mesuere
- Computational Biology Group, Ghent University, Krijgslaan 281, B-9000 Ghent, Belgium.
| | - Zachary Brown
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN 55455, USA.
| | - Joshua E Elias
- Department of Chemical & Systems Biology, Stanford University, Stanford, CA 94305, USA.
| | - W Judson Hervey
- Center for Bio/Molecular Science & Engineering, Naval Research Laboratory, Washington, DC 20375, USA.
| | - Thomas McGowan
- Minnesota Supercomputing Institute, University of Minnesota, Minneapolis, MN 55455, USA.
| | - Thilo Muth
- Bioinformatics Unit (MF1), Department for Methods Development and Research Infrastructure, Robert Koch Institute, 13353 Berlin, Germany.
| | - Brook Nunn
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA.
| | - Joel Rudney
- Department of Diagnostic and Biological Sciences, University of Minnesota, Minneapolis, MN 55455, USA.
| | - Alessandro Tanca
- Porto Conte Ricerche Science and Technology Park of Sardinia, 07041 Alghero, Italy.
| | - Timothy J Griffin
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN 55455, USA.
| | - Pratik D Jagtap
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN 55455, USA.
| |
Collapse
|
38
|
Gut Dysbiosis and Muscle Aging: Searching for Novel Targets against Sarcopenia. Mediators Inflamm 2018; 2018:7026198. [PMID: 29686533 PMCID: PMC5893006 DOI: 10.1155/2018/7026198] [Citation(s) in RCA: 117] [Impact Index Per Article: 16.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2017] [Revised: 11/28/2017] [Accepted: 12/05/2017] [Indexed: 12/12/2022] Open
Abstract
Advanced age is characterized by several changes, one of which is the impairment of the homeostasis of intestinal microbiota. These alterations critically influence host health and have been associated with morbidity and mortality in older adults. “Inflammaging,” an age-related chronic inflammatory process, is a common trait of several conditions, including sarcopenia. Interestingly, imbalanced intestinal microbial community has been suggested to contribute to inflammaging. Changes in gut microbiota accompanying sarcopenia may be attenuated by supplementation with pre- and probiotics. Although muscle aging has been increasingly recognized as a biomarker of aging, the pathophysiology of sarcopenia is to date only partially appreciated. Due to its development in the context of the age-related inflammatory milieu, several studies favor the hypothesis of a tight connection between sarcopenia and inflammaging. However, conclusive evidence describing the signaling pathways involved has not yet been produced. Here, we review the current knowledge of the changes in intestinal microbiota that occur in advanced age with a special emphasis on findings supporting the idea of a modulation of muscle physiology through alterations in gut microbial composition and activity.
Collapse
|
39
|
Auffret MD, Stewart R, Dewhurst RJ, Duthie CA, Rooke JA, Wallace RJ, Freeman TC, Snelling TJ, Watson M, Roehe R. Identification, Comparison, and Validation of Robust Rumen Microbial Biomarkers for Methane Emissions Using Diverse Bos Taurus Breeds and Basal Diets. Front Microbiol 2018; 8:2642. [PMID: 29375511 PMCID: PMC5767246 DOI: 10.3389/fmicb.2017.02642] [Citation(s) in RCA: 51] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2017] [Accepted: 12/19/2017] [Indexed: 01/04/2023] Open
Abstract
Previous shotgun metagenomic analyses of ruminal digesta identified some microbial information that might be useful as biomarkers to select cattle that emit less methane (CH4), which is a potent greenhouse gas. It is known that methane production (g/kgDMI) and to an extent the microbial community is heritable and therefore biomarkers can offer a method of selecting cattle for low methane emitting phenotypes. In this study a wider range of Bos Taurus cattle, varying in breed and diet, was investigated to determine microbial communities and genetic markers associated with high/low CH4 emissions. Digesta samples were taken from 50 beef cattle, comprising four cattle breeds, receiving two basal diets containing different proportions of concentrate and also including feed additives (nitrate or lipid), that may influence methane emissions. A combination of partial least square analysis and network analysis enabled the identification of the most significant and robust biomarkers of CH4 emissions (VIP > 0.8) across diets and breeds when comparing all potential biomarkers together. Genes associated with the hydrogenotrophic methanogenesis pathway converting carbon dioxide to methane, provided the dominant biomarkers of CH4 emissions and methanogens were the microbial populations most closely correlated with CH4 emissions and identified by metagenomics. Moreover, these genes grouped together as confirmed by network analysis for each independent experiment and when combined. Finally, the genes involved in the methane synthesis pathway explained a higher proportion of variation in CH4 emissions by PLS analysis compared to phylogenetic parameters or functional genes. These results confirmed the reproducibility of the analysis and the advantage to use these genes as robust biomarkers of CH4 emissions. Volatile fatty acid concentrations and ratios were significantly correlated with CH4, but these factors were not identified as robust enough for predictive purposes. Moreover, the methanotrophic Methylomonas genus was found to be negatively correlated with CH4. Finally, this study confirmed the importance of using robust and applicable biomarkers from the microbiome as a proxy of CH4 emissions across diverse production systems and environments.
Collapse
Affiliation(s)
- Marc D. Auffret
- Scotland's Rural College, Future Farming System (FFS), Edinburgh, United Kingdom
| | - Robert Stewart
- Edinburgh Genomics, The Roslin Institute and R(D)SVS, University of Edinburgh, Edinburgh, United Kingdom
| | - Richard J. Dewhurst
- Scotland's Rural College, Future Farming System (FFS), Edinburgh, United Kingdom
| | - Carol-Anne Duthie
- Scotland's Rural College, Future Farming System (FFS), Edinburgh, United Kingdom
| | - John A. Rooke
- Scotland's Rural College, Future Farming System (FFS), Edinburgh, United Kingdom
| | - Robert J. Wallace
- Rowett Institute of Nutrition and Health, University of Aberdeen, Aberdeen, United Kingdom
| | - Tom C. Freeman
- Division of Genetics and Genomics, The Roslin Institute and R(D)SVS, University of Edinburgh, Edinburgh, United Kingdom
| | - Timothy J. Snelling
- Rowett Institute of Nutrition and Health, University of Aberdeen, Aberdeen, United Kingdom
| | - Mick Watson
- Edinburgh Genomics, The Roslin Institute and R(D)SVS, University of Edinburgh, Edinburgh, United Kingdom
- Division of Genetics and Genomics, The Roslin Institute and R(D)SVS, University of Edinburgh, Edinburgh, United Kingdom
| | - Rainer Roehe
- Scotland's Rural College, Future Farming System (FFS), Edinburgh, United Kingdom
| |
Collapse
|
40
|
Greay TL, Gofton AW, Paparini A, Ryan UM, Oskam CL, Irwin PJ. Recent insights into the tick microbiome gained through next-generation sequencing. Parasit Vectors 2018; 11:12. [PMID: 29301588 PMCID: PMC5755153 DOI: 10.1186/s13071-017-2550-5] [Citation(s) in RCA: 118] [Impact Index Per Article: 16.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2017] [Accepted: 11/21/2017] [Indexed: 02/06/2023] Open
Abstract
The tick microbiome comprises communities of microorganisms, including viruses, bacteria and eukaryotes, and is being elucidated through modern molecular techniques. The advent of next-generation sequencing (NGS) technologies has enabled the genes and genomes within these microbial communities to be explored in a rapid and cost-effective manner. The advantages of using NGS to investigate microbiomes surpass the traditional non-molecular methods that are limited in their sensitivity, and conventional molecular approaches that are limited in their scalability. In recent years the number of studies using NGS to investigate the microbial diversity and composition of ticks has expanded. Here, we provide a review of NGS strategies for tick microbiome studies and discuss the recent findings from tick NGS investigations, including the bacterial diversity and composition, influential factors, and implications of the tick microbiome.
Collapse
Affiliation(s)
- Telleasha L Greay
- Vector and Waterborne Pathogens Research Group, School of Veterinary and Life Sciences, Murdoch University, Perth, WA, Australia.
| | - Alexander W Gofton
- Vector and Waterborne Pathogens Research Group, School of Veterinary and Life Sciences, Murdoch University, Perth, WA, Australia
| | - Andrea Paparini
- Vector and Waterborne Pathogens Research Group, School of Veterinary and Life Sciences, Murdoch University, Perth, WA, Australia
| | - Una M Ryan
- Vector and Waterborne Pathogens Research Group, School of Veterinary and Life Sciences, Murdoch University, Perth, WA, Australia
| | - Charlotte L Oskam
- Vector and Waterborne Pathogens Research Group, School of Veterinary and Life Sciences, Murdoch University, Perth, WA, Australia
| | - Peter J Irwin
- Vector and Waterborne Pathogens Research Group, School of Veterinary and Life Sciences, Murdoch University, Perth, WA, Australia
| |
Collapse
|
41
|
Free A, McDonald MA, Pagaling E. Diversity-Function Relationships in Natural, Applied, and Engineered Microbial Ecosystems. ADVANCES IN APPLIED MICROBIOLOGY 2018; 105:131-189. [PMID: 30342721 DOI: 10.1016/bs.aambs.2018.07.002] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
The connection between ecosystem function and taxonomic diversity has been of interest and relevance to macroecologists for decades. After many years of lagging behind due to the difficulty of assigning both taxonomy and function to poorly distinguishable microscopic cells, microbial ecology now has access to a suite of powerful molecular tools which allow its practitioners to generate data relating to diversity and function of a microbial community on an unprecedented scale. Instead, the problem facing today's microbial ecologists is coupling the ease of generation of these datasets with the formulation and testing of workable hypotheses relating the diversity and function of environmental, host-associated, and engineered microbial communities. Here, we review the current state of knowledge regarding the links between taxonomic alpha- and beta-diversity and ecosystem function, comparing our knowledge in this area to that obtained by macroecologists who use more traditional techniques. We consider the methodologies that can be applied to study these properties and how successful they are at linking function to diversity, using examples from the study of model microbial ecosystems, methanogenic bioreactors (anaerobic digesters), and host-associated microbiota. Finally, we assess ways in which our newly acquired understanding might be used to manipulate diversity in ecosystems of interest in order to improve function for the benefit of us or the environment in general through the provision of ecosystem services.
Collapse
Affiliation(s)
- Andrew Free
- School of Biological Sciences, The University of Edinburgh, Edinburgh, United Kingdom
| | - Michael A McDonald
- School of Biological Sciences, The University of Edinburgh, Edinburgh, United Kingdom
| | - Eulyn Pagaling
- The James Hutton Institute, Craigiebuckler, Aberdeen, United Kingdom
| |
Collapse
|
42
|
Panetta JL, Šíma R, Calvani NED, Hajdušek O, Chandra S, Panuccio J, Šlapeta J. Reptile-associated Borrelia species in the goanna tick (Bothriocroton undatum) from Sydney, Australia. Parasit Vectors 2017; 10:616. [PMID: 29262840 PMCID: PMC5738880 DOI: 10.1186/s13071-017-2579-5] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2017] [Accepted: 12/06/2017] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Knowledge on the capacity of Australian ticks to carry Borrelia species is currently limited or missing. To evaluate the potential of ticks to carry bacterial pathogens and their DNA, it is imperative to have a robust workflow that maximises recovery of bacterial DNA within ticks in order to enable accurate identification. By exploiting the bilateral anatomical symmetry of ticks, we were able to directly compare two DNA extraction methods for 16S rRNA gene diversity profiling and pathogen detection. We aimed to assess which combination of DNA extraction and 16S rRNA hypervariable region enables identification of the greatest bacterial diversity, whilst minimising bias, and providing the greatest capacity for the identification of Borrelia spp. RESULTS We collected Australian endemic ticks (Bothriocroton undatum), isolated DNA from equal tick halves using two commercial DNA extraction methods and sequenced samples using V1-V3 and V3-V4 16S rRNA gene diversity profiling assays. Two distinct Borrelia spp. operational taxonomic units (OTUs) were detected using the V1-V3 16S rRNA hypervariable region and matching Borrelia spp. sequences were obtained using a conventional nested-PCR. The tick 16S rRNA gene diversity profile was dominated by Rickettsia spp. (98-99%), while the remaining OTUs belonged to Proteobacteria (51-81%), Actinobacteria (6-30%) and Firmicutes (2-7%). Multiple comparisons tests demonstrated biases in each of the DNA extraction kits towards different bacterial taxa. CONCLUSIONS Two distinct Borrelia species belonging to the reptile-associated Borrelia group were identified. Our results show that the method of DNA extraction can promote bias in the final microbiota identified. We determined an optimal DNA extraction method and 16S rRNA gene diversity profile assay that maximises detection of Borrelia species.
Collapse
Affiliation(s)
- Jessica L. Panetta
- Sydney School of Veterinary Science, Faculty of Science, University of Sydney, Sydney, NSW 2006 Australia
| | - Radek Šíma
- Institute of Parasitology, Biology Centre of the Czech Academy of Sciences, Branišovská 31, 37005 České Budějovice, Czech Republic
| | - Nichola E. D. Calvani
- Sydney School of Veterinary Science, Faculty of Science, University of Sydney, Sydney, NSW 2006 Australia
| | - Ondřej Hajdušek
- Institute of Parasitology, Biology Centre of the Czech Academy of Sciences, Branišovská 31, 37005 České Budějovice, Czech Republic
| | - Shona Chandra
- Sydney School of Veterinary Science, Faculty of Science, University of Sydney, Sydney, NSW 2006 Australia
| | - Jessica Panuccio
- Sydney School of Veterinary Science, Faculty of Science, University of Sydney, Sydney, NSW 2006 Australia
| | - Jan Šlapeta
- Sydney School of Veterinary Science, Faculty of Science, University of Sydney, Sydney, NSW 2006 Australia
| |
Collapse
|
43
|
Neves ALA, Li F, Ghoshal B, McAllister T, Guan LL. Enhancing the Resolution of Rumen Microbial Classification from Metatranscriptomic Data Using Kraken and Mothur. Front Microbiol 2017; 8:2445. [PMID: 29270165 PMCID: PMC5725470 DOI: 10.3389/fmicb.2017.02445] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2017] [Accepted: 11/24/2017] [Indexed: 12/23/2022] Open
Abstract
The advent of next generation sequencing and bioinformatics tools have greatly advanced our knowledge about the phylogenetic diversity and ecological role of microbes inhabiting the mammalian gut. However, there is a lack of information on the evaluation of these computational tools in the context of the rumen microbiome as these programs have mostly been benchmarked on real or simulated datasets generated from human studies. In this study, we compared the outcomes of two methods, Kraken (mRNA based) and a pipeline developed in-house based on Mothur (16S rRNA based), to assess the taxonomic profiles (bacteria and archaea) of rumen microbial communities using total RNA sequencing of rumen fluid collected from 12 cattle with differing feed conversion ratios (FCR). Both approaches revealed a similar phyla distribution of the most abundant taxa, with Bacteroidetes, Firmicutes, and Proteobacteria accounting for approximately 80% of total bacterial abundance. For bacterial taxa, although 69 genera were commonly detected by both methods, an additional 159 genera were exclusively identified by Kraken. Kraken detected 423 species, while Mothur was not able to assign bacterial sequences to the species level. For archaea, both methods generated similar results only for the abundance of Methanomassiliicoccaceae (previously referred as RCC), which comprised more than 65% of the total archaeal families. Taxon R4-41B was exclusively identified by Mothur in the rumen of feed efficient bulls, whereas Kraken uniquely identified Methanococcaceae in inefficient bulls. Although Kraken enhanced the microbial classification at the species level, identification of bacteria or archaea in the rumen is limited due to a lack of reference genomes for the rumen microbiome. The findings from this study suggest that the development of the combined pipelines using Mothur and Kraken is needed for a more inclusive and representative classification of microbiomes.
Collapse
Affiliation(s)
- Andre L A Neves
- Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB, Canada
| | - Fuyong Li
- Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB, Canada
| | - Bibaswan Ghoshal
- Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB, Canada
| | - Tim McAllister
- Lethbridge Research Centre, Agriculture and Agri-Food Canada, Lethbridge, AB, Canada
| | - Le L Guan
- Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB, Canada
| |
Collapse
|
44
|
Hugerth LW, Andersson AF. Analysing Microbial Community Composition through Amplicon Sequencing: From Sampling to Hypothesis Testing. Front Microbiol 2017; 8:1561. [PMID: 28928718 PMCID: PMC5591341 DOI: 10.3389/fmicb.2017.01561] [Citation(s) in RCA: 169] [Impact Index Per Article: 21.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2017] [Accepted: 08/02/2017] [Indexed: 12/20/2022] Open
Abstract
Microbial ecology as a scientific field is fundamentally driven by technological advance. The past decade's revolution in DNA sequencing cost and throughput has made it possible for most research groups to map microbial community composition in environments of interest. However, the computational and statistical methodology required to analyse this kind of data is often not part of the biologist training. In this review, we give a historical perspective on the use of sequencing data in microbial ecology and restate the current need for this method; but also highlight the major caveats with standard practices for handling these data, from sample collection and library preparation to statistical analysis. Further, we outline the main new analytical tools that have been developed in the past few years to bypass these caveats, as well as highlight the major requirements of common statistical practices and the extent to which they are applicable to microbial data. Besides delving into the meaning of select alpha- and beta-diversity measures, we give special consideration to techniques for finding the main drivers of community dissimilarity and for interaction network construction. While every project design has specific needs, this review should serve as a starting point for considering what options are available.
Collapse
Affiliation(s)
- Luisa W Hugerth
- Department of Molecular, Tumour and Cell Biology, Centre for Translational Microbiome Research, Karolinska InstitutetSolna, Sweden.,Division of Gene Technology, Science for Life Laboratory, School of Biotechnology, KTH Royal Institute of TechnologySolna, Sweden
| | - Anders F Andersson
- Division of Gene Technology, Science for Life Laboratory, School of Biotechnology, KTH Royal Institute of TechnologySolna, Sweden
| |
Collapse
|
45
|
Sperling JL, Silva-Brandão K, Brandão M, Lloyd V, Dang S, Davis C, Sperling F, Magor K. Comparison of bacterial 16S rRNA variable regions for microbiome surveys of ticks. Ticks Tick Borne Dis 2017; 8:453-461. [DOI: 10.1016/j.ttbdis.2017.02.002] [Citation(s) in RCA: 45] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2016] [Revised: 01/30/2017] [Accepted: 02/01/2017] [Indexed: 01/23/2023]
|