1
|
Griffard-Smith R, Schueddig E, Mahoney DE, Chalise P, Koestler DC, Pei D. micRoclean: an R package for decontaminating low-biomass 16S-rRNA microbiome data. FRONTIERS IN BIOINFORMATICS 2025; 5:1556361. [PMID: 40406150 PMCID: PMC12095030 DOI: 10.3389/fbinf.2025.1556361] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2025] [Accepted: 04/10/2025] [Indexed: 05/26/2025] Open
Abstract
In 16S-rRNA microbiome studies, cross-contamination and environmental contamination can obscure true biological signal. This contamination is particularly problematic in low-biomass studies, which are characterized by samples with a small amount of microbial DNA. Although multiple methods and packages for decontaminating microbiome data exist, there is no consensus on the most appropriate tool for decontamination based on the individual research study design and how to quantify the impact of removing identified contaminants to avoid over-filtering. To address these gaps, we introduce micRoclean, an open-source R package that contains two distinct microbiome decontamination pipelines with guidance on which to select based on the downstream goals of the research study and study design. This package integrates and expands on existing packages for microbiome decontamination and analysis for convenience of users. Furthermore, micRoclean also implements a filtering loss statistic to quantify the impact of decontamination on the overall covariance structure of the data. In this paper, we demonstrate the utility of micRoclean through implementation on example data, illustrating that micRoclean effectively and intuitively decontaminates microbiome data. Further, we demonstrate through a multi-batch simulated microbiome sample that micRoclean matches or outperforms tools with similar objectives. This package is freely available from GitHub repository rachelgriffard/micRoclean.
Collapse
Affiliation(s)
- Rachel Griffard-Smith
- Department of Biostatistics & Data Science, University of Kansas Medical Center, Kansas City, KS, United States
| | - Emily Schueddig
- Department of Biostatistics & Data Science, University of Kansas Medical Center, Kansas City, KS, United States
| | - Diane E. Mahoney
- School of Nursing, University of Kansas Medical Center, Kansas City, KS, United States
| | - Prabhakar Chalise
- Department of Biostatistics & Data Science, University of Kansas Medical Center, Kansas City, KS, United States
| | - Devin C. Koestler
- Department of Biostatistics & Data Science, University of Kansas Medical Center, Kansas City, KS, United States
| | - Dong Pei
- Department of Biostatistics & Data Science, University of Kansas Medical Center, Kansas City, KS, United States
| |
Collapse
|
2
|
Herbert J, Thompson S, Beckett AH, Robson SC. Impact of microbiological molecular methodologies on adaptive sampling using nanopore sequencing in metagenomic studies. ENVIRONMENTAL MICROBIOME 2025; 20:47. [PMID: 40325409 PMCID: PMC12054170 DOI: 10.1186/s40793-025-00704-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/11/2024] [Accepted: 04/08/2025] [Indexed: 05/07/2025]
Abstract
INTRODUCTION Metagenomics, the genomic analysis of all species present within a mixed population, is an important tool used for the exploration of microbiomes in clinical and environmental microbiology. Whilst the development of next-generation sequencing, and more recently third generation long-read approaches such as nanopore sequencing, have greatly advanced the study of metagenomics, recovery of unbiased material from microbial populations remains challenging. One promising advancement in genomic sequencing from Oxford Nanopore Technologies (ONT) is adaptive sampling, which enables real-time enrichment or depletion of target sequences. As sequencing technologies continue to develop, and advances such as adaptive sampling become common techniques within the microbiological toolkit, it is essential to evaluate the benefits of such advancements to metagenomic studies, and the impact of methodological choices on research outcomes. AIM AND METHODS Given the rapid development of sequencing tools and chemistry, this study aimed to demonstrate the impacts of choice of DNA extraction kit and sequencing chemistry on downstream metagenomic analyses. We first explored the quality and accuracy of 16S rRNA amplicon sequencing for DNA extracted from the ZymoBIOMICS Microbial Community Standard, using a range of commercially available DNA extraction kits to understand the effects of different kit biases on assessment of microbiome composition. We next compared the quality and accuracy of metagenomic analyses for two nanopore-based ligation chemistry kits with differing levels of base-calling error; the older and more error-prone (~ 97% accuracy) LSK109 chemistry, and newer more accurate (~ 99% accuracy) LSK112 Q20 + chemistry. Finally, we assessed the impact of the nanopore sequencing chemistry version on the output of the novel adaptive sampling approach for real-time enrichment of the genome for the yeast Saccharomyces cerevisiae from the microbial community. RESULTS Firstly, DNA extraction kit methodology impacted the composition of the yield, with mechanical bead-beating methodologies providing the least biased picture due to efficient lysis of Gram-positive microbes present in the community standard, with differences in bead-beating methodologies also producing variation in composition. Secondly, whilst use of the Q20 + nanopore sequencing kit chemistry improved the base-calling data quality, the resulting metagenomic assemblies were not significantly improved based on common metrics and assembly statistics. Most importantly, we demonstrated the effective application of adaptive sampling for enriching a low-abundance genome within a metagenomic sample. This resulted in a 5-7-fold increase in target enrichment compared to non-adaptive sequencing, despite a reduction in overall sequencing throughput due to strand-rejection processes. Interestingly, no significant differences in adaptive sampling enrichment efficiency were observed between the older and newer ONT sequencing chemistries, suggesting that adaptive sampling performs consistently across different library preparation kits. CONCLUSION Our findings underscore the importance of selecting a DNA extraction methodology that minimises bias to ensure an accurate representation of microbial diversity in metagenomic studies. Additionally, despite the improved base-calling accuracy provided by newer Q20 + sequencing chemistry, we demonstrate that even older ONT sequencing chemistries can achieve reliable metagenomic sequencing results, enabling researchers to confidently use these approaches depending on their specific experimental needs. Critically, we highlight the significant potential of ONT's adaptive sampling technology for targeted enrichment of specific genomes within metagenomic samples. This approach offers broad applicability for enriching target organisms or genetic elements (e.g., pathogens or plasmids) or depleting unwanted DNA (e.g., host DNA) in diverse sample types from environmental and clinical studies. However, researchers should carefully weigh the benefits of adaptive sampling against the potential trade-offs in sequencing throughput, particularly for low-abundance targets, where strand rejection can lead to pore blocking. These results provide valuable guidance for optimising adaptive sampling in metagenomic workflows to achieve specific research objectives.
Collapse
Affiliation(s)
- Josephine Herbert
- Centre for Enzyme Innovation, University of Portsmouth, Portsmouth, Hampshire, PO1 2DT, UK
- Institute of Life Sciences and Healthcare, University of Portsmouth, Portsmouth, Hampshire, PO1 2DT, UK
| | - Stanley Thompson
- Institute of Life Sciences and Healthcare, University of Portsmouth, Portsmouth, Hampshire, PO1 2DT, UK
- Department of Life Sciences, University of Bath, Bath, BA2 7AY, UK
| | - Angela H Beckett
- Centre for Enzyme Innovation, University of Portsmouth, Portsmouth, Hampshire, PO1 2DT, UK
- Institute of Life Sciences and Healthcare, University of Portsmouth, Portsmouth, Hampshire, PO1 2DT, UK
| | - Samuel C Robson
- Centre for Enzyme Innovation, University of Portsmouth, Portsmouth, Hampshire, PO1 2DT, UK.
- Institute of Life Sciences and Healthcare, University of Portsmouth, Portsmouth, Hampshire, PO1 2DT, UK.
| |
Collapse
|
3
|
Skidmore AM, Bradfute SB. Bacterial DNA Contamination of Commercial PCR Enzymes: Considerations for Microbiome Protocols and Analysis. Microorganisms 2025; 13:732. [PMID: 40284569 PMCID: PMC12029200 DOI: 10.3390/microorganisms13040732] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2025] [Revised: 03/14/2025] [Accepted: 03/20/2025] [Indexed: 04/29/2025] Open
Abstract
The microbiome remains a top area of research, and it is now common to examine any organic and inorganic samples for bacterial colonization. However, due to the ubiquity of bacteria in the environment, separating the low-burden colonization of bacteria from the possible contamination of laboratory reagents remains problematic. When examining samples of expected low bacterial burden, it is common to first amplify any bacterial DNA present through PCR before sequencing. In this work, we examined nine different commercial PCR enzymes and their reaction components as possible sources of bacterial DNA contamination. We found contaminating bacterial DNA in seven of the nine reactions, and this DNA was shown to come from a variety of species. Importantly, we were able to perform these studies solely with endpoint PCR and Sanger sequencing, which are more accessible and affordable than high-throughput, short-read sequencing and real-time PCR. This work confirms that there needs to be an increased emphasis on including control reactions in microbiome studies so that contaminating DNA sequences can be identified and addressed, and that this can be achieved with minimal resources.
Collapse
Affiliation(s)
| | - Steven B. Bradfute
- Center for Global Health, Department of Internal Medicine, University of New Mexico Health Sciences Center, Albuquerque, NM 87131, USA;
| |
Collapse
|
4
|
Di Gloria L, Baldi S, Curini L, Bertorello S, Nannini G, Cei F, Niccolai E, Ramazzotti M, Amedei A. Experimental tests challenge the evidence of a healthy human blood microbiome. FEBS J 2025; 292:796-808. [PMID: 39690119 PMCID: PMC11839906 DOI: 10.1111/febs.17362] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2024] [Revised: 10/27/2024] [Accepted: 12/10/2024] [Indexed: 12/19/2024]
Abstract
The advent of next-generation sequencing (NGS) technologies has made it possible to investigate microbial communities in various environments, including different sites within the human body. Therefore, the previously established belief of the sterile nature of several body sites, including human blood, has now been challenged. However, metagenomics investigation of areas with an anticipated low microbial biomass may be susceptible to misinterpretation. Here, we critically evaluate the results of 16S targeted amplicon sequencing performed on total DNA collected from healthy donors' blood samples while incorporating specific negative controls aimed at addressing potential bias to supplement and strengthen the research in this area. We prepared negative controls by increasing the initial DNA quantity through sequences that can be recognized and subsequently discarded. We found that only three organisms were sporadically present among the samples, and this was mostly attributable to bacteria ubiquitously present in laboratory reagents. Despite not fully confirming or denying the existence of healthy blood microbiota, our results suggest that living bacteria, or at least their residual DNA sequences, are not a common feature of human blood in healthy people. Finally, our study poses relevant questions on the design of controls in this research area that must be considered in order to avoid misinterpreted results that appear to contaminate current high-throughput research.
Collapse
Affiliation(s)
- Leandro Di Gloria
- Department of Experimental and Clinical Biomedical SciencesUniversity of FlorenceItaly
| | - Simone Baldi
- Department of Experimental and Clinical MedicineUniversity of FlorenceItaly
| | - Lavinia Curini
- Cardiovascular Tissue Engineering Research Unit – Centro Cardiologico MonzinoIRCCSItaly
| | - Sara Bertorello
- Department of Experimental and Clinical MedicineUniversity of FlorenceItaly
| | - Giulia Nannini
- Department of Experimental and Clinical MedicineUniversity of FlorenceItaly
| | - Francesco Cei
- Department of Experimental and Clinical MedicineUniversity of FlorenceItaly
| | - Elena Niccolai
- Department of Experimental and Clinical MedicineUniversity of FlorenceItaly
| | - Matteo Ramazzotti
- Department of Experimental and Clinical Biomedical SciencesUniversity of FlorenceItaly
| | - Amedeo Amedei
- Department of Experimental and Clinical MedicineUniversity of FlorenceItaly
- Network of Immunity in Infection, Malignancy and Autoimmunity (NIIMA)Universal Scientific Education and Research Network (USERN)FlorenceItaly
| |
Collapse
|
5
|
Nguyen PN, Samad-Zada F, Chau KD, Rehan SM. Microbiome and floral associations of a wild bee using biodiversity survey collections. Environ Microbiol 2024; 26:e16657. [PMID: 38817079 DOI: 10.1111/1462-2920.16657] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2024] [Accepted: 05/07/2024] [Indexed: 06/01/2024]
Abstract
The health of bees can be assessed through their microbiome, which serves as a biomarker indicating the presence of both beneficial and harmful microorganisms within a bee community. This study presents the characterisation of the bacterial, fungal, and plant composition on the cuticle of adult bicoloured sweat bees (Agapostemon virescens). These bees were collected using various methods such as pan traps, blue vane traps and sweep netting across the northern extent of their habitat range. Non-destructive methods were employed to extract DNA from the whole pinned specimens of these wild bees. Metabarcoding of the 16S rRNA, ITS and rbcL regions was then performed. The study found that the method of collection influenced the detection of certain microbial and plant taxa. Among the collection methods, sweep net samples showed the lowest fungal alpha diversity. However, minor differences in bacterial or fungal beta diversity suggest that no single method is significantly superior to others. Therefore, a combination of techniques can cater to a broader spectrum of microbial detection. The study also revealed regional variations in bacterial, fungal and plant diversity. The core microbiome of A. virescens comprises two bacteria, three fungi and a plant association, all of which are commonly detected in other wild bees. These core microbes remained consistent across different collection methods and locations. Further extensive studies of wild bee microbiomes across various species and landscapes will help uncover crucial relationships between pollinator health and their environment.
Collapse
Affiliation(s)
- Phuong N Nguyen
- Department of Biology, York University, Toronto, Ontario, Canada
| | | | - Katherine D Chau
- Department of Biology, York University, Toronto, Ontario, Canada
| | - Sandra M Rehan
- Department of Biology, York University, Toronto, Ontario, Canada
| |
Collapse
|
6
|
Gand M, Navickaite I, Bartsch LJ, Grützke J, Overballe-Petersen S, Rasmussen A, Otani S, Michelacci V, Matamoros BR, González-Zorn B, Brouwer MSM, Di Marcantonio L, Bloemen B, Vanneste K, Roosens NHCJ, AbuOun M, De Keersmaecker SCJ. Towards facilitated interpretation of shotgun metagenomics long-read sequencing data analyzed with KMA for the detection of bacterial pathogens and their antimicrobial resistance genes. Front Microbiol 2024; 15:1336532. [PMID: 38659981 PMCID: PMC11042533 DOI: 10.3389/fmicb.2024.1336532] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2023] [Accepted: 02/29/2024] [Indexed: 04/26/2024] Open
Abstract
Metagenomic sequencing is a promising method that has the potential to revolutionize the world of pathogen detection and antimicrobial resistance (AMR) surveillance in food-producing environments. However, the analysis of the huge amount of data obtained requires performant bioinformatics tools and databases, with intuitive and straightforward interpretation. In this study, based on long-read metagenomics data of chicken fecal samples with a spike-in mock community, we proposed confidence levels for taxonomic identification and AMR gene detection, with interpretation guidelines, to help with the analysis of the output data generated by KMA, a popular k-mer read alignment tool. Additionally, we demonstrated that the completeness and diversity of the genomes present in the reference databases are key parameters for accurate and easy interpretation of the sequencing data. Finally, we explored whether KMA, in a two-step procedure, can be used to link the detected AMR genes to their bacterial host chromosome, both detected within the same long-reads. The confidence levels were successfully tested on 28 metagenomics datasets which were obtained with sequencing of real and spiked samples from fecal (chicken, pig, and buffalo) or food (minced beef and food enzyme products) origin. The methodology proposed in this study will facilitate the analysis of metagenomics sequencing datasets for KMA users. Ultimately, this will contribute to improvements in the rapid diagnosis and surveillance of pathogens and AMR genes in food-producing environments, as prioritized by the EU.
Collapse
Affiliation(s)
- Mathieu Gand
- Transversal Activities in Applied Genomics, Sciensano, Brussels, Belgium
| | - Indre Navickaite
- Department of Bacteriology, Animal and Plant Health Agency, Weybridge, United Kingdom
| | - Lee-Julia Bartsch
- Department of Biological Safety, German Federal Institute for Risk Assessment, Berlin, Germany
| | - Josephine Grützke
- Department of Biological Safety, German Federal Institute for Risk Assessment, Berlin, Germany
| | | | - Astrid Rasmussen
- Bacterial Reference Center, Statens Serum Institute, Copenhagen, Denmark
| | - Saria Otani
- National Food Institute, Technical University of Denmark, Kongens Lyngby, Denmark
| | - Valeria Michelacci
- Department of Food Safety, Nutrition and Veterinary Public Health, Istituto Superiore di Sanità, Rome, Italy
| | | | - Bruno González-Zorn
- Department of Animal Health, Complutense University of Madrid, Madrid, Spain
| | - Michael S. M. Brouwer
- Wageningen Bioveterinary Research Part of Wageningen University and Research, Lelystad, Netherlands
| | - Lisa Di Marcantonio
- Istituto Zooprofilattico Sperimentale dell’Abruzzo e del Molise “G. Caporale”, Teramo, Italy
| | - Bram Bloemen
- Transversal Activities in Applied Genomics, Sciensano, Brussels, Belgium
| | - Kevin Vanneste
- Transversal Activities in Applied Genomics, Sciensano, Brussels, Belgium
| | | | - Manal AbuOun
- Department of Bacteriology, Animal and Plant Health Agency, Weybridge, United Kingdom
| | | |
Collapse
|
7
|
Isali I, Helstrom EK, Uzzo N, Lakshmanan A, Nandwana D, Valentine H, Sindhani M, Abbosh P, Bukavina L. Current Trends and Challenges of Microbiome Research in Bladder Cancer. Curr Oncol Rep 2024; 26:292-298. [PMID: 38376627 PMCID: PMC10920447 DOI: 10.1007/s11912-024-01508-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/11/2024] [Indexed: 02/21/2024]
Abstract
PURPOSE OF THE REVIEW Microbiome research has provided valuable insights into the associations between microbial communities and bladder cancer. However, this field faces significant challenges that hinder the interpretation, generalization, and translation of findings into clinical practice. This review aims to elucidate these challenges and highlight the importance of addressing them for the advancement of microbiome research in bladder cancer. RECENT FINDINGS Recent findings underscore the complexities involved in microbiome research, particularly in the context of bladder cancer. Challenges include low microbial biomass in urine samples, potential contamination issues during collection and processing, variability in sequencing methods and primer selection, and the difficulty of establishing causality between microbiota and bladder cancer. Studies have shown the impact of sample storage conditions and DNA isolation kits on microbiome analysis, emphasizing the need for standardization. Additionally, variations in urine collection methods can introduce contamination and affect results. The choice of 16S rRNA gene amplicon sequencing or shotgun metagenomic sequencing introduces technical challenges, including primer selection and sequencing read length. Establishing causality between the microbiota and bladder cancer requires experimental methods like fecal microbiota transplantation and human microbiota-associated murine models, which face their own set of challenges. Translating microbiome research into therapeutic applications is hindered by methodological variability, incomplete understanding of bioactive molecules, imperfect animal models, and the inherent heterogeneity of microbiome communities among individuals. Microbiome research in bladder cancer presents significant challenges stemming from technical and conceptual complexities. Addressing these challenges through standardization, improved experimental models, and advanced analytical approaches is essential for advancing our understanding of the microbiome's role in bladder cancer and its potential clinical applications. Achieving this goal can lead to improved patient outcomes and novel therapeutic strategies in the future.
Collapse
Affiliation(s)
- Ilaha Isali
- Department of Urology, University Hospitals Cleveland Medical Center, Cleveland, OH, USA
- Case Western Reserve School of Medicine, Cleveland, OH, USA
| | - Emma K Helstrom
- Department of Urology, Fox Chase Cancer Center, Philadelphia, PA, USA
| | - Nicole Uzzo
- Department of Urology, Fox Chase Cancer Center, Philadelphia, PA, USA
| | - Ankita Lakshmanan
- Department of Urology, Fox Chase Cancer Center, Philadelphia, PA, USA
| | - Devika Nandwana
- Department of Urology, University Hospitals Cleveland Medical Center, Cleveland, OH, USA
- Case Western Reserve School of Medicine, Cleveland, OH, USA
| | - Henkel Valentine
- Department of Urology, Fox Chase Cancer Center, Philadelphia, PA, USA
| | - Mohit Sindhani
- Department of Urology, University Hospitals Cleveland Medical Center, Cleveland, OH, USA
- Case Western Reserve School of Medicine, Cleveland, OH, USA
| | - Philip Abbosh
- Department of Urology, Fox Chase Cancer Center, Philadelphia, PA, USA
| | - Laura Bukavina
- Department of Urology, University Hospitals Cleveland Medical Center, Cleveland, OH, USA.
- Case Western Reserve School of Medicine, Cleveland, OH, USA.
| |
Collapse
|
8
|
Piro VC, Renard BY. Contamination detection and microbiome exploration with GRIMER. Gigascience 2022; 12:giad017. [PMID: 36994872 PMCID: PMC10061425 DOI: 10.1093/gigascience/giad017] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2022] [Revised: 02/06/2023] [Accepted: 03/01/2023] [Indexed: 03/31/2023] Open
Abstract
BACKGROUND Contamination detection is a important step that should be carefully considered in early stages when designing and performing microbiome studies to avoid biased outcomes. Detecting and removing true contaminants is challenging, especially in low-biomass samples or in studies lacking proper controls. Interactive visualizations and analysis platforms are crucial to better guide this step, to help to identify and detect noisy patterns that could potentially be contamination. Additionally, external evidence, like aggregation of several contamination detection methods and the use of common contaminants reported in the literature, could help to discover and mitigate contamination. RESULTS We propose GRIMER, a tool that performs automated analyses and generates a portable and interactive dashboard integrating annotation, taxonomy, and metadata. It unifies several sources of evidence to help detect contamination. GRIMER is independent of quantification methods and directly analyzes contingency tables to create an interactive and offline report. Reports can be created in seconds and are accessible for nonspecialists, providing an intuitive set of charts to explore data distribution among observations and samples and its connections with external sources. Further, we compiled and used an extensive list of possible external contaminant taxa and common contaminants with 210 genera and 627 species reported in 22 published articles. CONCLUSION GRIMER enables visual data exploration and analysis, supporting contamination detection in microbiome studies. The tool and data presented are open source and available at https://gitlab.com/dacs-hpi/grimer.
Collapse
Affiliation(s)
- Vitor C Piro
- Data Analytics and Computational Statistics, Hasso Plattner Insititute, Digital Engineering Faculty, University of Potsdam, Potsdam 14482, Germany
- Department of Mathematics and Computer Science, Freie Universität Berlin, Berlin 14195, Germany
| | - Bernhard Y Renard
- Data Analytics and Computational Statistics, Hasso Plattner Insititute, Digital Engineering Faculty, University of Potsdam, Potsdam 14482, Germany
| |
Collapse
|