1
|
Closing the gap: Oxford Nanopore Technologies R10 sequencing allows comparable results to Illumina sequencing for SNP-based outbreak investigation of bacterial pathogens. J Clin Microbiol 2024; 62:e0157623. [PMID: 38441926 PMCID: PMC11077942 DOI: 10.1128/jcm.01576-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2023] [Accepted: 02/09/2024] [Indexed: 03/08/2024] Open
Abstract
Whole-genome sequencing has become the method of choice for bacterial outbreak investigation, with most clinical and public health laboratories currently routinely using short-read Illumina sequencing. Recently, long-read Oxford Nanopore Technologies (ONT) sequencing has gained prominence and may offer advantages over short-read sequencing, particularly with the recent introduction of the R10 chemistry, which promises much lower error rates than the R9 chemistry. However, limited information is available on its performance for bacterial single-nucleotide polymorphism (SNP)-based outbreak investigation. We present an open-source workflow, Prokaryotic Awesome variant Calling Utility (PACU) (https://github.com/BioinformaticsPlatformWIV-ISP/PACU), for constructing SNP phylogenies using Illumina and/or ONT R9/R10 sequencing data. The workflow was evaluated using outbreak data sets of Shiga toxin-producing Escherichia coli and Listeria monocytogenes by comparing ONT R9 and R10 with Illumina data. The performance of each sequencing technology was evaluated not only separately but also by integrating samples sequenced by different technologies/chemistries into the same phylogenomic analysis. Additionally, the minimum sequencing time required to obtain accurate phylogenetic results using nanopore sequencing was evaluated. PACU allowed accurate identification of outbreak clusters for both species using all technologies/chemistries, but ONT R9 results deviated slightly more from the Illumina results. ONT R10 results showed trends very similar to Illumina, and we found that integrating data sets sequenced by either Illumina or ONT R10 for different isolates into the same analysis produced stable and highly accurate phylogenomic results. The resulting phylogenies for these two outbreaks stabilized after ~20 hours of sequencing for ONT R9 and ~8 hours for ONT R10. This study provides a proof of concept for using ONT R10, either in isolation or in combination with Illumina, for rapid and accurate bacterial SNP-based outbreak investigation.
Collapse
|
2
|
Comparison of 6 DNA extraction methods for isolation of high yield of high molecular weight DNA suitable for shotgun metagenomics Nanopore sequencing to detect bacteria. BMC Genomics 2023; 24:438. [PMID: 37537550 PMCID: PMC10401787 DOI: 10.1186/s12864-023-09537-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2023] [Accepted: 07/27/2023] [Indexed: 08/05/2023] Open
Abstract
BACKGROUND Oxford Nanopore Technologies (ONT) offers an accessible platform for long-read sequencing, which improves the reconstruction of genomes and helps to resolve complex genomic contexts, especially in the case of metagenome analysis. To take the best advantage of long-read sequencing, DNA extraction methods must be able to isolate pure high molecular weight (HMW) DNA from complex metagenomics samples, without introducing any bias. New methods released on the market, and protocols developed at the research level, were specifically designed for this application and need to be assessed. RESULTS In this study, with different bacterial cocktail mixes, analyzed as pure or spiked in a synthetic fecal matrix, we evaluated the performances of 6 DNA extraction methods using various cells lysis and purification techniques, from quick and easy, to more time-consuming and gentle protocols, including a portable method for on-site application. In addition to the comparison of the quality, quantity and purity of the extracted DNA, the performance obtained when doing Nanopore sequencing on a MinION flow cell was also tested. From the obtained results, the Quick-DNA HMW MagBead Kit (Zymo Research) was selected as producing the best yield of pure HMW DNA. Furthermore, this kit allowed an accurate detection, by Nanopore sequencing, of almost all the bacterial species present in a complex mock community. CONCLUSION Amongst the 6 tested methods, the Quick-DNA HMW MagBead Kit (Zymo Research) was considered as the most suitable for Nanopore sequencing and would be recommended for bacterial metagenomics studies using this technology.
Collapse
|
3
|
CamPype: an open-source workflow for automated bacterial whole-genome sequencing analysis focused on Campylobacter. BMC Bioinformatics 2023; 24:291. [PMID: 37474912 PMCID: PMC10357626 DOI: 10.1186/s12859-023-05414-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2023] [Accepted: 07/14/2023] [Indexed: 07/22/2023] Open
Abstract
BACKGROUND The rapid expansion of Whole-Genome Sequencing has revolutionized the fields of clinical and food microbiology. However, its implementation as a routine laboratory technique remains challenging due to the growth of data at a faster rate than can be effectively analyzed and critical gaps in bioinformatics knowledge. RESULTS To address both issues, CamPype was developed as a new bioinformatics workflow for the genomics analysis of sequencing data of bacteria, especially Campylobacter, which is the main cause of gastroenteritis worldwide making a negative impact on the economy of the public health systems. CamPype allows fully customization of stages to run and tools to use, including read quality control filtering, read contamination, reads extension and assembly, bacterial typing, genome annotation, searching for antibiotic resistance genes, virulence genes and plasmids, pangenome construction and identification of nucleotide variants. All results are processed and resumed in an interactive HTML report for best data visualization and interpretation. CONCLUSIONS The minimal user intervention of CamPype makes of this workflow an attractive resource for microbiology laboratories with no expertise in bioinformatics as a first line method for bacterial typing and epidemiological analyses, that would help to reduce the costs of disease outbreaks, or for comparative genomic analyses. CamPype is publicly available at https://github.com/JoseBarbero/CamPype .
Collapse
|
4
|
Transforming Shiga toxin-producing Escherichia coli surveillance through whole genome sequencing in food safety practices. Front Microbiol 2023; 14:1204630. [PMID: 37520372 PMCID: PMC10381951 DOI: 10.3389/fmicb.2023.1204630] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2023] [Accepted: 06/22/2023] [Indexed: 08/01/2023] Open
Abstract
Introduction Shiga toxin-producing Escherichia coli (STEC) is a gastrointestinal pathogen causing foodborne outbreaks. Whole Genome Sequencing (WGS) in STEC surveillance holds promise in outbreak prevention and confinement, in broadening STEC epidemiology and in contributing to risk assessment and source attribution. However, despite international recommendations, WGS is often restricted to assist outbreak investigation and is not yet fully implemented in food safety surveillance across all European countries, in contrast to for example in the United States. Methods In this study, WGS was retrospectively applied to isolates collected within the context of Belgian food safety surveillance and combined with data from clinical isolates to evaluate its benefits. A cross-sector WGS-based collection of 754 strains from 1998 to 2020 was analyzed. Results We confirmed that WGS in food safety surveillance allows accurate detection of genomic relationships between human cases and strains isolated from food samples, including those dispersed over time and geographical locations. Identifying these links can reveal new insights into outbreaks and direct epidemiological investigations to facilitate outbreak management. Complete WGS-based isolate characterization enabled expanding epidemiological insights related to circulating serotypes, virulence genes and antimicrobial resistance across different reservoirs. Moreover, associations between virulence genes and severe disease were determined by incorporating human metadata into the data analysis. Gaps in the surveillance system were identified and suggestions for optimization related to sample centralization, harmonizing isolation methods, and expanding sampling strategies were formulated. Discussion This study contributes to developing a representative WGS-based collection of circulating STEC strains and by illustrating its benefits, it aims to incite policymakers to support WGS uptake in food safety surveillance.
Collapse
|
5
|
Retrospective surveillance of viable Bacillus cereus group contaminations in commercial food and feed vitamin B 2 products sold on the Belgian market using whole-genome sequencing. Front Microbiol 2023; 14:1173594. [PMID: 37415815 PMCID: PMC10321352 DOI: 10.3389/fmicb.2023.1173594] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2023] [Accepted: 06/01/2023] [Indexed: 07/08/2023] Open
Abstract
Bacillus cereus is a spore-forming bacterium that occurs as a contaminant in food and feed, occasionally resulting in food poisoning through the production of various toxins. In this study, we retrospectively characterized viable B. cereus sensu lato (s.l.) isolates originating from commercial vitamin B2 feed and food additives collected between 2016 and 2022 by the Belgian Federal Agency for the Safety of the Food Chain from products sold on the Belgian market. In total, 75 collected product samples were cultured on a general medium and, in case of bacterial growth, two isolates per product sample were collected and characterized using whole-genome sequencing (WGS) and subsequently characterized in terms of sequence type (ST), virulence gene profile, antimicrobial resistance (AMR) gene profile, plasmid content, and phylogenomic relationships. Viable B. cereus was identified in 18 of the 75 (24%) tested products, resulting in 36 WGS datasets, which were classified into eleven different STs, with ST165 (n = 10) and ST32 (n = 8) being the most common. All isolates carried multiple genes encoding virulence factors, including cytotoxin K-2 (52.78%) and cereulide (22.22%). Most isolates were predicted to be resistant to beta-lactam antibiotics (100%) and fosfomycin (88.89%), and a subset was predicted to be resistant to streptothricin (30.56%). Phylogenomic analysis revealed that some isolates obtained from different products were closely related or even identical indicating a likely common origin, whereas for some products the two isolates obtained did not show any close relationship to each other or other isolates found in other products. This study reveals that potentially pathogenic and drug-resistant B. cereus s.l. can be present in food and feed vitamin B2 additives that are commercially available, and that more research is warranted to assess whether their presence in these types of products poses a threat to consumers.
Collapse
|
6
|
Bacterial community development and diversity during the first year of production in a new salmon processing plant. Food Microbiol 2023; 109:104138. [DOI: 10.1016/j.fm.2022.104138] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2022] [Revised: 08/26/2022] [Accepted: 09/06/2022] [Indexed: 11/26/2022]
|
7
|
Establishing a national reference laboratory for antimicrobial resistance using a whole-genome sequencing framework: Nigeria's experience. MICROBIOLOGY (READING, ENGLAND) 2022; 168. [PMID: 35980376 DOI: 10.1099/mic.0.001208] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Whole-genome sequencing (WGS) is finding important applications in the surveillance of antimicrobial resistance (AMR), providing the most granular data and broadening the scope of niches and locations that can be surveilled. A common but often overlooked application of WGS is to replace or augment reference laboratory services for AMR surveillance. WGS has supplanted traditional strain subtyping in many comprehensive reference laboratories and is now the gold standard for rapidly ruling isolates into or out of suspected outbreak clusters. These and other properties give WGS the potential to serve in AMR reference functioning where a reference laboratory did not hitherto exist. In this perspective, we describe how we have employed a WGS approach, and an academic-public health system collaboration, to provide AMR reference laboratory services in Nigeria, as a model for leapfrogging to national AMR surveillance.
Collapse
|
8
|
The Notable Achievements and the Prospects of Bacterial Pathogen Genomics. Microorganisms 2022; 10:microorganisms10051040. [PMID: 35630482 PMCID: PMC9148168 DOI: 10.3390/microorganisms10051040] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2022] [Revised: 05/04/2022] [Accepted: 05/16/2022] [Indexed: 02/04/2023] Open
Abstract
Throughout the entirety of human history, bacterial pathogens have played an important role and even shaped the fate of civilizations. The application of genomics within the last 27 years has radically changed the way we understand the biology and evolution of these pathogens. In this review, we discuss how the short- (Illumina) and long-read (PacBio, Oxford Nanopore) sequencing technologies have shaped the discipline of bacterial pathogen genomics, in terms of fundamental research (i.e., evolution of pathogenicity), forensics, food safety, and routine clinical microbiology. We have mined and discuss some of the most prominent data/bioinformatics resources such as NCBI pathogens, PATRIC, and Pathogenwatch. Based on this mining, we present some of the most popular sequencing technologies, hybrid approaches, assemblers, and annotation pipelines. A small number of bacterial pathogens are of very high importance, and we also present the wealth of the genomic data for these species (i.e., which ones they are, the number of antimicrobial resistance genes per genome, the number of virulence factors). Finally, we discuss how this discipline will probably be transformed in the near future, especially by transitioning into metagenome-assembled genomes (MAGs), thanks to long-read sequencing.
Collapse
|
9
|
PulseNet International Survey on the Implementation of Whole Genome Sequencing in Low and Middle-Income Countries for Foodborne Disease Surveillance. Foodborne Pathog Dis 2022; 19:332-340. [PMID: 35325576 PMCID: PMC10863729 DOI: 10.1089/fpd.2021.0110] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
PulseNet International (PNI) is a global network of 88 countries who work together through their regional and national public health laboratories to track foodborne disease around the world. The vision of PNI is to implement globally standardized surveillance using whole genome sequencing (WGS) for real-time identification and subtyping of foodborne pathogens to strengthen preparedness and response and lower the burden of disease. Several countries in North America and Europe have experienced significant benefits in disease mitigation after implementing WGS. To broaden the routine use of WGS around the world, challenges and barriers must be overcome. We conducted this study to determine the challenges and barriers countries are encountering in their attempts to implement WGS and to identify how PNI can provide support to improve and become a better integrated system overall. A survey was designed with a set of qualitative questions to capture the status, challenges, barriers, and successes of countries in the implementation of WGS and was administered to laboratories in Africa, Asia-Pacific, Latin America and the Caribbean, and Middle East. One-third of respondents do not use WGS, and only 8% reported using WGS for routine, real-time surveillance. The main barriers for implementation of WGS were lack of funding, gaps in expertise, and training, especially for data analysis and interpretation. Features of an ideal system to facilitate implementation and global surveillance were identified as an all-in-one software that is free, accessible, standardized and validated. This survey highlights the minimal use of WGS for foodborne disease surveillance outside the United States, Canada, and Europe to date. Although funding remains a major barrier to WGS-based surveillance, critical gaps in expertise and availability of tools must be overcome. Opportunities to seek sustainable funding, provide training, and identify solutions for a globally standardized surveillance platform will accelerate implementation of WGS worldwide.
Collapse
|
10
|
Rapid Identification and Source Tracing of a Salmonella Typhimurium Outbreak in China by Metagenomic and Whole-Genome Sequencing. Foodborne Pathog Dis 2022; 19:259-265. [PMID: 35420907 DOI: 10.1089/fpd.2021.0072] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Salmonella spp. are among the most prevalent foodborne pathogens. Rapid identification of etiologic agents during foodborne outbreaks is of great importance. In this study, we report a traceback investigation of a Salmonella outbreak in China. Metagenomic sequencing of suspected food samples was performed on MinION and MiSeq platforms. Real-time nanopore sequencing analysis identified reads belonging to the Enterobacteriaceae family. MiSeq sequencing identified 63 reads specifically mapped to Salmonella. Conventional methods including quantitative-PCR and culture-based isolation confirmed as Salmonella enterica serovar Typhimurium. The foodborne outbreak of Salmonella Typhimurium was further recognized by whole-genome sequencing and pulsed-field gel electrophoresis analysis. Our study demonstrates the ability of metagenomic sequencing to rapidly identify enteric pathogens directly from food samples. These results highlight the capacity of metagenomic sequencing to deliver actionable information rapidly and to expedite the tracing and identification of etiologic agents during foodborne outbreaks.
Collapse
|
11
|
Whole Genome Sequencing Provides an Added Value to the Investigation of Staphylococcal Food Poisoning Outbreaks. Front Microbiol 2021; 12:750278. [PMID: 34795649 PMCID: PMC8593433 DOI: 10.3389/fmicb.2021.750278] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2021] [Accepted: 10/04/2021] [Indexed: 12/13/2022] Open
Abstract
Through staphylococcal enterotoxin (SE) production, Staphylococcus aureus is a common cause of food poisoning. Detection of staphylococcal food poisoning (SFP) is mostly performed using immunoassays, which, however, only detect five of 27 SEs described to date. Polymerase chain reactions are, therefore, frequently used in complement to identify a bigger arsenal of SE at the gene level (se) but are labor-intensive. Complete se profiling of isolates from different sources, i.e., food and human cases, is, however, important to provide an indication of their potential link within foodborne outbreak investigation. In addition to complete se gene profiling, relatedness between isolates is determined with more certainty using pulsed-field gel electrophoresis, Staphylococcus protein A gene typing and other methods, but these are shown to lack resolution. We evaluated how whole genome sequencing (WGS) can offer a solution to these shortcomings. By WGS analysis of a selection of S. aureus isolates, including some belonging to a confirmed foodborne outbreak, its added value as the ultimate multiplexing method was demonstrated. In contrast to PCR-based se gene detection for which primers are sometimes shown to be non-specific, WGS enabled complete se gene profiling with high performance, provided that a database containing reference sequences for all se genes was constructed and employed. The custom compiled database and applied parameters were made publicly available in an online user-friendly interface. As an all-in-one approach with high resolution, WGS additionally allowed inferring correct isolate relationships. The different DNA extraction kits that were tested affected neither se gene profiling nor relatedness determination, which is interesting for data sharing during SFP outbreak investigation. Although confirming the production of enterotoxins remains important for SFP investigation, we delivered a proof-of-concept that WGS is a valid alternative and/or complementary tool for outbreak investigation.
Collapse
|
12
|
Towards Real-Time and Affordable Strain-Level Metagenomics-Based Foodborne Outbreak Investigations Using Oxford Nanopore Sequencing Technologies. Front Microbiol 2021; 12:738284. [PMID: 34803953 PMCID: PMC8602914 DOI: 10.3389/fmicb.2021.738284] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2021] [Accepted: 10/13/2021] [Indexed: 11/18/2022] Open
Abstract
The current routine laboratory practices to investigate food samples in case of foodborne outbreaks still rely on attempts to isolate the pathogen in order to characterize it. We present in this study a proof of concept using Shiga toxin-producing Escherichia coli spiked food samples for a strain-level metagenomics foodborne outbreak investigation method using the MinION and Flongle flow cells from Oxford Nanopore Technologies, and we compared this to Illumina short-read-based metagenomics. After 12 h of MinION sequencing, strain-level characterization could be achieved, linking the food containing a pathogen to the related human isolate of the affected patient, by means of a single-nucleotide polymorphism (SNP)-based phylogeny. The inferred strain harbored the same virulence genes as the spiked isolate and could be serotyped. This was achieved by applying a bioinformatics method on the long reads using reference-based classification. The same result could be obtained after 24-h sequencing on the more recent lower output Flongle flow cell, on an extract treated with eukaryotic host DNA removal. Moreover, an alternative approach based on in silico DNA walking allowed to obtain rapid confirmation of the presence of a putative pathogen in the food sample. The DNA fragment harboring characteristic virulence genes could be matched to the E. coli genus after sequencing only 1 h with the MinION, 1 h with the Flongle if using a host DNA removal extraction, or 5 h with the Flongle with a classical DNA extraction. This paves the way towards the use of metagenomics as a rapid, simple, one-step method for foodborne pathogen detection and for fast outbreak investigation that can be implemented in routine laboratories on samples prepared with the current standard practices.
Collapse
|
13
|
Evaluation of WGS performance for bacterial pathogen characterization with the Illumina technology optimized for time-critical situations. Microb Genom 2021; 7:000699. [PMID: 34739368 PMCID: PMC8743554 DOI: 10.1099/mgen.0.000699] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2021] [Accepted: 09/30/2021] [Indexed: 12/29/2022] Open
Abstract
Whole genome sequencing (WGS) has become the reference standard for bacterial outbreak investigation and pathogen typing, providing a resolution unattainable with conventional molecular methods. Data generated with Illumina sequencers can however only be analysed after the sequencing run has finished, thereby losing valuable time during emergency situations. We evaluated both the effect of decreasing overall run time, and also a protocol to transfer and convert intermediary files generated by Illumina sequencers enabling real-time data analysis for multiple samples part of the same ongoing sequencing run, as soon as the forward reads have been sequenced. To facilitate implementation for laboratories operating under strict quality systems, extensive validation of several bioinformatics assays (16S rRNA species confirmation, gene detection against virulence factor and antimicrobial resistance databases, SNP-based antimicrobial resistance detection, serotype determination, and core genome multilocus sequence typing) for three bacterial pathogens (Mycobacterium tuberculosis , Neisseria meningitidis , and Shiga-toxin producing Escherichia coli ) was performed by evaluating performance in function of the two most critical sequencing parameters, i.e. read length and coverage. For the majority of evaluated bioinformatics assays, actionable results could be obtained between 14 and 22 h of sequencing, decreasing the overall sequencing-to-results time by more than half. This study aids in reducing the turn-around time of WGS analysis by facilitating a faster response in time-critical scenarios and provides recommendations for time-optimized WGS with respect to required read length and coverage to achieve a minimum level of performance for the considered bioinformatics assay(s), which can also be used to maximize the cost-effectiveness of routine surveillance sequencing when response time is not essential.
Collapse
|
14
|
ON-rep-seq as a rapid and cost-effective alternative to whole-genome sequencing for species-level identification and strain-level discrimination of Listeria monocytogenes contamination in a salmon processing plant. Microbiologyopen 2021; 10:e1246. [PMID: 34964295 PMCID: PMC8591450 DOI: 10.1002/mbo3.1246] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2021] [Accepted: 10/19/2021] [Indexed: 12/28/2022] Open
Abstract
Identification, source tracking, and surveillance of food pathogens are crucial factors for the food-producing industry. Over the last decade, the techniques used for this have moved from conventional enrichment methods, through species-specific detection by PCR to sequencing-based methods, whole-genome sequencing (WGS) being the ultimate method. However, using WGS requires the right infrastructure, high computational power, and bioinformatics expertise. Therefore, there is a need for faster, more cost-effective, and more user-friendly methods. A newly developed method, ON-rep-seq, combines the classical rep-PCR method with nanopore sequencing, resulting in a highly discriminating set of sequences that can be used for species identification and also strain discrimination. This study is essentially a real industry case from a salmon processing plant. Twenty Listeria monocytogenes isolates were analyzed both by ON-rep-seq and WGS to identify and differentiate putative L. monocytogenes from a routine sampling of processing equipment and products, and finally, compare the strain-level discriminatory power of ON-rep-seq to different analyzing levels delivered from the WGS data. The analyses revealed that among the isolates tested there were three different strains. The isolates of the most frequently detected strain (n = 15) were all detected in the problematic area in the processing plant. The strain level discrimination done by ON-rep-seq was in full accordance with the interpretation of WGS data. Our findings also demonstrate that ON-rep-seq may serve as a primary screening method alternative to WGS for identification and strain-level differentiation for surveillance of potential pathogens in a food-producing environment.
Collapse
|
15
|
Characterization of Genetically Modified Microorganisms Using Short- and Long-Read Whole-Genome Sequencing Reveals Contaminations of Related Origin in Multiple Commercial Food Enzyme Products. Foods 2021; 10:2637. [PMID: 34828918 PMCID: PMC8624754 DOI: 10.3390/foods10112637] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2021] [Revised: 10/22/2021] [Accepted: 10/28/2021] [Indexed: 12/02/2022] Open
Abstract
Despite their presence being unauthorized on the European market, contaminations with genetically modified (GM) microorganisms have repeatedly been reported in diverse commercial microbial fermentation produce types. Several of these contaminations are related to a GM Bacillus velezensis used to synthesize a food enzyme protease, for which genomic characterization remains currently incomplete, and it is unknown whether these contaminations have a common origin. In this study, GM B. velezensis isolates from multiple food enzyme products were characterized by short- and long-read whole-genome sequencing (WGS), demonstrating that they harbor a free recombinant pUB110-derived plasmid carrying antimicrobial resistance genes. Additionally, single-nucleotide polymorphism (SNP) and whole-genome based comparative analyses showed that the isolates likely originate from the same parental GM strain. This study highlights the added value of a hybrid WGS approach for accurate genomic characterization of GMM (e.g., genomic location of the transgenic construct), and of SNP-based phylogenomic analysis for source-tracking of GMM.
Collapse
|
16
|
Global population structure and genotyping framework for genomic surveillance of the major dysentery pathogen, Shigella sonnei. Nat Commun 2021; 12:2684. [PMID: 33976138 PMCID: PMC8113504 DOI: 10.1038/s41467-021-22700-4] [Citation(s) in RCA: 27] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2020] [Accepted: 03/23/2021] [Indexed: 01/20/2023] Open
Abstract
Shigella sonnei is the most common agent of shigellosis in high-income countries, and causes a significant disease burden in low- and middle-income countries. Antimicrobial resistance is increasingly common in all settings. Whole genome sequencing (WGS) is increasingly utilised for S. sonnei outbreak investigation and surveillance, but comparison of data between studies and labs is challenging. Here, we present a genomic framework and genotyping scheme for S. sonnei to efficiently identify genotype and resistance determinants from WGS data. The scheme is implemented in the software package Mykrobe and tested on thousands of genomes. Applying this approach to analyse >4,000 S. sonnei isolates sequenced in public health labs in three countries identified several common genotypes associated with increased rates of ciprofloxacin resistance and azithromycin resistance, confirming intercontinental spread of highly-resistant S. sonnei clones and demonstrating the genomic framework can facilitate monitoring the spread of resistant clones, including those that have recently emerged, at local and global scales.
Collapse
|
17
|
Validation strategy of a bioinformatics whole genome sequencing workflow for Shiga toxin-producing Escherichia coli using a reference collection extensively characterized with conventional methods. Microb Genom 2021; 7:mgen000531. [PMID: 33656437 PMCID: PMC8190621 DOI: 10.1099/mgen.0.000531] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2020] [Accepted: 01/25/2021] [Indexed: 12/13/2022] Open
Abstract
Whole genome sequencing (WGS) enables complete characterization of bacterial pathogenic isolates at single nucleotide resolution, making it the ultimate tool for routine surveillance and outbreak investigation. The lack of standardization, and the variation regarding bioinformatics workflows and parameters, however, complicates interoperability among (inter)national laboratories. We present a validation strategy applied to a bioinformatics workflow for Illumina data that performs complete characterization of Shiga toxin-producing Escherichia coli (STEC) isolates including antimicrobial resistance prediction, virulence gene detection, serotype prediction, plasmid replicon detection and sequence typing. The workflow supports three commonly used bioinformatics approaches for the detection of genes and alleles: alignment with blast+, kmer-based read mapping with KMA, and direct read mapping with SRST2. A collection of 131 STEC isolates collected from food and human sources, extensively characterized with conventional molecular methods, was used as a validation dataset. Using a validation strategy specifically adopted to WGS, we demonstrated high performance with repeatability, reproducibility, accuracy, precision, sensitivity and specificity above 95 % for the majority of all assays. The WGS workflow is publicly available as a 'push-button' pipeline at https://galaxy.sciensano.be. Our validation strategy and accompanying reference dataset consisting of both conventional and WGS data can be used for characterizing the performance of various bioinformatics workflows and assays, facilitating interoperability between laboratories with different WGS and bioinformatics set-ups.
Collapse
|
18
|
Impact of DNA extraction on whole genome sequencing analysis for characterization and relatedness of Shiga toxin-producing Escherichia coli isolates. Sci Rep 2020; 10:14649. [PMID: 32887913 PMCID: PMC7474065 DOI: 10.1038/s41598-020-71207-3] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2019] [Accepted: 08/11/2020] [Indexed: 01/28/2023] Open
Abstract
Whole genome sequencing (WGS) has proven to be the ultimate tool for bacterial isolate characterization and relatedness determination. However, standardized and harmonized workflows, e.g. for DNA extraction, are required to ensure robust and exchangeable WGS data. Data sharing between (inter)national laboratories is essential to support foodborne pathogen control, including outbreak investigation. This study evaluated eight commercial DNA preparation kits for their potential influence on: (i) DNA quality for Nextera XT library preparation; (ii) MiSeq sequencing (data quality, read mapping against plasmid and chromosome references); and (iii) WGS data analysis, i.e. isolate characterization (serotyping, virulence and antimicrobial resistance genotyping) and phylogenetic relatedness (core genome multilocus sequence typing and single nucleotide polymorphism analysis). Shiga toxin-producing Escherichia coli (STEC) was selected as a case study. Overall, data quality and inferred phylogenetic relationships between isolates were not affected by the DNA extraction kit choice, irrespective of the presence of confounding factors such as EDTA in DNA solution buffers. Nevertheless, completeness of STEC characterization was, although not substantially, influenced by the plasmid extraction performance of the kits, especially when using Nextera XT library preparation. This study contributes to addressing the WGS challenges of standardizing protocols to support data portability and to enable full exploitation of its potential.
Collapse
|
19
|
A Practical Method to Implement Strain-Level Metagenomics-Based Foodborne Outbreak Investigation and Source Tracking in Routine. Microorganisms 2020; 8:E1191. [PMID: 32764329 PMCID: PMC7463776 DOI: 10.3390/microorganisms8081191] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2020] [Revised: 07/31/2020] [Accepted: 08/01/2020] [Indexed: 12/13/2022] Open
Abstract
The management of a foodborne outbreak depends on the rapid and accurate identification of the responsible food source. Conventional methods based on isolation of the pathogen from the food matrix and target-specific real-time polymerase chain reactions (qPCRs) are used in routine. In recent years, the use of whole genome sequencing (WGS) of bacterial isolates has proven its value to collect relevant information for strain characterization as well as tracing the origin of the contamination by linking the food isolate with the patient's isolate with high resolution. However, the isolation of a bacterial pathogen from food matrices is often time-consuming and not always successful. Therefore, we aimed to improve outbreak investigation by developing a method that can be implemented in reference laboratories to characterize the pathogen in the food vehicle without its prior isolation and link it back to human cases. We tested and validated a shotgun metagenomics approach by spiking food pathogens in specific food matrices using the Shiga toxin-producing Escherichia coli (STEC) as a case study. Different DNA extraction kits and enrichment procedures were investigated to obtain the most practical workflow. We demonstrated the feasibility of shotgun metagenomics to obtain the same information as in ISO/TS 13136:2012 and WGS of the isolate in parallel by inferring the genome of the contaminant and characterizing it in a shorter timeframe. This was achieved in food samples containing different E. coli strains, including a combination of different STEC strains. For the first time, we also managed to link individual strains from a food product to isolates from human cases, demonstrating the power of shotgun metagenomics for rapid outbreak investigation and source tracking.
Collapse
|